Skip to main content

Showing 1–13 of 13 results for author: Hayder, Z

  1. arXiv:2405.15439  [pdf, other

    cs.CV cs.AI

    Text-guided 3D Human Motion Generation with Keyframe-based Parallel Skip Transformer

    Authors: Zichen Geng, Caren Han, Zeeshan Hayder, Jian Liu, Mubarak Shah, Ajmal Mian

    Abstract: Text-driven human motion generation is an emerging task in animation and humanoid robot design. Existing algorithms directly generate the full sequence which is computationally expensive and prone to errors as it does not pay special attention to key poses, a process that has been the cornerstone of animation for decades. We propose KeyMotion, that generates plausible human motion sequences corres… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  2. arXiv:2404.11256  [pdf, other

    cs.CV

    MMCBE: Multi-modality Dataset for Crop Biomass Estimation and Beyond

    Authors: Xuesong Li, Zeeshan Hayder, Ali Zia, Connor Cassidy, Shiming Liu, Warwick Stiller, Eric Stone, Warren Conaty, Lars Petersson, Vivien Rolland

    Abstract: Crop biomass, a critical indicator of plant growth, health, and productivity, is invaluable for crop breeding programs and agronomic research. However, the accurate and scalable quantification of crop biomass remains inaccessible due to limitations in existing measurement methods. One of the obstacles impeding the advancement of current crop biomass prediction methodologies is the scarcity of publ… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 10 pages, 10 figures, 3 tables

  3. arXiv:2403.18442  [pdf, other

    cs.CV

    Backpropagation-free Network for 3D Test-time Adaptation

    Authors: Yanshuo Wang, Ali Cheraghian, Zeeshan Hayder, Jie Hong, Sameera Ramasinghe, Shafin Rahman, David Ahmedt-Aristizabal, Xuesong Li, Lars Petersson, Mehrtash Harandi

    Abstract: Real-world systems often encounter new data over time, which leads to experiencing target domain shifts. Existing Test-Time Adaptation (TTA) methods tend to apply computationally heavy and memory-intensive backpropagation-based approaches to handle this. Here, we propose a novel method that uses a backpropagation-free approach for TTA for the specific case of 3D data. Our model uses a two-stream a… ▽ More

    Submitted 24 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: CVPR 2024

  4. arXiv:2403.14886  [pdf, other

    cs.CV

    DSGG: Dense Relation Transformer for an End-to-end Scene Graph Generation

    Authors: Zeeshan Hayder, Xuming He

    Abstract: Scene graph generation aims to capture detailed spatial and semantic relationships between objects in an image, which is challenging due to incomplete labelling, long-tailed relationship categories, and relational semantic overlap. Existing Transformer-based methods either employ distinct queries for objects and predicates or utilize holistic queries for relation triplets and hence often suffer fr… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  5. arXiv:2403.14235  [pdf, other

    astro-ph.GA astro-ph.CO astro-ph.IM cs.CV cs.LG

    RG-CAT: Detection Pipeline and Catalogue of Radio Galaxies in the EMU Pilot Survey

    Authors: Nikhel Gupta, Ray P. Norris, Zeeshan Hayder, Minh Huynh, Lars Petersson, X. Rosalind Wang, Andrew M. Hopkins, Heinz Andernach, Yjan Gordon, Simone Riggi, Miranda Yew, Evan J. Crawford, Bärbel Koribalski, Miroslav D. Filipović, Anna D. Kapinśka, Stanislav Shabala, Tessa Vernstrom, Joshua R. Marvil

    Abstract: We present source detection and catalogue construction pipelines to build the first catalogue of radio galaxies from the 270 $\rm deg^2$ pilot survey of the Evolutionary Map of the Universe (EMU-PS) conducted with the Australian Square Kilometre Array Pathfinder (ASKAP) telescope. The detection pipeline uses Gal-DINO computer-vision networks (Gupta et al., 2024) to predict the categories of radio… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Accepted for publication in PASA. The paper has 22 pages, 12 figures and 5 tables

  6. arXiv:2312.10930  [pdf, other

    cs.CV

    Deep Learning Approaches for Seizure Video Analysis: A Review

    Authors: David Ahmedt-Aristizabal, Mohammad Ali Armin, Zeeshan Hayder, Norberto Garcia-Cairasco, Lars Petersson, Clinton Fookes, Simon Denman, Aileen McGonigal

    Abstract: Seizure events can manifest as transient disruptions in the control of movements which may be organized in distinct behavioral sequences, accompanied or not by other observable features such as altered facial expressions. The analysis of these clinical signs, referred to as semiology, is subject to observer variations when specialists evaluate video-recorded events in the clinical setting. To enha… ▽ More

    Submitted 4 March, 2024; v1 submitted 17 December, 2023; originally announced December 2023.

    Comments: Accepted in Epilepsy & Behavior

  7. arXiv:2312.06728  [pdf, other

    cs.CV astro-ph.CO astro-ph.GA astro-ph.IM

    A Multimodal Dataset and Benchmark for Radio Galaxy and Infrared Host Detection

    Authors: Nikhel Gupta, Zeeshan Hayder, Ray P. Norris, Minh Hyunh, Lars Petersson

    Abstract: We present a novel multimodal dataset developed by expert astronomers to automate the detection and localisation of multi-component extended radio galaxies and their corresponding infrared hosts. The dataset comprises 4,155 instances of galaxies in 2,800 images with both radio and infrared modalities. Each instance contains information on the extended radio galaxy class, its corresponding bounding… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted in NeurIPS 2023 conference ML4PS workshop (https://nips.cc/). The full version accepted in PASA, is available at https://doi.org/10.1017/pasa.2023.64

  8. arXiv:2312.00306  [pdf, other

    astro-ph.IM astro-ph.CO astro-ph.GA cs.CV

    RadioGalaxyNET: Dataset and Novel Computer Vision Algorithms for the Detection of Extended Radio Galaxies and Infrared Hosts

    Authors: Nikhel Gupta, Zeeshan Hayder, Ray P. Norris, Minh Huynh, Lars Petersson

    Abstract: Creating radio galaxy catalogues from next-generation deep surveys requires automated identification of associated components of extended sources and their corresponding infrared hosts. In this paper, we introduce RadioGalaxyNET, a multimodal dataset, and a suite of novel computer vision algorithms designed to automate the detection and localization of multi-component extended radio galaxies and t… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

    Comments: Accepted for publication in PASA. The paper has 17 pages, 6 figures, 5 tables

  9. arXiv:2308.12558  [pdf, other

    cs.CV

    Hyperbolic Audio-visual Zero-shot Learning

    Authors: Jie Hong, Zeeshan Hayder, Junlin Han, Pengfei Fang, Mehrtash Harandi, Lars Petersson

    Abstract: Audio-visual zero-shot learning aims to classify samples consisting of a pair of corresponding audio and video sequences from classes that are not present during training. An analysis of the audio-visual data reveals a large degree of hyperbolicity, indicating the potential benefit of using a hyperbolic transformation to achieve curvature-aware geometric learning, with the aim of exploring more co… ▽ More

    Submitted 16 December, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: ICCV 2023

  10. arXiv:2308.05166  [pdf, other

    astro-ph.IM astro-ph.CO astro-ph.GA cs.CV cs.LG

    Deep Learning for Morphological Identification of Extended Radio Galaxies using Weak Labels

    Authors: Nikhel Gupta, Zeeshan Hayder, Ray P. Norris, Minh Huynh, Lars Petersson, X. Rosalind Wang, Heinz Andernach, Bärbel S. Koribalski, Miranda Yew, Evan J. Crawford

    Abstract: The present work discusses the use of a weakly-supervised deep learning algorithm that reduces the cost of labelling pixel-level masks for complex radio galaxies with multiple components. The algorithm is trained on weak class-level labels of radio galaxies to get class activation maps (CAMs). The CAMs are further refined using an inter-pixel relations network (IRNet) to get instance segmentation… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: 14 pages, 6 figues, accepted for publication in PASA

  11. Topological Deep Learning: A Review of an Emerging Paradigm

    Authors: Ali Zia, Abdelwahed Khamis, James Nichols, Zeeshan Hayder, Vivien Rolland, Lars Petersson

    Abstract: Topological data analysis (TDA) provides insight into data shape. The summaries obtained by these methods are principled global descriptions of multi-dimensional data whilst exhibiting stable properties such as robustness to deformation and noise. Such properties are desirable in deep learning pipelines but they are typically obtained using non-TDA strategies. This is partly caused by the difficul… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

    Comments: 7 pages and 2 references

  12. arXiv:1709.07322  [pdf, other

    cs.CV

    Playing for Benchmarks

    Authors: Stephan R. Richter, Zeeshan Hayder, Vladlen Koltun

    Abstract: We present a benchmark suite for visual perception. The benchmark is based on more than 250K high-resolution video frames, all annotated with ground-truth data for both low-level and high-level vision tasks, including optical flow, semantic instance segmentation, object detection and tracking, object-level 3D scene layout, and visual odometry. Ground-truth data for all tasks is available for every… ▽ More

    Submitted 21 September, 2017; originally announced September 2017.

    Comments: Published at the International Conference on Computer Vision (ICCV 2017)

    ACM Class: I.4.8

  13. arXiv:1612.03129  [pdf, other

    cs.CV

    Boundary-aware Instance Segmentation

    Authors: Zeeshan Hayder, Xuming He, Mathieu Salzmann

    Abstract: We address the problem of instance-level semantic segmentation, which aims at jointly detecting, segmenting and classifying every individual object in an image. In this context, existing methods typically propose candidate objects, usually as bounding boxes, and directly predict a binary mask within each such proposal. As a consequence, they cannot recover from errors in the object candidate gener… ▽ More

    Submitted 6 April, 2017; v1 submitted 9 December, 2016; originally announced December 2016.