Skip to main content

Showing 1–15 of 15 results for author: Pei, G

  1. arXiv:2407.03178  [pdf, other

    cs.MM cs.CV cs.LG

    Relating CNN-Transformer Fusion Network for Change Detection

    Authors: Yuhao Gao, Gensheng Pei, Mengmeng Sheng, Zeren Sun, Tao Chen, Yazhou Yao

    Abstract: While deep learning, particularly convolutional neural networks (CNNs), has revolutionized remote sensing (RS) change detection (CD), existing approaches often miss crucial features due to neglecting global context and incomplete change learning. Additionally, transformer networks struggle with low-level details. RCTNet addresses these limitations by introducing \textbf{(1)} an early fusion backbo… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: accepted by IEEE Conference on Multimedia Expo

  2. arXiv:2407.02768  [pdf, other

    cs.CV

    Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation

    Authors: Tao Chen, XiRuo Jiang, Gensheng Pei, Zeren Sun, Yucheng Wang, Yazhou Yao

    Abstract: Though adversarial erasing has prevailed in weakly supervised semantic segmentation to help activate integral object regions, existing approaches still suffer from the dilemma of under-activation and over-expansion due to the difficulty in determining when to stop erasing. In this paper, we propose a \textbf{K}nowledge \textbf{T}ransfer with \textbf{S}imulated Inter-Image \textbf{E}rasing (KTSE) a… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: accepted by the European Conference on Computer Vision (ECCV), 2024

  3. arXiv:2405.11742  [pdf, other

    cs.MM

    Universal Organizer of SAM for Unsupervised Semantic Segmentation

    Authors: Tingting Li, Gensheng Pei, Xinhao Cai, Huafeng Liu, Qiong Wang, Yazhou Yao

    Abstract: Unsupervised semantic segmentation (USS) aims to achieve high-quality segmentation without manual pixel-level annotations. Existing USS models provide coarse category classification for regions, but the results often have blurry and imprecise edges. Recently, a robust framework called the segment anything model (SAM) has been proven to deliver precise boundary object masks. Therefore, this paper p… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: accepted by IEEE International Conference on Multimedia & Expo

  4. arXiv:2405.05990  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    Special Characters Attack: Toward Scalable Training Data Extraction From Large Language Models

    Authors: Yang Bai, Ge Pei, Jindong Gu, Yong Yang, Xingjun Ma

    Abstract: Large language models (LLMs) have achieved remarkable performance on a wide range of tasks. However, recent studies have shown that LLMs can memorize training data and simple repeated tokens can trick the model to leak the data. In this paper, we take a step further and show that certain special characters or their combinations with English letters are stronger memory triggers, leading to more sev… ▽ More

    Submitted 20 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  5. arXiv:2404.19311  [pdf, other

    cs.CV cs.MM

    A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images

    Authors: Wang Zhang, Tingting Li, Yuntian Zhang, Gensheng Pei, Xiruo Jiang, Yazhou Yao

    Abstract: Matching visible and near-infrared (NIR) images remains a significant challenge in remote sensing image fusion. The nonlinear radiometric differences between heterogeneous remote sensing images make the image matching task even more difficult. Deep learning has gained substantial attention in computer vision tasks in recent years. However, many methods rely on supervised learning and necessitate l… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: accepted by Information Fusion

  6. arXiv:2404.13505  [pdf, other

    cs.CV

    Dynamic in Static: Hybrid Visual Correspondence for Self-Supervised Video Object Segmentation

    Authors: Gensheng Pei, Yazhou Yao, Jianbo Jiao, Wenguan Wang, Liqiang Nie, Jinhui Tang

    Abstract: Conventional video object segmentation (VOS) methods usually necessitate a substantial volume of pixel-level annotated video data for fully supervised learning. In this paper, we present HVC, a \textbf{h}ybrid static-dynamic \textbf{v}isual \textbf{c}orrespondence framework for self-supervised VOS. HVC extracts pseudo-dynamic signals from static images, enabling an efficient and scalable VOS model… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  7. arXiv:2403.17881  [pdf, other

    cs.CV

    Deepfake Generation and Detection: A Benchmark and Survey

    Authors: Gan Pei, Jiangning Zhang, Menghan Hu, Zhenyu Zhang, Chengjie Wang, Yunsheng Wu, Guangtao Zhai, Jian Yang, Chunhua Shen, Dacheng Tao

    Abstract: Deepfake is a technology dedicated to creating highly realistic facial images and videos under specific conditions, which has significant application potential in fields such as entertainment, movie production, digital human creation, to name a few. With the advancements in deep learning, techniques primarily represented by Variational Autoencoders and Generative Adversarial Networks have achieved… ▽ More

    Submitted 16 May, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: We closely follow the latest developments in https://github.com/flyingby/Awesome-Deepfake-Generation-and-Detection

  8. arXiv:2402.19082  [pdf, other

    cs.CV

    VideoMAC: Video Masked Autoencoders Meet ConvNets

    Authors: Gensheng Pei, Tao Chen, Xiruo Jiang, Huafeng Liu, Zeren Sun, Yazhou Yao

    Abstract: Recently, the advancement of self-supervised learning techniques, like masked autoencoders (MAE), has greatly influenced visual representation learning for images and videos. Nevertheless, it is worth noting that the predominant approaches in existing masked image / video modeling rely excessively on resource-intensive vision transformers (ViTs) as the feature encoder. In this paper, we propose a… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: accepted by IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024

  9. arXiv:2312.09525  [pdf, other

    cs.CV

    Hierarchical Graph Pattern Understanding for Zero-Shot VOS

    Authors: Gensheng Pei, Fumin Shen, Yazhou Yao, Tao Chen, Xian-Sheng Hua, Heng-Tao Shen

    Abstract: The optical flow guidance strategy is ideal for obtaining motion information of objects in the video. It is widely utilized in video segmentation tasks. However, existing optical flow-based methods have a significant dependency on optical flow, which results in poor performance when the optical flow estimation fails for a particular scene. The temporal consistency provided by the optical flow coul… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: accepted by IEEE Transactions on Image Processing

    Journal ref: IEEE Transactions on Image Processing 2023

  10. Co-attention Propagation Network for Zero-Shot Video Object Segmentation

    Authors: Gensheng Pei, Yazhou Yao, Fumin Shen, Dan Huang, Xingguo Huang, Heng-Tao Shen

    Abstract: Zero-shot video object segmentation (ZS-VOS) aims to segment foreground objects in a video sequence without prior knowledge of these objects. However, existing ZS-VOS methods often struggle to distinguish between foreground and background or to keep track of the foreground in complex scenarios. The common practice of introducing motion information, such as optical flow, can lead to overreliance on… ▽ More

    Submitted 8 April, 2023; originally announced April 2023.

    Comments: accepted by IEEE Transactions on Image Processing

  11. arXiv:2207.08485  [pdf, other

    cs.CV

    Hierarchical Feature Alignment Network for Unsupervised Video Object Segmentation

    Authors: Gensheng Pei, Fumin Shen, Yazhou Yao, Guo-Sen Xie, Zhenmin Tang, Jinhui Tang

    Abstract: Optical flow is an easily conceived and precious cue for advancing unsupervised video object segmentation (UVOS). Most of the previous methods directly extract and fuse the motion and appearance features for segmenting target objects in the UVOS setting. However, optical flow is intrinsically an instantaneous velocity of all pixels among consecutive frames, thus making the motion features not alig… ▽ More

    Submitted 19 July, 2022; v1 submitted 18 July, 2022; originally announced July 2022.

    Comments: Accepted by ECCV-2022

  12. arXiv:2201.00628  [pdf

    eess.SP cs.AI

    An EEG-based approach for Parkinson's disease diagnosis using Capsule network

    Authors: Shujie Wang, Gongshu Wang, Guangying Pei

    Abstract: As the second most common neurodegenerative disease, Parkinson's disease has caused serious problems worldwide. However, the cause and mechanism of PD are not clear, and no systematic early diagnosis and treatment of PD have been established. Many patients with PD have not been diagnosed or misdiagnosed. In this paper, we proposed an EEG-based approach to diagnosing Parkinson's disease. It mapped… ▽ More

    Submitted 11 January, 2022; v1 submitted 27 December, 2021; originally announced January 2022.

    Comments: 6 pages,2 image, 3 tables

    ACM Class: I.2.8

  13. arXiv:2111.02040  [pdf, other

    cs.IR

    Three-dimensional Cooperative Localization of Commercial-Off-The-Shelf Sensors

    Authors: Yulong Wang, Shenghong Li, Wei Ni, David Abbott, Mark Johnson, Guangyu Pei, Mark Hedley

    Abstract: Many location-based services use Received Signal Strength (RSS) measurements due to their universal availability. In this paper, we study the association of a large number of low-cost Internet-of-Things (IoT) sensors and their possible installation locations, which can enable various sensing and automation-related applications. We propose an efficient approach to solve the corresponding permutatio… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: 10 pages, 12 figures

    ACM Class: I.m; I.6.5

  14. arXiv:1208.0811  [pdf, ps, other

    cs.DC cs.DS

    Efficient Algorithms for Maximum Link Scheduling in Distributed Computing Models with SINR Constraints

    Authors: Guanhong Pei, Anil Kumar S. Vullikanti

    Abstract: A fundamental problem in wireless networks is the maximum link scheduling problem: given a set $L$ of links, compute the largest possible subset $L'\subseteq L$ of links that can be scheduled simultaneously without interference. This problem is particularly challenging in the physical interference model based on SINR constraints (referred to as the SINR model), which has gained a lot of interest i… ▽ More

    Submitted 16 November, 2012; v1 submitted 3 August, 2012; originally announced August 2012.

  15. arXiv:1206.1113  [pdf, ps, other

    cs.DC cs.DS

    A Fast Distributed Approximation Algorithm for Minimum Spanning Trees in the SINR Model

    Authors: Maleq Khan, V. S. Anil Kumar, Gopal Pandurangan, Guanhong Pei

    Abstract: A fundamental problem in wireless networks is the \emph{minimum spanning tree} (MST) problem: given a set $V$ of wireless nodes, compute a spanning tree $T$, so that the total cost of $T$ is minimized. In recent years, there has been a lot of interest in the physical interference model based on SINR constraints. Distributed algorithms are especially challenging in the SINR model, because of the no… ▽ More

    Submitted 5 June, 2012; originally announced June 2012.

    ACM Class: C.2.4; F.2.2