Skip to main content

Showing 1–50 of 144 results for author: He, F

  1. arXiv:2407.03953  [pdf, other

    cs.LG cs.SI

    Generalizing Graph Transformers Across Diverse Graphs and Tasks via Pre-Training on Industrial-Scale Data

    Authors: Yufei He, Zhenyu Hou, Yukuo Cen, Feng He, Xu Cheng, Bryan Hooi

    Abstract: Graph pre-training has been concentrated on graph-level on small graphs (e.g., molecular graphs) or learning node representations on a fixed graph. Extending graph pre-trained models to web-scale graphs with billions of nodes in industrial scenarios, while avoiding negative transfer across graphs or tasks, remains a challenge. We aim to develop a general graph pre-trained model with inductive abil… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: Work in progress

  2. arXiv:2407.03361  [pdf, ps, other

    cs.SD cs.AI eess.AS

    PianoBART: Symbolic Piano Music Generation and Understanding with Large-Scale Pre-Training

    Authors: Xiao Liang, Zijian Zhao, Weichao Zeng, Yutong He, Fupeng He, Yiyi Wang, Chengying Gao

    Abstract: Learning musical structures and composition patterns is necessary for both music generation and understanding, but current methods do not make uniform use of learned features to generate and comprehend music simultaneously. In this paper, we propose PianoBART, a pre-trained model that uses BART for both symbolic piano music generation and understanding. We devise a multi-level object selection str… ▽ More

    Submitted 25 June, 2024; originally announced July 2024.

  3. arXiv:2406.18159  [pdf, other

    cs.CV cs.GR

    Human-Aware 3D Scene Generation with Spatially-constrained Diffusion Models

    Authors: Xiaolin Hong, Hongwei Yi, Fazhi He, Qiong Cao

    Abstract: Generating 3D scenes from human motion sequences supports numerous applications, including virtual reality and architectural design. However, previous auto-regression-based human-aware 3D scene generation methods have struggled to accurately capture the joint distribution of multiple objects and input humans, often resulting in overlapping object generation in the same space. To address this limit… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  4. arXiv:2406.04941  [pdf, ps, other

    cs.CL

    TCMD: A Traditional Chinese Medicine QA Dataset for Evaluating Large Language Models

    Authors: Ping Yu, Kaitao Song, Fengchen He, Ming Chen, Jianfeng Lu

    Abstract: The recently unprecedented advancements in Large Language Models (LLMs) have propelled the medical community by establishing advanced medical-domain models. However, due to the limited collection of medical datasets, there are only a few comprehensive benchmarks available to gauge progress in this area. In this paper, we introduce a new medical question-answering (QA) dataset that contains massive… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  5. arXiv:2406.01435  [pdf, other

    cs.LG stat.ML

    Learning Analysis of Kernel Ridgeless Regression with Asymmetric Kernel Learning

    Authors: Fan He, Mingzhen He, Lei Shi, Xiaolin Huang, Johan A. K. Suykens

    Abstract: Ridgeless regression has garnered attention among researchers, particularly in light of the ``Benign Overfitting'' phenomenon, where models interpolating noisy samples demonstrate robust generalization. However, kernel ridgeless regression does not always perform well due to the lack of flexibility. This paper enhances kernel ridgeless regression with Locally-Adaptive-Bandwidths (LAB) RBF kernels,… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2310.05236

  6. arXiv:2405.07800  [pdf, other

    cs.LG

    Data Imputation by Pursuing Better Classification: A Supervised Kernel-Based Method

    Authors: Ruikai Yang, Fan He, Mingzhen He, Kaijie Wang, Xiaolin Huang

    Abstract: Data imputation, the process of filling in missing feature elements for incomplete data sets, plays a crucial role in data-driven learning. A fundamental belief is that data imputation is helpful for learning performance, and it follows that the pursuit of better classification can guide the data imputation process. While some works consider using label information to assist in this task, their si… ▽ More

    Submitted 9 July, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  7. arXiv:2405.07791  [pdf, ps, other

    cs.LG cs.DC stat.ML

    Decentralized Kernel Ridge Regression Based on Data-Dependent Random Feature

    Authors: Ruikai Yang, Fan He, Mingzhen He, Jie Yang, Xiaolin Huang

    Abstract: Random feature (RF) has been widely used for node consistency in decentralized kernel ridge regression (KRR). Currently, the consistency is guaranteed by imposing constraints on coefficients of features, necessitating that the random features on different nodes are identical. However, in many applications, data on different nodes varies significantly on the number or distribution, which calls for… ▽ More

    Submitted 5 July, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  8. arXiv:2403.10980  [pdf, other

    cs.GT eess.SY math.OC

    Inverse learning of black-box aggregator for robust Nash equilibrium

    Authors: Guanpu Chen, Gehui Xu, Fengxiang He, Dacheng Tao, Thomas Parisini, Karl Henrik Johansson

    Abstract: In this note, we investigate the robustness of Nash equilibria (NE) in multi-player aggregative games with coupling constraints. There are many algorithms for computing an NE of an aggregative game given a known aggregator. When the coupling parameters are affected by uncertainty, robust NE need to be computed. We consider a scenario where players' weight in the aggregator is unknown, making the a… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

  9. arXiv:2402.18331  [pdf, other

    cs.CV

    FineDiffusion: Scaling up Diffusion Models for Fine-grained Image Generation with 10,000 Classes

    Authors: Ziying Pan, Kun Wang, Gang Li, Feihong He, Yongxuan Lai

    Abstract: The class-conditional image generation based on diffusion models is renowned for generating high-quality and diverse images. However, most prior efforts focus on generating images for general categories, e.g., 1000 classes in ImageNet-1k. A more challenging task, large-scale fine-grained image generation, remains the boundary to explore. In this work, we present a parameter-efficient strategy, cal… ▽ More

    Submitted 3 June, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  10. arXiv:2402.13815  [pdf, other

    cs.SE cs.CR

    An Empirical Study on Oculus Virtual Reality Applications: Security and Privacy Perspectives

    Authors: Hanyang Guo, Hong-Ning Dai, Xiapu Luo, Zibin Zheng, Gengyang Xu, Fengliang He

    Abstract: Although Virtual Reality (VR) has accelerated its prevalent adoption in emerging metaverse applications, it is not a fundamentally new technology. On one hand, most VR operating systems (OS) are based on off-the-shelf mobile OS. As a result, VR apps also inherit privacy and security deficiencies from conventional mobile apps. On the other hand, in contrast to conventional mobile apps, VR apps can… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted by ICSE 2024

  11. arXiv:2402.13523  [pdf, other

    eess.SP cs.LG q-bio.NC

    Balancing Spectral, Temporal and Spatial Information for EEG-based Alzheimer's Disease Classification

    Authors: Stephan Goerttler, Fei He, Min Wu

    Abstract: The prospect of future treatment warrants the development of cost-effective screening for Alzheimer's disease (AD). A promising candidate in this regard is electroencephalography (EEG), as it is one of the most economic imaging modalities. Recent efforts in EEG analysis have shifted towards leveraging spatial information, employing novel frameworks such as graph signal processing or graph neural n… ▽ More

    Submitted 30 April, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: 4 pages, 3 figures, conference paper

  12. arXiv:2401.15636  [pdf, other

    cs.CV eess.IV

    FreeStyle: Free Lunch for Text-guided Style Transfer using Diffusion Models

    Authors: Feihong He, Gang Li, Mengyuan Zhang, Leilei Yan, Lingyu Si, Fanzhang Li, Li Shen

    Abstract: The rapid development of generative diffusion models has significantly advanced the field of style transfer. However, most current style transfer methods based on diffusion models typically involve a slow iterative optimization process, e.g., model fine-tuning and textual inversion of style concept. In this paper, we introduce FreeStyle, an innovative style transfer method built upon a pre-trained… ▽ More

    Submitted 18 July, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

  13. arXiv:2401.10529  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences

    Authors: Xiyao Wang, Yuhang Zhou, Xiaoyu Liu, Hongjin Lu, Yuancheng Xu, Feihong He, Jaehong Yoon, Taixi Lu, Gedas Bertasius, Mohit Bansal, Huaxiu Yao, Furong Huang

    Abstract: Multimodal Large Language Models (MLLMs) have demonstrated proficiency in handling a variety of visual-language tasks. However, current MLLM benchmarks are predominantly designed to evaluate reasoning based on static information about a single image, and the ability of modern MLLMs to extrapolate from image sequences, which is essential for understanding our ever-changing world, has been less inve… ▽ More

    Submitted 24 January, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: 27 pages, 23 figures

  14. arXiv:2401.01656  [pdf, other

    cs.GT cs.AI

    Deep Automated Mechanism Design for Integrating Ad Auction and Allocation in Feed

    Authors: Xuejian Li, Ze Wang, Bingqi Zhu, Fei He, Yongkang Wang, Xingxing Wang

    Abstract: E-commerce platforms usually present an ordered list, mixed with several organic items and an advertisement, in response to each user's page view request. This list, the outcome of ad auction and allocation processes, directly impacts the platform's ad revenue and gross merchandise volume (GMV). Specifically, the ad auction determines which ad is displayed and the corresponding payment, while the… ▽ More

    Submitted 11 April, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

    Comments: 9 pages, 2 figures, Posting

  15. arXiv:2312.17624  [pdf, other

    cs.LG cs.AI

    XAI for In-hospital Mortality Prediction via Multimodal ICU Data

    Authors: Xingqiao Li, Jindong Gu, Zhiyong Wang, Yancheng Yuan, Bo Du, Fengxiang He

    Abstract: Predicting in-hospital mortality for intensive care unit (ICU) patients is key to final clinical outcomes. AI has shown advantaged accuracy but suffers from the lack of explainability. To address this issue, this paper proposes an eXplainable Multimodal Mortality Predictor (X-MMP) approaching an efficient, explainable AI solution for predicting in-hospital mortality via multimodal ICU data. We emp… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

  16. arXiv:2312.04377  [pdf, other

    cs.IT eess.SP

    HARQ-IR Aided Short Packet Communications: BLER Analysis and Throughput Maximization

    Authors: Fuchao He, Zheng Shi, Guanghua Yang, Xiaofan Li, Xinrong Ye, Shaodan Ma

    Abstract: This paper introduces hybrid automatic repeat request with incremental redundancy (HARQ-IR) to boost the reliability of short packet communications. The finite blocklength information theory and correlated decoding events tremendously preclude the analysis of average block error rate (BLER). Fortunately, the recursive form of average BLER motivates us to calculate its value through the trapezoidal… ▽ More

    Submitted 9 January, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: 13 pages, 10 figures

  17. arXiv:2311.16140  [pdf

    cs.CV cs.AI cs.LG

    Adapting Segment Anything Model (SAM) through Prompt-based Learning for Enhanced Protein Identification in Cryo-EM Micrographs

    Authors: Fei He, Zhiyuan Yang, Mingyue Gao, Biplab Poudel, Newgin Sam Ebin Sam Dhas, Rajan Gyawali, Ashwin Dhakal, Jianlin Cheng, Dong Xu

    Abstract: Cryo-electron microscopy (cryo-EM) remains pivotal in structural biology, yet the task of protein particle picking, integral for 3D protein structure construction, is laden with manual inefficiencies. While recent AI tools such as Topaz and crYOLO are advancing the field, they do not fully address the challenges of cryo-EM images, including low contrast, complex shapes, and heterogeneous conformat… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

  18. arXiv:2310.19654  [pdf, other

    cs.CV cs.AI

    MCAD: Multi-teacher Cross-modal Alignment Distillation for efficient image-text retrieval

    Authors: Youbo Lei, Feifei He, Chen Chen, Yingbin Mo, Si Jia Li, Defeng Xie, Haonan Lu

    Abstract: Due to the success of large-scale visual-language pretraining (VLP) models and the widespread use of image-text retrieval in industry areas, it is now critically necessary to reduce the model size and streamline their mobile-device deployment. Single- and dual-stream model structures are commonly used in image-text retrieval with the goal of closing the semantic gap between textual and visual moda… ▽ More

    Submitted 1 April, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted by NAACL 2024 Findings

  19. arXiv:2310.09071  [pdf, other

    cs.LG eess.SY

    Online Relocating and Matching of Ride-Hailing Services: A Model-Based Modular Approach

    Authors: Chang Gao, Xi Lin, Fang He, Xindi Tang

    Abstract: This study proposes an innovative model-based modular approach (MMA) to dynamically optimize order matching and vehicle relocation in a ride-hailing platform. MMA utilizes a two-layer and modular modeling structure. The upper layer determines the spatial transfer patterns of vehicle flow within the system to maximize the total revenue of the current and future stages. With the guidance provided by… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  20. arXiv:2310.05236  [pdf, other

    cs.LG

    Enhancing Kernel Flexibility via Learning Asymmetric Locally-Adaptive Kernels

    Authors: Fan He, Mingzhen He, Lei Shi, Xiaolin Huang, Johan A. K. Suykens

    Abstract: The lack of sufficient flexibility is the key bottleneck of kernel-based learning that relies on manually designed, pre-given, and non-trainable kernels. To enhance kernel flexibility, this paper introduces the concept of Locally-Adaptive-Bandwidths (LAB) as trainable parameters to enhance the Radial Basis Function (RBF) kernel, giving rise to the LAB RBF kernel. The parameters in LAB RBF kernels… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

  21. arXiv:2310.03517  [pdf, other

    cs.CV

    PrototypeFormer: Learning to Explore Prototype Relationships for Few-shot Image Classification

    Authors: Feihong He, Gang Li, Lingyu Si, Leilei Yan, Fanzhang Li, Fuchun Sun

    Abstract: Few-shot image classification has received considerable attention for addressing the challenge of poor classification performance with limited samples in novel classes. However, numerous studies have employed sophisticated learning strategies and diversified feature extraction methods to address this issue. In this paper, we propose our method called PrototypeFormer, which aims to significantly ad… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: Submitted to AAAI2024

  22. arXiv:2310.02638  [pdf, other

    cs.CV

    P2CADNet: An End-to-End Reconstruction Network for Parametric 3D CAD Model from Point Clouds

    Authors: Zhihao Zong, Fazhi He, Rubin Fan, Yuxin Liu

    Abstract: Computer Aided Design (CAD), especially the feature-based parametric CAD, plays an important role in modern industry and society. However, the reconstruction of featured CAD model is more challenging than the reconstruction of other CAD models. To this end, this paper proposes an end-to-end network to reconstruct featured CAD model from point cloud (P2CADNet). Initially, the proposed P2CADNet arch… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  23. arXiv:2310.02152  [pdf, other

    q-bio.NC cs.LG q-bio.QM

    Graph Neural Network-based EEG Classification: A Survey

    Authors: Dominik Klepl, Min Wu, Fei He

    Abstract: Graph neural networks (GNN) are increasingly used to classify EEG for tasks such as emotion recognition, motor imagery and neurological diseases and disorders. A wide range of methods have been proposed to design GNN-based classifiers. Therefore, there is a need for a systematic review and categorisation of these approaches. We exhaustively search the published literature on this topic and derive… ▽ More

    Submitted 20 December, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: 14 pages, 3 figures

  24. arXiv:2309.14834  [pdf, other

    cs.LO

    Leveraging Datapath Propagation in IC3 for Hardware Model Checking

    Authors: Hongyu Fan, Fei He

    Abstract: IC3 is a famous bit-level framework for safety verification. By incorporating datapath abstraction, a notable enhancement in the efficiency of hardware verification can be achieved. However, datapath abstraction entails a coarse level of abstraction where all datapath operations are approximated as uninterpreted functions. This level of abstraction, albeit useful, can lead to an increased computat… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

  25. arXiv:2309.08375  [pdf, other

    cs.LG cs.CY

    Boosting Fair Classifier Generalization through Adaptive Priority Reweighing

    Authors: Zhihao Hu, Yiran Xu, Mengnan Du, Jindong Gu, Xinmei Tian, Fengxiang He

    Abstract: With the increasing penetration of machine learning applications in critical decision-making areas, calls for algorithmic fairness are more prominent. Although there have been various modalities to improve algorithmic fairness through learning with fairness constraints, their performance does not generalize well in the test set. A performance-promising fair algorithm with better generalizability i… ▽ More

    Submitted 20 May, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

  26. arXiv:2309.08251  [pdf, other

    cs.CV

    Cartoondiff: Training-free Cartoon Image Generation with Diffusion Transformer Models

    Authors: Feihong He, Gang Li, Lingyu Si, Leilei Yan, Shimeng Hou, Hongwei Dong, Fanzhang Li

    Abstract: Image cartoonization has attracted significant interest in the field of image generation. However, most of the existing image cartoonization techniques require re-training models using images of cartoon style. In this paper, we present CartoonDiff, a novel training-free sampling approach which generates image cartoonization using diffusion transformer models. Specifically, we decompose the reverse… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: 5 pages,5 figures

  27. arXiv:2308.15364  [pdf, other

    cs.LG stat.ML

    Heterogeneous Multi-Task Gaussian Cox Processes

    Authors: Feng Zhou, Quyu Kong, Zhijie Deng, Fengxiang He, Peng Cui, Jun Zhu

    Abstract: This paper presents a novel extension of multi-task Gaussian Cox processes for modeling multiple heterogeneous correlated tasks jointly, e.g., classification and regression, via multi-output Gaussian processes (MOGP). A MOGP prior over the parameters of the dedicated likelihoods for classification, regression and point process tasks can facilitate sharing of information between heterogeneous tasks… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  28. arXiv:2307.06742  [pdf, other

    eess.SY cs.AI cs.LG

    Vehicle Dispatching and Routing of On-Demand Intercity Ride-Pooling Services: A Multi-Agent Hierarchical Reinforcement Learning Approach

    Authors: Jinhua Si, Fang He, Xi Lin, Xindi Tang

    Abstract: The integrated development of city clusters has given rise to an increasing demand for intercity travel. Intercity ride-pooling service exhibits considerable potential in upgrading traditional intercity bus services by implementing demand-responsive enhancements. Nevertheless, its online operations suffer the inherent complexities due to the coupling of vehicle resource allocation among cities and… ▽ More

    Submitted 20 March, 2024; v1 submitted 13 July, 2023; originally announced July 2023.

  29. arXiv:2306.08105  [pdf

    q-fin.PM cs.LG q-fin.GN q-fin.RM

    Model-Free Market Risk Hedging Using Crowding Networks

    Authors: Vadim Zlotnikov, Jiayu Liu, Igor Halperin, Fei He, Lisa Huang

    Abstract: Crowding is widely regarded as one of the most important risk factors in designing portfolio strategies. In this paper, we analyze stock crowding using network analysis of fund holdings, which is used to compute crowding scores for stocks. These scores are used to construct costless long-short portfolios, computed in a distribution-free (model-free) way and without using any numerical optimization… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: 8 pages, 6 figures

  30. arXiv:2306.06722  [pdf, other

    cs.CV cs.AI

    $E(2)$-Equivariant Vision Transformer

    Authors: Renjun Xu, Kaifan Yang, Ke Liu, Fengxiang He

    Abstract: Vision Transformer (ViT) has achieved remarkable performance in computer vision. However, positional encoding in ViT makes it substantially difficult to learn the intrinsic equivariance in data. Initial attempts have been made on designing equivariant ViT but are proved defective in some cases in this paper. To address this issue, we design a Group Equivariant Vision Transformer (GE-ViT) via a nov… ▽ More

    Submitted 7 July, 2023; v1 submitted 11 June, 2023; originally announced June 2023.

    Comments: Accept to UAI2023

  31. arXiv:2306.03679  [pdf, other

    cs.CV cs.AI cs.CR cs.LG stat.ML

    Human-imperceptible, Machine-recognizable Images

    Authors: Fusheng Hao, Fengxiang He, Yikai Wang, Fuxiang Wu, Jing Zhang, Jun Cheng, Dacheng Tao

    Abstract: Massive human-related data is collected to train neural networks for computer vision tasks. A major conflict is exposed relating to software engineers between better developing AI systems and distancing from the sensitive training data. To reconcile this conflict, this paper proposes an efficient privacy-preserving learning paradigm, where images are first encrypted to become ``human-imperceptible… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  32. arXiv:2306.02913  [pdf, other

    cs.LG cs.CY cs.DC eess.SY stat.ML

    Decentralized SGD and Average-direction SAM are Asymptotically Equivalent

    Authors: Tongtian Zhu, Fengxiang He, Kaixuan Chen, Mingli Song, Dacheng Tao

    Abstract: Decentralized stochastic gradient descent (D-SGD) allows collaborative learning on massive devices simultaneously without the control of a central server. However, existing theories claim that decentralization invariably undermines generalization. In this paper, we challenge the conventional belief and present a completely new perspective for understanding decentralized learning. We prove that D-S… ▽ More

    Submitted 9 November, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: 40th International Conference on Machine Learning (ICML 2023)

  33. arXiv:2305.13871  [pdf, other

    cs.LG

    Improving Heterogeneous Model Reuse by Density Estimation

    Authors: Anke Tang, Yong Luo, Han Hu, Fengxiang He, Kehua Su, Bo Du, Yixin Chen, Dacheng Tao

    Abstract: This paper studies multiparty learning, aiming to learn a model using the private data of different participants. Model reuse is a promising solution for multiparty learning, assuming that a local model has been trained for each party. Considering the potential sample selection bias among different parties, some heterogeneous model reuse approaches have been developed. However, although pre-traine… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 9 pages, 5 figues. Accepted by IJCAI 2023

  34. arXiv:2304.05874  [pdf, other

    q-bio.NC cs.AI cs.LG cs.NE eess.SP q-bio.QM

    Adaptive Gated Graph Convolutional Network for Explainable Diagnosis of Alzheimer's Disease using EEG Data

    Authors: Dominik Klepl, Fei He, Min Wu, Daniel J. Blackburn, Ptolemaios G. Sarrigiannis

    Abstract: Graph neural network (GNN) models are increasingly being used for the classification of electroencephalography (EEG) data. However, GNN-based diagnosis of neurological disorders, such as Alzheimer's disease (AD), remains a relatively unexplored area of research. Previous studies have relied on functional connectivity methods to infer brain graph structures and used simple GNN architectures for the… ▽ More

    Submitted 27 September, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

    Comments: 16 pages, 16 figures

  35. arXiv:2303.05093  [pdf, other

    cs.CV cs.CL

    Improving Video Retrieval by Adaptive Margin

    Authors: Feng He, Qi Wang, Zhifan Feng, Wenbin Jiang, Yajuan Lv, Yong zhu, Xiao Tan

    Abstract: Video retrieval is becoming increasingly important owing to the rapid emergence of videos on the Internet. The dominant paradigm for video retrieval learns video-text representations by pushing the distance between the similarity of positive pairs and that of negative pairs apart from a fixed margin. However, negative pairs used for training are sampled randomly, which indicates that the semantics… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: Accepted by SIGIR 2021

  36. arXiv:2303.01205  [pdf, other

    cs.RO

    Distributed Consistent Multi-robot Cooperative Localization: A Coordinate Transformation Approach

    Authors: Chungeng Tian, Ning Hao, Fenghua He, Haodi Yao

    Abstract: This paper considers the problem of distributed cooperative localization (CL) via robot-to-robot measurements for a multi-robot system. We propose a distributed consistent CL algorithm. The key idea is to perform the EKF-based state estimation in a transformed coordinate system. Specifically, a coordinate transformation is constructed by decomposing the state-propagation Jacobian by which the corr… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  37. arXiv:2303.00501  [pdf, other

    cs.LG cs.AI

    OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System

    Authors: Chao Xue, Wei Liu, Shuai Xie, Zhenfang Wang, Jiaxing Li, Xuyang Peng, Liang Ding, Shanshan Zhao, Qiong Cao, Yibo Yang, Fengxiang He, Bohua Cai, Rongcheng Bian, Yiyan Zhao, Heliang Zheng, Xiangyang Liu, Dongkai Liu, Daqing Liu, Li Shen, Chang Li, Shijin Zhang, Yukang Zhang, Guanpu Chen, Shixiang Chen, Yibing Zhan , et al. (3 additional authors not shown)

    Abstract: Automated machine learning (AutoML) seeks to build ML models with minimal human effort. While considerable research has been conducted in the area of AutoML in general, aiming to take humans out of the loop when building artificial intelligence (AI) applications, scant literature has focused on how AutoML works well in open-environment scenarios such as the process of training and updating large m… ▽ More

    Submitted 8 July, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

  38. arXiv:2302.11085  [pdf, other

    cs.LG stat.ML

    Learning to Generalize Provably in Learning to Optimize

    Authors: Junjie Yang, Tianlong Chen, Mingkang Zhu, Fengxiang He, Dacheng Tao, Yingbin Liang, Zhangyang Wang

    Abstract: Learning to optimize (L2O) has gained increasing popularity, which automates the design of optimizers by data-driven approaches. However, current L2O methods often suffer from poor generalization performance in at least two folds: (i) applying the L2O-learned optimizer to unseen optimizees, in terms of lowering their loss function values (optimizer generalization, or ``generalizable learning of op… ▽ More

    Submitted 28 March, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: This paper is accepted in AISTATS 2023

  39. arXiv:2302.10043  [pdf, other

    cs.AI cs.SI

    Friend Ranking in Online Games via Pre-training Edge Transformers

    Authors: Liang Yao, Jiazhen Peng, Shenggong Ji, Qiang Liu, Hongyun Cai, Feng He, Xu Cheng

    Abstract: Friend recall is an important way to improve Daily Active Users (DAU) in online games. The problem is to generate a proper lost friend ranking list essentially. Traditional friend recall methods focus on rules like friend intimacy or training a classifier for predicting lost players' return probability, but ignore feature information of (active) players and historical friend recall events. In this… ▽ More

    Submitted 26 April, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: Accepted by the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2023)

  40. arXiv:2301.08015  [pdf, other

    cs.GT cs.LG math.OC stat.ML

    Global Nash Equilibrium in Non-convex Multi-player Game: Theory and Algorithms

    Authors: Guanpu Chen, Gehui Xu, Fengxiang He, Yiguang Hong, Leszek Rutkowski, Dacheng Tao

    Abstract: Wide machine learning tasks can be formulated as non-convex multi-player games, where Nash equilibrium (NE) is an acceptable solution to all players, since no one can benefit from changing its strategy unilaterally. Attributed to the non-convexity, obtaining the existence condition of global NE is challenging, let alone designing theoretically guaranteed realization algorithms. This paper takes co… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

  41. arXiv:2301.03167  [pdf

    cs.CG cs.CV

    Machining feature recognition using descriptors with range constraints for mechanical 3D models

    Authors: Seungeun Lim, Changmo Yeo, Fazhi He, Jinwon Lee, Duhwan Mun

    Abstract: In machining feature recognition, geometric elements generated in a three-dimensional computer-aided design model are identified. This technique is used in manufacturability evaluation, process planning, and tool path generation. Here, we propose a method of recognizing 16 types of machining features using descriptors, often used in shape-based part retrieval studies. The base face is selected for… ▽ More

    Submitted 8 January, 2023; originally announced January 2023.

  42. arXiv:2301.01882  [pdf, other

    cs.CV

    InsPro: Propagating Instance Query and Proposal for Online Video Instance Segmentation

    Authors: Fei He, Haoyang Zhang, Naiyu Gao, Jian Jia, Yanhu Shan, Xin Zhao, Kaiqi Huang

    Abstract: Video instance segmentation (VIS) aims at segmenting and tracking objects in videos. Prior methods typically generate frame-level or clip-level object instances first and then associate them by either additional tracking heads or complex instance matching algorithms. This explicit instance association approach increases system complexity and fails to fully exploit temporal cues in videos. In this… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

    Comments: NeurIPS 2022

  43. arXiv:2212.08272  [pdf, other

    cs.DC

    Communication-Efficient Federated Learning for Heterogeneous Edge Devices Based on Adaptive Gradient Quantization

    Authors: Heting Liu, Fang He, Guohong Cao

    Abstract: Federated learning (FL) enables geographically dispersed edge devices (i.e., clients) to learn a global model without sharing the local datasets, where each client performs gradient descent with its local data and uploads the gradients to a central server to update the global model. However, FL faces massive communication overhead resulted from uploading the gradients in each training round. To ad… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  44. arXiv:2212.02809  [pdf, other

    cs.CV

    An advanced YOLOv3 method for small object detection

    Authors: Baokai Liu, Fengjie He, Shiqiang Du, Jiacheng Li, Wenjie Liu

    Abstract: Small object detection has important application value in the fields of autonomous driving and drone scene analysis. As one of the most advanced object detection algorithms, YOLOv3 suffers some challenges when detecting small objects, such as the problem of detection failure of small objects and occluded objects. To solve these problems, an improved YOLOv3 algorithm for small object detection is p… ▽ More

    Submitted 22 March, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

  45. arXiv:2212.01461  [pdf, other

    cs.CV

    Learning Disentangled Label Representations for Multi-label Classification

    Authors: Jian Jia, Fei He, Naiyu Gao, Xiaotang Chen, Kaiqi Huang

    Abstract: Although various methods have been proposed for multi-label classification, most approaches still follow the feature learning mechanism of the single-label (multi-class) classification, namely, learning a shared image feature to classify multiple labels. However, we find this One-shared-Feature-for-Multiple-Labels (OFML) mechanism is not conducive to learning discriminative label features and make… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

    Comments: 17 pages, 9 figures

  46. arXiv:2212.00935  [pdf, other

    cs.CV cs.AI

    Dunhuang murals contour generation network based on convolution and self-attention fusion

    Authors: Baokai Liu, Fengjie He, Shiqiang Du, Kaiwu Zhang, Jianhua Wang

    Abstract: Dunhuang murals are a collection of Chinese style and national style, forming a self-contained Chinese-style Buddhist art. It has very high historical and cultural value and research significance. Among them, the lines of Dunhuang murals are highly general and expressive. It reflects the character's distinctive character and complex inner emotions. Therefore, the outline drawing of murals is of gr… ▽ More

    Submitted 13 March, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

  47. arXiv:2211.16192  [pdf, other

    cs.CV cs.CR

    Be Careful with Rotation: A Uniform Backdoor Pattern for 3D Shape

    Authors: Linkun Fan, Fazhi He, Qing Guo, Wei Tang, Xiaolin Hong, Bing Li

    Abstract: For saving cost, many deep neural networks (DNNs) are trained on third-party datasets downloaded from internet, which enables attacker to implant backdoor into DNNs. In 2D domain, inherent structures of different image formats are similar. Hence, backdoor attack designed for one image format will suite for others. However, when it comes to 3D world, there is a huge disparity among different 3D dat… ▽ More

    Submitted 1 December, 2022; v1 submitted 28 November, 2022; originally announced November 2022.

  48. arXiv:2211.15953  [pdf, other

    cs.DC

    A Decentralized Framework for Kernel PCA with Projection Consensus Constraints

    Authors: Fan He, Ruikai Yang, Lei Shi, Xiaolin Huang

    Abstract: This paper studies kernel PCA in a decentralized setting, where data are distributively observed with full features in local nodes and a fusion center is prohibited. Compared with linear PCA, the use of kernel brings challenges to the design of decentralized consensus optimization: the local projection directions are data-dependent. As a result, the consensus constraint in distributed linear PCA i… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  49. CLOP: Video-and-Language Pre-Training with Knowledge Regularizations

    Authors: Guohao Li, Hu Yang, Feng He, Zhifan Feng, Yajuan Lyu, Hua Wu, Haifeng Wang

    Abstract: Video-and-language pre-training has shown promising results for learning generalizable representations. Most existing approaches usually model video and text in an implicit manner, without considering explicit structural representations of the multi-modal content. We denote such form of representations as structural knowledge, which express rich semantics of multiple granularities. There are relat… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: ACM Multimedia 2022 (MM'22)

  50. arXiv:2211.00824  [pdf, other

    cs.LG cs.CV

    Adversarial Auto-Augment with Label Preservation: A Representation Learning Principle Guided Approach

    Authors: Kaiwen Yang, Yanchao Sun, Jiahao Su, Fengxiang He, Xinmei Tian, Furong Huang, Tianyi Zhou, Dacheng Tao

    Abstract: Data augmentation is a critical contributing factor to the success of deep learning but heavily relies on prior domain knowledge which is not always available. Recent works on automatic data augmentation learn a policy to form a sequence of augmentation operations, which are still pre-defined and restricted to limited options. In this paper, we show that a prior-free autonomous data augmentation's… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022)