Skip to main content

Showing 1–50 of 202 results for author: Qin, X

  1. arXiv:2407.00943  [pdf, other

    cs.DC cs.LG

    FedEx: Expediting Federated Learning over Heterogeneous Mobile Devices by Overlapping and Participant Selection

    Authors: Jiaxiang Geng, Boyu Li, Xiaoqi Qin, Yixuan Li, Liang Li, Yanzhao Hou, Miao Pan

    Abstract: Training latency is critical for the success of numerous intrigued applications ignited by federated learning (FL) over heterogeneous mobile devices. By revolutionarily overlapping local gradient transmission with continuous local computing, FL can remarkably reduce its training latency over homogeneous clients, yet encounter severe model staleness, model drifts, memory cost and straggler issues i… ▽ More

    Submitted 2 July, 2024; v1 submitted 30 June, 2024; originally announced July 2024.

    Comments: 21 pages, 10 figures, Submitted to Sensys2024

  2. arXiv:2406.19251  [pdf, other

    cs.CL cs.AI

    AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation

    Authors: Jia Fu, Xiaoting Qin, Fangkai Yang, Lu Wang, Jue Zhang, Qingwei Lin, Yubo Chen, Dongmei Zhang, Saravan Rajmohan, Qi Zhang

    Abstract: Recent advancements in Large Language Models have transformed ML/AI development, necessitating a reevaluation of AutoML principles for the Retrieval-Augmented Generation (RAG) systems. To address the challenges of hyper-parameter optimization and online adaptation in RAG, we propose the AutoRAG-HP framework, which formulates the hyper-parameter tuning as an online multi-armed bandit (MAB) problem… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  3. arXiv:2406.11519  [pdf, other

    cs.CV eess.IV

    HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model

    Authors: Di Wang, Meiqi Hu, Yao Jin, Yuchun Miao, Jiaqi Yang, Yichu Xu, Xiaolei Qin, Jiaqi Ma, Lingyu Sun, Chenxing Li, Chuan Fu, Hongruixuan Chen, Chengxi Han, Naoto Yokoya, Jing Zhang, Minqiang Xu, Lin Liu, Lefei Zhang, Chen Wu, Bo Du, Dacheng Tao, Liangpei Zhang

    Abstract: Foundation models (FMs) are revolutionizing the analysis and understanding of remote sensing (RS) scenes, including aerial RGB, multispectral, and SAR images. However, hyperspectral images (HSIs), which are rich in spectral information, have not seen much application of FMs, with existing methods often restricted to specific tasks and lacking generality. To fill this gap, we introduce HyperSIGMA,… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: The code and models will be released at https://github.com/WHU-Sigma/HyperSIGMA

  4. arXiv:2406.10765  [pdf, other

    cs.DC

    PWDFT-SW: Extending the Limit of Plane-Wave DFT Calculations to 16K Atoms on the New Sunway Supercomputer

    Authors: Qingcai Jiang, Zhenwei Cao, Junshi Chen, Xinming Qin, Wei Hu, Hong An, Jinlong Yang

    Abstract: First-principles density functional theory (DFT) with plane wave (PW) basis set is the most widely used method in quantum mechanical material simulations due to its advantages in accuracy and universality. However, a perceived drawback of PW-based DFT calculations is their substantial computational cost and memory usage, which currently limits their ability to simulate large-scale complex systems… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  5. arXiv:2406.09534  [pdf, other

    cs.DB cs.LG

    FeatNavigator: Automatic Feature Augmentation on Tabular Data

    Authors: Jiaming Liang, Chuan Lei, Xiao Qin, Jiani Zhang, Asterios Katsifodimos, Christos Faloutsos, Huzefa Rangwala

    Abstract: Data-centric AI focuses on understanding and utilizing high-quality, relevant data in training machine learning (ML) models, thereby increasing the likelihood of producing accurate and useful results. Automatic feature augmentation, aiming to augment the initial base table with useful features from other tables, is critical in data preparation as it improves model performance, robustness, and gene… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 15 pages, 41 figures

  6. arXiv:2406.07390  [pdf, other

    eess.SP cs.IT eess.IV

    DiffCom: Channel Received Signal is a Natural Condition to Guide Diffusion Posterior Sampling

    Authors: Sixian Wang, Jincheng Dai, Kailin Tan, Xiaoqi Qin, Kai Niu, Ping Zhang

    Abstract: End-to-end visual communication systems typically optimize a trade-off between channel bandwidth costs and signal-level distortion metrics. However, under challenging physical conditions, this traditional discriminative communication paradigm often results in unrealistic reconstructions with perceptible blurring and aliasing artifacts, despite the inclusion of perceptual or adversarial losses for… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  7. arXiv:2406.06446  [pdf, other

    cs.IT cs.LG cs.MM

    Deep Generative Modeling Reshapes Compression and Transmission: From Efficiency to Resiliency

    Authors: Jincheng Dai, Xiaoqi Qin, Sixian Wang, Lexi Xu, Kai Niu, Ping Zhang

    Abstract: Information theory and machine learning are inextricably linked and have even been referred to as "two sides of the same coin". One particularly elegant connection is the essential equivalence between probabilistic generative modeling and data compression or transmission. In this article, we reveal the dual-functionality of deep generative models that reshapes both data compression for efficiency… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Publication in IEEE Wireless Communications

  8. arXiv:2406.03726  [pdf

    cs.LG

    Efficient Graph Encoder Embedding for Large Sparse Graphs in Python

    Authors: Xihan Qin, Cencheng Shen

    Abstract: Graph is a ubiquitous representation of data in various research fields, and graph embedding is a prevalent machine learning technique for capturing key features and generating fixed-sized attributes. However, most state-of-the-art graph embedding methods are computationally and spatially expensive. Recently, the Graph Encoder Embedding (GEE) has been shown as the fastest graph embedding technique… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  9. arXiv:2405.09459  [pdf, other

    cs.CV cs.AI

    Fourier Boundary Features Network with Wider Catchers for Glass Segmentation

    Authors: Xiaolin Qin, Jiacen Liu, Qianlei Wang, Shaolin Zhang, Fei Zhu, Zhang Yi

    Abstract: Glass largely blurs the boundary between the real world and the reflection. The special transmittance and reflectance quality have confused the semantic tasks related to machine vision. Therefore, how to clear the boundary built by glass, and avoid over-capturing features as false positive information in deep structure, matters for constraining the segmentation of reflection surface and penetratin… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  10. arXiv:2405.07250  [pdf

    cs.DC

    Towards Cloud Efficiency with Large-scale Workload Characterization

    Authors: Anjaly Parayil, Jue Zhang, Xiaoting Qin, Íñigo Goiri, Lexiang Huang, Timothy Zhu, Chetan Bansal

    Abstract: Cloud providers introduce features (e.g., Spot VMs, Harvest VMs, and Burstable VMs) and optimizations (e.g., oversubscription, auto-scaling, power harvesting, and overclocking) to improve efficiency and reliability. To effectively utilize these features, it's crucial to understand the characteristics of workloads running in the cloud. However, workload characteristics can be complex and depend on… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: 6 figures, 13 Tables

  11. arXiv:2405.04122  [pdf, other

    cs.LG cs.DC

    Ranking-based Client Selection with Imitation Learning for Efficient Federated Learning

    Authors: Chunlin Tian, Zhan Shi, Xinpeng Qin, Li Li, Chengzhong Xu

    Abstract: Federated Learning (FL) enables multiple devices to collaboratively train a shared model while ensuring data privacy. The selection of participating devices in each training round critically affects both the model performance and training efficiency, especially given the vast heterogeneity in training capabilities and data distribution across devices. To address these challenges, we introduce a no… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Accepted by ICML 2024

  12. arXiv:2405.02861  [pdf, other

    cs.CL cs.AI cs.LG

    Revisiting a Pain in the Neck: Semantic Phrase Processing Benchmark for Language Models

    Authors: Yang Liu, Melissa Xiaohui Qin, Hongming Li, Chao Huang

    Abstract: We introduce LexBench, a comprehensive evaluation suite enabled to test language models (LMs) on ten semantic phrase processing tasks. Unlike prior studies, it is the first work to propose a framework from the comparative perspective to model the general semantic phrase (i.e., lexical collocation) and three fine-grained semantic phrases, including idiomatic expression, noun compound, and verbal co… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: 24 pages, 17 figures, 10 tables

    MSC Class: 68T50 ACM Class: I.2.7

  13. arXiv:2405.00885  [pdf, other

    cs.LG cs.NI eess.IV

    WHALE-FL: Wireless and Heterogeneity Aware Latency Efficient Federated Learning over Mobile Devices via Adaptive Subnetwork Scheduling

    Authors: Huai-an Su, Jiaxiang Geng, Liang Li, Xiaoqi Qin, Yanzhao Hou, Xin Fu, Miao Pan

    Abstract: As a popular distributed learning paradigm, federated learning (FL) over mobile devices fosters numerous applications, while their practical deployment is hindered by participating devices' computing and communication heterogeneity. Some pioneering research efforts proposed to extract subnetworks from the global model, and assign as large a subnetwork as possible to the device for local training b… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  14. arXiv:2404.19143  [pdf, other

    cs.DC

    Workload Intelligence: Punching Holes Through the Cloud Abstraction

    Authors: Lexiang Huang, Anjaly Parayil, Jue Zhang, Xiaoting Qin, Chetan Bansal, Jovan Stojkovic, Pantea Zardoshti, Pulkit Misra, Eli Cortez, Raphael Ghelman, Íñigo Goiri, Saravan Rajmohan, Jim Kleewein, Rodrigo Fonseca, Timothy Zhu, Ricardo Bianchini

    Abstract: Today, cloud workloads are essentially opaque to the cloud platform. Typically, the only information the platform receives is the virtual machine (VM) type and possibly a decoration to the type (e.g., the VM is evictable). Similarly, workloads receive little to no information from the platform; generally, workloads might receive telemetry from their VMs or exceptional signals (e.g., shortly before… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  15. arXiv:2404.18209  [pdf, other

    cs.LG cs.DB

    4DBInfer: A 4D Benchmarking Toolbox for Graph-Centric Predictive Modeling on Relational DBs

    Authors: Minjie Wang, Quan Gan, David Wipf, Zhenkun Cai, Ning Li, Jianheng Tang, Yanlin Zhang, Zizhao Zhang, Zunyao Mao, Yakun Song, Yanbo Wang, Jiahang Li, Han Zhang, Guang Yang, Xiao Qin, Chuan Lei, Muhan Zhang, Weinan Zhang, Christos Faloutsos, Zheng Zhang

    Abstract: Although RDBs store vast amounts of rich, informative data spread across interconnected tables, the progress of predictive machine learning models as applied to such tasks arguably falls well behind advances in other domains such as computer vision or natural language processing. This deficit stems, at least in part, from the lack of established/public RDB benchmarks as needed for training and eva… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: Under review

  16. PromptCL: Improving Event Representation via Prompt Template and Contrastive Learning

    Authors: Yubo Feng, Lishuang Li, Yi Xiang, Xueyang Qin

    Abstract: The representation of events in text plays a significant role in various NLP tasks. Recent research demonstrates that contrastive learning has the ability to improve event comprehension capabilities of Pre-trained Language Models (PLMs) and enhance the performance of event representation learning. However, the efficacy of event representation learning based on contrastive learning and PLMs is limi… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: NLPCC 2023 Best Student Paper

    Journal ref: Natural Language Processing and Chinese Computing (NLPCC 2023)

  17. arXiv:2404.13434  [pdf, other

    cs.CV cs.AI

    Nested-TNT: Hierarchical Vision Transformers with Multi-Scale Feature Processing

    Authors: Yuang Liu, Zhiheng Qiu, Xiaokai Qin

    Abstract: Transformer has been applied in the field of computer vision due to its excellent performance in natural language processing, surpassing traditional convolutional neural networks and achieving new state-of-the-art. ViT divides an image into several local patches, known as "visual sentences". However, the information contained in the image is vast and complex, and focusing only on the features at t… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  18. arXiv:2403.15157  [pdf, other

    cs.SE

    AllHands: Ask Me Anything on Large-scale Verbatim Feedback via Large Language Models

    Authors: Chaoyun Zhang, Zicheng Ma, Yuhao Wu, Shilin He, Si Qin, Minghua Ma, Xiaoting Qin, Yu Kang, Yuyi Liang, Xiaoyu Gou, Yajie Xue, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang

    Abstract: Verbatim feedback constitutes a valuable repository of user experiences, opinions, and requirements essential for software development. Effectively and efficiently extracting valuable insights from such data poses a challenging task. This paper introduces Allhands , an innovative analytic framework designed for large-scale feedback analysis through a natural language interface, leveraging large la… ▽ More

    Submitted 3 April, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

  19. arXiv:2403.14232  [pdf, other

    cs.LG

    Contrastive Balancing Representation Learning for Heterogeneous Dose-Response Curves Estimation

    Authors: Minqin Zhu, Anpeng Wu, Haoxuan Li, Ruoxuan Xiong, Bo Li, Xiaoqing Yang, Xuan Qin, Peng Zhen, Jiecheng Guo, Fei Wu, Kun Kuang

    Abstract: Estimating the individuals' potential response to varying treatment doses is crucial for decision-making in areas such as precision medicine and management science. Most recent studies predict counterfactual outcomes by learning a covariate representation that is independent of the treatment variable. However, such independence constraints neglect much of the covariate information that is useful f… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  20. arXiv:2403.11380  [pdf, other

    cs.CV

    Boosting Order-Preserving and Transferability for Neural Architecture Search: a Joint Architecture Refined Search and Fine-tuning Approach

    Authors: Beichen Zhang, Xiaoxing Wang, Xiaohan Qin, Junchi Yan

    Abstract: Supernet is a core component in many recent Neural Architecture Search (NAS) methods. It not only helps embody the search space but also provides a (relative) estimation of the final performance of candidate architectures. Thus, it is critical that the top architectures ranked by a supernet should be consistent with those ranked by true performance, which is known as the order-preserving ability.… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR2024

  21. arXiv:2403.08593  [pdf, other

    cs.CL cs.AI

    Call Me When Necessary: LLMs can Efficiently and Faithfully Reason over Structured Environments

    Authors: Sitao Cheng, Ziyuan Zhuang, Yong Xu, Fangkai Yang, Chaoyun Zhang, Xiaoting Qin, Xiang Huang, Ling Chen, Qingwei Lin, Dongmei Zhang, Saravan Rajmohan, Qi Zhang

    Abstract: Large Language Models (LLMs) have shown potential in reasoning over structured environments, e.g., knowledge graph and table. Such tasks typically require multi-hop reasoning, i.e., match natural language utterance with instances in the environment. Previous methods leverage LLMs to incrementally build a reasoning path, where the LLMs either invoke tools or pick up schemas by step-by-step interact… ▽ More

    Submitted 3 July, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: Accepted by ACL 2024 Findings. 21 pages, 7 figures, 17 tables

  22. arXiv:2403.07653  [pdf, other

    cs.DB

    OmniMatch: Effective Self-Supervised Any-Join Discovery in Tabular Data Repositories

    Authors: Christos Koutras, Jiani Zhang, Xiao Qin, Chuan Lei, Vasileios Ioannidis, Christos Faloutsos, George Karypis, Asterios Katsifodimos

    Abstract: How can we discover join relationships among columns of tabular data in a data repository? Can this be done effectively when metadata is missing? Traditional column matching works mainly rely on similarity measures based on exact value overlaps, hence missing important semantics or failing to handle noise in the data. At the same time, recent dataset discovery methods focusing on deep table repres… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  23. arXiv:2403.06532  [pdf, other

    eess.IV cs.CV q-bio.NC

    Reconstructing Visual Stimulus Images from EEG Signals Based on Deep Visual Representation Model

    Authors: Hongguang Pan, Zhuoyi Li, Yunpeng Fu, Xuebin Qin, Jianchen Hu

    Abstract: Reconstructing visual stimulus images is a significant task in neural decoding, and up to now, most studies consider the functional magnetic resonance imaging (fMRI) as the signal source. However, the fMRI-based image reconstruction methods are difficult to widely applied because of the complexity and high cost of the acquisition equipments. Considering the advantages of low cost and easy portabil… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  24. arXiv:2403.00673  [pdf, other

    cs.LG

    Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency

    Authors: Yanxiao Zhao, Yangge Qian, Tianyi Wang, Jingyang Shan, Xiaolin Qin

    Abstract: Deep reinforcement learning (DRL) algorithms require substantial samples and computational resources to achieve higher performance, which restricts their practical application and poses challenges for further development. Given the constraint of limited resources, it is essential to leverage existing computational work (e.g., learned policies, samples) to enhance sample efficiency and reduce the c… ▽ More

    Submitted 12 March, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

    Comments: Under review

  25. arXiv:2402.06854  [pdf, other

    cs.CV cs.GR cs.LG

    Gyroscope-Assisted Motion Deblurring Network

    Authors: Simin Luan, Cong Yang, Zeyd Boukhers, Xue Qin, Dongfeng Cheng, Wei Sui, Zhijun Li

    Abstract: Image research has shown substantial attention in deblurring networks in recent years. Yet, their practical usage in real-world deblurring, especially motion blur, remains limited due to the lack of pixel-aligned training triplets (background, blurred image, and blur heat map) and restricted information inherent in blurred images. This paper presents a simple yet efficient framework to synthetic a… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  26. arXiv:2401.12694  [pdf, other

    cs.CV

    Pragmatic Communication in Multi-Agent Collaborative Perception

    Authors: Yue Hu, Xianghe Pang, Xiaoqi Qin, Yonina C. Eldar, Siheng Chen, Ping Zhang, Wenjun Zhang

    Abstract: Collaborative perception allows each agent to enhance its perceptual abilities by exchanging messages with others. It inherently results in a trade-off between perception ability and communication costs. Previous works transmit complete full-frame high-dimensional feature maps among agents, resulting in substantial communication costs. To promote communication efficiency, we propose only transmitt… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 18 pages

  27. arXiv:2401.11775  [pdf, other

    cs.CV

    Collaborative Position Reasoning Network for Referring Image Segmentation

    Authors: Jianjian Cao, Beiya Dai, Yulin Li, Xiameng Qin, Jingdong Wang

    Abstract: Given an image and a natural language expression as input, the goal of referring image segmentation is to segment the foreground masks of the entities referred by the expression. Existing methods mainly focus on interactive learning between vision and language to enhance the multi-modal representations for global context reasoning. However, predicting directly in pixel-level space can lead to coll… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  28. arXiv:2401.07753  [pdf, other

    cs.CV

    Low-light Stereo Image Enhancement and De-noising in the Low-frequency Information Enhanced Image Space

    Authors: Minghua Zhao, Xiangdong Qin, Shuangli Du, Xuefei Bai, Jiahao Lyu, Yiguang Liu

    Abstract: Unlike single image task, stereo image enhancement can use another view information, and its key stage is how to perform cross-view feature interaction to extract useful information from another view. However, complex noise in low-light image and its impact on subsequent feature encoding and interaction are ignored by the existing methods. In this paper, a method is proposed to perform enhancement… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  29. arXiv:2401.01176  [pdf, other

    cs.IT cs.LG eess.SP

    Fundamental Limitation of Semantic Communications: Neural Estimation for Rate-Distortion

    Authors: Dongxu Li, Jianhao Huang, Chuan Huang, Xiaoqi Qin, Han Zhang, Ping Zhang

    Abstract: This paper studies the fundamental limit of semantic communications over the discrete memoryless channel. We consider the scenario to send a semantic source consisting of an observation state and its corresponding semantic state, both of which are recovered at the receiver. To derive the performance limitation, we adopt the semantic rate-distortion function (SRDF) to study the relationship among t… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

  30. arXiv:2401.00865  [pdf, other

    cs.DC

    Xorbits: Automating Operator Tiling for Distributed Data Science

    Authors: Weizheng Lu, Kaisheng He, Xuye Qin, Chengjie Li, Zhong Wang, Tao Yuan, Xia Liao, Feng Zhang, Yueguo Chen, Xiaoyong Du

    Abstract: Data science pipelines commonly utilize dataframe and array operations for tasks such as data preprocessing, analysis, and machine learning. The most popular tools for these tasks are pandas and NumPy. However, these tools are limited to executing on a single node, making them unsuitable for processing large-scale data. Several systems have attempted to distribute data science applications to clus… ▽ More

    Submitted 19 March, 2024; v1 submitted 29 December, 2023; originally announced January 2024.

    Comments: ICDE 2024 Industrial and Application Track

  31. arXiv:2312.16274  [pdf, other

    cs.CV

    Towards Flexible, Scalable, and Adaptive Multi-Modal Conditioned Face Synthesis

    Authors: Jingjing Ren, Cheng Xu, Haoyu Chen, Xinran Qin, Lei Zhu

    Abstract: Recent progress in multi-modal conditioned face synthesis has enabled the creation of visually striking and accurately aligned facial images. Yet, current methods still face issues with scalability, limited flexibility, and a one-size-fits-all approach to control strength, not accounting for the differing levels of conditional entropy, a measure of unpredictability in data given some condition, ac… ▽ More

    Submitted 21 March, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

  32. arXiv:2312.14521  [pdf, other

    quant-ph cs.ET

    Tuning Quantum Computing Privacy through Quantum Error Correction

    Authors: Hui Zhong, Keyi Ju, Manojna Sistla, Xinyue Zhang, Xiaoqi Qin, Xin Fu, Miao Pan

    Abstract: Quantum computing is a promising paradigm for efficiently solving large and high-complexity problems. To protect quantum computing privacy, pioneering research efforts proposed to redefine differential privacy (DP) in quantum computing, i.e., quantum differential privacy (QDP), and harvest inherent noises generated by quantum computing to implement QDP. However, such an implementation approach is… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  33. arXiv:2312.11126  [pdf, other

    quant-ph cs.CR cs.LG

    Harnessing Inherent Noises for Privacy Preservation in Quantum Machine Learning

    Authors: Keyi Ju, Xiaoqi Qin, Hui Zhong, Xinyue Zhang, Miao Pan, Baoling Liu

    Abstract: Quantum computing revolutionizes the way of solving complex problems and handling vast datasets, which shows great potential to accelerate the machine learning process. However, data leakage in quantum machine learning (QML) may present privacy risks. Although differential privacy (DP), which protects privacy through the injection of artificial noise, is a well-established approach, its applicatio… ▽ More

    Submitted 6 March, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 6 pages, 4 figures

  34. arXiv:2311.17541  [pdf, other

    cs.AI

    TaskWeaver: A Code-First Agent Framework

    Authors: Bo Qiao, Liqun Li, Xu Zhang, Shilin He, Yu Kang, Chaoyun Zhang, Fangkai Yang, Hang Dong, Jue Zhang, Lu Wang, Minghua Ma, Pu Zhao, Si Qin, Xiaoting Qin, Chao Du, Yong Xu, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang

    Abstract: Large Language Models (LLMs) have shown impressive abilities in natural language understanding and generation, leading to their widespread use in applications such as chatbots and virtual assistants. However, existing LLM frameworks face limitations in handling domain-specific data analytics tasks with rich data structures. Moreover, they struggle with flexibility to meet diverse user requirements… ▽ More

    Submitted 19 June, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

  35. arXiv:2311.15593  [pdf, other

    cs.IT cs.PF eess.SP

    Performance Analysis of MDMA-Based Cooperative MRC Networks with Relays in Dissimilar Rayleigh Fading Channels

    Authors: Lei Teng, Wannian An, Chen Dong, Xiaoqi Qin, Xiaodong Xu

    Abstract: Multiple access technology is a key technology in various generations of wireless communication systems. As a potential multiple access technology for the next generation wireless communication systems, model division multiple access (MDMA) technology improves spectrum efficiency and feasibility regions. This implies that the MDMA scheme can achieve greater performance gains compared to traditiona… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 6 pages, 4 figures, conference

  36. arXiv:2311.09932  [pdf, other

    cs.CY

    The Communication GSC System with Energy Harvesting Nodes aided by Opportunistic Routing

    Authors: Hanyu Liu, Lei Teng, Wannian An, Xiaoqi Qin, Chen Dong, Xiaodong Xu

    Abstract: In this paper, a cooperative communication network based on energy-harvesting (EH) decode-and-forward (DF) relays is proposed. For relay nodes, there is harvest-storage-use (HSU) structure in this system. And energy can be obtained from the surrounding environment through energy buffering. In order to improve the performance of the communication system, the opportunistic routing algorithm and the… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  37. arXiv:2311.08896  [pdf, other

    cs.CL

    HeLM: Highlighted Evidence augmented Language Model for Enhanced Table-to-Text Generation

    Authors: Junyi Bian, Xiaolei Qin, Wuhe Zou, Mengzuo Huang, Congyi Luo, Ke Zhang, Weidong Zhang

    Abstract: Large models have demonstrated significant progress across various domains, particularly in tasks related to text generation. In the domain of Table to Text, many Large Language Model (LLM)-based methods currently resort to modifying prompts to invoke public APIs, incurring potential costs and information leaks. With the advent of open-source large models, fine-tuning LLMs has become feasible. In… ▽ More

    Submitted 27 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  38. arXiv:2311.06002  [pdf, other

    eess.SP cs.IT

    Fully-Passive versus Semi-Passive IRS-Enabled Sensing: SNR and CRB Comparison

    Authors: Xianxin Song, Xinmin Li, Xiaoqi Qin, Jie Xu, Tony Xiao Han, Derrick Wing Kwan Ng

    Abstract: This paper investigates the sensing performance of two intelligent reflecting surface (IRS)-enabled non-line-of-sight (NLoS) sensing systems with fully-passive and semi-passive IRSs, respectively. In particular, we consider a fundamental setup with one base station (BS), one uniform linear array (ULA) IRS, and one point target in the NLoS region of the BS. Accordingly, we analyze the sensing signa… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: 13 pages,7 figures

  39. arXiv:2310.14318  [pdf, other

    cs.IR

    Intent Contrastive Learning with Cross Subsequences for Sequential Recommendation

    Authors: Xiuyuan Qin, Huanhuan Yuan, Pengpeng Zhao, Guanfeng Liu, Fuzhen Zhuang, Victor S. Sheng

    Abstract: The user purchase behaviors are mainly influenced by their intentions (e.g., buying clothes for decoration, buying brushes for painting, etc.). Modeling a user's latent intention can significantly improve the performance of recommendations. Previous works model users' intentions by considering the predefined label in auxiliary information or introducing stochastic data augmentation to learn purpos… ▽ More

    Submitted 25 November, 2023; v1 submitted 22 October, 2023; originally announced October 2023.

    Comments: 10pages, 5figures, WSDM2024. arXiv admin note: text overlap with arXiv:2304.07763

  40. arXiv:2310.09656  [pdf, other

    cs.LG

    Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space

    Authors: Hengrui Zhang, Jiani Zhang, Balasubramaniam Srinivasan, Zhengyuan Shen, Xiao Qin, Christos Faloutsos, Huzefa Rangwala, George Karypis

    Abstract: Recent advances in tabular data generation have greatly enhanced synthetic data quality. However, extending diffusion models to tabular data is challenging due to the intricately varied distributions and a blend of data types of tabular data. This paper introduces Tabsyn, a methodology that synthesizes tabular data by leveraging a diffusion model within a variational autoencoder (VAE) crafted late… ▽ More

    Submitted 11 May, 2024; v1 submitted 14 October, 2023; originally announced October 2023.

    Comments: Accepted by ICLR 2024 (Oral Presentation). Code is available at: https://github.com/amazon-science/tabsyn

  41. CCAE: A Corpus of Chinese-based Asian Englishes

    Authors: Yang Liu, Melissa Xiaohui Qin, Long Wang, Chao Huang

    Abstract: Language models have been foundations in various scenarios of NLP applications, but it has not been well applied in language variety studies, even for the most popular language like English. This paper represents one of the few initial efforts to utilize the NLP technology in the paradigm of World Englishes, specifically in creating a multi-variety corpus for studying Asian Englishes. We present a… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

    Comments: NLPCC'2023 (12 pages, 3 figures, 4 charts)

    MSC Class: 68T50 ACM Class: I.2.7

  42. arXiv:2310.04760  [pdf, other

    eess.AS cs.SD

    Multi-objective Progressive Clustering for Semi-supervised Domain Adaptation in Speaker Verification

    Authors: Ze Li, Yuke Lin, Ning Jiang, Xiaoyi Qin, Guoqing Zhao, Haiying Wu, Ming Li

    Abstract: Utilizing the pseudo-labeling algorithm with large-scale unlabeled data becomes crucial for semi-supervised domain adaptation in speaker verification tasks. In this paper, we propose a novel pseudo-labeling method named Multi-objective Progressive Clustering (MoPC), specifically designed for semi-supervised domain adaptation. Firstly, we utilize limited labeled data from the target domain to deriv… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  43. arXiv:2309.16721  [pdf, other

    cs.AI cs.RO

    GPT-Lab: Next Generation Of Optimal Chemistry Discovery By GPT Driven Robotic Lab

    Authors: Xiaokai Qin, Mingda Song, Yangguan Chen, Zhehong Ai, Jing Jiang

    Abstract: The integration of robots in chemical experiments has enhanced experimental efficiency, but lacking the human intelligence to comprehend literature, they seldom provide assistance in experimental design. Therefore, achieving full-process autonomy from experiment design to validation in self-driven laboratories (SDL) remains a challenge. The introduction of Generative Pre-trained Transformers (GPT)… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  44. arXiv:2309.14359  [pdf, other

    math.OC cs.AI

    Optimizing Chance-Constrained Submodular Problems with Variable Uncertainties

    Authors: Xiankun Yan, Anh Viet Do, Feng Shi, Xiaoyu Qin, Frank Neumann

    Abstract: Chance constraints are frequently used to limit the probability of constraint violations in real-world optimization problems where the constraints involve stochastic components. We study chance-constrained submodular optimization problems, which capture a wide range of optimization problems with stochastic constraints. Previous studies considered submodular problems with stochastic knapsack constr… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

  45. arXiv:2309.14109  [pdf, other

    eess.AS cs.SD

    Haha-Pod: An Attempt for Laughter-based Non-Verbal Speaker Verification

    Authors: Yuke Lin, Xiaoyi Qin, Ning Jiang, Guoqing Zhao, Ming Li

    Abstract: It is widely acknowledged that discriminative representation for speaker verification can be extracted from verbal speech. However, how much speaker information that non-verbal vocalization carries is still a puzzle. This paper explores speaker verification based on the most ubiquitous form of non-verbal voice, laughter. First, we use a semi-automatic pipeline to collect a new Haha-Pod dataset fro… ▽ More

    Submitted 9 October, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: accepted by ASRU 2023

  46. arXiv:2309.13643  [pdf, other

    cs.LG cs.NI

    REWAFL: Residual Energy and Wireless Aware Participant Selection for Efficient Federated Learning over Mobile Devices

    Authors: Y. Li, X. Qin, J. Geng, R. Chen, Y. Hou, Y. Gong, M. Pan, P. Zhang

    Abstract: Participant selection (PS) helps to accelerate federated learning (FL) convergence, which is essential for the practical deployment of FL over mobile devices. While most existing PS approaches focus on improving training accuracy and efficiency rather than residual energy of mobile devices, which fundamentally determines whether the selected devices can participate. Meanwhile, the impacts of mobil… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

  47. arXiv:2309.09187  [pdf, ps, other

    eess.SY cs.LG cs.RO

    Data-Driven Reachability Analysis of Stochastic Dynamical Systems with Conformal Inference

    Authors: Navid Hashemi, Xin Qin, Lars Lindemann, Jyotirmoy V. Deshmukh

    Abstract: We consider data-driven reachability analysis of discrete-time stochastic dynamical systems using conformal inference. We assume that we are not provided with a symbolic representation of the stochastic system, but instead have access to a dataset of $K$-step trajectories. The reachability problem is to construct a probabilistic flowpipe such that the probability that a $K$-step trajectory can vio… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

  48. arXiv:2309.08180  [pdf, other

    cs.RO

    AVM-SLAM: Semantic Visual SLAM with Multi-Sensor Fusion in a Bird's Eye View for Automated Valet Parking

    Authors: Ye Li, Wenchao Yang, Dekun Lin, Qianlei Wang, Zhe Cui, Xiaolin Qin

    Abstract: Accurate localization in challenging garage environments -- marked by poor lighting, sparse textures, repetitive structures, dynamic scenes, and the absence of GPS -- is crucial for automated valet parking (AVP) tasks. Addressing these challenges, our research introduces AVM-SLAM, a cutting-edge semantic visual SLAM architecture with multi-sensor fusion in a bird's eye view (BEV). This novel frame… ▽ More

    Submitted 1 July, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: Accepted by IROS 2024

  49. arXiv:2309.07796  [pdf, other

    cs.CV

    For A More Comprehensive Evaluation of 6DoF Object Pose Tracking

    Authors: Yang Li, Fan Zhong, Xin Wang, Shuangbing Song, Jiachen Li, Xueying Qin, Changhe Tu

    Abstract: Previous evaluations on 6DoF object pose tracking have presented obvious limitations along with the development of this area. In particular, the evaluation protocols are not unified for different methods, the widely-used YCBV dataset contains significant annotation error, and the error metrics also may be biased. As a result, it is hard to fairly compare the methods, which has became a big obstacl… ▽ More

    Submitted 14 September, 2023; v1 submitted 14 September, 2023; originally announced September 2023.

  50. arXiv:2309.07495  [pdf, other

    cs.CV cs.AI

    HDTR-Net: A Real-Time High-Definition Teeth Restoration Network for Arbitrary Talking Face Generation Methods

    Authors: Yongyuan Li, Xiuyuan Qin, Chao Liang, Mingqiang Wei

    Abstract: Talking Face Generation (TFG) aims to reconstruct facial movements to achieve high natural lip movements from audio and facial features that are under potential connections. Existing TFG methods have made significant advancements to produce natural and realistic images. However, most work rarely takes visual quality into consideration. It is challenging to ensure lip synchronization while avoiding… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: 15pages, 6 figures, PRCV2023