Skip to main content

Showing 1–50 of 1,560 results for author: Zhou, W

  1. arXiv:2407.08585  [pdf, other

    cs.RO cs.AI cs.LG

    HACMan++: Spatially-Grounded Motion Primitives for Manipulation

    Authors: Bowen Jiang, Yilin Wu, Wenxuan Zhou, Chris Paxton, David Held

    Abstract: Although end-to-end robot learning has shown some success for robot manipulation, the learned policies are often not sufficiently robust to variations in object pose or geometry. To improve the policy generalization, we introduce spatially-grounded parameterized motion primitives in our method HACMan++. Specifically, we propose an action representation consisting of three components: what primitiv… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2407.08125  [pdf, ps, other

    cs.LG

    Real-Time Summarization of Twitter

    Authors: Yixin Jin, Meiqi Wang, Meng Li, Wenjing Zhou, Yi Shen, Hao Liu

    Abstract: In this paper, we describe our approaches to TREC Real-Time Summarization of Twitter. We focus on real time push notification scenario, which requires a system monitors the stream of sampled tweets and returns the tweets relevant and novel to given interest profiles. Dirichlet score with and with very little smoothing (baseline) are employed to classify whether a tweet is relevant to a given inter… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: This paper was accepted to International Conference on Artificial Intelligence and Electromechanical Automation 2024

  3. arXiv:2407.06779  [pdf, other

    cs.CL

    Using Pretrained Large Language Model with Prompt Engineering to Answer Biomedical Questions

    Authors: Wenxin Zhou, Thuy Hang Ngo

    Abstract: Our team participated in the BioASQ 2024 Task12b and Synergy tasks to build a system that can answer biomedical questions by retrieving relevant articles and snippets from the PubMed database and generating exact and ideal answers. We propose a two-level information retrieval and question-answering system based on pre-trained large language models (LLM), focused on LLM prompt engineering and respo… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Submitted to Conference and Labs of the Evaluation Forum (CLEF) 2024 CEUR-WS

  4. arXiv:2407.06153  [pdf, other

    cs.SE cs.CL

    What's Wrong with Your Code Generated by Large Language Models? An Extensive Study

    Authors: Shihan Dou, Haoxiang Jia, Shenxi Wu, Huiyuan Zheng, Weikang Zhou, Muling Wu, Mingxu Chai, Jessica Fan, Caishuang Huang, Yunbo Tao, Yan Liu, Enyu Zhou, Ming Zhang, Yuhao Zhou, Yueming Wu, Rui Zheng, Ming Wen, Rongxiang Weng, Jingang Wang, Xunliang Cai, Tao Gui, Xipeng Qiu, Qi Zhang, Xuanjing Huang

    Abstract: The increasing development of large language models (LLMs) in code generation has drawn significant attention among researchers. To enhance LLM-based code generation ability, current efforts are predominantly directed towards collecting high-quality datasets and leveraging diverse training technologies. However, there is a notable lack of comprehensive studies examining the limitations and boundar… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 17 pages, 7 figures

  5. arXiv:2407.06151  [pdf, other

    cs.CE

    Auto-PICNN: Automated machine learning for physics-informed convolutional neural networks

    Authors: Wanyun Zhou, Xiaowen Chu

    Abstract: Recent advances in deep learning for solving partial differential equations (PDEs) have introduced physics-informed neural networks (PINNs), which integrate machine learning with physical laws. Physics-informed convolutional neural networks (PICNNs) extend PINNs by leveraging CNNs for enhanced generalization and efficiency. However, current PICNNs depend on manual design, and inappropriate designs… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  6. Robust Precoding Designs for Multiuser MIMO Systems with Limited Feedback

    Authors: Wentao Zhou, Di Zhang, Merouane Debbah, Inkyu Lee

    Abstract: It has been well known that the achievable rate of multiuser multiple-input multiple-output systems with limited feedback is severely degraded by quantization errors when the number of feedback bits is not sufficient. To overcome such a rate degradation, we propose new robust precoding designs which can compensate for the quantization errors. In this paper, we first analyze the achievable rate of… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: IEEE Trans. Wireless Commun., Early Access, Feb. 2024

  7. arXiv:2407.05480  [pdf, other

    cs.CL

    Biomedical Nested NER with Large Language Model and UMLS Heuristics

    Authors: Wenxin Zhou

    Abstract: In this paper, we present our system for the BioNNE English track, which aims to extract 8 types of biomedical nested named entities from biomedical text. We use a large language model (Mixtral 8x7B instruct) and ScispaCy NER model to identify entities in an article and build custom heuristics based on unified medical language system (UMLS) semantic types to categorize the entities. We discuss the… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Submitted to CEUR-WS for the BioNNE task of BioASQ Lab in Conference and Labs of the Evaluation Forum (CLEF) 2024 as a working note

  8. arXiv:2407.05388  [pdf, other

    cs.CV

    Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis

    Authors: Qi Sun, Hang Zhou, Wengang Zhou, Li Li, Houqiang Li

    Abstract: Synthesizing realistic 3D indoor scenes is a challenging task that traditionally relies on manual arrangement and annotation by expert designers. Recent advances in autoregressive models have automated this process, but they often lack semantic understanding of the relationships and hierarchies present in real-world scenes, yielding limited performance. In this paper, we propose Forest2Seq, a fram… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  9. arXiv:2407.04254  [pdf, other

    eess.SY

    A Complex-Coefficient Voltage Control for Virtual Synchronous Generators for Dynamic Enhancement and Power-Voltage Decoupling

    Authors: Jingzhe Xu, Weihua Zhou, Behrooz Bahrani

    Abstract: As electric power systems evolve towards decarbonization, the transition to inverter-based resources (IBRs) presents challenges to grid stability, necessitating innovative control solutions. Virtual synchronous generator (VSG) emerges as a prominent solution. However, conventional VSGs are prone to instability in strong grids, slow voltage regulation, and coupled power-voltage response. To address… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 13 pages, 26 figures. Preparing for journal submission

  10. arXiv:2407.01853  [pdf, other

    cs.CL cs.AI cs.LG

    Improving Multilingual Instruction Finetuning via Linguistically Natural and Diverse Datasets

    Authors: Sathish Reddy Indurthi, Wenxuan Zhou, Shamil Chollampatt, Ravi Agrawal, Kaiqiang Song, Lingxiao Zhao, Chenguang Zhu

    Abstract: Advancements in Large Language Models (LLMs) have significantly enhanced instruction-following capabilities. However, most Instruction Fine-Tuning (IFT) datasets are predominantly in English, limiting model performance in other languages. Traditional methods for creating multilingual IFT datasets such as translating existing English IFT datasets or converting existing NLP datasets into IFT dataset… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  11. arXiv:2407.01796  [pdf, other

    cs.CL

    Ground Every Sentence: Improving Retrieval-Augmented LLMs with Interleaved Reference-Claim Generation

    Authors: Sirui Xia, Xintao Wang, Jiaqing Liang, Yifei Zhang, Weikang Zhou, Jiaji Deng, Fei Yu, Yanghua Xiao

    Abstract: Retrieval-Augmented Generation (RAG) has been widely adopted to enhance Large Language Models (LLMs) in knowledge-intensive tasks. Recently, Attributed Text Generation (ATG) has attracted growing attention, which provides citations to support the model's responses in RAG, so as to enhance the credibility of LLM-generated content and facilitate verification. Prior methods mainly adopt coarse-graine… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 15 pages,2 figures

  12. arXiv:2407.01364  [pdf

    econ.GN

    Co-benefits of Agricultural Diversification and Technology for Food and Nutrition Security in China

    Authors: Thomas Cherico Wanger, Estelle Raveloaritiana, Siyan Zeng, Haixiu Gao, Xueqing He, Yiwen Shao, Panlong Wu, Kris A. G. Wyckhuys, Wenwu Zhou, Yi Zou, Zengrong Zhu, Ling Li, Haiyan Cen, Yunhui Liu, Shenggen Fan

    Abstract: China is the leading crop producer and has successfully implemented sustainable development programs related to agriculture. Sustainable agriculture has been promoted to achieve national food security targets such as food self-sufficiency through the well-facilitated farmland construction (WFFC) approach. The WFFC is introduced in Chinas current national 10-year plan to consolidate farmlands into… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  13. arXiv:2407.01251  [pdf, other

    cs.CR cs.AI

    QUEEN: Query Unlearning against Model Extraction

    Authors: Huajie Chen, Tianqing Zhu, Lefeng Zhang, Bo Liu, Derui Wang, Wanlei Zhou, Minhui Xue

    Abstract: Model extraction attacks currently pose a non-negligible threat to the security and privacy of deep learning models. By querying the model with a small dataset and usingthe query results as the ground-truth labels, an adversary can steal a piracy model with performance comparable to the original model. Two key issues that cause the threat are, on the one hand, accurate and unlimited queries can be… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  14. arXiv:2407.00882  [pdf, other

    stat.ME

    Subgroup Identification with Latent Factor Structure

    Authors: Yong He, Dong Liu, Fuxin Wang, Mingjuan Zhang, Wen-Xin Zhou

    Abstract: Subgroup analysis has attracted growing attention due to its ability to identify meaningful subgroups from a heterogeneous population and thereby improving predictive power. However, in many scenarios such as social science and biology, the covariates are possibly highly correlated due to the existence of common factors, which brings great challenges for group identification and is neglected in th… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  15. arXiv:2406.18927  [pdf, other

    cs.CV

    RoFIR: Robust Fisheye Image Rectification Framework Impervious to Optical Center Deviation

    Authors: Zhaokang Liao, Hao Feng, Shaokai Liu, Wengang Zhou, Houqiang Li

    Abstract: Fisheye images are categorized fisheye into central and deviated based on the optical center position. Existing rectification methods are limited to central fisheye images, while this paper proposes a novel method that extends to deviated fisheye image rectification. The challenge lies in the variant global distortion distribution pattern caused by the random optical center position. To address th… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  16. arXiv:2406.18532  [pdf, other

    cs.CL cs.AI cs.LG

    Symbolic Learning Enables Self-Evolving Agents

    Authors: Wangchunshu Zhou, Yixin Ou, Shengwei Ding, Long Li, Jialong Wu, Tiannan Wang, Jiamin Chen, Shuai Wang, Xiaohua Xu, Ningyu Zhang, Huajun Chen, Yuchen Eleanor Jiang

    Abstract: The AI community has been exploring a pathway to artificial general intelligence (AGI) by developing "language agents", which are complex large language models (LLMs) pipelines involving both prompting techniques and tool usage methods. While language agents have demonstrated impressive capabilities for many real-world tasks, a fundamental limitation of current language agents research is that the… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Code available at https://github.com/aiwaves-cn/agents

  17. arXiv:2406.18311  [pdf, other

    cs.LG

    Online Learning of Multiple Tasks and Their Relationships : Testing on Spam Email Data and EEG Signals Recorded in Construction Fields

    Authors: Yixin Jin, Wenjing Zhou, Meiqi Wang, Meng Li, Xintao Li, Tianyu Hu

    Abstract: This paper examines an online multi-task learning (OMTL) method, which processes data sequentially to predict labels across related tasks. The framework learns task weights and their relatedness concurrently. Unlike previous models that assumed static task relatedness, our approach treats tasks as initially independent, updating their relatedness iteratively using newly calculated weight vectors.… ▽ More

    Submitted 29 June, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

  18. arXiv:2406.18108  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Token-Weighted RNN-T for Learning from Flawed Data

    Authors: Gil Keren, Wei Zhou, Ozlem Kalinli

    Abstract: ASR models are commonly trained with the cross-entropy criterion to increase the probability of a target token sequence. While optimizing the probability of all tokens in the target sequence is sensible, one may want to de-emphasize tokens that reflect transcription errors. In this work, we propose a novel token-weighted RNN-T criterion that augments the RNN-T objective with token-specific weights… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  19. arXiv:2406.17777  [pdf, other

    cs.CV

    Text-Animator: Controllable Visual Text Video Generation

    Authors: Lin Liu, Quande Liu, Shengju Qian, Yuan Zhou, Wengang Zhou, Houqiang Li, Lingxi Xie, Qi Tian

    Abstract: Video generation is a challenging yet pivotal task in various industries, such as gaming, e-commerce, and advertising. One significant unresolved aspect within T2V is the effective visualization of text within generated videos. Despite the progress achieved in Text-to-Video~(T2V) generation, current methods still cannot effectively visualize texts in videos directly, as they mainly focus on summar… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Project Page: https://laulampaul.github.io/text-animator.html

  20. arXiv:2406.16963  [pdf, other

    cs.LG cs.AI cs.CR cs.SI

    Large Language Models for Link Stealing Attacks Against Graph Neural Networks

    Authors: Faqian Guan, Tianqing Zhu, Hui Sun, Wanlei Zhou, Philip S. Yu

    Abstract: Graph data contains rich node features and unique edge information, which have been applied across various domains, such as citation networks or recommendation systems. Graph Neural Networks (GNNs) are specialized for handling such data and have shown impressive performance in many applications. However, GNNs may contain of sensitive information and susceptible to privacy attacks. For example, lin… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  21. arXiv:2406.16641  [pdf, other

    cs.CV cs.AI

    Vision-Language Consistency Guided Multi-modal Prompt Learning for Blind AI Generated Image Quality Assessment

    Authors: Jun Fu, Wei Zhou, Qiuping Jiang, Hantao Liu, Guangtao Zhai

    Abstract: Recently, textual prompt tuning has shown inspirational performance in adapting Contrastive Language-Image Pre-training (CLIP) models to natural image quality assessment. However, such uni-modal prompt learning method only tunes the language branch of CLIP models. This is not enough for adapting CLIP models to AI generated image quality assessment (AGIQA) since AGIs visually differ from natural im… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted by IEEE Signal Processing Letter

  22. arXiv:2406.14289  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci cond-mat.str-el

    Electrical switching of chirality in rhombohedral graphene Chern insulators

    Authors: Jing Ding, Hanxiao Xiang, Jiannan Hua, Wenqiang Zhou, Naitian Liu, Le Zhang, Na Xin, Kenji Watanabe, Takashi Taniguchi, Wei Zhu, Shuigang Xu

    Abstract: A Chern insulator hosts topologically protected chiral edge currents with quantized conductance characterized by its Chern number. Switching the chirality of the Chern insulator, namely, the direction of the edge current, is highly challenging due to topologically forbidden backscattering but is of considerable importance for the design of topological devices. Nevertheless, this can be achieved by… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 21 pages, 4 figures in main text

  23. arXiv:2406.14130  [pdf, other

    cs.CV

    ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning

    Authors: Zhongjie Duan, Wenmeng Zhou, Cen Chen, Yaliang Li, Weining Qian

    Abstract: Recently, advancements in video synthesis have attracted significant attention. Video synthesis models such as AnimateDiff and Stable Video Diffusion have demonstrated the practical applicability of diffusion models in creating dynamic visual content. The emergence of SORA has further spotlighted the potential of video generation technologies. Nonetheless, the extension of video lengths has been c… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 8 pages, 5 figures

  24. arXiv:2406.13607  [pdf, other

    cs.CV

    Ultra-High-Definition Restoration: New Benchmarks and A Dual Interaction Prior-Driven Solution

    Authors: Liyan Wang, Cong Wang, Jinshan Pan, Weixiang Zhou, Xiaoran Sun, Wei Wang, Zhixun Su

    Abstract: Ultra-High-Definition (UHD) image restoration has acquired remarkable attention due to its practical demand. In this paper, we construct UHD snow and rain benchmarks, named UHD-Snow and UHD-Rain, to remedy the deficiency in this field. The UHD-Snow/UHD-Rain is established by simulating the physics process of rain/snow into consideration and each benchmark contains 3200 degraded/clear image pairs o… ▽ More

    Submitted 22 June, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

  25. arXiv:2406.13368  [pdf

    cond-mat.mtrl-sci

    Lewis Acidity and Basicity Diagnostics of Molten Salt for its Properties and Structure Online Monitoring

    Authors: Changzu Zhu, Jia Song, Xiaorui Xu, Chengyu Wang, Yang Tong, Lve Lin, Shaoqiang Guo, Wentao Zhou, Adrien Couet, Yafei Wang

    Abstract: Analogous to the aqueous solution where the pH of the solvent affects its multiple behaviors, the Lewis acidity-basicity of molten salts also greatly influences their thermophysical and thermochemical properties. In the study, we develop ion probes to quantitatively determine the acidity-basicity scale of molten NaCl-xAlCl3 (x = 1.5-2.1) salt using in-situ ultra-violet visible (UV-Vis) spectroscop… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  26. arXiv:2406.12516  [pdf, other

    cs.CR cs.DC cs.LG

    Update Selective Parameters: Federated Machine Unlearning Based on Model Explanation

    Authors: Heng Xu, Tianqing Zhu, Lefeng Zhang, Wanlei Zhou, Philip S. Yu

    Abstract: Federated learning is a promising privacy-preserving paradigm for distributed machine learning. In this context, there is sometimes a need for a specialized process called machine unlearning, which is required when the effect of some specific training samples needs to be removed from a learning model due to privacy, security, usability, and/or legislative factors. However, problems arise when curr… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted by IEEE Transactions on Big Data

  27. arXiv:2406.12180  [pdf

    cond-mat.mtrl-sci quant-ph

    Unusual charge density wave introduced by Janus structure in monolayer vanadium dichalcogenides

    Authors: Ziqiang Xu, Yan Shao, Chun Huang, Genyu Hu, Shihao Hu, Zhi-Lin Li, Xiaoyu Hao, Yanhui Hou, Teng Zhang, Jin-An Shi, Chen Liu, Jia-Ou Wang, Wu Zhou, Jiadong Zhou, Wei Ji, Jingsi Qiao, Xu Wu, Hong-Jun Gao, Yeliang Wang

    Abstract: As a fundamental structural feature, the symmetry of materials determines the exotic quantum properties in transition metal dichalcogenides (TMDs) with charge density wave (CDW). Breaking the inversion symmetry, the Janus structure, an artificially constructed lattice, provides an opportunity to tune the CDW states and the related properties. However, limited by the difficulties in atomic-level fa… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  28. arXiv:2406.12178  [pdf, other

    cs.CV

    FCA-RAC: First Cycle Annotated Repetitive Action Counting

    Authors: Jiada Lu, WeiWei Zhou, Xiang Qian, Dongze Lian, Yanyu Xu, Weifeng Wang, Lina Cao, Shenghua Gao

    Abstract: Repetitive action counting quantifies the frequency of specific actions performed by individuals. However, existing action-counting datasets have limited action diversity, potentially hampering model performance on unseen actions. To address this issue, we propose a framework called First Cycle Annotated Repetitive Action Counting (FCA-RAC). This framework contains 4 parts: 1) a labeling technique… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  29. arXiv:2406.11839  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    mDPO: Conditional Preference Optimization for Multimodal Large Language Models

    Authors: Fei Wang, Wenxuan Zhou, James Y. Huang, Nan Xu, Sheng Zhang, Hoifung Poon, Muhao Chen

    Abstract: Direct preference optimization (DPO) has shown to be an effective method for large language model (LLM) alignment. Recent works have attempted to apply DPO to multimodal scenarios but have found it challenging to achieve consistent improvement. Through a comparative experiment, we identify the unconditional preference problem in multimodal preference optimization, where the model overlooks the ima… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  30. arXiv:2406.11827  [pdf, other

    cs.CL cs.AI cs.LG

    WPO: Enhancing RLHF with Weighted Preference Optimization

    Authors: Wenxuan Zhou, Ravi Agrawal, Shujian Zhang, Sathish Reddy Indurthi, Sanqiang Zhao, Kaiqiang Song, Silei Xu, Chenguang Zhu

    Abstract: Reinforcement learning from human feedback (RLHF) is a promising solution to align large language models (LLMs) more closely with human values. Off-policy preference optimization, where the preference data is obtained from other models, is widely adopted due to its cost efficiency and scalability. However, off-policy preference optimization often suffers from a distributional gap between the polic… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  31. arXiv:2406.11467  [pdf, other

    econ.EM

    Resilience of international oil trade networks under extreme event shock-recovery simulations

    Authors: Na Wei, Wen-Jie Xie, Wei-Xing Zhou

    Abstract: With the frequent occurrence of black swan events, global energy security situation has become increasingly complex and severe. Assessing the resilience of the international oil trade network (iOTN) is crucial for evaluating its ability to withstand extreme shocks and recover thereafter, ensuring energy security. We overcomes the limitations of discrete historical data by developing a simulation m… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  32. arXiv:2406.10954  [pdf, other

    cs.LG cs.CR

    Towards Efficient Target-Level Machine Unlearning Based on Essential Graph

    Authors: Heng Xu, Tianqing Zhu, Lefeng Zhang, Wanlei Zhou, Wei Zhao

    Abstract: Machine unlearning is an emerging technology that has come to attract widespread attention. A number of factors, including regulations and laws, privacy, and usability concerns, have resulted in this need to allow a trained model to forget some of its training data. Existing studies of machine unlearning mainly focus on unlearning requests that forget a cluster of instances or all instances from o… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  33. arXiv:2406.10953  [pdf, other

    cs.CR

    Really Unlearned? Verifying Machine Unlearning via Influential Sample Pairs

    Authors: Heng Xu, Tianqing Zhu, Lefeng Zhang, Wanlei Zhou

    Abstract: Machine unlearning enables pre-trained models to eliminate the effects of partial training samples. Previous research has mainly focused on proposing efficient unlearning strategies. However, the verification of machine unlearning, or in other words, how to guarantee that a sample has been successfully unlearned, has been overlooked for a long time. Existing verification schemes typically rely on… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  34. arXiv:2406.10951  [pdf, other

    cs.CR

    Don't Forget Too Much: Towards Machine Unlearning on Feature Level

    Authors: Heng Xu, Tianqing Zhu, Wanlei Zhou, Wei Zhao

    Abstract: Machine unlearning enables pre-trained models to remove the effect of certain portions of training data. Previous machine unlearning schemes have mainly focused on unlearning a cluster of instances or all instances belonging to a specific class. These types of unlearning might have a significant impact on the model utility; and they may be inadequate for situations where we only need to unlearn fe… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  35. arXiv:2406.10884  [pdf, other

    cs.LG cs.CR cs.DC

    Linkage on Security, Privacy and Fairness in Federated Learning: New Balances and New Perspectives

    Authors: Linlin Wang, Tianqing Zhu, Wanlei Zhou, Philip S. Yu

    Abstract: Federated learning is fast becoming a popular paradigm for applications involving mobile devices, banking systems, healthcare, and IoT systems. Hence, over the past five years, researchers have undertaken extensive studies on the privacy leaks, security threats, and fairness associated with these emerging models. For the most part, these three critical concepts have been studied in isolation; howe… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  36. arXiv:2406.10861  [pdf, other

    cs.LG cs.DC

    Knowledge Distillation in Federated Learning: a Survey on Long Lasting Challenges and New Solutions

    Authors: Laiqiao Qin, Tianqing Zhu, Wanlei Zhou, Philip S. Yu

    Abstract: Federated Learning (FL) is a distributed and privacy-preserving machine learning paradigm that coordinates multiple clients to train a model while keeping the raw data localized. However, this traditional FL poses some challenges, including privacy risks, data heterogeneity, communication bottlenecks, and system heterogeneity issues. To tackle these challenges, knowledge distillation (KD) has been… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  37. arXiv:2406.10501  [pdf, other

    cs.CV

    Self-Supervised Representation Learning with Spatial-Temporal Consistency for Sign Language Recognition

    Authors: Weichao Zhao, Wengang Zhou, Hezhen Hu, Min Wang, Houqiang Li

    Abstract: Recently, there have been efforts to improve the performance in sign language recognition by designing self-supervised learning methods. However, these methods capture limited information from sign pose data in a frame-wise learning manner, leading to sub-optimal solutions. To this end, we propose a simple yet effective self-supervised contrastive learning framework to excavate rich context via sp… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: Accepted by TIP2023

  38. arXiv:2406.09411  [pdf, other

    cs.CV cs.AI cs.CL

    MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

    Authors: Fei Wang, Xingyu Fu, James Y. Huang, Zekun Li, Qin Liu, Xiaogeng Liu, Mingyu Derek Ma, Nan Xu, Wenxuan Zhou, Kai Zhang, Tianyi Lorena Yan, Wenjie Jacky Mo, Hsiang-Hui Liu, Pan Lu, Chunyuan Li, Chaowei Xiao, Kai-Wei Chang, Dan Roth, Sheng Zhang, Hoifung Poon, Muhao Chen

    Abstract: We introduce MuirBench, a comprehensive benchmark that focuses on robust multi-image understanding capabilities of multimodal LLMs. MuirBench consists of 12 diverse multi-image tasks (e.g., scene understanding, ordering) that involve 10 categories of multi-image relations (e.g., multiview, temporal relations). Comprising 11,264 images and 2,600 multiple-choice questions, MuirBench is created in a… ▽ More

    Submitted 1 July, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: typos corrected, references added, Project Page: https://muirbench.github.io/

  39. arXiv:2406.08203  [pdf, other

    eess.AS cs.SD

    LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation

    Authors: Wenhao Guan, Kaidi Wang, Wangjin Zhou, Yang Wang, Feng Deng, Hui Wang, Lin Li, Qingyang Hong, Yong Qin

    Abstract: Recently, the application of diffusion models has facilitated the significant development of speech and audio generation. Nevertheless, the quality of samples generated by diffusion models still needs improvement. And the effectiveness of the method is accompanied by the extensive number of sampling steps, leading to an extended synthesis time necessary for generating high-quality audio. Previous… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted at Interspeech2024

  40. arXiv:2406.08173  [pdf, other

    cs.CL

    Semi-Supervised Spoken Language Glossification

    Authors: Huijie Yao, Wengang Zhou, Hao Zhou, Houqiang Li

    Abstract: Spoken language glossification (SLG) aims to translate the spoken language text into the sign language gloss, i.e., a written record of sign language. In this work, we present a framework named $S$emi-$S$upervised $S$poken $L$anguage $G$lossification ($S^3$LG) for SLG. To tackle the bottleneck of limited parallel data in SLG, our $S^3$LG incorporates large-scale monolingual spoken language text in… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL2024 main

  41. arXiv:2406.07973  [pdf, other

    cs.CR

    Unique Security and Privacy Threats of Large Language Model: A Comprehensive Survey

    Authors: Shang Wang, Tianqing Zhu, Bo Liu, Ming Ding, Xu Guo, Dayong Ye, Wanlei Zhou, Philip S. Yu

    Abstract: With the rapid development of artificial intelligence, large language models (LLMs) have made remarkable advancements in natural language processing. These models are trained on vast datasets to exhibit powerful language understanding and generation capabilities across various applications, including machine translation, chatbots, and agents. However, LLMs have revealed a variety of privacy and se… ▽ More

    Submitted 18 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  42. arXiv:2406.07293  [pdf, other

    cs.SI

    Exploring Cognitive Bias Triggers in COVID-19 Misinformation Tweets: A Bot vs. Human Perspective

    Authors: Lynnette Hui Xian Ng, Wenqi Zhou, Kathleen M. Carley

    Abstract: During the COVID-19 pandemic, the proliferation of misinformation on social media has been rapidly increasing. Automated Bot authors are believed to be significant contributors of this surge. It is hypothesized that Bot authors deliberately craft online misinformation aimed at triggering and exploiting human cognitive biases, thereby enhancing tweet engagement and persuasive influence. This study… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  43. arXiv:2406.07023  [pdf, other

    cs.CV

    LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection

    Authors: Jiahua Xu, Si Zuo, Chenfeng Wei, Wei Zhou

    Abstract: With the rapid proliferation of autonomous driving, there has been a heightened focus on the research of lidar-based 3D semantic segmentation and object detection methodologies, aiming to ensure the safety of traffic participants. In recent decades, learning-based approaches have emerged, demonstrating remarkable performance gains in comparison to conventional algorithms. However, the segmentation… ▽ More

    Submitted 11 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

  44. arXiv:2406.06104  [pdf

    cond-mat.mtrl-sci

    Correlated electrons of the flat band in charge density wave state of 4Hb-TaSexS2-x

    Authors: Yanyan Geng, Jianfeng Guo, Fanyu Meng, Manyu Wang, Shuo Mi, Li Huang, Rui Xu, Fei Pang, Kai Liu, Shancai Wang, Hong-Jun Gao, Weichang Zhou, Wei Ji, Hechang Lei, Zhihai Cheng

    Abstract: Many intriguing quantum states of matter, such as unconventional superconductivity, magnetic phases and fractional quantum Hall physics, emergent from the spatially-correlated localized electrons in the flat band of solid materials. By using scanning tunneling microscopy and spectroscopy (STM/STS), we report the real-space investigation of correlated electrons in the flat band of superlattice 4Hb-… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 18 pages, 4 figures

  45. arXiv:2406.05981  [pdf, other

    cs.LG cs.AI cs.CL

    ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization

    Authors: Haoran You, Yipin Guo, Yichao Fu, Wei Zhou, Huihong Shi, Xiaofan Zhang, Souvik Kundu, Amir Yazdanbakhsh, Yingyan, Lin

    Abstract: Large language models (LLMs) have shown impressive performance on language tasks but face challenges when deployed on resource-constrained devices due to their extensive parameters and reliance on dense multiplications, resulting in high memory demands and latency bottlenecks. Shift-and-add reparameterization offers a promising solution by replacing costly multiplications with hardware-friendly pr… ▽ More

    Submitted 11 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

  46. arXiv:2406.05510  [pdf, other

    cs.LG cs.CL

    Representation Learning with Conditional Information Flow Maximization

    Authors: Dou Hu, Lingwei Wei, Wei Zhou, Songlin Hu

    Abstract: This paper proposes an information-theoretic representation learning framework, named conditional information flow maximization, to extract noise-invariant sufficient representations for the input data and target task. It promotes the learned representations have good feature uniformity and sufficient predictive ability, which can enhance the generalization of pre-trained language models (PLMs) fo… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: 16 pages, accepted to ACL 2024 (main conference)

  47. arXiv:2406.04980  [pdf, other

    astro-ph.SR astro-ph.GA

    M17 MIR: A Massive Star is Forming via Episodic Mass Accretion

    Authors: Wei Zhou, Zhiwei Chen, Zhibo Jiang, Haoran Feng, Yu Jiang

    Abstract: We analyzed the Atacama Large Millimeter/submillimeter Array (ALMA) band 6 data for the outbursting massive protostar M17~MIR. The ALMA CO $J=2-1$ data reveal a collimated and bipolar north-south outflow from M17~MIR. The blue-shifted outflow exhibits four CO knots (N1 to N4) along the outflow axis, while the red-shifted outflow appears as a single knot (S1). The extremely high velocity (EHV) emis… ▽ More

    Submitted 17 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted for publication in ApJL; typos corrected

  48. arXiv:2406.04598  [pdf, other

    cs.AI

    OCDB: Revisiting Causal Discovery with a Comprehensive Benchmark and Evaluation Framework

    Authors: Wei Zhou, Hong Huang, Guowen Zhang, Ruize Shi, Kehan Yin, Yuanyuan Lin, Bang Liu

    Abstract: Large language models (LLMs) have excelled in various natural language processing tasks, but challenges in interpretability and trustworthiness persist, limiting their use in high-stakes fields. Causal discovery offers a promising approach to improve transparency and reliability. However, current evaluations are often one-sided and lack assessments focused on interpretability performance. Addition… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  49. arXiv:2406.04076  [pdf, other

    cs.CR

    Federated TrustChain: Blockchain-Enhanced LLM Training and Unlearning

    Authors: Xuhan Zuo, Minghao Wang, Tianqing Zhu, Lefeng Zhang, Dayong Ye, Shui Yu, Wanlei Zhou

    Abstract: The development of Large Language Models (LLMs) faces a significant challenge: the exhausting of publicly available fresh data. This is because training a LLM needs a large demanding of new data. Federated learning emerges as a promising solution, enabling collaborative model to contribute their private data to LLM global model. However, integrating federated learning with LLMs introduces new chal… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 16 pages, 7 figures,

  50. arXiv:2406.03758  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Phonon heat conduction across slippery interfaces in twisted graphite

    Authors: Fuwei Yang, Wenjiang Zhou, Zhibin Zhang, Xuanyu Huang, Jingwen Zhang, Nianjie Liang, Wujuan Yan, Yuxi Wang, Mingchao Ding, Quanlin Guo, Yu Han, Te-Huan Liu, Kaihui Liu, Quanshui Zheng, Bai Song

    Abstract: Interlayer rotation in van der Waals (vdW) materials offers great potential for manipulating phonon dynamics and heat flow in advanced electronics with ever higher compactness and power density. However, despite extensive theoretical efforts in recent years, experimental measurements remain scarce especially due to the critical challenges of preparing single-crystalline twisted interfaces and prob… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.