Skip to main content

Showing 1–50 of 531 results for author: Deng, W

  1. arXiv:2407.07780  [pdf, other

    cs.CV

    Cross Domain Object Detection via Multi-Granularity Confidence Alignment based Mean Teacher

    Authors: Jiangming Chen, Li Liu, Wanxia Deng, Zhen Liu, Yu Liu, Yingmei Wei, Yongxiang Liu

    Abstract: Cross domain object detection learns an object detector for an unlabeled target domain by transferring knowledge from an annotated source domain. Promising results have been achieved via Mean Teacher, however, pseudo labeling which is the bottleneck of mutual learning remains to be further explored. In this study, we find that confidence misalignment of the predictions, including category-level ov… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  2. arXiv:2407.06935  [pdf, other

    cs.LG stat.CO stat.ML

    Bayesian Federated Learning with Hamiltonian Monte Carlo: Algorithm and Theory

    Authors: Jiajun Liang, Qian Zhang, Wei Deng, Qifan Song, Guang Lin

    Abstract: This work introduces a novel and efficient Bayesian federated learning algorithm, namely, the Federated Averaging stochastic Hamiltonian Monte Carlo (FA-HMC), for parameter estimation and uncertainty quantification. We establish rigorous convergence guarantees of FA-HMC on non-iid distributed data sets, under the strong convexity and Hessian smoothness assumptions. Our analysis investigates the ef… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  3. arXiv:2407.03868  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Observation of exceptional line semimetal in three-dimensional non-Hermitian phononic crystals

    Authors: Yejian Hu, Jien Wu, Peidong Ye, Weiyin Deng, Jiuyang Lu, Xueqin Huang, Ziyu Wang, Manzhu Ke, Zhengyou Liu

    Abstract: Non-Hermitian topological phases, which exhibit unique features such as skin effect and exceptional points originated from nontrivial band topologies in complex plane, have attracted enormous attention in condensed-matter physics and metamaterials. Here we report the realization of an exceptional line semimetal in a three-dimensional non-Hermitian phononic crystal. A pair of exceptional rings with… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 5 figures

  4. arXiv:2407.03598  [pdf, other

    cs.CV

    ASteISR: Adapting Single Image Super-resolution Pre-trained Model for Efficient Stereo Image Super-resolution

    Authors: Yuanbo Zhou, Yuyang Xue, Wei Deng, Xinlin Zhang, Qinquan Gao, Tong Tong

    Abstract: Despite advances in the paradigm of pre-training then fine-tuning in low-level vision tasks, significant challenges persist particularly regarding the increased size of pre-trained models such as memory usage and training time. Another concern often encountered is the unsatisfying results yielded when directly applying pre-trained single-image models to multi-image domain. In this paper, we propos… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  5. arXiv:2407.02886  [pdf, other

    cs.CR

    A Wolf in Sheep's Clothing: Practical Black-box Adversarial Attacks for Evading Learning-based Windows Malware Detection in the Wild

    Authors: Xiang Ling, Zhiyu Wu, Bin Wang, Wei Deng, Jingzheng Wu, Shouling Ji, Tianyue Luo, Yanjun Wu

    Abstract: Given the remarkable achievements of existing learning-based malware detection in both academia and industry, this paper presents MalGuise, a practical black-box adversarial attack framework that evaluates the security risks of existing learning-based Windows malware detection systems under the black-box setting. MalGuise first employs a novel semantics-preserving transformation of call-based redi… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: This paper has been accepted by 33rd USENIX Security Symposium 2024

  6. arXiv:2406.18957  [pdf, other

    cs.DC cs.GT

    A Treatment of EIP-1559: Enhancing Transaction Fee Mechanism through Nth-Price Auction

    Authors: Kun Li, Guangpeng Qi, Guangyong Shang, Wanli Deng, Minghui Xu, Xiuzhen Cheng

    Abstract: With the widespread adoption of blockchain technology, the transaction fee mechanism (TFM) in blockchain systems has become a prominent research topic. An ideal TFM should satisfy user incentive compatibility (UIC), miner incentive compatibility (MIC), and miner-user side contract proofness ($c$-SCP). However, state-of-the-art works either fail to meet these three properties simultaneously or only… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  7. arXiv:2406.11147  [pdf, other

    cs.SE cs.AI

    Vul-RAG: Enhancing LLM-based Vulnerability Detection via Knowledge-level RAG

    Authors: Xueying Du, Geng Zheng, Kaixin Wang, Jiayi Feng, Wentai Deng, Mingwei Liu, Bihuan Chen, Xin Peng, Tao Ma, Yiling Lou

    Abstract: Vulnerability detection is essential for software quality assurance. In recent years, deep learning models (especially large language models) have shown promise in vulnerability detection. In this work, we propose a novel LLM-based vulnerability detection technique Vul-RAG, which leverages knowledge-level retrieval-augmented generation (RAG) framework to detect vulnerability for the given code in… ▽ More

    Submitted 19 June, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

  8. arXiv:2406.09908  [pdf, other

    cs.LG cs.CV

    What Does Softmax Probability Tell Us about Classifiers Ranking Across Diverse Test Conditions?

    Authors: Weijie Tu, Weijian Deng, Liang Zheng, Tom Gedeon

    Abstract: This work aims to develop a measure that can accurately rank the performance of various classifiers when they are tested on unlabeled data from out-of-distribution (OOD) distributions. We commence by demonstrating that conventional uncertainty metrics, notably the maximum Softmax prediction probability, possess inherent utility in forecasting model generalization across certain OOD contexts. Build… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: TMLR 2024 (https://openreview.net/forum?id=vtiDUgGjyx)

  9. arXiv:2406.08772  [pdf, other

    cs.CV cs.CL

    MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs

    Authors: Xuannan Liu, Zekun Li, Peipei Li, Shuhan Xia, Xing Cui, Linzhi Huang, Huaibo Huang, Weihong Deng, Zhaofeng He

    Abstract: Current multimodal misinformation detection (MMD) methods often assume a single source and type of forgery for each sample, which is insufficient for real-world scenarios where multiple forgery sources coexist. The lack of a benchmark for mixed-source misinformation has hindered progress in this field. To address this, we introduce MMFakeBench, the first comprehensive benchmark for mixed-source MM… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  10. arXiv:2405.19039  [pdf, other

    hep-ph hep-ex hep-lat

    Heavy baryons in the relativized quark model with chromodynamics

    Authors: Xin-Zhen Weng, Wei-Zhen Deng, Shi-Lin Zhu

    Abstract: Following the work of Capstick and Isgur [\href{https://doi.org/10.1103/PhysRevD.34.2809}{Phys.~Rev.~D~34,~2809~(1986)}], we systematically study the mass spectrum of the heavy baryons in the relativized quark potential model with chromodynamics. Besides the original Godfrey-Isgur (GI) model, we also adopt a modified GI model which replaces the linear confinement by a screened one. The two models… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 20 pages, 10 figures

  11. arXiv:2405.18979  [pdf, other

    cs.LG stat.ML

    MANO: Exploiting Matrix Norm for Unsupervised Accuracy Estimation Under Distribution Shifts

    Authors: Renchunzi Xie, Ambroise Odonnat, Vasilii Feofanov, Weijian Deng, Jianfeng Zhang, Bo An

    Abstract: Leveraging the models' outputs, specifically the logits, is a common approach to estimating the test accuracy of a pre-trained neural network on out-of-distribution (OOD) samples without requiring access to the corresponding ground truth labels. Despite their ease of implementation and computational efficiency, current logit-based methods are vulnerable to overconfidence issues, leading to predict… ▽ More

    Submitted 24 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: The three first authors contributed equally

  12. arXiv:2405.16423  [pdf, other

    gr-qc

    Energy flux and waveform of gravitational wave generated by coalescing slow-spinning binary system in effective one-body theory

    Authors: Weike Deng, Sheng Long, Jiliang Jing

    Abstract: We extend our research on the energy flux and waveform characteristics of gravitational waves generated by merging nonspinning binary black holes through self-consistent effective one-body theory \cite{L2023} to include binary systems with slowly spinning black holes. Initially, we decompose the equation for the null tetrad component of the gravitationally perturbed Weyl tensor $ψ^B_{4}$ into radi… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 38 pages, 2 figures

  13. arXiv:2405.14280  [pdf, other

    cs.IR

    ASI++: Towards Distributionally Balanced End-to-End Generative Retrieval

    Authors: Yuxuan Liu, Tianchi Yang, Zihan Zhang, Minghui Song, Haizhen Huang, Weiwei Deng, Feng Sun, Qi Zhang

    Abstract: Generative retrieval, a promising new paradigm in information retrieval, employs a seq2seq model to encode document features into parameters and decode relevant document identifiers (IDs) based on search queries. Existing generative retrieval solutions typically rely on a preprocessing stage to pre-define document IDs, which can suffer from a semantic gap between these IDs and the retrieval task.… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  14. arXiv:2405.12130  [pdf, other

    cs.CL cs.LG

    MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

    Authors: Ting Jiang, Shaohan Huang, Shengyue Luo, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang, Deqing Wang, Fuzhen Zhuang

    Abstract: Low-rank adaptation is a popular parameter-efficient fine-tuning method for large language models. In this paper, we analyze the impact of low-rank updating, as implemented in LoRA. Our findings suggest that the low-rank updating mechanism may limit the ability of LLMs to effectively learn and memorize new knowledge. Inspired by this observation, we propose a new method called MoRA, which employs… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: Work in Progress

  15. arXiv:2405.07839  [pdf, other

    cs.LG cs.AI stat.ML

    Constrained Exploration via Reflected Replica Exchange Stochastic Gradient Langevin Dynamics

    Authors: Haoyang Zheng, Hengrong Du, Qi Feng, Wei Deng, Guang Lin

    Abstract: Replica exchange stochastic gradient Langevin dynamics (reSGLD) is an effective sampler for non-convex learning in large-scale datasets. However, the simulation may encounter stagnation issues when the high-temperature chain delves too deeply into the distribution tails. To tackle this issue, we propose reflected reSGLD (r2SGLD): an algorithm tailored for constrained non-convex exploration by util… ▽ More

    Submitted 3 June, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: 28 pages, 13 figures

  16. arXiv:2405.04795  [pdf, other

    cs.LG

    Variational Schrödinger Diffusion Models

    Authors: Wei Deng, Weijian Luo, Yixin Tan, Marin Biloš, Yu Chen, Yuriy Nevmyvaka, Ricky T. Q. Chen

    Abstract: Schrödinger bridge (SB) has emerged as the go-to method for optimizing transportation plans in diffusion models. However, SB requires estimating the intractable forward score functions, inevitably resulting in the costly implicit training loss based on simulated trajectories. To improve the scalability while preserving efficient transportation plans, we leverage variational inference to linearize… ▽ More

    Submitted 19 June, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  17. arXiv:2405.02241  [pdf, other

    cs.RO

    WeightedPose: Generalizable Cross-Pose Estimation via Weighted SVD

    Authors: Xuxin Cheng, Heng Yu, Harry Zhang, Wenxing Deng

    Abstract: We introduce a new approach for robotic manipulation tasks in human settings that necessitates understanding the 3D geometric connections between a pair of objects. Conventional end-to-end training approaches, which convert pixel observations directly into robot actions, often fail to effectively understand complex pose relationships and do not easily adapt to new object configurations. To overcom… ▽ More

    Submitted 21 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2211.09325

  18. arXiv:2404.17227  [pdf, other

    econ.GN cs.CE cs.CR cs.CY q-fin.RM

    Trust Dynamics and Market Behavior in Cryptocurrency: A Comparative Study of Centralized and Decentralized Exchanges

    Authors: Xintong Wu, Wanling Deng, Yuotng Quan, Luyao Zhang

    Abstract: In the evolving landscape of digital finance, the transition from centralized to decentralized trust mechanisms, primarily driven by blockchain technology, plays a critical role in shaping the cryptocurrency ecosystem. This paradigm shift raises questions about the traditional reliance on centralized trust and introduces a novel, decentralized trust framework built upon distributed networks. Our r… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  19. arXiv:2404.16484  [pdf, other

    cs.CV eess.IV

    Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey

    Authors: Marcos V. Conde, Zhijun Lei, Wen Li, Cosmin Stejerean, Ioannis Katsavounidis, Radu Timofte, Kihwan Yoon, Ganzorig Gankhuyag, Jiangtao Lv, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Zhiyuan Li, Hao Wei, Chenyang Ge, Dongyang Zhang, Tianle Liu, Huaian Chen, Yi Jin, Menghan Zhou, Yiqiang Yan, Si Gao, Biao Wu, Shaoli Liu , et al. (50 additional authors not shown)

    Abstract: This paper introduces a novel benchmark as part of the AIS 2024 Real-Time Image Super-Resolution (RTSR) Challenge, which aims to upscale compressed images from 540p to 4K resolution (4x factor) in real-time on commercial GPUs. For this, we use a diverse test set containing a variety of 4K images ranging from digital art to gaming and photography. The images are compressed using the modern AVIF cod… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: CVPR 2024, AI for Streaming (AIS) Workshop

  20. arXiv:2404.14248  [pdf, other

    cs.CV

    NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results

    Authors: Xiaoning Liu, Zongwei Wu, Ao Li, Florin-Alexandru Vasluianu, Yulun Zhang, Shuhang Gu, Le Zhang, Ce Zhu, Radu Timofte, Zhi Jin, Hongjun Wu, Chenxi Wang, Haitao Ling, Yuanhao Cai, Hao Bian, Yuxin Zheng, Jing Lin, Alan Yuille, Ben Shao, Jin Guo, Tianli Liu, Mohao Wu, Yixu Feng, Shuo Hou, Haotian Lin , et al. (87 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 low light image enhancement challenge, highlighting the proposed solutions and results. The aim of this challenge is to discover an effective network design or solution capable of generating brighter, clearer, and visually appealing results when dealing with a variety of conditions, including ultra-high resolution (4K and beyond), non-uniform illumination, backlig… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: NTIRE 2024 Challenge Report

  21. arXiv:2404.05412  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Valley edge states as bound states in the continuum

    Authors: Shunda Yin, Liping Ye, Hailong He, Xueqin Huang, Manzhu Ke, Weiyin Deng, Jiuyang Lu, Zhengyou Liu

    Abstract: Bound states in the continuum (BICs) are spatially localized states with energy embedded in the continuum spectrum of extended states. The combination of BICs physics and nontrivial band topology theory giving rise to topological BICs, which are robust against disorders and meanwhile of the merit of conventional BICs, is attracting wide attention recently. Here, we report valley edge states as top… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: A revised version has been accepted by Science Bulletin

  22. arXiv:2404.00563  [pdf, other

    cs.CV

    Exploiting Inter-sample and Inter-feature Relations in Dataset Distillation

    Authors: Wenxiao Deng, Wenbin Li, Tianyu Ding, Lei Wang, Hongguang Zhang, Kuihua Huang, Jing Huo, Yang Gao

    Abstract: Dataset distillation has emerged as a promising approach in deep learning, enabling efficient training with small synthetic datasets derived from larger real ones. Particularly, distribution matching-based distillation methods attract attention thanks to its effectiveness and low computational cost. However, these methods face two primary limitations: the dispersed feature distribution within the… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: Accepted to CVPR 2024

  23. arXiv:2403.19322  [pdf, other

    cs.CV cs.CL

    Plug-and-Play Grounding of Reasoning in Multimodal Large Language Models

    Authors: Jiaxing Chen, Yuxuan Liu, Dehu Li, Xiang An, Weimo Deng, Ziyong Feng, Yongle Zhao, Yin Xie

    Abstract: The rise of Multimodal Large Language Models (MLLMs), renowned for their advanced instruction-following and reasoning capabilities, has significantly propelled the field of visual reasoning. However, due to limitations in their image tokenization processes, most MLLMs struggle to capture fine details of text and objects in images, especially in high-resolution samples. To overcome this limitation,… ▽ More

    Submitted 18 June, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: 15 pages, 8 figures

  24. arXiv:2403.17752  [pdf, other

    cs.CL

    Can multiple-choice questions really be useful in detecting the abilities of LLMs?

    Authors: Wangyue Li, Liangzhi Li, Tong Xiang, Xiao Liu, Wei Deng, Noa Garcia

    Abstract: Multiple-choice questions (MCQs) are widely used in the evaluation of large language models (LLMs) due to their simplicity and efficiency. However, there are concerns about whether MCQs can truly measure LLM's capabilities, particularly in knowledge-intensive scenarios where long-form generation (LFG) answers are required. The misalignment between the task and the evaluation method demands a thoug… ▽ More

    Submitted 23 May, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: LREC-COLING 2024

  25. arXiv:2403.14760  [pdf, other

    cs.CV

    Can 3D Vision-Language Models Truly Understand Natural Language?

    Authors: Weipeng Deng, Jihan Yang, Runyu Ding, Jiahui Liu, Yijiang Li, Xiaojuan Qi, Edith Ngai

    Abstract: Rapid advancements in 3D vision-language (3D-VL) tasks have opened up new avenues for human interaction with embodied agents or robots using natural language. Despite this progress, we find a notable limitation: existing 3D-VL models exhibit sensitivity to the styles of language input, struggling to understand sentences with the same semantic meaning but written in different variants. This observa… ▽ More

    Submitted 3 July, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: https://github.com/VincentDENGP/3D-LR

  26. arXiv:2403.10873  [pdf, other

    cs.IT eess.SP

    CSI Transfer From Sub-6G to mmWave: Reduced-Overhead Multi-User Hybrid Beamforming

    Authors: Weicao Deng, Min Li, Ming-Min Zhao, Min-Jian Zhao, Osvaldo Simeone

    Abstract: Hybrid beamforming is vital in modern wireless systems, especially for massive MIMO and millimeter-wave deployments, offering efficient directional transmission with reduced hardware complexity. However, effective beamforming in multi-user scenarios relies heavily on accurate channel state information, the acquisition of which often incurs excessive pilot overhead, degrading system performance. To… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 13 pages, 12 figures, submitted

  27. arXiv:2403.09500  [pdf, other

    cs.CV

    Faceptor: A Generalist Model for Face Perception

    Authors: Lixiong Qin, Mei Wang, Xuannan Liu, Yuhang Zhang, Wei Deng, Xiaoshuai Song, Weiran Xu, Weihong Deng

    Abstract: With the comprehensive research conducted on various face analysis tasks, there is a growing interest among researchers to develop a unified approach to face perception. Existing methods mainly discuss unified representation and training, which lack task extensibility and application efficiency. To tackle this issue, we focus on the unified model structure, exploring a face generalist model. As an… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  28. Energy flux and waveforms by coalescing spinless binary system in effective one-body theory

    Authors: Sheng Long, Weike Deng, Jiliang Jing

    Abstract: We present a study on the energy radiation rate and waveforms of the gravitational wave generated by coalescing spinless binary systems up to the third post-Minkowskian approximation in the effective one-body theory. To derive an analytical expansion of the null tetrad components of the gravitational perturbed Weyl tensor $\varPsi_{4}$ in the effective spacetime, we utilize the method proposed by… ▽ More

    Submitted 9 May, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: 33 pages, 5 figures

    Journal ref: SCIENCE CHINA Physics, Mechanics & Astronomy, 2024, V67, 260412

  29. arXiv:2403.08497  [pdf, other

    math.AG

    Viro's patchworking and the signed reduced A-discriminant

    Authors: Weixun Deng, J. Maurice Rojas, Máté L. Telek

    Abstract: Computing the isotopy type of a hypersurface, defined as the positive real zero set of a multivariate polynomial, is a challenging problem in real algebraic geometry. We focus on the case where the defining polynomial has combinatorially restricted exponent vectors and fixed coefficient signs, enabling faster computation of the isotopy type. In particular, Viro's patchworking provides a polyhedral… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  30. arXiv:2403.06529  [pdf, other

    cs.CV

    Confidence-Aware RGB-D Face Recognition via Virtual Depth Synthesis

    Authors: Zijian Chen, Mei Wang, Weihong Deng, Hongzhi Shi, Dongchao Wen, Yingjie Zhang, Xingchen Cui, Jian Zhao

    Abstract: 2D face recognition encounters challenges in unconstrained environments due to varying illumination, occlusion, and pose. Recent studies focus on RGB-D face recognition to improve robustness by incorporating depth information. However, collecting sufficient paired RGB-D training data is expensive and time-consuming, hindering wide deployment. In this work, we first construct a diverse depth datase… ▽ More

    Submitted 16 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: 9 pages, 5 figures

  31. arXiv:2403.06104  [pdf, other

    cs.CV

    Debiased Noise Editing for Fair Medical Image Classification

    Authors: Ruinan Jin, Wenlong Deng, Minghui Chen, Xiaoxiao Li

    Abstract: In the era of Foundation Models' (FMs) rising prominence in AI, our study addresses the challenge of biases in medical images while the model operates in black-box (e.g., using FM API), particularly spurious correlations between pixels and sensitive attributes. Traditional methods for bias mitigation face limitations due to the restricted access to web-hosted FMs and difficulties in addressing the… ▽ More

    Submitted 10 July, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

    Comments: 13 pages, 3 figures. Accepted by MICCAI 2024

  32. arXiv:2403.05523  [pdf, other

    cs.CV

    Beyond Finite Data: Towards Data-free Out-of-distribution Generalization via Extrapolation

    Authors: Yijiang Li, Sucheng Ren, Weipeng Deng, Yuzhi Xu, Ying Gao, Edith Ngai, Haohan Wang

    Abstract: Out-of-distribution (OOD) generalization is a favorable yet challenging property for deep neural networks. The core challenges lie in the limited availability of source domains that help models learn an invariant representation from the spurious features. Various domain augmentation have been proposed but largely rely on interpolating existing domains and frequently face difficulties in creating t… ▽ More

    Submitted 11 March, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: Preprint. Paper under review

  33. arXiv:2403.01988  [pdf, other

    cs.CL

    FakeNewsGPT4: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs

    Authors: Xuannan Liu, Peipei Li, Huaibo Huang, Zekun Li, Xing Cui, Jiahao Liang, Lixiong Qin, Weihong Deng, Zhaofeng He

    Abstract: The massive generation of multimodal fake news exhibits substantial distribution discrepancies, prompting the need for generalized detectors. However, the insulated nature of training within specific domains restricts the capability of classical detectors to obtain open-world facts. In this paper, we propose FakeNewsGPT4, a novel framework that augments Large Vision-Language Models (LVLMs) with fo… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  34. arXiv:2402.18039  [pdf, other

    cs.CL cs.AI

    ResLoRA: Identity Residual Mapping in Low-Rank Adaption

    Authors: Shuhua Shi, Shaohan Huang, Minghui Song, Zhoujun Li, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang

    Abstract: As one of the most popular parameter-efficient fine-tuning (PEFT) methods, low-rank adaptation (LoRA) is commonly applied to fine-tune large language models (LLMs). However, updating the weights of LoRA blocks effectively and expeditiously is challenging due to the long calculation path in the original model. To address this, we propose ResLoRA, an improved framework of LoRA. By adding residual pa… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 14 pages, 7 figures

  35. arXiv:2402.15754  [pdf, other

    cs.CL

    HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition

    Authors: Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang

    Abstract: Large language models (LLMs) have emerged as a promising alternative to expensive human evaluations. However, the alignment and coverage of LLM-based evaluations are often limited by the scope and potential bias of the evaluation prompts and criteria. To address this challenge, we propose HD-Eval, a novel framework that iteratively aligns LLM-based evaluators with human preference via Hierarchical… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

    Comments: 20 pages, 13 figures

  36. arXiv:2402.14843  [pdf, other

    cs.CL cs.AI cs.LG

    Text Diffusion with Reinforced Conditioning

    Authors: Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang

    Abstract: Diffusion models have demonstrated exceptional capability in generating high-quality images, videos, and audio. Due to their adaptiveness in iterative refinement, they provide a strong potential for achieving better non-autoregressive sequence generation. However, existing text diffusion models still fall short in their performance due to a challenge in handling the discreteness of language. This… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 9 pages, 3 figures

  37. arXiv:2402.14208  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    LLM-Assisted Content Conditional Debiasing for Fair Text Embedding

    Authors: Wenlong Deng, Blair Chen, Beidi Zhao, Chiyu Zhang, Xiaoxiao Li, Christos Thrampoulidis

    Abstract: Mitigating biases in machine learning models has become an increasing concern in Natural Language Processing (NLP), particularly in developing fair text embeddings, which are crucial yet challenging for real-world applications like search engines. In response, this paper proposes a novel method for learning fair text embeddings. First, we define a novel content-conditional equal distance (CCED) fa… ▽ More

    Submitted 24 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  38. arXiv:2402.13874  [pdf, other

    cs.CL

    $Se^2$: Sequential Example Selection for In-Context Learning

    Authors: Haoyu Liu, Jianfeng Liu, Shaohan Huang, Yuefeng Zhan, Hao Sun, Weiwei Deng, Furu Wei, Qi Zhang

    Abstract: The remarkable capability of large language models (LLMs) for in-context learning (ICL) needs to be activated by demonstration examples. Prior work has extensively explored the selection of examples for ICL, predominantly following the "select then organize" paradigm, such approaches often neglect the internal relationships between examples and exist an inconsistency between the training and infer… ▽ More

    Submitted 6 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted by ACL 2024 Findings

  39. arXiv:2402.10924  [pdf

    physics.app-ph

    Compact Ka-Band Metalens Antenna Enabled by Physics Assisted Particle Swarm Optimization (PA-PSO) Algorithm

    Authors: Shibin Jiang, Wenjun Deng, Weiming Zhu

    Abstract: The design of multiple-feed lens antennas requires multivariate and multi-objective optimization processes, which can be accelerated by PSO algorithms. However, the PSO algorithm often fails to achieve optimal results with limited computation resources since the spaces of candidate solutions are quite large for lens antenna designs. This paper presents a design paradigm for multiple-feed lens ante… ▽ More

    Submitted 29 January, 2024; originally announced February 2024.

  40. arXiv:2402.10797  [pdf, other

    cs.MS cs.LG stat.CO stat.ML

    BlackJAX: Composable Bayesian inference in JAX

    Authors: Alberto Cabezas, Adrien Corenflos, Junpeng Lao, Rémi Louf, Antoine Carnec, Kaustubh Chaudhari, Reuben Cohn-Gordon, Jeremie Coullon, Wei Deng, Sam Duffield, Gerardo Durán-Martín, Marcin Elantkowski, Dan Foreman-Mackey, Michele Gregori, Carlos Iguaran, Ravin Kumar, Martin Lysy, Kevin Murphy, Juan Camilo Orduz, Karm Patel, Xi Wang, Rob Zinkov

    Abstract: BlackJAX is a library implementing sampling and variational inference algorithms commonly used in Bayesian computation. It is designed for ease of use, speed, and modularity by taking a functional approach to the algorithms' implementation. BlackJAX is written in Python, using JAX to compile and run NumpPy-like samplers and variational methods on CPUs, GPUs, and TPUs. The library integrates well w… ▽ More

    Submitted 22 February, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: Companion paper for the library https://github.com/blackjax-devs/blackjax Update: minor changes and updated the list of authors to include technical contributors

  41. arXiv:2402.07417  [pdf, other

    cs.CV cs.LG

    An Empirical Study Into What Matters for Calibrating Vision-Language Models

    Authors: Weijie Tu, Weijian Deng, Dylan Campbell, Stephen Gould, Tom Gedeon

    Abstract: Vision-Language Models (VLMs) have emerged as the dominant approach for zero-shot recognition, adept at handling diverse scenarios and significant distribution changes. However, their deployment in risk-sensitive areas requires a deeper understanding of their uncertainty estimation capabilities, a relatively uncharted area. In this study, we explore the calibration properties of VLMs across differ… ▽ More

    Submitted 14 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: ICML 2024 Camera Ready

  42. arXiv:2402.07410  [pdf, other

    cs.CV cs.LG

    A Closer Look at the Robustness of Contrastive Language-Image Pre-Training (CLIP)

    Authors: Weijie Tu, Weijian Deng, Tom Gedeon

    Abstract: Contrastive Language-Image Pre-training (CLIP) models have demonstrated remarkable generalization capabilities across multiple challenging distribution shifts. However, there is still much to be explored in terms of their robustness to the variations of specific visual factors. In real-world applications, reliable and safe systems must consider other safety objectives beyond classification accurac… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted by NeurIPS 2023

  43. arXiv:2402.02935  [pdf, other

    nucl-th astro-ph.SR nucl-ex

    Nuclear mass table in deformed relativistic Hartree-Bogoliubov theory in continuum, II: Even-$Z$ nuclei

    Authors: DRHBc Mass Table Collaboration, Peng Guo, Xiaojie Cao, Kangmin Chen, Zhihui Chen, Myung-Ki Cheoun, Yong-Beom Choi, Pak Chung Lam, Wenmin Deng, Jianmin Dong, Pengxiang Du, Xiaokai Du, Kangda Duan, Xiaohua Fan, Wei Gao, Lisheng Geng, Eunja Ha, Xiao-Tao He, Jinniu Hu, Jingke Huang, Kun Huang, Yanan Huang, Zidan Huang, Kim Da Hyung, Hoi Yat Chan , et al. (58 additional authors not shown)

    Abstract: The mass table in the deformed relativistic Hartree-Bogoliubov theory in continuum (DRHBc) with the PC-PK1 density functional has been established for even-$Z$ nuclei with $8\le Z\le120$, extended from the previous work for even-even nuclei [Zhang $\it{et.~al.}$ (DRHBc Mass Table Collaboration), At. Data Nucl. Data Tables 144, 101488 (2022)]. The calculated binding energies, two-nucleon and one-ne… ▽ More

    Submitted 10 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: 394 pages, 17 figures, 2 tables, published in Atomic Data and Nuclear Data Tables, data file in the TXT form is available for download under "Ancillary files"

    Journal ref: Peng Guo, et. al. (DRHBc Mass Table Collaboration), Atomic Data and Nuclear Data Tables 158 (2024) 101661

  44. arXiv:2401.16729  [pdf, other

    cs.LG

    Widely Linear Matched Filter: A Lynchpin towards the Interpretability of Complex-valued CNNs

    Authors: Qingchen Wang, Zhe Li, Zdenka Babic, Wei Deng, Ljubiša Stanković, Danilo P. Mandic

    Abstract: A recent study on the interpretability of real-valued convolutional neural networks (CNNs) {Stankovic_Mandic_2023CNN} has revealed a direct and physically meaningful link with the task of finding features in data through matched filters. However, applying this paradigm to illuminate the interpretability of complex-valued CNNs meets a formidable obstacle: the extension of matched filtering to a gen… ▽ More

    Submitted 31 January, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

  45. arXiv:2401.15897  [pdf, other

    cs.CY cs.HC cs.LG

    Red-Teaming for Generative AI: Silver Bullet or Security Theater?

    Authors: Michael Feffer, Anusha Sinha, Wesley Hanwen Deng, Zachary C. Lipton, Hoda Heidari

    Abstract: In response to rising concerns surrounding the safety, security, and trustworthiness of Generative AI (GenAI) models, practitioners and regulators alike have pointed to AI red-teaming as a key component of their strategies for identifying and mitigating these risks. However, despite AI red-teaming's central role in policy discussions and corporate messaging, significant questions remain about what… ▽ More

    Submitted 15 May, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

  46. Extremely intrinsic chirality in two-dimensional planar waveguide grating induced by quasi-bound states in the continuum

    Authors: Dandan Zhang, Tingting Liu, Linlin Lei, Weimin Deng, Tongbiao Wang, Qinghua Liao, Wenxing Liu, Shuyuan Xiao, Tianbao Yu

    Abstract: The strong chiral light-matter interaction is crucial for various important fields such as chiral optics, quantum optics, and biomedical optics, driving a quest for the extreme intrinsic chirality assisted by ultrahigh quality ($Q$-) factor resonances. In this quest, we propose a straightforward method to achieve extreme intrinsic chirality in lossless planar structures by manipulating the quasi-B… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Journal ref: Physical Review B 109 (20), 205403 (2024)

  47. arXiv:2401.13154  [pdf, other

    cs.OS

    Nomad: Non-Exclusive Memory Tiering via Transactional Page Migration

    Authors: Lingfeng Xiang, Zhen Lin, Weishu Deng, Hui Lu, Jia Rao, Yifan Yuan, Ren Wang

    Abstract: With the advent of byte-addressable memory devices, such as CXL memory, persistent memory, and storage-class memory, tiered memory systems have become a reality. Page migration is the de facto method within operating systems for managing tiered memory. It aims to bring hot data whenever possible into fast memory to optimize the performance of data accesses while using slow memory to accommodate da… ▽ More

    Submitted 17 June, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  48. arXiv:2401.12507  [pdf, other

    cs.CV

    Open-Set Facial Expression Recognition

    Authors: Yuhang Zhang, Yue Yao, Xuannan Liu, Lixiong Qin, Wenjing Wang, Weihong Deng

    Abstract: Facial expression recognition (FER) models are typically trained on datasets with a fixed number of seven basic classes. However, recent research works point out that there are far more expressions than the basic ones. Thus, when these models are deployed in the real world, they may encounter unknown classes, such as compound expressions that cannot be classified into existing basic classes. To ad… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: Accepted by AAAI2024

  49. arXiv:2401.11665  [pdf, other

    stat.ML cs.AI cs.LG

    Accelerating Approximate Thompson Sampling with Underdamped Langevin Monte Carlo

    Authors: Haoyang Zheng, Wei Deng, Christian Moya, Guang Lin

    Abstract: Approximate Thompson sampling with Langevin Monte Carlo broadens its reach from Gaussian posterior sampling to encompass more general smooth posteriors. However, it still encounters scalability issues in high-dimensional problems when demanding high accuracy. To address this, we propose an approximate Thompson sampling strategy, utilizing underdamped Langevin Monte Carlo, where the latter is the g… ▽ More

    Submitted 20 June, 2024; v1 submitted 21 January, 2024; originally announced January 2024.

    Comments: 52 pages, 2 figures

  50. arXiv:2401.09119  [pdf, other

    eess.SP

    Anchor-points Assisted Uplink Sensing in Perceptive Mobile Networks

    Authors: Yanmo Hu, J. Andrew Zhang, Weibo Deng, Y. Jay Guo

    Abstract: Uplink sensing in integrated sensing and communications (ISAC) systems, such as Perceptive Mobile Networks, is challenging due to the clock asynchronism between transmitter and receiver. Existing solutions typically require the presence of a dominating line-of-sight path and the knowledge of transmitter location at the receiver. In this paper, relaxing these requirements, we propose a novel and ef… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: 14 pages, 12 figures, journal paper