Skip to main content

Showing 1–50 of 339 results for author: Du, W

  1. arXiv:2407.09001  [pdf

    cond-mat.mtrl-sci

    Coupling multi-space topologies in 2D ferromagnetic lattice

    Authors: Zhonglin He, Wenhui Du, Kaiying Dou, Ying Dai, Baibiao Huang, Yandong Ma

    Abstract: Topology can manifest topological magnetism (e.g., skyrmion and bimeron) in real space and quantum anomalous Hall (QAH) state in momentum space, which have changed the modern conceptions of matter phase. While the topologies in different spaces are widely studied separately, their coexistence and coupling in single phase is seldomly explored. Here, we report a novel phenomenon that arises from the… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  2. arXiv:2407.07531  [pdf, other

    cs.CL

    Beyond Benchmarking: A New Paradigm for Evaluation and Assessment of Large Language Models

    Authors: Jin Liu, Qingquan Li, Wenlong Du

    Abstract: In current benchmarks for evaluating large language models (LLMs), there are issues such as evaluation content restriction, untimely updates, and lack of optimization guidance. In this paper, we propose a new paradigm for the measurement of LLMs: Benchmarking-Evaluation-Assessment. Our paradigm shifts the "location" of LLM evaluation from the "examination room" to the "hospital". Through conductin… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  3. arXiv:2407.04115  [pdf, other

    cs.RO

    LiDAR-based Real-Time Object Detection and Tracking in Dynamic Environments

    Authors: Wenqiang Du, Giovanni Beltrame

    Abstract: In dynamic environments, the ability to detect and track moving objects in real-time is crucial for autonomous robots to navigate safely and effectively. Traditional methods for dynamic object detection rely on high accuracy odometry and maps to detect and track moving objects. However, these methods are not suitable for long-term operation in dynamic environments where the surrounding environment… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  4. MARLP: Time-series Forecasting Control for Agricultural Managed Aquifer Recharge

    Authors: Yuning Chen, Kang Yang, Zhiyu An, Brady Holder, Luke Paloutzian, Khaled Bali, Wan Du

    Abstract: The rapid decline in groundwater around the world poses a significant challenge to sustainable agriculture. To address this issue, agricultural managed aquifer recharge (Ag-MAR) is proposed to recharge the aquifer by artificially flooding agricultural lands using surface water. Ag-MAR requires a carefully selected flooding schedule to avoid affecting the oxygen absorption of crop roots. However, c… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Accepted by KDD 2024

  5. arXiv:2406.17245  [pdf, other

    cs.LG cs.AI cs.CL

    Unlocking Continual Learning Abilities in Language Models

    Authors: Wenyu Du, Shuang Cheng, Tongxu Luo, Zihan Qiu, Zeyu Huang, Ka Chun Cheung, Reynold Cheng, Jie Fu

    Abstract: Language models (LMs) exhibit impressive performance and generalization capabilities. However, LMs struggle with the persistent challenge of catastrophic forgetting, which undermines their long-term sustainability in continual learning (CL). Existing approaches usually address the issue by incorporating old task data or task-wise inductive bias into LMs. However, old data and accurate task informa… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: preprint, 19 pages

  6. arXiv:2406.16571  [pdf, other

    math.OC cs.AI cs.LG eess.SY

    Differentiable Distributionally Robust Optimization Layers

    Authors: Xutao Ma, Chao Ning, Wenli Du

    Abstract: In recent years, there has been a growing research interest in decision-focused learning, which embeds optimization problems as a layer in learning pipelines and demonstrates a superior performance than the prediction-focused approach. However, for distributionally robust optimization (DRO), a popular paradigm for decision-making under uncertainty, it is still unknown how to embed it as a layer, i… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: In Forty-first International Conference on Machine Learning (2024)

  7. arXiv:2406.12747  [pdf, other

    cs.LG cs.AI

    TSI-Bench: Benchmarking Time Series Imputation

    Authors: Wenjie Du, Jun Wang, Linglong Qian, Yiyuan Yang, Fanxing Liu, Zepu Wang, Zina Ibrahim, Haoxin Liu, Zhiyuan Zhao, Yingjie Zhou, Wenjia Wang, Kaize Ding, Yuxuan Liang, B. Aditya Prakash, Qingsong Wen

    Abstract: Effective imputation is a crucial preprocessing step for time series analysis. Despite the development of numerous deep learning algorithms for time series imputation, the community lacks standardized and comprehensive benchmark platforms to effectively evaluate imputation performance across different settings. Moreover, although many deep learning forecasting algorithms have demonstrated excellen… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  8. arXiv:2406.12206  [pdf, other

    astro-ph.SR astro-ph.CO astro-ph.GA

    The Absolute Age of NGC 3201 derived from Detached Eclipsing Binaries and the Hess Diagram

    Authors: Jiaqi, Ying, Brian Chaboyer, Wenxin Du

    Abstract: We estimate the absolute age of the globular cluster NGC 3201 using $10,000$ sets of theoretical isochrones constructed through Monte Carlo simulation using the Dartmouth Stellar Evolution Program. These isochrones take into consideration of uncertainty introduced by the choice of stellar evolution parameters. We fit isochrones with 3 detached eclipsing binaries and obtained an age independent of… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 14 pages, 10 figures, 3 Tables; Accepted for Publication ApJ

  9. arXiv:2406.11906  [pdf, other

    q-bio.QM cs.AI

    NovoBench: Benchmarking Deep Learning-based De Novo Peptide Sequencing Methods in Proteomics

    Authors: Jingbo Zhou, Shaorong Chen, Jun Xia, Sizhe Liu, Tianze Ling, Wenjie Du, Yue Liu, Jianwei Yin, Stan Z. Li

    Abstract: Tandem mass spectrometry has played a pivotal role in advancing proteomics, enabling the high-throughput analysis of protein composition in biological tissues. Many deep learning methods have been developed for \emph{de novo} peptide sequencing task, i.e., predicting the peptide sequence for the observed mass spectrum. However, two key challenges seriously hinder the further advancement of this im… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  10. arXiv:2406.11231  [pdf, other

    cs.RO cs.AI cs.CL cs.LG

    Enabling robots to follow abstract instructions and complete complex dynamic tasks

    Authors: Ruaridh Mon-Williams, Gen Li, Ran Long, Wenqian Du, Chris Lucas

    Abstract: Completing complex tasks in unpredictable settings like home kitchens challenges robotic systems. These challenges include interpreting high-level human commands, such as "make me a hot beverage" and performing actions like pouring a precise amount of water into a moving mug. To address these challenges, we present a novel framework that combines Large Language Models (LLMs), a curated Knowledge B… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  11. arXiv:2406.09209  [pdf, other

    astro-ph.CO gr-qc hep-th

    Acceleration of the Universe without the Hubble tension with Kaniadakis holographic dark energy using the Hubble horizon as the IR cut-off

    Authors: Wei Fang, Guo Chen, Chao-Jun Feng, Wei Du, Chenggang Shu

    Abstract: We introduce a holographic dark energy model that incorporates the first-order approximate Kaniadaski entropy, utilizing the Hubble horizon, $1/H$, as the infrared cutoff. We investigate the cosmological evolution within this framework. The model introduces an extra parameter relative to the $Λ$CDM model. It posits a Universe that is initially dominated by dark matter, which then evolves to a phas… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 9 pages, 5 figures

  12. arXiv:2406.06652  [pdf, other

    cs.LG cs.AI

    Improving Generalization of Neural Vehicle Routing Problem Solvers Through the Lens of Model Architecture

    Authors: Yubin Xiao, Di Wang, Xuan Wu, Yuesong Wu, Boyang Li, Wei Du, Liupu Wang, You Zhou

    Abstract: Neural models produce promising results when solving Vehicle Routing Problems (VRPs), but often fall short in generalization. Recent attempts to enhance model generalization often incur unnecessarily large training cost or cannot be directly applied to other models solving different VRP variants. To address these issues, we take a novel perspective on model architecture in this study. Specifically… ▽ More

    Submitted 17 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: 13 pages, 6 figures, and 6 tables

  13. arXiv:2406.01719  [pdf, other

    astro-ph.IM astro-ph.GA

    Imputation of Missing Photometric Data and Photometric Redshift Estimation for CSST

    Authors: Zhijian Luo, Zhirui Tang, Zhu Chen, Liping Fu, Wei Du, Shaohua Zhang, Yan Gong, Chenggang Shu, Junhao Lu, Yicheng Li, Xian-Min Meng, Xingchen Zhou, Zuhui Fan

    Abstract: Accurate photometric redshift (photo-$z$) estimation requires support from multi-band observational data. However, in the actual process of astronomical observations and data processing, some sources may have missing observational data in certain bands for various reasons. This could greatly affect the accuracy and reliability of photo-$z$ estimation for these sources, and even render some estimat… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  14. arXiv:2405.17508  [pdf, other

    cs.LG stat.ML

    Unveiling the Secrets: How Masking Strategies Shape Time Series Imputation

    Authors: Linglong Qian, Zina Ibrahim, Wenjie Du, Yiyuan Yang, Richard JB Dobson

    Abstract: In this study, we explore the impact of different masking strategies on time series imputation models. We evaluate the effects of pre-masking versus in-mini-batch masking, normalization timing, and the choice between augmenting and overlaying artificial missingness. Using three diverse datasets, we benchmark eleven imputation models with different missing rates. Our results demonstrate that maskin… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  15. arXiv:2405.15319  [pdf, other

    cs.CL cs.AI

    Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training

    Authors: Wenyu Du, Tongxu Luo, Zihan Qiu, Zeyu Huang, Yikang Shen, Reynold Cheng, Yike Guo, Jie Fu

    Abstract: LLMs are computationally expensive to pre-train due to their large scale. Model growth emerges as a promising approach by leveraging smaller models to accelerate the training of larger ones. However, the viability of these model growth methods in efficient LLM pre-training remains underexplored. This work identifies three critical $\underline{\textit{O}}$bstacles: ($\textit{O}$1) lack of comprehen… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Preprint; The project link: $\href{https://llm-stacking.github.io/}{https://llm-stacking.github.io/}$

  16. arXiv:2405.13761  [pdf

    cond-mat.mtrl-sci

    Monolithic Germanium Tin on Si Avalanche Photodiodes

    Authors: Justin Rudie, Sylvester Amoah, Xiaoxin Wang, Rajesh Kumar, Grey Abernathy, Steven Akwabli, Perry C. Grant, Jifeng Liu, Baohua Li, Wei Du, Shui-Qing Yu

    Abstract: We demonstrate monolithically grown germanium-tin (GeSn) on silicon avalanche photodiodes (APDs) for infrared light detection. A relatively thinner Ge buffer design was adopted to allow effective photo carriers to transport from the GeSn absorber to the Si multiplication layer such that clear punch-through behavior and a saturated primary responsivity of 0.3 A/W at 1550 nm were observed before ava… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 8 pages, 5 figures, invited paper

  17. arXiv:2405.13401  [pdf, ps, other

    cs.CR cs.CL

    TrojanRAG: Retrieval-Augmented Generation Can Be Backdoor Driver in Large Language Models

    Authors: Pengzhou Cheng, Yidong Ding, Tianjie Ju, Zongru Wu, Wei Du, Ping Yi, Zhuosheng Zhang, Gongshen Liu

    Abstract: Large language models (LLMs) have raised concerns about potential security threats despite performing significantly in Natural Language Processing (NLP). Backdoor attacks initially verified that LLM is doing substantial harm at all stages, but the cost and robustness have been criticized. Attacking LLMs is inherently risky in security review, while prohibitively expensive. Besides, the continuous… ▽ More

    Submitted 7 July, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: 19 pages, 14 figures, 4 tables

  18. arXiv:2405.10163  [pdf

    physics.optics physics.app-ph

    Electrically Injected mid-infrared GeSn laser on Si operating at 140 K

    Authors: Sudip Acharya, Hryhorii Stanchu, Rajesh Kumar, Solomon Ojo, Mourad Benamara, Guo-En Chang, Baohua Li, Wei Du, Shui-Qing Yu

    Abstract: Owing to its true direct bandgap and tunable bandgap energies,GeSn alloys are increasingly attractive as gain media for mid-IR lasers that can be monolithically integrated on Si. Demonstrations of optically pumped GeSn laser at room under pulsed condition and at cryogenic temperature under continuous-wave excitation show great promise of GeSn lasers to be efficient electrically injected light sour… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  19. Low surface brightness galaxies from BASS+MzLS with Machine Learning

    Authors: Peng-Liang Du, Wei Du, Bing-Qing Zhang, Zhen-Ping Yi, Min He, Hong Wu

    Abstract: From $\sim$ 5000 deg$^{2}$ of the combination of the Beijing-Arizona Sky Survey (BASS) and Mayall $z$-band Legacy Survey (MzLS) which is also the northern sky region of the Dark Energy Spectroscopic Instrument (DESI) Legacy Imaging Surveys, we selected a sample of 31,825 candidates of low surface brightness galaxies (LSBGs) with the mean effective surface brightness 24.2 $< \barμ_{\rm eff,g} <$ 28… ▽ More

    Submitted 29 April, 2024; v1 submitted 28 April, 2024; originally announced April 2024.

    Comments: 20 pages, 11 figures, 1 table, accepted by Research in Astronomy and Astrophysics

  20. arXiv:2404.10515  [pdf, other

    cs.NE

    An Enhanced Differential Grouping Method for Large-Scale Overlapping Problems

    Authors: Maojiang Tian, Mingke Chen, Wei Du, Yang Tang, Yaochu Jin

    Abstract: Large-scale overlapping problems are prevalent in practical engineering applications, and the optimization challenge is significantly amplified due to the existence of shared variables. Decomposition-based cooperative coevolution (CC) algorithms have demonstrated promising performance in addressing large-scale overlapping problems. However, current CC frameworks designed for overlapping problems r… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  21. arXiv:2404.08169  [pdf, other

    stat.ME

    AutoGFI: Streamlined Generalized Fiducial Inference for Modern Inference Problems

    Authors: Wei Du, Jan Hannig, Thomas C. M. Lee, Yi Su, Chunzhe Zhang

    Abstract: The origins of fiducial inference trace back to the 1930s when R. A. Fisher first introduced the concept as a response to what he perceived as a limitation of Bayesian inference - the requirement for a subjective prior distribution on model parameters in cases where no prior information was available. However, Fisher's initial fiducial approach fell out of favor as complications arose, particularl… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  22. arXiv:2404.00819  [pdf, other

    quant-ph hep-th nucl-th

    Ultra-relativistic quark-nucleus scattering on quantum computers

    Authors: Sihao Wu, Weijie Du, Xingbo Zhao, James P. Vary

    Abstract: Quantum computing provides a promising approach for solving the real-time dynamics of systems consist of quarks and gluons from first-principle calculations that are intractable with classical computers. In this work, we start with an initial problem of the ultra-relativistic quark-nucleus scattering and present an efficient and precise approach to quantum simulate the dynamics on the light front.… ▽ More

    Submitted 15 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: 27 pages, 11 figures. Comments are welcome

  23. arXiv:2404.00555  [pdf, other

    astro-ph.GA

    Gas-rich Ultra-diffuse Galaxies Are Originated from High Specific Angular Momentum

    Authors: Yu Rong, Huijie Hu, Min He, Wei Du, Qi Guo, Hui-Yuan Wang, Hong-Xin Zhang, Houjun Mo

    Abstract: Ultra-diffuse galaxies, characterized by comparable effective radii to the Milky Way but possessing 100-1,000 times fewer stars, offer a unique opportunity to garner novel insights into the mechanisms governing galaxy formation. Nevertheless, the existing corpus of observational and simulation studies has not yet yielded a definitive constraint or comprehensive consensus on the formation mechanism… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: comments welcome

  24. arXiv:2403.15393  [pdf, other

    cs.CL cs.LG cs.SI

    Detection of Opioid Users from Reddit Posts via an Attention-based Bidirectional Recurrent Neural Network

    Authors: Yuchen Wang, Zhengyu Fang, Wei Du, Shuai Xu, Rong Xu, Jing Li

    Abstract: The opioid epidemic, referring to the growing hospitalizations and deaths because of overdose of opioid usage and addiction, has become a severe health problem in the United States. Many strategies have been developed by the federal and local governments and health communities to combat this crisis. Among them, improving our understanding of the epidemic through better health surveillance is one o… ▽ More

    Submitted 9 February, 2024; originally announced March 2024.

  25. arXiv:2403.12130  [pdf, other

    astro-ph.GA

    Almost Optically Dark Galaxies in DECaLS (I): Detection, Optical Properties and Possible Origins

    Authors: Lin Du, Wei Du, Cheng Cheng, Ming Zhu, Haiyang Yu, Hong Wu

    Abstract: We report the discovery of eight optical counterparts of ALFALFA extragalactic objects from DECaLS, five of which are discovered for the first time. These objects were flagged as HI emission sources with no optical counterparts in SDSS before. Multi-band data reveal their unusual physical properties. They are faint and blue ($g-r=-0.35\sim0.55$), with quite low surface brightness (… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 32 pages, 11 figures, accepted by the Astrophysical Journal

  26. arXiv:2403.07013  [pdf, other

    q-bio.QM cs.LG q-bio.BM

    AdaNovo: Adaptive \emph{De Novo} Peptide Sequencing with Conditional Mutual Information

    Authors: Jun Xia, Shaorong Chen, Jingbo Zhou, Tianze Ling, Wenjie Du, Sizhe Liu, Stan Z. Li

    Abstract: Tandem mass spectrometry has played a pivotal role in advancing proteomics, enabling the analysis of protein composition in biological samples. Despite the development of various deep learning methods for identifying amino acid sequences (peptides) responsible for observed spectra, challenges persist in \emph{de novo} peptide sequencing. Firstly, prior methods struggle to identify amino acids with… ▽ More

    Submitted 15 March, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  27. arXiv:2403.03425  [pdf, other

    cs.LG physics.chem-ph q-bio.BM

    Sculpting Molecules in 3D: A Flexible Substructure Aware Framework for Text-Oriented Molecular Optimization

    Authors: Kaiwei Zhang, Yange Lin, Guangcheng Wu, Yuxiang Ren, Xuecang Zhang, Bo wang, Xiaoyu Zhang, Weitao Du

    Abstract: The integration of deep learning, particularly AI-Generated Content, with high-quality data derived from ab initio calculations has emerged as a promising avenue for transforming the landscape of scientific research. However, the challenge of designing molecular drugs or materials that incorporate multi-modality prior knowledge remains a critical and complex undertaking. Specifically, achieving a… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  28. arXiv:2403.01192  [pdf, other

    math.OC cs.LG cs.NE

    A Composite Decomposition Method for Large-Scale Global Optimization

    Authors: Maojiang Tian, Minyang Chen, Wei Du, Yang Tang, Yaochu Jin, Gary G. Yen

    Abstract: Cooperative co-evolution (CC) algorithms, based on the divide-and-conquer strategy, have emerged as the predominant approach to solving large-scale global optimization (LSGO) problems. The efficiency and accuracy of the grouping stage significantly impact the performance of the optimization process. While the general separability grouping (GSG) method has overcome the limitation of previous differ… ▽ More

    Submitted 8 March, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

  29. arXiv:2403.00172  [pdf, other

    eess.SY cs.AI cs.LG

    Go Beyond Black-box Policies: Rethinking the Design of Learning Agent for Interpretable and Verifiable HVAC Control

    Authors: Zhiyu An, Xianzhong Ding, Wan Du

    Abstract: Recent research has shown the potential of Model-based Reinforcement Learning (MBRL) to enhance energy efficiency of Heating, Ventilation, and Air Conditioning (HVAC) systems. However, existing methods rely on black-box thermal dynamics models and stochastic optimizers, lacking reliability guarantees and posing risks to occupant health. In this work, we overcome the reliability bottleneck by redes… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Comments: Accepted for the 61st Design Automation Conference (DAC)

  30. arXiv:2402.18945  [pdf, other

    cs.CR cs.AI cs.CL

    SynGhost: Imperceptible and Universal Task-agnostic Backdoor Attack in Pre-trained Language Models

    Authors: Pengzhou Cheng, Wei Du, Zongru Wu, Fengwei Zhang, Libo Chen, Gongshen Liu

    Abstract: Pre-training has been a necessary phase for deploying pre-trained language models (PLMs) to achieve remarkable performance in downstream tasks. However, we empirically show that backdoor attacks exploit such a phase as a vulnerable entry point for task-agnostic. In this paper, we first propose $\mathtt{maxEntropy}$, an entropy-based poisoning filtering defense, to prove that existing task-agnostic… ▽ More

    Submitted 24 May, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: 18 pages, 19 figures, 13 tables

  31. arXiv:2402.16918  [pdf, other

    cs.LG cs.CV

    m2mKD: Module-to-Module Knowledge Distillation for Modular Transformers

    Authors: Ka Man Lo, Yiming Liang, Wenyu Du, Yuantao Fan, Zili Wang, Wenhao Huang, Lei Ma, Jie Fu

    Abstract: Modular neural architectures are gaining attention for their powerful generalization and efficient adaptation to new domains. However, training these models poses challenges due to optimization difficulties arising from intrinsic sparse connectivity. Leveraging knowledge from monolithic models through techniques like knowledge distillation can facilitate training and enable integration of diverse… ▽ More

    Submitted 7 July, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

  32. arXiv:2402.16061  [pdf, other

    cs.CL

    How Large Language Models Encode Context Knowledge? A Layer-Wise Probing Study

    Authors: Tianjie Ju, Weiwei Sun, Wei Du, Xinwei Yuan, Zhaochun Ren, Gongshen Liu

    Abstract: Previous work has showcased the intriguing capability of large language models (LLMs) in retrieving facts and processing context knowledge. However, only limited research exists on the layer-wise capability of LLMs to encode knowledge, which challenges our understanding of their internal mechanisms. In this paper, we devote the first attempt to investigate the layer-wise capability of LLMs through… ▽ More

    Submitted 4 March, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

    Comments: Accepted at LREC-COLING 2024 (Long Paper)

  33. arXiv:2402.14600  [pdf, other

    cs.AI

    Diffusion Model-Based Multiobjective Optimization for Gasoline Blending Scheduling

    Authors: Wenxuan Fang, Wei Du, Renchu He, Yang Tang, Yaochu Jin, Gary G. Yen

    Abstract: Gasoline blending scheduling uses resource allocation and operation sequencing to meet a refinery's production requirements. The presence of nonlinearity, integer constraints, and a large number of decision variables adds complexity to this problem, posing challenges for traditional and evolutionary algorithms. This paper introduces a novel multiobjective optimization approach driven by a diffusio… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  34. arXiv:2402.13419  [pdf, ps, other

    cs.AI

    Reward Bound for Behavioral Guarantee of Model-based Planning Agents

    Authors: Zhiyu An, Xianzhong Ding, Wan Du

    Abstract: Recent years have seen an emerging interest in the trustworthiness of machine learning-based agents in the wild, especially in robotics, to provide safety assurance for the industry. Obtaining behavioral guarantees for these agents remains an important problem. In this work, we focus on guaranteeing a model-based planning agent reaches a goal state within a specific future time step. We show that… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: To be published in ICLR 24 tiny paper track

  35. arXiv:2402.12720  [pdf, other

    cs.CR cs.AI

    Revisiting the Information Capacity of Neural Network Watermarks: Upper Bound Estimation and Beyond

    Authors: Fangqi Li, Haodong Zhao, Wei Du, Shilin Wang

    Abstract: To trace the copyright of deep neural networks, an owner can embed its identity information into its model as a watermark. The capacity of the watermark quantify the maximal volume of information that can be verified from the watermarked model. Current studies on capacity focus on the ownership verification accuracy under ordinary removal attacks and fail to capture the relationship between robust… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: Accepted by AAAI 2024

  36. arXiv:2402.11900  [pdf, other

    cs.CL

    Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models

    Authors: Tianjie Ju, Yijin Chen, Xinwei Yuan, Zhuosheng Zhang, Wei Du, Yubin Zheng, Gongshen Liu

    Abstract: Recent work has showcased the powerful capability of large language models (LLMs) in recalling knowledge and reasoning. However, the reliability of LLMs in combining these two capabilities into reasoning through multi-hop facts has not been widely explored. This paper systematically investigates the possibilities for LLMs to utilize shortcuts based on direct connections between the initial and ter… ▽ More

    Submitted 2 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Accepted at ACL 2024 (Long Paper. Main Conference)

  37. arXiv:2402.11205  [pdf, other

    nucl-th math.NA quant-ph

    An Efficient Quantum Circuit for Block Encoding a Pairing Hamiltonian

    Authors: Diyi Liu, Weijie Du, Lin Lin, James P. Vary, Chao Yang

    Abstract: We present an efficient quantum circuit for block encoding pairing Hamiltonian often studied in nuclear physics. Our block encoding scheme does not require mapping the creation and annihilation operators to the Pauli operators and representing the Hamiltonian as a linear combination of unitaries. Instead, we show how to encode the Hamiltonian directly using controlled swap operations. We analyze t… ▽ More

    Submitted 21 February, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

    Comments: 27 pages, 18 figures

    MSC Class: 68Q12; 81P68

  38. arXiv:2402.10760  [pdf, other

    q-fin.ST cs.LG

    RAGIC: Risk-Aware Generative Adversarial Model for Stock Interval Construction

    Authors: Jingyi Gu, Wenlu Du, Guiling Wang

    Abstract: Efforts to predict stock market outcomes have yielded limited success due to the inherently stochastic nature of the market, influenced by numerous unpredictable factors. Many existing prediction approaches focus on single-point predictions, lacking the depth needed for effective decision-making and often overlooking market risk. To bridge this gap, we propose a novel model, RAGIC, which introduce… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  39. arXiv:2402.08969  [pdf, other

    quant-ph nucl-th

    Hamiltonian input model and spectroscopy on quantum computers

    Authors: Weijie Du, James P. Vary

    Abstract: We present a novel input model for general second-quantized Hamiltonians of relativistic or non-relativistic many-fermion systems. This input model incorporates the fermionic anticommutation relations, particle number variations, and respects the symmetries of the Hamiltonian. Based on our input model, we propose a hybrid framework for spectral calculations on future quantum hardwares. We provide… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: We welcome comments. Please send comments to duweigy@gmail.com

  40. arXiv:2402.04059  [pdf, other

    cs.LG cs.AI

    Deep Learning for Multivariate Time Series Imputation: A Survey

    Authors: Jun Wang, Wenjie Du, Wei Cao, Keli Zhang, Wenjia Wang, Yuxuan Liang, Qingsong Wen

    Abstract: The ubiquitous missing values cause the multivariate time series data to be partially observed, destroying the integrity of time series and hindering the effective time series data analysis. Recently deep learning imputation methods have demonstrated remarkable success in elevating the quality of corrupted time series data, subsequently enhancing performance in downstream tasks. In this paper, we… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 9 pages, 1 figure, 5 tables, 58 referred papers

  41. arXiv:2402.03781  [pdf, other

    q-bio.QM cs.AI cs.LG

    MolTC: Towards Molecular Relational Modeling In Language Models

    Authors: Junfeng Fang, Shuai Zhang, Chang Wu, Zhengyi Yang, Zhiyuan Liu, Sihang Li, Kun Wang, Wenjie Du, Xiang Wang

    Abstract: Molecular Relational Learning (MRL), aiming to understand interactions between molecular pairs, plays a pivotal role in advancing biochemical research. Recently, the adoption of large language models (LLMs), known for their vast knowledge repositories and advanced logical inference capabilities, has emerged as a promising way for efficient and effective MRL. Despite their potential, these methods… ▽ More

    Submitted 10 June, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: ACL 2024

  42. arXiv:2402.01204  [pdf, other

    cs.LG cs.AI

    A Survey on Self-Supervised Learning for Non-Sequential Tabular Data

    Authors: Wei-Yao Wang, Wei-Wei Du, Derek Xu, Wei Wang, Wen-Chih Peng

    Abstract: Self-supervised learning (SSL) has been incorporated into many state-of-the-art models in various domains, where SSL defines pretext tasks based on unlabeled datasets to learn contextualized and robust representations. Recently, SSL has been a new trend in exploring the representation learning capability in the realm of tabular data, which is more challenging due to not having explicit relations f… ▽ More

    Submitted 5 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: The paper list can be found at https://github.com/wwweiwei/awesome-self-supervised-learning-for-tabular-data

  43. arXiv:2401.17138  [pdf, other

    nucl-th quant-ph

    Nuclear scattering via quantum computing

    Authors: Peiyan Wang, Weijie Du, Wei Zuo, James P. Vary

    Abstract: We propose a hybrid quantum-classical framework to solve the elastic scattering phase shift of two well-bound nuclei in an uncoupled channel. Within this framework, we develop a many-body formalism in which the continuum scattering states of the two colliding nuclei are regulated by a weak external harmonic oscillator potential with varying strength. Based on our formalism, we propose an approach… ▽ More

    Submitted 15 June, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: We welcome comments!

  44. arXiv:2401.15122  [pdf, other

    cs.LG cs.AI q-bio.BM q-bio.QM stat.ML

    A Multi-Grained Symmetric Differential Equation Model for Learning Protein-Ligand Binding Dynamics

    Authors: Shengchao Liu, Weitao Du, Yanjing Li, Zhuoxinran Li, Vignesh Bhethanabotla, Nakul Rampal, Omar Yaghi, Christian Borgs, Anima Anandkumar, Hongyu Guo, Jennifer Chayes

    Abstract: In drug discovery, molecular dynamics (MD) simulation for protein-ligand binding provides a powerful tool for predicting binding affinities, estimating transport properties, and exploring pocket sites. There has been a long history of improving the efficiency of MD simulations through better numerical methods and, more recently, by utilizing machine learning (ML) methods. Yet, challenges remain, s… ▽ More

    Submitted 1 February, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

  45. arXiv:2401.14944  [pdf, other

    astro-ph.IM

    Modeling the wavelength dependence of photo-response non-uniformity of a CCD sensor

    Authors: Zun Luo, Wei Du, Baocun Chen, Xianmin Meng, Hu Zhan

    Abstract: Precision measurements of astrometry and photometry require stringent control of systematics such as those arising from imperfect correction of sensor effects. In this work, we develop a parametric method to model the wavelength dependence of photo-response non-uniformity (PRNU) for a laser annealed backside-illuminated charge-coupled device. The model accurately reproduces the PRNU patterns of fl… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 28 pages, 10 figures, comments are welcome

  46. arXiv:2401.12975  [pdf, other

    cs.CV cs.AI cs.CL

    HAZARD Challenge: Embodied Decision Making in Dynamically Changing Environments

    Authors: Qinhong Zhou, Sunli Chen, Yisong Wang, Haozhe Xu, Weihua Du, Hongxin Zhang, Yilun Du, Joshua B. Tenenbaum, Chuang Gan

    Abstract: Recent advances in high-fidelity virtual environments serve as one of the major driving forces for building intelligent embodied agents to perceive, reason and interact with the physical world. Typically, these environments remain unchanged unless agents interact with them. However, in real-world scenarios, agents might also face dynamically changing environments characterized by unexpected events… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: ICLR 2024. The first two authors contributed equally to this work

  47. arXiv:2401.10274  [pdf, ps, other

    cs.NE cs.AI

    Knowledge-Assisted Dual-Stage Evolutionary Optimization of Large-Scale Crude Oil Scheduling

    Authors: Wanting Zhang, Wei Du, Guo Yu, Renchu He, Wenli Du, Yaochu Jin

    Abstract: With the scaling up of crude oil scheduling in modern refineries, large-scale crude oil scheduling problems (LSCOSPs) emerge with thousands of binary variables and non-linear constraints, which are challenging to be optimized by traditional optimization methods. To solve LSCOSPs, we take the practical crude oil scheduling from a marine-access refinery as an example and start with modeling LSCOSPs… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

  48. arXiv:2401.06786  [pdf, other

    cs.DC cs.AI

    CloudEval-YAML: A Practical Benchmark for Cloud Configuration Generation

    Authors: Yifei Xu, Yuning Chen, Xumiao Zhang, Xianshang Lin, Pan Hu, Yunfei Ma, Songwu Lu, Wan Du, Zhuoqing Mao, Ennan Zhai, Dennis Cai

    Abstract: Among the thriving ecosystem of cloud computing and the proliferation of Large Language Model (LLM)-based code generation tools, there is a lack of benchmarking for code generation in cloud-native applications. In response to this need, we present CloudEval-YAML, a practical benchmark for cloud configuration generation. CloudEval-YAML tackles the diversity challenge by focusing on YAML, the de fac… ▽ More

    Submitted 9 November, 2023; originally announced January 2024.

  49. arXiv:2401.03072  [pdf, other

    stat.ME math.ST

    Optimal Nonparametric Inference on Network Effects with Dependent Edges

    Authors: Wenqin Du, Yuan Zhang, Wen Zhou

    Abstract: Testing network effects in weighted directed networks is a foundational problem in econometrics, sociology, and psychology. Yet, the prevalent edge dependency poses a significant methodological challenge. Most existing methods are model-based and come with stringent assumptions, limiting their applicability. In response, we introduce a novel, fully nonparametric framework that requires only minima… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 29 pages, 3 figures

    MSC Class: 62E17; 62G10; 91D30

  50. arXiv:2401.01801  [pdf, other

    cs.LG cs.AI physics.comp-ph

    A quatum inspired neural network for geometric modeling

    Authors: Weitao Du, Shengchao Liu, Xuecang Zhang

    Abstract: By conceiving physical systems as 3D many-body point clouds, geometric graph neural networks (GNNs), such as SE(3)/E(3) equivalent GNNs, have showcased promising performance. In particular, their effective message-passing mechanics make them adept at modeling molecules and crystalline materials. However, current geometric GNNs only offer a mean-field approximation of the many-body system, encapsul… ▽ More

    Submitted 28 January, 2024; v1 submitted 3 January, 2024; originally announced January 2024.