Skip to main content

Showing 101–150 of 1,481 results for author: Wu, D

  1. arXiv:2403.05014  [pdf, other

    cs.LG cs.AI

    Simple Multigraph Convolution Networks

    Authors: Danyang Wu, Xinjie Shen, Jitao Lu, Jin Xu, Feiping Nie

    Abstract: Existing multigraph convolution methods either ignore the cross-view interaction among multiple graphs, or induce extremely high computational cost due to standard cross-view polynomial operators. To alleviate this problem, this paper proposes a Simple MultiGraph Convolution Networks (SMGCN) which first extracts consistent cross-view topology from multigraphs including edge-level and subgraph-leve… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: Accepted by WWW 2024 Short

  2. arXiv:2403.04458  [pdf, ps, other

    cond-mat.stat-mech physics.plasm-ph

    Extended Time-Dependent Density Functional Theory for Multi-Body Densities

    Authors: Jiong-Hang Liang, Tian-Xing Hu, D. Wu, Zheng-Mao Sheng, J. Zhang

    Abstract: Time-dependent density functional theory (TDDFT) is widely used for understanding and predicting properties and behaviors of matter. As one of the fundamental theorems in TDDFT, van Leeuwen's theorem [Phys. Rev. Lett. 82, 3863 (1999)] guarantees how to construct a unique potential with the same one-body density evolution. Here we extend van Leeuwen's theorem by exploring truncation criteria in BBG… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  3. arXiv:2403.04268  [pdf

    quant-ph cs.LG

    Qubit-Wise Architecture Search Method for Variational Quantum Circuits

    Authors: Jialin Chen, Zhiqiang Cai, Ke Xu, Di Wu, Wei Cao

    Abstract: Considering the noise level limit, one crucial aspect for quantum machine learning is to design a high-performing variational quantum circuit architecture with small number of quantum gates. As the classical neural architecture search (NAS), quantum architecture search methods (QAS) employ methods like reinforcement learning, evolutionary algorithms and supernet optimiza-tion to improve the search… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  4. arXiv:2403.03004  [pdf, other

    astro-ph.CO gr-qc hep-ph

    Ultralight vector dark matter search using data from the KAGRA O3GK run

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi , et al. (1778 additional authors not shown)

    Abstract: Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 20 pages, 5 figures

    Report number: LIGO-P2300250

  5. arXiv:2403.02775  [pdf, other

    cs.AI cs.LG

    EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs

    Authors: Hanlin Tang, Yifu Sun, Decheng Wu, Kai Liu, Jianchen Zhu, Zhanhui Kang

    Abstract: Large language models (LLMs) have proven to be very superior to conventional methods in various tasks. However, their expensive computations and high memory requirements are prohibitive for deployment. Model quantization is an effective method for reducing this overhead. The problem is that in most previous works, the quantized model was calibrated using few samples from the training data, which m… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  6. arXiv:2403.02716  [pdf, other

    cs.SE

    Pre-trained Model-based Actionable Warning Identification: A Feasibility Study

    Authors: Xiuting Ge, Chunrong Fang, Quanjun Zhang, Daoyuan Wu, Bowen Yu, Qirui Zheng, An Guo, Shangwei Lin, Zhihong Zhao, Yang Liu, Zhenyu Chen

    Abstract: Actionable Warning Identification (AWI) plays a pivotal role in improving the usability of static code analyzers. Currently, Machine Learning (ML)-based AWI approaches, which mainly learn an AWI classifier from labeled warnings, are notably common. However, these approaches still face the problem of restricted performance due to the direct reliance on a limited number of labeled warnings to develo… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  7. arXiv:2403.02696  [pdf, ps, other

    math.ST stat.ME

    Low-rank matrix estimation via nonconvex spectral regularized methods in errors-in-variables matrix regression

    Authors: Xin Li, Dongya Wu

    Abstract: High-dimensional matrix regression has been studied in various aspects, such as statistical properties, computational efficiency and application to specific instances including multivariate regression, system identification and matrix compressed sensing. Current studies mainly consider the idealized case that the covariate matrix is obtained without noise, while the more realistic scenario that th… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  8. arXiv:2403.02360  [pdf, other

    cs.LG cs.AI

    Towards Optimal Customized Architecture for Heterogeneous Federated Learning with Contrastive Cloud-Edge Model Decoupling

    Authors: Xingyan Chen, Tian Du, Mu Wang, Tiancheng Gu, Yu Zhao, Gang Kou, Changqiao Xu, Dapeng Oliver Wu

    Abstract: Federated learning, as a promising distributed learning paradigm, enables collaborative training of a global model across multiple network edge clients without the need for central data collecting. However, the heterogeneity of edge data distribution drags the model towards the local minima, which can be distant from the global optimum. Such heterogeneity often leads to slow convergence and substa… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  9. arXiv:2403.01744  [pdf, other

    cs.IR

    NoteLLM: A Retrievable Large Language Model for Note Recommendation

    Authors: Chao Zhang, Shiwei Wu, Haoxin Zhang, Tong Xu, Yan Gao, Yao Hu, Di Wu, Enhong Chen

    Abstract: People enjoy sharing "notes" including their experiences within online communities. Therefore, recommending notes aligned with user interests has become a crucial task. Existing online methods only input notes into BERT-based models to generate note embeddings for assessing similarity. However, they may underutilize some important cues, e.g., hashtags or categories, which represent the key concept… ▽ More

    Submitted 25 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: Published as a WWW'24 full paper

  10. arXiv:2403.01702  [pdf

    q-bio.MN

    Hill Function-based Model of Transcriptional Response: Impact of Nonspecific Binding and RNAP Interactions

    Authors: Wenjia Shi, Yao Ma, Peilin Hu, Mi Pang, Xiaona Huang, Yiting Dang, Yuxin Xie, Danni Wu

    Abstract: Hill function is one of the widely used gene transcription regulation models. Its attribute of fitting may result in a lack of an underlying physical picture, yet the fitting parameters can provide information about biochemical reactions, such as the number of transcription factors (TFs) and the binding energy between regulatory elements. However, it remains unclear when and how much biochemical i… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  11. arXiv:2402.18846  [pdf, other

    cs.LG

    Multi-Fidelity Residual Neural Processes for Scalable Surrogate Modeling

    Authors: Ruijia Niu, Dongxia Wu, Kai Kim, Yi-An Ma, Duncan Watson-Parris, Rose Yu

    Abstract: Multi-fidelity surrogate modeling aims to learn an accurate surrogate at the highest fidelity level by combining data from multiple sources. Traditional methods relying on Gaussian processes can hardly scale to high-dimensional data. Deep learning approaches utilize neural network based encoders and decoders to improve scalability. These approaches share encoded representations across fidelities w… ▽ More

    Submitted 24 June, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: A novel probabilistic inference approach for scalable multi-fidelity surrogate modeling

  12. arXiv:2402.18458  [pdf, other

    cs.CL

    Meta-Task Prompting Elicits Embedding from Large Language Models

    Authors: Yibin Lei, Di Wu, Tianyi Zhou, Tao Shen, Yu Cao, Chongyang Tao, Andrew Yates

    Abstract: In this work, we introduce a new unsupervised embedding method, Meta-Task Prompting with Explicit One-Word Limitation (MetaEOL), for generating high-quality sentence embeddings from Large Language Models (LLMs) without the need for model fine-tuning or task-specific engineering. Leveraging meta-task prompting, MetaEOL guides LLMs to produce embeddings through a series of carefully designed prompts… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  13. arXiv:2402.18122  [pdf, other

    cs.CV cs.MM

    G4G:A Generic Framework for High Fidelity Talking Face Generation with Fine-grained Intra-modal Alignment

    Authors: Juan Zhang, Jiahao Chen, Cheng Wang, Zhiwang Yu, Tangquan Qi, Di Wu

    Abstract: Despite numerous completed studies, achieving high fidelity talking face generation with highly synchronized lip movements corresponding to arbitrary audio remains a significant challenge in the field. The shortcomings of published studies continue to confuse many researchers. This paper introduces G4G, a generic framework for high fidelity talking face generation with fine-grained intra-modal ali… ▽ More

    Submitted 2 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  14. arXiv:2402.18012  [pdf, other

    cs.LG cs.AI

    Diffusion Models as Constrained Samplers for Optimization with Unknown Constraints

    Authors: Lingkai Kong, Yuanqi Du, Wenhao Mu, Kirill Neklyudov, Valentin De Bortoli, Haorui Wang, Dongxia Wu, Aaron Ferber, Yi-An Ma, Carla P. Gomes, Chao Zhang

    Abstract: Addressing real-world optimization problems becomes particularly challenging when analytic objective functions or constraints are unavailable. While numerous studies have addressed the issue of unknown objectives, limited research has focused on scenarios where feasibility constraints are not given explicitly. Overlooking these constraints can lead to spurious solutions that are unrealistic in pra… ▽ More

    Submitted 29 April, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  15. arXiv:2402.16579  [pdf, other

    cond-mat.stat-mech cond-mat.dis-nn math.CO physics.comp-ph

    Sparse Autoregressive Neural Networks for Classical Spin Systems

    Authors: Indaco Biazzo, Dian Wu, Giuseppe Carleo

    Abstract: Efficient sampling and approximation of Boltzmann distributions involving large sets of binary variables, or spins, are pivotal in diverse scientific fields even beyond physics. Recent advances in generative neural networks have significantly impacted this domain. However, these neural networks are often treated as black boxes, with architectures primarily influenced by data-driven problems in com… ▽ More

    Submitted 21 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: 15 pages, 7 figures

    Journal ref: Mach. Learn.: Sci. Technol. 5 025074 (2024)

  16. arXiv:2402.16272  [pdf, other

    physics.ins-det hep-ex

    Mass production and performance study on the 20-inch PMT acrylic protection covers in JUNO

    Authors: Miao He, Zhonghua Qin, Diru Wu, Meihang Xu, Wan Xie, Fang Chen, Xiaoping Jing, Genhua Yin, Shengjiong Yin, Linhua Gu, Xiaofeng Xia, Qinchang Wang

    Abstract: The Jiangmen Underground Neutrino Observatory is a neutrino experiment that incorporates 20,012 20-inch photomultiplier tubes (PMTs) and 25,600 3-inch PMTs. A dedicated system was designed to protect the PMTs from an implosion chain reaction underwater. As a crucial element of the protection system, over 20,000 acrylic covers were manufactured through injection molding, ensuring high dimensional p… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: 12 pages, 10 figures

  17. arXiv:2402.15727  [pdf, other

    cs.CR cs.AI

    LLMs Can Defend Themselves Against Jailbreaking in a Practical Manner: A Vision Paper

    Authors: Daoyuan Wu, Shuai Wang, Yang Liu, Ning Liu

    Abstract: Jailbreaking is an emerging adversarial attack that bypasses the safety alignment deployed in off-the-shelf large language models (LLMs). A considerable amount of research exists proposing more effective jailbreak attacks, including the recent Greedy Coordinate Gradient (GCG) attack, jailbreak template-based attacks such as using "Do-Anything-Now" (DAN), and multilingual jailbreak. In contrast, th… ▽ More

    Submitted 4 March, 2024; v1 submitted 24 February, 2024; originally announced February 2024.

    Comments: Fixed the bibliography reference issue in our LLM jailbreak defense vision paper submitted on 24 Feb 2024

  18. arXiv:2402.15531  [pdf, other

    hep-th gr-qc

    Topological classes of thermodynamics of the rotating charged AdS black holes in gauged supergravities

    Authors: Xiao-Dan Zhu, Di Wu, Dan Wen

    Abstract: In this paper, we investigate the topological numbers of rotating charged AdS black holes in both four- and five-dimensional gauged supergravity theories. Our analysis is conducted within the framework of the thermodynamical topological approach to black holes, utilizing the generalized off-shell Helmholtz free energy. We demonstrate that the number of rotation parameters plays a significant role… ▽ More

    Submitted 30 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: 10 pages, 10 figures, 1 table, revtex4-1.cls. arXiv admin note: text overlap with arXiv:2307.02030

  19. arXiv:2402.15187  [pdf

    nucl-ex physics.plasm-ph

    Ultra-short lifetime isomer studies from photonuclear reactions using laser-driven ultra-intense γ-ray

    Authors: Di Wu, Haoyang Lan, Jiaxing Liu, Huangang Lu, Jianyao Zhang, Jianfeng Lv, Xuezhi Wu, Hui Zhang, Yadong Xia, Qiangyou He, Jie Cai, Qianyi Ma, Yuhui Xia, Zhenan Wang, Meizhi Wang, Zhiyan Yang, Xinlu Xu, Yixing Geng, Chen Lin, Wenjun Ma, Yanying Zhao, Haoran Wang, Fulong Liu, Chuangye He, Jinqing Yu , et al. (7 additional authors not shown)

    Abstract: Isomers, ubiquitous populations of relatively long-lived nuclear excited states, play a crucial role in nuclear physics. However, isomers with half-life times of several seconds or less barely had experimental cross section data due to the lack of a suitable measuring method. We report a method of online γ spectroscopy for ultra-short-lived isomers from photonuclear reactions using laser-driven ul… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  20. arXiv:2402.15069  [pdf, other

    astro-ph.HE

    Investigation of profile shifting and subpulse movement in PSR J0344-0901 with FAST

    Authors: H. M. Tedila, R. Yuen, N. Wang, D. Li, Z. G. Wen, W. M. Yan, J. P. Yuan, X. H. Han, P. Wang, W. W. Zhu, S. J. Dang, S. Q. Wang, J. T. Xie, Q. D. Wu, Sh. Khasanov, FAST Collaboration

    Abstract: We report two phenomena detected in PSR J0344$-$0901 from two observations conducted at frequency centered at 1.25 GHz using the Five-hundred-meter Aperture Spherical radio Telescope (FAST). The first phenomenon manifests as shifting in the pulse emission to later longitudinal phases and then gradually returns to its original location. The event lasts for about 216 pulse periods, with an average s… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  21. arXiv:2402.14052  [pdf, other

    cs.CL

    On Leveraging Encoder-only Pre-trained Language Models for Effective Keyphrase Generation

    Authors: Di Wu, Wasi Uddin Ahmad, Kai-Wei Chang

    Abstract: This study addresses the application of encoder-only Pre-trained Language Models (PLMs) in keyphrase generation (KPG) amidst the broader availability of domain-tailored encoder-only models compared to encoder-decoder models. We investigate three core inquiries: (1) the efficacy of encoder-only PLMs in KPG, (2) optimal architectural decisions for employing encoder-only PLMs in KPG, and (3) a perfor… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: LREC-COLING 2024 camera ready. arXiv admin note: text overlap with arXiv:2212.10233

  22. arXiv:2402.13921  [pdf, ps, other

    cs.DS math.PR

    Robust recovery for stochastic block models, simplified and generalized

    Authors: Sidhanth Mohanty, Prasad Raghavendra, David X. Wu

    Abstract: We study the problem of $\textit{robust community recovery}$: efficiently recovering communities in sparse stochastic block models in the presence of adversarial corruptions. In the absence of adversarial corruptions, there are efficient algorithms when the $\textit{signal-to-noise ratio}$ exceeds the $\textit{Kesten--Stigum (KS) threshold}$, widely believed to be the computational threshold for t… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 33 pages

  23. arXiv:2402.13886  [pdf, other

    cond-mat.supr-con cond-mat.mtrl-sci

    Multigap superconductivity in lithium intercalated bilayer Mo$_2$C

    Authors: Can Hong, Danhong Wu, Xi-Bo Li, Feipeng Zheng

    Abstract: Interlayer coupling can significantly influence the physical properties of layered transition metal compounds. The superconductivity in layered Mo$_2$C systems, belonging to the emergent family of MXene, has garnered considerable attention. However, the impact of interlayer coupling on superconductivity, and the anisotropic superconducting properties in these systems are not yet clear. By performi… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 9 pages, 7 figures

    Journal ref: Physical Review B 2024, 109: 064515

  24. arXiv:2402.10928  [pdf, other

    physics.app-ph physics.optics

    Passive Aperiodic Optical Phased Array based on Uniform Random Shuffle

    Authors: Bowen Yu, Dachuan Wu, Yasha Yi

    Abstract: Grating lobes arise from the periodic nature of element spacing in the optical phased array. Essentially, the phased array performs the Spatial Fourier Transform on light; the steering capability of the main lobe is governed by phase shift variations among waveguides, and the Sidelobe Suppression Ratio (SLSR) correlates with the uniformity of emitter positions. Leveraging this understanding, we ha… ▽ More

    Submitted 30 January, 2024; originally announced February 2024.

  25. arXiv:2402.10387  [pdf, other

    q-bio.BM cs.LG

    MFBind: a Multi-Fidelity Approach for Evaluating Drug Compounds in Practical Generative Modeling

    Authors: Peter Eckmann, Dongxia Wu, Germano Heinzelmann, Michael K Gilson, Rose Yu

    Abstract: Current generative models for drug discovery primarily use molecular docking to evaluate the quality of generated compounds. However, such models are often not useful in practice because even compounds with high docking scores do not consistently show experimental activity. More accurate methods for activity prediction exist, such as molecular dynamics based binding free energy calculations, but t… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 9 pages, 4 figures

  26. arXiv:2402.10127  [pdf, other

    stat.ML cs.LG math.PR math.ST

    Nonlinear spiked covariance matrices and signal propagation in deep neural networks

    Authors: Zhichao Wang, Denny Wu, Zhou Fan

    Abstract: Many recent works have studied the eigenvalue spectrum of the Conjugate Kernel (CK) defined by the nonlinear feature map of a feedforward neural network. However, existing results only establish weak convergence of the empirical eigenvalue distribution, and fall short of providing precise quantitative characterizations of the ''spike'' eigenvalues and eigenvectors that often capture the low-dimens… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 55 pages

  27. arXiv:2402.09240  [pdf, other

    cs.LG cs.CV

    Switch EMA: A Free Lunch for Better Flatness and Sharpness

    Authors: Siyuan Li, Zicheng Liu, Juanxi Tian, Ge Wang, Zedong Wang, Weiyang Jin, Di Wu, Cheng Tan, Tao Lin, Yang Liu, Baigui Sun, Stan Z. Li

    Abstract: Exponential Moving Average (EMA) is a widely used weight averaging (WA) regularization to learn flat optima for better generalizations without extra cost in deep neural network (DNN) optimization. Despite achieving better flatness, existing WA methods might fall into worse final performances or require extra test-time computations. This work unveils the full potential of EMA with a single line of… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Preprint V1. Source code and models at https://github.com/Westlake-AI/SEMA

  28. arXiv:2402.07090  [pdf

    cs.NI cs.ET eess.SP

    Design of a W-band High-PAE Class A&AB Power Amplifier in 150nm GaAs Technology

    Authors: Jun Yan Leea, Duo Wu, Xuanrui Guoc, Mohammad Mahdi Ariannejad, Mohammad Arif Sobhan Bhuiyan, Mahdi H. Miraz

    Abstract: Nanometer scale power amplifiers (PA) at sub-THz suffer from severe parasitic effects that lead to experience limited maximum frequency and reduced power performance at the device transceiver front end. The integrated circuits researchers proposed different PA design architecture combinations at scaled down technologies to overcome these limitations. Although the designs meet the minimum requireme… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

    Journal ref: Transactions on Electrical and Electronic Materials (TEEM), Electronic ISSN: 2092-7592, Print ISSN: 1229-7607, 10th February 2024, Available: https://link.springer.com/article/10.1007/s42341-024-00513-8

  29. arXiv:2402.05383  [pdf, other

    nucl-ex hep-ex

    First measurement of the yield of $^8$He isotopes produced in liquid scintillator by cosmic-ray muons at Daya Bay

    Authors: Daya Bay Collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, Y. C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng, X. Y. Ding , et al. (177 additional authors not shown)

    Abstract: Daya Bay presents the first measurement of cosmogenic $^8$He isotope production in liquid scintillator, using an innovative method for identifying cascade decays of $^8$He and its child isotope, $^8$Li. We also measure the production yield of $^9$Li isotopes using well-established methodology. The results, in units of 10$^{-8}μ^{-1}$g$^{-1}$cm$^{2}$, are 0.307$\pm$0.042, 0.341$\pm$0.040, and 0.546… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  30. arXiv:2402.03726  [pdf, other

    cs.LG stat.ML

    Learning Granger Causality from Instance-wise Self-attentive Hawkes Processes

    Authors: Dongxia Wu, Tsuyoshi Idé, Aurélie Lozano, Georgios Kollias, Jiří Navrátil, Naoki Abe, Yi-An Ma, Rose Yu

    Abstract: We address the problem of learning Granger causality from asynchronous, interdependent, multi-type event sequences. In particular, we are interested in discovering instance-level causal structures in an unsupervised manner. Instance-level causality identifies causal relationships among individual events, providing more fine-grained information for decision-making. Existing work in the literature e… ▽ More

    Submitted 29 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  31. arXiv:2402.02616   

    cs.LG

    The Virtues of Pessimism in Inverse Reinforcement Learning

    Authors: David Wu, Gokul Swamy, J. Andrew Bagnell, Zhiwei Steven Wu, Sanjiban Choudhury

    Abstract: Inverse Reinforcement Learning (IRL) is a powerful framework for learning complex behaviors from expert demonstrations. However, it traditionally requires repeatedly solving a computationally expensive reinforcement learning (RL) problem in its inner loop. It is desirable to reduce the exploration burden by leveraging expert demonstrations in the inner-loop RL. As an example, recent work resets th… ▽ More

    Submitted 8 February, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: This paper has been withdrawn by the authors pending edits from other authors

  32. arXiv:2402.02608  [pdf, other

    cs.LG

    Accelerating Inverse Reinforcement Learning with Expert Bootstrapping

    Authors: David Wu, Sanjiban Choudhury

    Abstract: Existing inverse reinforcement learning methods (e.g. MaxEntIRL, $f$-IRL) search over candidate reward functions and solve a reinforcement learning problem in the inner loop. This creates a rather strange inversion where a harder problem, reinforcement learning, is in the inner loop of a presumably easier problem, imitation learning. In this work, we show that better utilization of expert demonstr… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  33. arXiv:2402.02338  [pdf, other

    cs.NI cs.LG

    NetLLM: Adapting Large Language Models for Networking

    Authors: Duo Wu, Xianda Wang, Yaqi Qiao, Zhi Wang, Junchen Jiang, Shuguang Cui, Fangxin Wang

    Abstract: Many networking tasks now employ deep learning (DL) to solve complex prediction and system optimization problems. However, current design philosophy of DL-based algorithms entails intensive engineering overhead due to the manual design of deep neural networks (DNNs) for different networking tasks. Besides, DNNs tend to achieve poor generalization performance on unseen data distributions/environmen… ▽ More

    Submitted 5 May, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

    Comments: This paper has been accepted by ACM SIGCOMM 2024

  34. arXiv:2402.01760  [pdf, other

    cs.CY cs.AI

    Trust and ethical considerations in a multi-modal, explainable AI-driven chatbot tutoring system: The case of collaboratively solving Rubik's Cube

    Authors: Kausik Lakkaraju, Vedant Khandelwal, Biplav Srivastava, Forest Agostinelli, Hengtao Tang, Prathamjeet Singh, Dezhi Wu, Matt Irvin, Ashish Kundu

    Abstract: Artificial intelligence (AI) has the potential to transform education with its power of uncovering insights from massive data about student learning patterns. However, ethical and trustworthy concerns of AI have been raised but are unsolved. Prominent ethical issues in high school AI education include data privacy, information leakage, abusive language, and fairness. This paper describes technolog… ▽ More

    Submitted 30 January, 2024; originally announced February 2024.

    Comments: Accepted at 'Neural Conversational AI Workshop - What's left to TEACH (Trustworthy, Enhanced, Adaptable, Capable, and Human-centric) chatbots?' at ICML 2023

  35. arXiv:2402.01010  [pdf, other

    cs.CE

    A generalized essentially non-hourglass total Lagrangian SPH solid dynamics

    Authors: Dong Wu, Xiaojing Tang, Shuaihao Zhang, Xiangyu Hu

    Abstract: In this paper, we tackle a persistent numerical instability within the total Lagrangian smoothed particle hydrodynamics (TLSPH) solid dynamics. Specifically, we address the hourglass modes that may grow and eventually deteriorate the reliability of simulation, particularly in the scenarios characterized by large deformations. We propose a generalized essentially non-hourglass formulation based on… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 61 pages, 37 figures

  36. arXiv:2402.00641  [pdf, other

    cs.CR

    Testing side-channel security of cryptographic implementations against future microarchitectures

    Authors: Gilles Barthe, Marcel Böhme, Sunjay Cauligi, Chitchanok Chuengsatiansup, Daniel Genkin, Marco Guarnieri, David Mateos Romero, Peter Schwabe, David Wu, Yuval Yarom

    Abstract: How will future microarchitectures impact the security of existing cryptographic implementations? As we cannot keep reducing the size of transistors, chip vendors have started developing new microarchitectural optimizations to speed up computation. A recent study (Sanchez Vicarte et al., ISCA 2021) suggests that these optimizations might open the Pandora's box of microarchitectural attacks. Howeve… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  37. arXiv:2402.00537  [pdf, other

    cs.RO

    Robust Path Planning via Learning from Demonstrations for Robotic Catheters in Deformable Environments

    Authors: Zhen Li, Chiara Lambranzi, Di Wu, Alice Segato, Federico De Marco, Emmanuel Vander Poorten, Jenny Dankelman, Elena De Momi

    Abstract: Navigation through tortuous and deformable vessels using catheters with limited steering capability underscores the need for reliable path planning. State-of-the-art path planners do not fully account for the deformable nature of the environment. This work proposes a robust path planner via a learning from demonstrations method, named Curriculum Generative Adversarial Imitation Learning (C-GAIL).… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: Under review in IEEE Transactions on Biomedical Engineering (TBME)

  38. Topological classes of thermodynamics of the static multi-charge AdS black holes in gauged supergravities: novel temperature-dependent thermodynamic topological phase transition

    Authors: Di Wu, Shuang-Yong Gu, Xiao-Dan Zhu, Qing-Quan Jiang, Shu-Zheng Yang

    Abstract: In this paper, we investigate, in the framework of the topological approach to black hole thermodynamics, using the generalized off-shell Helmholtz free energy, the topological numbers of the static multi-charge AdS black holes in four- and five-dimensional gauged supergravities. We find that the topological number of the static-charged AdS black holes in four-dimensional Kaluza-Klein (K-K) gauged… ▽ More

    Submitted 28 June, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

    Comments: 35 pages, 29 figures, 2 tables, JHEP3.cls, match with the published version in JHEP

    Journal ref: JHEP 06 (2024) 213

  39. arXiv:2401.17880  [pdf, other

    cs.MA cs.IT cs.LG

    Graph Attention-based Reinforcement Learning for Trajectory Design and Resource Assignment in Multi-UAV Assisted Communication

    Authors: Zikai Feng, Di Wu, Mengxing Huang, Chau Yuen

    Abstract: In the multiple unmanned aerial vehicle (UAV)- assisted downlink communication, it is challenging for UAV base stations (UAV BSs) to realize trajectory design and resource assignment in unknown environments. The cooperation and competition between UAV BSs in the communication network leads to a Markov game problem. Multi-agent reinforcement learning is a significant solution for the above decision… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 13 pages

    MSC Class: 68M11 ACM Class: I.2.11

  40. arXiv:2401.16832  [pdf, other

    cs.CY cs.LG stat.ML

    Analysis of Knowledge Tracing performance on synthesised student data

    Authors: Panagiotis Pagonis, Kai Hartung, Di Wu, Munir Georges, Sören Gröttrup

    Abstract: Knowledge Tracing (KT) aims to predict the future performance of students by tracking the development of their knowledge states. Despite all the recent progress made in this field, the application of KT models in education systems is still restricted from the data perspectives: 1) limited access to real life data due to data protection concerns, 2) lack of diversity in public datasets, 3) noises i… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted at AI4AI Education workshop 2023 ( https://sme.uni-bamberg.de/ai4ai/ )

  41. arXiv:2401.16185  [pdf, other

    cs.CR cs.AI cs.SE

    LLM4Vuln: A Unified Evaluation Framework for Decoupling and Enhancing LLMs' Vulnerability Reasoning

    Authors: Yuqiang Sun, Daoyuan Wu, Yue Xue, Han Liu, Wei Ma, Lyuye Zhang, Miaolei Shi, Yang Liu

    Abstract: Large language models (LLMs) have demonstrated significant potential for many downstream tasks, including those requiring human-level intelligence, such as vulnerability detection. However, recent attempts to use LLMs for vulnerability detection are still preliminary, as they lack an in-depth understanding of a subject LLM's vulnerability reasoning capability -- whether it originates from the mode… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: This is a technical report by Nanyang Technological University

  42. arXiv:2401.15819  [pdf, ps, other

    math.AP nlin.SI

    Stability of KdV solitons

    Authors: Derchyi Wu

    Abstract: We prove an orbital stability theorem of KdV $n$-solitons with explicit phase shifts in the soliton region with cones around the $x$-axis and lines determined by bound states of the KdV $n$-solitons removed.

    Submitted 31 March, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

    MSC Class: 35Q53; 35P25

  43. arXiv:2401.14617  [pdf, other

    cs.SE cs.AI

    A Systematic Literature Review on Explainability for Machine/Deep Learning-based Software Engineering Research

    Authors: Sicong Cao, Xiaobing Sun, Ratnadira Widyasari, David Lo, Xiaoxue Wu, Lili Bo, Jiale Zhang, Bin Li, Wei Liu, Di Wu, Yixin Chen

    Abstract: The remarkable achievements of Artificial Intelligence (AI) algorithms, particularly in Machine Learning (ML) and Deep Learning (DL), have fueled their extensive deployment across multiple sectors, including Software Engineering (SE). However, due to their black-box nature, these promising AI-driven SE models are still far from being deployed in practice. This lack of explainability poses unwanted… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: submitted to ACM Computing Surveys. arXiv admin note: text overlap with arXiv:2202.06840 by other authors

  44. arXiv:2401.13792  [pdf, other

    cs.NI

    Probabilistic Mobility Load Balancing for Multi-band 5G and Beyond Networks

    Authors: Saria Al Lahham, Di Wu, Ekram Hossain, Xue Liu, Gregory Dudek

    Abstract: The ever-increasing demand for data services and the proliferation of user equipment (UE) have resulted in a significant rise in the volume of mobile traffic. Moreover, in multi-band networks, non-uniform traffic distribution among different operational bands can lead to congestion, which can adversely impact the user's quality of experience. Load balancing is a critical aspect of network optimiza… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  45. arXiv:2401.12985  [pdf, other

    cs.CL

    The Effect of Human v/s Synthetic Test Data and Round-tripping on Assessment of Sentiment Analysis Systems for Bias

    Authors: Kausik Lakkaraju, Aniket Gupta, Biplav Srivastava, Marco Valtorta, Dezhi Wu

    Abstract: Sentiment Analysis Systems (SASs) are data-driven Artificial Intelligence (AI) systems that output polarity and emotional intensity when given a piece of text as input. Like other AIs, SASs are also known to have unstable behavior when subjected to changes in data which can make it problematic to trust out of concerns like bias when AI works with humans and data has protected attributes like gende… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: text overlap with arXiv:2302.02038

    Journal ref: The Fifth IEEE International Conference on Trust, Privacy and Security in Intelligent Systems, and Applications (2023)

  46. arXiv:2401.12413  [pdf, other

    cs.CL cs.LG

    How Far Can 100 Samples Go? Unlocking Overall Zero-Shot Multilingual Translation via Tiny Multi-Parallel Data

    Authors: Di Wu, Shaomu Tan, Yan Meng, David Stap, Christof Monz

    Abstract: Zero-shot translation aims to translate between language pairs not seen during training in Multilingual Machine Translation (MMT) and is largely considered an open problem. A common, albeit resource-consuming, solution is to add as many related translation directions as possible to the training corpus. In this paper, we show that for an English-centric model, surprisingly large zero-shot improveme… ▽ More

    Submitted 26 February, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: 15 pages, 5 figures

  47. arXiv:2401.11894  [pdf, other

    physics.plasm-ph

    Exact Normal Modes of Quantum Plasmas

    Authors: Tian-Xing Hu, Dong Wu, Z. M. Sheng, J. Zhang

    Abstract: The normal modes, i.e., the eigen solutions to the dispersion relation equation, are the most fundamental properties of a plasma, which also of key importance to many nonlinear effects such as parametric and two-plasmon decay, and Raman scattering. The real part indicates the intrinsic oscillation frequency while the imaginary part the Landau damping rate. In most of the literatures, the normal mo… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  48. arXiv:2401.11891  [pdf, other

    physics.plasm-ph

    Validation of Classical Transport Cross Section for Ion-Ion Interactions Under Repulsive Yukawa Potential

    Authors: Tian-Xing Hu, Dong Wu, C. L. Lin, Z. M. Sheng, B. He, J. Zhang

    Abstract: Value of cross section is a fundamental parameter to depict the transport of charged particles in matters. Due to masses of orders of magnitude higher than electrons and convenience of realistic calculation, the cross section of elastic nuclei-nuclei collision is usually treated via classical mechanics. The famous Bohr criterion was firstly proposed to judge whether the treatment via classical mec… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  49. arXiv:2401.10213  [pdf

    cs.CV cs.CY cs.LG

    Improving automatic detection of driver fatigue and distraction using machine learning

    Authors: Dongjiang Wu

    Abstract: Changes and advances in information technology have played an important role in the development of intelligent vehicle systems in recent years. Driver fatigue and distracted driving are important factors in traffic accidents. Thus, onboard monitoring of driving behavior has become a crucial component of advanced driver assistance systems for intelligent vehicles. In this article, we present techni… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: Master's thesis, 55 pages

  50. arXiv:2401.09719  [pdf, ps, other

    stat.ME

    Kernel-based multi-marker tests of association based on the accelerated failure time model

    Authors: Chenxi Li, Di Wu, Qing Lu

    Abstract: Kernel-based multi-marker tests for survival outcomes use primarily the Cox model to adjust for covariates. The proportional hazards assumption made by the Cox model could be unrealistic, especially in the long-term follow-up. We develop a suite of novel multi-marker survival tests for genetic association based on the accelerated failure time model, which is a popular alternative to the Cox model… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.