Skip to main content

Showing 1–50 of 322 results for author: Cheng, D

  1. arXiv:2406.14491  [pdf, other

    cs.CL

    Instruction Pre-Training: Language Models are Supervised Multitask Learners

    Authors: Daixuan Cheng, Yuxian Gu, Shaohan Huang, Junyu Bi, Minlie Huang, Furu Wei

    Abstract: Unsupervised multitask pre-training has been the critical method behind the recent success of language models (LMs). However, supervised multitask learning still holds significant promise, as scaling it in the post-training stage trends towards better generalization. In this paper, we explore supervised multitask pre-training by proposing Instruction Pre-Training, a framework that scalably augment… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2406.12920  [pdf, ps, other

    math.RA math.OC

    Cross-Dimensional Mathematics: A Foundation For STP/STA

    Authors: Daizhan Cheng

    Abstract: A new mathematical structure, called the cross-dimensional mathematics (CDM), is proposed. The CDM considered in this paper consists of three parts: hyper algebra, hyper geometry, and hyper Lie group/Lie algebra. Hyper algebra proposes some new algebraic structures such as hyper group, hyper ring, and hyper module over matrices and vectors with mixed dimensions (MVMDs). They have sets of classical… ▽ More

    Submitted 27 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

  3. arXiv:2406.08830  [pdf, other

    cs.LG cs.AI

    Center-Sensitive Kernel Optimization for Efficient On-Device Incremental Learning

    Authors: Dingwen Zhang, Yan Li, De Cheng, Nannan Wang, Junwei Han

    Abstract: To facilitate the evolution of edge intelligence in ever-changing environments, we study on-device incremental learning constrained in limited computation resource in this paper. Current on-device training methods just focus on efficient training without considering the catastrophic forgetting, preventing the model getting stronger when continually exploring the world. To solve this problem, a dir… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  4. arXiv:2406.08698  [pdf, other

    astro-ph.HE hep-ph

    Constraints on Ultra Heavy Dark Matter Properties from Dwarf Spheroidal Galaxies with LHAASO Observations

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 17 pages, 12 figures, accepted by PRL

  5. arXiv:2406.05658  [pdf, other

    cs.CV cs.AI

    Visual Prompt Tuning in Null Space for Continual Learning

    Authors: Yue Lu, Shizhou Zhang, De Cheng, Yinghui Xing, Nannan Wang, Peng Wang, Yanning Zhang

    Abstract: Existing prompt-tuning methods have demonstrated impressive performances in continual learning (CL), by selecting and updating relevant prompts in the vision-transformer models. On the contrary, this paper aims to learn each task by tuning the prompts in the direction orthogonal to the subspace spanned by previous tasks' features, so as to ensure no interference on tasks that have been learned to… ▽ More

    Submitted 10 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

    Comments: 20 pages, 10 figures

  6. arXiv:2406.03751  [pdf, other

    cs.LG

    Adaptive Multi-Scale Decomposition Framework for Time Series Forecasting

    Authors: Yifan Hu, Peiyuan Liu, Peng Zhu, Dawei Cheng, Tao Dai

    Abstract: Transformer-based and MLP-based methods have emerged as leading approaches in time series forecasting (TSF). While Transformer-based methods excel in capturing long-range dependencies, they suffer from high computational complexities and tend to overfit. Conversely, MLP-based methods offer computational efficiency and adeptness in modeling temporal dynamics, but they struggle with capturing comple… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  7. arXiv:2406.00321  [pdf, other

    physics.optics cond-mat.other quant-ph

    Non-Abelian lattice gauge fields in the photonic synthetic frequency dimension

    Authors: Dali Cheng, Kai Wang, Charles Roques-Carmes, Eran Lustig, Olivia Y. Long, Heming Wang, Shanhui Fan

    Abstract: Non-Abelian gauge fields provide a conceptual framework for the description of particles having spins. The theoretical importance of non-Abelian gauge fields motivates their experimental synthesis and explorations. Here, we demonstrate non-Abelian lattice gauge fields for photons. In the study of gauge fields, lattice models are essential for the understanding of their implications in extended sys… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  8. First results of AUP Nb3Sn quadrupole horizontal tests

    Authors: M. Baldini, G. Ambrosio, G. Apollinari, J. Blowers, R. Bossert, R. Carcagno, G. Chlachidze, J. DiMarco, S. Feher, S. Krave, V. Lombardo, L. Martin, C. Narug, T. H. Nicol, V. Nikolic, A. Nobrega, V. Marinozzi, C. Orozco, T. Page, S. Stoynev, T. Strauss, M. Turenne, D. Turrioni, A. Vouris, M. Yu , et al. (26 additional authors not shown)

    Abstract: The Large Hadron Collider will soon undergo an upgrade to increase its luminosity by a factor of ~10 [1]. A crucial part of this upgrade will be replacement of the NbTi focusing magnets with Nb3Sn magnets that achieve a ~50% increase in the field strength. This will be the first ever large-scale implementation of Nb3Sn magnets in a particle accelerator. The High-Luminosity LHC Upgrade, HL-LHC is a… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: IPAC'24 - 15th International Particle Accelerator Conference

    Report number: FERMILAB-CONF-24-0273-TD

    Journal ref: JACoW IPAC2024 (2024) THYN1

  9. arXiv:2405.14622  [pdf, other

    cs.LG cs.CL cs.CV

    Calibrated Self-Rewarding Vision Language Models

    Authors: Yiyang Zhou, Zhiyuan Fan, Dongjie Cheng, Sihan Yang, Zhaorun Chen, Chenhang Cui, Xiyao Wang, Yun Li, Linjun Zhang, Huaxiu Yao

    Abstract: Large Vision-Language Models (LVLMs) have made substantial progress by integrating pre-trained large language models (LLMs) and vision models through instruction tuning. Despite these advancements, LVLMs often exhibit the hallucination phenomenon, where generated text responses appear linguistically plausible but contradict the input image, indicating a misalignment between image and text pairs. T… ▽ More

    Submitted 31 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: fix some typos and add acknowledgement section in V3

  10. arXiv:2405.11826  [pdf, other

    astro-ph.IM hep-ex physics.ins-det

    Data quality control system and long-term performance monitor of the LHAASO-KM2A

    Authors: Zhen Cao, F. Aharonian, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, H. X. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen , et al. (263 additional authors not shown)

    Abstract: The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To… ▽ More

    Submitted 13 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: 15 pages, 9 figures

  11. arXiv:2405.07691  [pdf, other

    astro-ph.HE

    Discovery of Very-high-energy Gamma-ray Emissions from the Low Luminosity AGN NGC 4278 by LHAASO

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: The first source catalog of Large High Altitude Air Shower Observatory reported the detection of a very-high-energy gamma ray source, 1LHAASO J1219+2915. In this paper a further detailed study of the spectral and temporal behavior of this point-like source have been carried. The best-fit position of the TeV source ($\rm{RA}=185.05^{\circ}\pm0.04^{\circ}$, $\rm{Dec}=29.25^{\circ}\pm0.03^{\circ}$) i… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 11 pages, 5 figures

  12. arXiv:2404.15688  [pdf, ps, other

    math.OC

    Observer-Based Realization of Control Systems

    Authors: Daizhan Cheng, Changxi Li, Xiao Zhang, Zhengping Ji

    Abstract: Lebesgue-type of dynamic control systems and dimension-keeping semi-tensor product (DK-STP) of matrices are introduced. Using bridge matrices, the DK-STP is used to construct approximated observer-based realization (OR) of linear control systems, as Lebesgue-type control systems, are proposed. A necessary and sufficient condition for the OR-system to have exactly same observer dynamics is obtained… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  13. arXiv:2404.11825  [pdf, other

    cs.LG

    Hypergraph Self-supervised Learning with Sampling-efficient Signals

    Authors: Fan Li, Xiaoyang Wang, Dawei Cheng, Wenjie Zhang, Ying Zhang, Xuemin Lin

    Abstract: Self-supervised learning (SSL) provides a promising alternative for representation learning on hypergraphs without costly labels. However, existing hypergraph SSL models are mostly based on contrastive methods with the instance-level discrimination strategy, suffering from two significant limitations: (1) They select negative samples arbitrarily, which is unreliable in deciding similar and dissimi… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 9 pages,4 figures,4 tables

  14. arXiv:2404.04801  [pdf, ps, other

    astro-ph.IM astro-ph.HE

    LHAASO-KM2A detector simulation using Geant4

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (254 additional authors not shown)

    Abstract: KM2A is one of the main sub-arrays of LHAASO, working on gamma ray astronomy and cosmic ray physics at energies above 10 TeV. Detector simulation is the important foundation for estimating detector performance and data analysis. It is a big challenge to simulate the KM2A detector in the framework of Geant4 due to the need to track numerous photons from a large number of detector units (>6000) with… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  15. arXiv:2404.03259  [pdf, ps, other

    cs.CL cs.AI

    Enhancing the Performance of Aspect-Based Sentiment Analysis Systems

    Authors: Chen Li, Huidong Tang, Peng Ju, Debo Cheng, Yasuhiko Morimoto

    Abstract: Aspect-based sentiment analysis aims to predict sentiment polarity with fine granularity. While Graph Convolutional Networks (GCNs) are widely utilized for sentimental feature extraction, their naive application for syntactic feature extraction can compromise information preservation. This study introduces an innovative edge-enhanced GCN, named SentiSys, to navigate the syntactic graph while prese… ▽ More

    Submitted 19 April, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

  16. arXiv:2404.03254  [pdf, ps, other

    cs.DC

    Mining Area Skyline Objects from Map-based Big Data using Apache Spark Framework

    Authors: Chen Li, Ye Zhu, Yang Cao, Jinli Zhang, Annisa Annisa, Debo Cheng, Yasuhiko Morimoto

    Abstract: The computation of the skyline provides a mechanism for utilizing multiple location-based criteria to identify optimal data points. However, the efficiency of these computations diminishes and becomes more challenging as the input data expands. This study presents a novel algorithm aimed at mitigating this challenge by harnessing the capabilities of Apache Spark, a distributed processing platform,… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  17. arXiv:2403.17458  [pdf, ps, other

    cs.CR cs.LG

    Expectations Versus Reality: Evaluating Intrusion Detection Systems in Practice

    Authors: Jake Hesford, Daniel Cheng, Alan Wan, Larry Huynh, Seungho Kim, Hyoungshick Kim, Jin B. Hong

    Abstract: Our paper provides empirical comparisons between recent IDSs to provide an objective comparison between them to help users choose the most appropriate solution based on their requirements. Our results show that no one solution is the best, but is dependent on external variables such as the types of attacks, complexity, and network environment in the dataset. For example, BoT_IoT and Stratosphere I… ▽ More

    Submitted 28 March, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: 10 pages

    MSC Class: 68M25; 68M20 ACM Class: C.4; D.m

  18. arXiv:2403.12865  [pdf, other

    cs.RO

    PE-Planner: A Performance-Enhanced Quadrotor Motion Planner for Autonomous Flight in Complex and Dynamic Environments

    Authors: Jiaxin Qiu, Qingchen Liu, Jiahu Qin, Dewang Cheng, Yawei Tian, Qichao Ma

    Abstract: The role of a motion planner is pivotal in quadrotor applications, yet existing methods often struggle to adapt to complex environments, limiting their ability to achieve fast, safe, and robust flight. In this letter, we introduce a performance-enhanced quadrotor motion planner designed for autonomous flight in complex environments including dense obstacles, dynamic obstacles, and unknown disturba… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  19. arXiv:2403.10339  [pdf, other

    cs.LG

    Generation is better than Modification: Combating High Class Homophily Variance in Graph Anomaly Detection

    Authors: Rui Zhang, Dawei Cheng, Xin Liu, Jie Yang, Yi Ouyang, Xian Wu, Yefeng Zheng

    Abstract: Graph-based anomaly detection is currently an important research topic in the field of graph neural networks (GNNs). We find that in graph anomaly detection, the homophily distribution differences between different classes are significantly greater than those in homophilic and heterophilic graphs. For the first time, we introduce a new metric called Class Homophily Variance, which quantitatively d… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  20. Measurements of All-Particle Energy Spectrum and Mean Logarithmic Mass of Cosmic Rays from 0.3 to 30 PeV with LHAASO-KM2A

    Authors: The LHAASO Collaboration, Zhen Cao, F. Aharonian, Q. An, A. Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen , et al. (256 additional authors not shown)

    Abstract: We present the measurements of all-particle energy spectrum and mean logarithmic mass of cosmic rays in the energy range of 0.3-30 PeV using data collected from LHAASO-KM2A between September 2021 and December 2022, which is based on a nearly composition-independent energy reconstruction method, achieving unprecedented accuracy. Our analysis reveals the position of the knee at… ▽ More

    Submitted 26 March, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: 8 pages, 3 figures

    Journal ref: Physical Review Letters 132, 131002 (2024)

  21. arXiv:2403.07292  [pdf, other

    cs.CV cs.AI

    Continual All-in-One Adverse Weather Removal with Knowledge Replay on a Unified Network Structure

    Authors: De Cheng, Yanling Ji, Dong Gong, Yan Li, Nannan Wang, Junwei Han, Dingwen Zhang

    Abstract: In real-world applications, image degeneration caused by adverse weather is always complex and changes with different weather conditions from days and seasons. Systems in real-world environments constantly encounter adverse weather conditions that are not previously observed. Therefore, it practically requires adverse weather removal models to continually learn from incrementally collected data re… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  22. arXiv:2403.06107  [pdf, other

    cs.CV

    Textureless Object Recognition: An Edge-based Approach

    Authors: Frincy Clement, Kirtan Shah, Dhara Pancholi, Gabriel Lugo Bustillo, Dr. Irene Cheng

    Abstract: Textureless object recognition has become a significant task in Computer Vision with the advent of Robotics and its applications in manufacturing sector. It has been challenging to obtain good accuracy in real time because of its lack of discriminative features and reflectance properties which makes the techniques for textured object recognition insufficient for textureless objects. A lot of work… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: text overlap with arXiv:1910.14255

  23. arXiv:2402.15759  [pdf

    cs.CV cs.AI

    Increasing SAM Zero-Shot Performance on Multimodal Medical Images Using GPT-4 Generated Descriptive Prompts Without Human Annotation

    Authors: Zekun Jiang, Dongjie Cheng, Ziyuan Qin, Jun Gao, Qicheng Lao, Kang Li, Le Zhang

    Abstract: This study develops and evaluates a novel multimodal medical image zero-shot segmentation algorithm named Text-Visual-Prompt SAM (TV-SAM) without any manual annotations. TV-SAM incorporates and integrates large language model GPT-4, Vision Language Model GLIP, and Segment Anything Model (SAM), to autonomously generate descriptive text prompts and visual bounding box prompts from medical images, th… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

    Comments: 12 pages, 4 figures, 4 tables

  24. arXiv:2402.09668  [pdf, other

    cs.LG cs.AI cs.CL

    How to Train Data-Efficient LLMs

    Authors: Noveen Sachdeva, Benjamin Coleman, Wang-Cheng Kang, Jianmo Ni, Lichan Hong, Ed H. Chi, James Caverlee, Julian McAuley, Derek Zhiyuan Cheng

    Abstract: The training of large language models (LLMs) is expensive. In this paper, we study data-efficient approaches for pre-training LLMs, i.e., techniques that aim to optimize the Pareto frontier of model quality and training resource/data consumption. We seek to understand the tradeoffs associated with data selection routines based on (i) expensive-to-compute data-quality estimates, and (ii) maximizati… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Under review. 44 pages, 30 figures

  25. arXiv:2402.07235  [pdf, other

    econ.GN

    Scientific Talent Leaks Out of Funding Gaps

    Authors: Wei Yang Tham, Joseph Staudt, Elisabeth Ruth Perlman, Stephanie D. Cheng

    Abstract: We study how delays in NIH grant funding affect the career outcomes of research personnel. Using comprehensive earnings and tax records linked to university transaction data along with a difference-in-differences design, we find that a funding interruption of more than 30 days has a substantial effect on job placements for personnel who work in labs with a single NIH R01 research grant, including… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  26. arXiv:2402.06854  [pdf, other

    cs.CV cs.GR cs.LG

    Gyroscope-Assisted Motion Deblurring Network

    Authors: Simin Luan, Cong Yang, Zeyd Boukhers, Xue Qin, Dongfeng Cheng, Wei Sui, Zhijun Li

    Abstract: Image research has shown substantial attention in deblurring networks in recent years. Yet, their practical usage in real-world deblurring, especially motion blur, remains limited due to the lack of pixel-aligned training triplets (background, blurred image, and blur heat map) and restricted information inherent in blurred images. This paper presents a simple yet efficient framework to synthetic a… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  27. arXiv:2402.04852  [pdf, other

    cs.LG

    Multi-Patch Prediction: Adapting LLMs for Time Series Representation Learning

    Authors: Yuxuan Bian, Xuan Ju, Jiangtong Li, Zhijian Xu, Dawei Cheng, Qiang Xu

    Abstract: In this study, we present aLLM4TS, an innovative framework that adapts Large Language Models (LLMs) for time-series representation learning. Central to our approach is that we reconceive time-series forecasting as a self-supervised, multi-patch prediction task, which, compared to traditional contrastive learning or mask-and-reconstruction methods, captures temporal dynamics in patch representation… ▽ More

    Submitted 9 March, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  28. arXiv:2402.04141  [pdf, other

    cs.SE cs.AI

    Multi-line AI-assisted Code Authoring

    Authors: Omer Dunay, Daniel Cheng, Adam Tait, Parth Thakkar, Peter C Rigby, Andy Chiu, Imad Ahmad, Arun Ganesan, Chandra Maddila, Vijayaraghavan Murali, Ali Tayyebi, Nachiappan Nagappan

    Abstract: CodeCompose is an AI-assisted code authoring tool powered by large language models (LLMs) that provides inline suggestions to 10's of thousands of developers at Meta. In this paper, we present how we scaled the product from displaying single-line suggestions to multi-line suggestions. This evolution required us to overcome several unique challenges in improving the usability of these suggestions f… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  29. arXiv:2402.01242  [pdf, other

    cs.LG

    Two Heads Are Better Than One: Boosting Graph Sparse Training via Semantic and Topological Awareness

    Authors: Guibin Zhang, Yanwei Yue, Kun Wang, Junfeng Fang, Yongduo Sui, Kai Wang, Yuxuan Liang, Dawei Cheng, Shirui Pan, Tianlong Chen

    Abstract: Graph Neural Networks (GNNs) excel in various graph learning tasks but face computational challenges when applied to large-scale graphs. A promising solution is to remove non-essential edges to reduce the computational overheads in GNN. Previous literature generally falls into two categories: topology-guided and semantic-guided. The former maintains certain graph topological properties yet often u… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  30. arXiv:2402.00672  [pdf, other

    cs.CV cs.AI

    Exploring Homogeneous and Heterogeneous Consistent Label Associations for Unsupervised Visible-Infrared Person ReID

    Authors: Lingfeng He, De Cheng, Nannan Wang, Xinbo Gao

    Abstract: Unsupervised visible-infrared person re-identification (USL-VI-ReID) aims to retrieve pedestrian images of the same identity from different modalities without annotations. While prior work focuses on establishing cross-modality pseudo-label associations to bridge the modality-gap, they ignore maintaining the instance-level homogeneous and heterogeneous consistency in pseudo-label space, resulting… ▽ More

    Submitted 4 February, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  31. arXiv:2401.01873  [pdf, other

    quant-ph cond-mat.mtrl-sci physics.optics

    Observation of the Magnonic Dicke Superradiant Phase Transition

    Authors: Dasom Kim, Sohail Dasgupta, Xiaoxuan Ma, Joong-Mok Park, Hao-Tian Wei, Liang Luo, Jacques Doumani, Xinwei Li, Wanting Yang, Di Cheng, Richard H. J. Kim, Henry O. Everitt, Shojiro Kimura, Hiroyuki Nojiri, Jigang Wang, Shixun Cao, Motoaki Bamba, Kaden R. A. Hazzard, Junichiro Kono

    Abstract: Two-level atoms coupled with single-mode cavity photons are predicted to exhibit a quantum phase transition when the coupling strength exceeds a critical value, entering a phase in which atomic polarization and photonic field are finite even at zero temperature and without external driving. However, this phenomenon, the superradiant phase transition (SRPT), is forbidden by a no-go theorem due to t… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  32. arXiv:2312.10912  [pdf, other

    cond-mat.str-el cond-mat.supr-con quant-ph

    Discovery of an Unconventional Quantum Echo by Interference of Higgs Coherence

    Authors: C. Huang, M. Mootz, L. Luo, D. Cheng, J. M. Park, R. H. J. Kim, Y. Qiang, V. L. Quito, Yongxin Yao, P. P. Orth, I. E. Perakis, J. Wang

    Abstract: Nonlinearities in quantum systems are fundamentally characterized by the interplay of phase coherences, their interference, and state transition amplitudes. Yet the question of how quantum coherence and interference manifest in transient, massive Higgs excitations, prevalent within both the quantum vacuum and superconductors, remains elusive. One hallmark example is photon echo, enabled by the gen… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

  33. arXiv:2312.07175  [pdf, other

    cs.LG cs.AI stat.ME

    Instrumental Variable Estimation for Causal Inference in Longitudinal Data with Time-Dependent Latent Confounders

    Authors: Debo Cheng, Ziqi Xu, Jiuyong Li, Lin Liu, Jixue Liu, Wentao Gao, Thuc Duy Le

    Abstract: Causal inference from longitudinal observational data is a challenging problem due to the difficulty in correctly identifying the time-dependent confounders, especially in the presence of latent time-dependent confounders. Instrumental variable (IV) is a powerful tool for addressing the latent confounders issue, but the traditional IV technique cannot deal with latent time-dependent confounders in… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 13 pages, 7 figures and 3 tables

  34. arXiv:2312.06323  [pdf, other

    cs.CV

    Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models

    Authors: Yubin Wang, Xinyang Jiang, De Cheng, Dongsheng Li, Cairong Zhao

    Abstract: Prompt learning has become a prevalent strategy for adapting vision-language foundation models to downstream tasks. As large language models (LLMs) have emerged, recent studies have explored the use of category-related descriptions as input to enhance prompt effectiveness. Nevertheless, conventional descriptions fall short of structured information that effectively represents the interconnections… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: AAAI2024

  35. arXiv:2312.05404  [pdf, other

    cs.LG cs.AI stat.ME

    Disentangled Latent Representation Learning for Tackling the Confounding M-Bias Problem in Causal Inference

    Authors: Debo Cheng, Yang Xie, Ziqi Xu, Jiuyong Li, Lin Liu, Jixue Liu, Yinghao Zhang, Zaiwen Feng

    Abstract: In causal inference, it is a fundamental task to estimate the causal effect from observational data. However, latent confounders pose major challenges in causal inference in observational data, for example, confounding bias and M-bias. Recent data-driven causal effect estimators tackle the confounding bias problem via balanced representation learning, but assume no M-bias in the system, thus they… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: 10 pages, 3 figures and 5 tables. Accepted by ICDM2023

  36. arXiv:2312.02483  [pdf, other

    cs.CV

    EtC: Temporal Boundary Expand then Clarify for Weakly Supervised Video Grounding with Multimodal Large Language Model

    Authors: Guozhang Li, Xinpeng Ding, De Cheng, Jie Li, Nannan Wang, Xinbo Gao

    Abstract: Early weakly supervised video grounding (WSVG) methods often struggle with incomplete boundary detection due to the absence of temporal boundary annotations. To bridge the gap between video-level and boundary-level annotation, explicit-supervision methods, i.e., generating pseudo-temporal boundaries for training, have achieved great success. However, data augmentations in these methods might disru… ▽ More

    Submitted 6 March, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

  37. arXiv:2311.08593  [pdf, other

    cs.CL cs.IR

    ACID: Abstractive, Content-Based IDs for Document Retrieval with Language Models

    Authors: Haoxin Li, Phillip Keung, Daniel Cheng, Jungo Kasai, Noah A. Smith

    Abstract: Generative retrieval (Wang et al., 2022; Tay et al., 2022) is a new approach for end-to-end document retrieval that directly generates document identifiers given an input query. Techniques for designing effective, high-quality document IDs remain largely unexplored. We introduce ACID, in which each document's ID is composed of abstractive keyphrases generated by a large language model, rather than… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  38. arXiv:2311.08430  [pdf, other

    cs.LG cs.AI cs.IR

    Rankitect: Ranking Architecture Search Battling World-class Engineers at Meta Scale

    Authors: Wei Wen, Kuang-Hung Liu, Igor Fedorov, Xin Zhang, Hang Yin, Weiwei Chu, Kaveh Hassani, Mengying Sun, Jiang Liu, Xu Wang, Lin Jiang, Yuxin Chen, Buyun Zhang, Xi Liu, Dehua Cheng, Zhengxing Chen, Guang Zhao, Fangqiu Han, Jiyan Yang, Yuchen Hao, Liang Xiong, Wen-Yen Chen

    Abstract: Neural Architecture Search (NAS) has demonstrated its efficacy in computer vision and potential for ranking systems. However, prior work focused on academic problems, which are evaluated at small scale under well-controlled fixed baselines. In industry system, such as ranking system in Meta, it is unclear whether NAS algorithms from the literature can outperform production baselines because of: (1… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: Wei Wen and Kuang-Hung Liu contribute equally

  39. arXiv:2311.07124  [pdf, other

    math.OC

    Design of zero-determinant strategies and its application to networked repeated games

    Authors: Daizhan Cheng, Changxi Li

    Abstract: Using semi-tensor product (STP) of matrices, the profile evolutionary equation (PEE) for repeated finite games is obtained. By virtue of PEE, the zero-determinant (ZD) strategies are developed for general finite games. A formula is then obtained to design ZD strategies for general finite games with multi-player and asymmetric strategies. A necessary and sufficient condition is obtained to ensure t… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2107.03255

  40. arXiv:2311.06761  [pdf, other

    cs.CL

    Learning Knowledge-Enhanced Contextual Language Representations for Domain Natural Language Understanding

    Authors: Ruyao Xu, Taolin Zhang, Chengyu Wang, Zhongjie Duan, Cen Chen, Minghui Qiu, Dawei Cheng, Xiaofeng He, Weining Qian

    Abstract: Knowledge-Enhanced Pre-trained Language Models (KEPLMs) improve the performance of various downstream NLP tasks by injecting knowledge facts from large-scale Knowledge Graphs (KGs). However, existing methods for pre-training KEPLMs with relational triples are difficult to be adapted to close domains due to the lack of sufficient domain graph semantics. In this paper, we propose a Knowledge-enhance… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: emnlp 2023

  41. arXiv:2311.05812  [pdf, other

    cs.CL

    CFBenchmark: Chinese Financial Assistant Benchmark for Large Language Model

    Authors: Yang Lei, Jiangtong Li, Dawei Cheng, Zhijun Ding, Changjun Jiang

    Abstract: Large language models (LLMs) have demonstrated great potential in the financial domain. Thus, it becomes important to assess the performance of LLMs in the financial tasks. In this work, we introduce CFBenchmark, to evaluate the performance of LLMs for Chinese financial assistant. The basic version of CFBenchmark is designed to evaluate the basic ability in Chinese financial text processing from t… ▽ More

    Submitted 21 May, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: 12 pages, 4 figures

  42. Strain-Tunable Magnetic Compensation Temperature of Epitaxial Tb$_3$Fe$_5$O$_{12}$ Thin Films

    Authors: Yufei Li, Xihui Yang, Hua Bai, Mingzhi Wang, Dashuai Cheng, Cheng Song, Zhe Yuan, Yi Liu, Zhong Shi

    Abstract: High-quality rare-earth iron garnet (ReIG) Tb$_3$Fe$_5$O$_{12}$ (TbIG) thin films are epitaxially grown on a series of (111)-oriented garnet substrates with various lattice constants. The coherent growth induces a substrate-dependent in-plane tensile or compressive strain in the TbIG film. Measurements of the anomalous Hall-like effect (AHLE) in TbIG/Pt heterostructures show that the compensation… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 28 pages, 5 figures

  43. arXiv:2310.17082  [pdf, ps, other

    astro-ph.HE

    Does or did the supernova remnant Cassiopeia A operate as a PeVatron?

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: For decades, supernova remnants (SNRs) have been considered the prime sources of Galactic Cosmic rays (CRs). But whether SNRs can accelerate CR protons to PeV energies and thus dominate CR flux up to the knee is currently under intensive theoretical and phenomenological debate. The direct test of the ability of SNRs to operate as CR PeVatrons can be provided by ultrahigh-energy (UHE;… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 11 pages, 3 figures, Accepted by the APJL

  44. arXiv:2310.14266  [pdf, ps, other

    math.NA math.RA

    On Universal Eigenvalues and Eigenvectors of Hypermatrices

    Authors: Daizhan Cheng, Zhengping Ji

    Abstract: A cubic hypermatrix of order $d$ can be considered as a structure matrix of a tensor with covariant order $r$ and contra-variant order $s=d-r$. Corresponding to this matrix expression of the hypermatrix, an eigenvector $x$ with respect to an eigenvalue $λ$ is proposed, called the universal eigenvector and eigenvalue of the hypermatrix. According to the action of tensors, if $x$ is decomposable, it… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

  45. arXiv:2310.09983  [pdf, other

    cs.LG cs.AI cs.CL cs.IR

    Farzi Data: Autoregressive Data Distillation

    Authors: Noveen Sachdeva, Zexue He, Wang-Cheng Kang, Jianmo Ni, Derek Zhiyuan Cheng, Julian McAuley

    Abstract: We study data distillation for auto-regressive machine learning tasks, where the input and output have a strict left-to-right causal structure. More specifically, we propose Farzi, which summarizes an event sequence dataset into a small number of synthetic sequences -- Farzi Data -- which are optimized to maintain (if not improve) model performance compared to training on the full dataset. Under t… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: Under review. 23 pages, 9 figures

  46. arXiv:2310.09762  [pdf, other

    cs.CL cs.AI

    Diversifying the Mixture-of-Experts Representation for Language Models with Orthogonal Optimizer

    Authors: Boan Liu, Liang Ding, Li Shen, Keqin Peng, Yu Cao, Dazhao Cheng, Dacheng Tao

    Abstract: The Mixture of Experts (MoE) has emerged as a highly successful technique in deep learning, based on the principle of divide-and-conquer to maximize model capacity without significant additional computational cost. Even in the era of large-scale language models (LLMs), MoE continues to play a crucial role, as some researchers have indicated that GPT-4 adopts the MoE structure to ensure diverse inf… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

  47. Very high energy gamma-ray emission beyond 10 TeV from GRB 221009A

    Authors: Zhen Cao, F. Aharonian, Q. An, A. Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: The highest energy gamma-rays from gamma-ray bursts (GRBs) have important implications for their radiation mechanism. Here we report for the first time the detection of gamma-rays up to 13 TeV from the brightest GRB 221009A by the Large High Altitude Air-shower Observatory (LHAASO). The LHAASO-KM2A detector registered more than 140 gamma-rays with energies above 3 TeV during 230$-$900s after the t… ▽ More

    Submitted 22 November, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: 49pages, 11figures

    Journal ref: Science Advances, 9, eadj2778 (2023) 15 November 2023

  48. arXiv:2310.08300  [pdf, ps, other

    math.DG math.AP

    Existence of constant mean curvature disks in $\mathbb{R}^3$ with capillary boundary condition

    Authors: Da Rong Cheng

    Abstract: We extend Struwe's result (Acta Math., 1988) on the existence of free boundary constant mean curvature disks to almost every prescribed boundary contact angle in $(0, π)$. Specifically, let $Σ$ be a surface in $\mathbb{R}^3$ diffeomorphic to the sphere, and let $Σ'$ be a convex surface enclosing $Σ$. Given $τ\in (-1, 1)$ and a constant $H \geq 0$ below the infimum of the mean curvature of $Σ'$, we… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  49. arXiv:2310.02589  [pdf, other

    cond-mat.supr-con

    Evidence for highly damped Higgs mode in infinite-layer nickelates

    Authors: Bing Cheng, Di Cheng, Kyuho Lee, Martin Mootz, Chuankun Huang, Liang Luo, 1 Zhuoyu Chen, Yonghun Lee, Bai Yang Wang, Ilias E. Perakis, Zhi-Xun Shen, Harold Y. Hwang, Jigang Wang

    Abstract: The dynamics of Higgs mode in superconductors, manifested as coherent oscillations of the superconducting order parameter amplitude, provides vital insights into the nature of the superconducting gap structure and symmetry. Here we utilize two-dimensional terahertz coherent spectroscopy to investigate Higgs dynamics of a newly discovered infinite-layer nickelate superconductor. While we observe di… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: 10 pages, 5 figures

  50. Low-energy electrodynamics of infinite-layer nickelates: evidence for d-wave superconductivity in the dirty limit

    Authors: Bing Cheng, Di Cheng, Kyuho Lee, Liang Luo, Zhuoyu Chen, Yonghun Lee, Bai Yang Wang, Martin Mootz, Ilias E. Perakis, Zhi-Xun Shen, Harold Y. Hwang, Jigang Wang

    Abstract: The discovery of superconductivity in infinite-layer nickelates establishes a new category of unconventional superconductors that share structural and electronic similarities with cuprates. Despite exciting advances, such as the establishment of a cuprate-like phase diagram and the observation of charge order and short-range antiferromagnetic fluctuation, the key issues of superconducting pairing… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: 8 pages, 4 figures

    Journal ref: Nature Materials (2024)