Skip to main content

Showing 1–50 of 3,144 results for author: Yang, C

  1. arXiv:2407.08901  [pdf, other

    physics.comp-ph math.NA

    Accelerating Eigenvalue Computation for Nuclear Structure Calculations via Perturbative Corrections

    Authors: Dong Min Roh, Esmond Ng, Chao Yang, Dean Lee, Pieter Maris, James P. Vary

    Abstract: We present a new method for computing the lowest few eigenvalues and the corresponding eigenvectors of a nuclear many-body Hamiltonian represented in a truncated configuration interaction subspace, i.e., the no-core shell model (NCSM). The method uses the hierarchical structure of the NCSM Hamiltonian to partition the Hamiltonian as the sum of two matrices. The first matrix corresponds to the Hami… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2407.07924  [pdf, other

    math.OC cs.AI cs.CL cs.LG

    Solving General Natural-Language-Description Optimization Problems with Large Language Models

    Authors: Jihai Zhang, Wei Wang, Siyan Guo, Li Wang, Fangquan Lin, Cheng Yang, Wotao Yin

    Abstract: Optimization problems seek to find the best solution to an objective under a set of constraints, and have been widely investigated in real-world applications. Modeling and solving optimization problems in a specific domain typically require a combination of domain knowledge, mathematical skills, and programming ability, making it difficult for general users and even domain professionals. In this p… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  3. arXiv:2407.07690  [pdf

    physics.optics physics.app-ph

    High power GaSb-based distributed feedback laser with laterally coupled dielectric gratings at 1.95μm

    Authors: Zhengqing Ding, Juntian Cao, Kun Zhan, Yihang Chen, Lidan Zhou, Hao Tan, Chenao Yang, Ying Yu, Zhichuan Niu, Siyuan Yu

    Abstract: Traditional Distributed Feedback (DFB) or Distributed Bragg Reflector (DBR) lasers typically utilize buried gratings as frequency-selective optical feedback mechanisms. However, the fabrication of such gratings often necessitates regrowth processes, which can pose technical challenges for materials platforms such as GaAs and GaSb. Metal gratings were also used for GaSb lasers but they introduce ad… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 9 pages, 7 figures, 1 table

    MSC Class: 78A60 ACM Class: J.2.6

  4. arXiv:2407.07061  [pdf, other

    cs.CL

    Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence

    Authors: Weize Chen, Ziming You, Ran Li, Yitong Guan, Chen Qian, Chenyang Zhao, Cheng Yang, Ruobing Xie, Zhiyuan Liu, Maosong Sun

    Abstract: The rapid advancement of large language models (LLMs) has paved the way for the development of highly capable autonomous agents. However, existing multi-agent frameworks often struggle with integrating diverse capable third-party agents due to reliance on agents defined within their own ecosystems. They also face challenges in simulating distributed environments, as most frameworks are limited to… ▽ More

    Submitted 10 July, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

    Comments: work in progress

  5. arXiv:2407.06957  [pdf, other

    eess.AS cs.CL cs.CY

    Listen and Speak Fairly: A Study on Semantic Gender Bias in Speech Integrated Large Language Models

    Authors: Yi-Cheng Lin, Tzu-Quan Lin, Chih-Kai Yang, Ke-Han Lu, Wei-Chih Chen, Chun-Yi Kuan, Hung-yi Lee

    Abstract: Speech Integrated Large Language Models (SILLMs) combine large language models with speech perception to perform diverse tasks, such as emotion recognition to speaker verification, demonstrating universal audio understanding capability. However, these models may amplify biases present in training data, potentially leading to biased access to information for marginalized groups. This work introduce… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  6. arXiv:2407.06103  [pdf, other

    quant-ph

    QTRL: Toward Practical Quantum Reinforcement Learning via Quantum-Train

    Authors: Chen-Yu Liu, Chu-Hsuan Abraham Lin, Chao-Han Huck Yang, Kuan-Cheng Chen, Min-Hsiu Hsieh

    Abstract: Quantum reinforcement learning utilizes quantum layers to process information within a machine learning model. However, both pure and hybrid quantum reinforcement learning face challenges such as data encoding and the use of quantum computers during the inference stage. We apply the Quantum-Train method to reinforcement learning tasks, called QTRL, training the classical policy network model using… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 6 pages, 1 figure

  7. arXiv:2407.05934  [pdf, other

    cs.LG cs.AI

    Graph Anomaly Detection with Noisy Labels by Reinforcement Learning

    Authors: Zhu Wang, Shuang Zhou, Junnan Dong, Chang Yang, Xiao Huang, Shengjie Zhao

    Abstract: Graph anomaly detection (GAD) has been widely applied in many areas, e.g., fraud detection in finance and robot accounts in social networks. Existing methods are dedicated to identifying the outlier nodes that deviate from normal ones. While they heavily rely on high-quality annotation, which is hard to obtain in real-world scenarios, this could lead to severely degraded performance based on noisy… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  8. arXiv:2407.05813  [pdf, other

    hep-ex astro-ph.CO

    DarkSide-20k sensitivity to light dark matter particles

    Authors: DarkSide-20k Collaboration, :, F. Acerbi, P. Adhikari, P. Agnes, I. Ahmad, S. Albergo, I. F. M. Albuquerque, T. Alexander, A. K. Alton, P. Amaudruz, M. Angiolilli, E. Aprile, R. Ardito, M. Atzori Corona, D. J. Auty, M. Ave, I. C. Avetisov, O. Azzolini, H. O. Back, Z. Balmforth, A. Barrado Olmedo, P. Barrillon, G. Batignani, P. Bhowmick , et al. (289 additional authors not shown)

    Abstract: The dual-phase liquid argon time projection chamber is presently one of the leading technologies to search for dark matter particles with masses below 10 GeV/c$^2$. This was demonstrated by the DarkSide-50 experiment with approximately 50 kg of low-radioactivity liquid argon as target material. The next generation experiment DarkSide-20k, currently under construction, will use 1,000 times more arg… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: submitted to Nature Communications

  9. arXiv:2407.05718  [pdf, other

    cs.CL

    A Factuality and Diversity Reconciled Decoding Method for Knowledge-Grounded Dialogue Generation

    Authors: Chenxu Yang, Zheng Lin, Chong Tian, Liang Pang, Lanrui Wang, Zhengyang Tong, Qirong Ho, Yanan Cao, Weiping Wang

    Abstract: Grounding external knowledge can enhance the factuality of responses in dialogue generation. However, excessive emphasis on it might result in the lack of engaging and diverse expressions. Through the introduction of randomness in sampling, current approaches can increase the diversity. Nevertheless, such sampling method could undermine the factuality in dialogue generation. In this study, to disc… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  10. arXiv:2407.05536  [pdf, other

    quant-ph

    Effective Many-body Interactions in Reduced-Dimensionality Spaces Through Neural Network Models

    Authors: Senwei Liang, Karol Kowalski, Chao Yang, Nicholas P. Bauman

    Abstract: Accurately describing properties of challenging problems in physical sciences often requires complex mathematical models that are unmanageable to tackle head-on. Therefore, developing reduced dimensionality representations that encapsulate complex correlation effects in many-body systems is crucial to advance the understanding of these complicated problems. However, a numerical evaluation of these… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  11. arXiv:2407.05361  [pdf, other

    eess.AS cs.CL

    Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation

    Authors: Haorui He, Zengqiang Shang, Chaoren Wang, Xuyuan Li, Yicheng Gu, Hua Hua, Liwei Liu, Chen Yang, Jiaqi Li, Peiyang Shi, Yuancheng Wang, Kai Chen, Pengyuan Zhang, Zhizheng Wu

    Abstract: Recently, speech generation models have made significant progress by using large-scale training data. However, the research community struggle to produce highly spontaneous and human-like speech due to the lack of large-scale, diverse, and spontaneous speech data. This paper presents \textit{Emilia}, the first multilingual speech generation dataset from in-the-wild speech data, and Emilia-Pipe, th… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  12. arXiv:2407.05216  [pdf, other

    cs.CL

    Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course

    Authors: Cheng-Han Chiang, Wei-Chih Chen, Chun-Yi Kuan, Chienchou Yang, Hung-yi Lee

    Abstract: Using large language models (LLMs) for automatic evaluation has become an important evaluation method in NLP research. However, it is unclear whether these LLM-based evaluators can be applied in real-world classrooms to assess student assignments. This empirical report shares how we use GPT-4 as an automatic assignment evaluator in a university course with 1,028 students. Based on student response… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: An empirical report of our course: Introduction to Generative AI 2024 Spring (https://speech.ee.ntu.edu.tw/~hylee/genai/2024-spring.php)

  13. arXiv:2407.05149  [pdf

    physics.bio-ph physics.app-ph physics.chem-ph physics.optics

    Quantized Acoustic Phonons Map the Dynamics of a Single Virus

    Authors: Yaqing Zhang, Rihan Wu, Md Shahjahan, Canchai Yang, Dohun Pyeon, Elad Harel

    Abstract: The natural vibrational frequencies of biological particles such as viruses and bacteria encode critical information about their mechanical and biological states as they interact with their local environment and undergo structural evolution. However, detecting and tracking these vibrations within a biological context at the single particle level has remained elusive. In this study, we track the vi… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: Main Manuscript: 19 pages, 4 figures Supplementary Information: 29 pages, 17 figures

  14. arXiv:2407.04738  [pdf

    eess.SP cs.LG cs.RO

    A Contrastive Learning Based Convolutional Neural Network for ERP Brain-Computer Interfaces

    Authors: Yuntian Cui, Xinke Shen, Dan Zhang, Chen Yang

    Abstract: ERP-based EEG detection is gaining increasing attention in the field of brain-computer interfaces. However, due to the complexity of ERP signal components, their low signal-to-noise ratio, and significant inter-subject variability, cross-subject ERP signal detection has been challenging. The continuous advancement in deep learning has greatly contributed to addressing this issue. This brief propos… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 5 pages, 2 figures, 2 tables

  15. arXiv:2407.03053  [pdf

    physics.optics physics.app-ph

    Visible, Near-, and Mid-infrared Computational Spectrometer Enabled by Single-Spinning Film Encoder

    Authors: Junren Wen, Weiming Shi, Cheng Gao, Yujie Liu, Shuaibo Feng, Yu Shao, Haiqi Gao, Yuchuan Shao, Yueguang Zhang, Weidong Shen, Chenying Yang

    Abstract: Computational spectrometers are pivotal in enabling low-cost, in-situ and rapid spectral analysis, with potential applications in chemistry, biology, and environmental science. However, filter-based spectral encoding approaches typically use filter arrays, complicating the manufacturing process and hindering device consistency. By capitalizing on the polarization separation effect under oblique in… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  16. arXiv:2407.02935  [pdf, other

    nucl-ex hep-ex nucl-th

    Properties of the QCD Matter -- An Experimental Review of Selected Results from RHIC BES Program

    Authors: Jinhui Chen, Xin Dong, Xionghong He, Huanzhong Huang, Feng Liu, Xiaofeng Luo, Yu-Gang Ma, Lijuan Ruan, Ming Shao, Shusu Shi, Xu Sun, Aihong Tang, Zebo Tang, Fuqiang Wang, Hai Wang, Yi Wang, Zhigang Xiao, Guannan Xie, Nu Xu, Qinghua Xu, Zhangbu Xu, Chi Yang, Shuai Yang, Wangmei Zha, Yapeng Zhang , et al. (3 additional authors not shown)

    Abstract: In the paper, we discuss the development of the multi-gap resistive plate chamber Time-of-Flight (TOF) technology and the production of the STAR TOF detector in China at the beginning of the 21st century. Then we review recent experimental results from the first beam energy scan program (BES-I) at the Relativistic Heavy Ion Collider (RHIC). Topics cover measurements of collectivity, chirality, cri… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 31 pages, 33 figures. This review is dedicated to Professor Wenqing Shen on the occasion to celebrate his leadership of the Chinese STAR Collaboration, the development and production of the STAR MRPC TOF detector in China and many physics analyses

  17. A Pairwise DomMix Attentive Adversarial Network for Unsupervised Domain Adaptive Object Detection

    Authors: Jie Shao, Jiacheng Wu, Wenzhong Shen, Cheng Yang

    Abstract: Unsupervised Domain Adaptive Object Detection (DAOD) could adapt a model trained on a source domain to an unlabeled target domain for object detection. Existing unsupervised DAOD methods usually perform feature alignments from the target to the source. Unidirectional domain transfer would omit information about the target samples and result in suboptimal adaptation when there are large domain shif… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: has published on IEEE Signal Processing Letters, 2023

  18. arXiv:2407.02511  [pdf, other

    cs.RO cs.AI cs.CL

    LLM-A*: Large Language Model Enhanced Incremental Heuristic Search on Path Planning

    Authors: Silin Meng, Yiwei Wang, Cheng-Fu Yang, Nanyun Peng, Kai-Wei Chang

    Abstract: Path planning is a fundamental scientific problem in robotics and autonomous navigation, requiring the derivation of efficient routes from starting to destination points while avoiding obstacles. Traditional algorithms like A* and its variants are capable of ensuring path validity but suffer from significant computational and memory inefficiencies as the state space grows. Conversely, large langua… ▽ More

    Submitted 19 June, 2024; originally announced July 2024.

    Comments: Submitted to The 2024 Conference on Empirical Methods in Natural Language Processing

  19. arXiv:2407.02235  [pdf

    cs.CL

    Towards a Holistic Framework for Multimodal Large Language Models in Three-dimensional Brain CT Report Generation

    Authors: Cheng-Yi Li, Kao-Jung Chang, Cheng-Fu Yang, Hsin-Yu Wu, Wenting Chen, Hritik Bansal, Ling Chen, Yi-Ping Yang, Yu-Chun Chen, Shih-Pin Chen, Jiing-Feng Lirng, Kai-Wei Chang, Shih-Hwa Chiou

    Abstract: Multi-modal large language models (MLLMs) have been given free rein to explore exciting medical applications with a primary focus on radiology report generation. Nevertheless, the preliminary success in 2D radiology captioning is incompetent to reflect the real-world diagnostic challenge in the volumetric 3D anatomy. To mitigate three crucial limitation aspects in the existing literature, includin… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 6 figures, 5 supplementary figures, 8 supplementary tables

  20. arXiv:2407.02047  [pdf, other

    cs.CV

    CountFormer: Multi-View Crowd Counting Transformer

    Authors: Hong Mo, Xiong Zhang, Jianchao Tan, Cheng Yang, Qiong Gu, Bo Hang, Wenqi Ren

    Abstract: Multi-view counting (MVC) methods have shown their superiority over single-view counterparts, particularly in situations characterized by heavy occlusion and severe perspective distortions. However, hand-crafted heuristic features and identical camera layout requirements in conventional MVC methods limit their applicability and scalability in real-world scenarios.In this work, we propose a concise… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted By ECCV2024

  21. arXiv:2407.01885  [pdf, other

    cs.CL cs.AI

    Survey on Knowledge Distillation for Large Language Models: Methods, Evaluation, and Application

    Authors: Chuanpeng Yang, Wang Lu, Yao Zhu, Yidong Wang, Qian Chen, Chenlong Gao, Bingjie Yan, Yiqiang Chen

    Abstract: Large Language Models (LLMs) have showcased exceptional capabilities in various domains, attracting significant interest from both academia and industry. Despite their impressive performance, the substantial size and computational demands of LLMs pose considerable challenges for practical deployment, particularly in environments with limited resources. The endeavor to compress language models whil… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 28 pages

  22. arXiv:2407.00903  [pdf, other

    quant-ph

    Observation of topological transitions associated with a Weyl exceptional ring

    Authors: Hao-Long Zhang, Pei-Rong Han, Xue-Jia Yu, Shou-Bang Yang, Jia-Hao Lü, Wen Ning, Fan Wu, Qi-Ping Su, Chui-Ping Yang, Zhen-Biao Yang, Shi-Biao Zheng

    Abstract: The environment-induced dissipation of an open system, once thought as a nuisance, can actually lead to emergence of many intriguing phenomena that are absent in an isolated system. Among these, Weyl exceptional rings (WER), extended from point-like singularities, are particularly interesting. Theoretically, a WER was predicted to carry a topological charge with a nonzero Chern number, but it has… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 16 pages, 10 figures

  23. arXiv:2407.00427  [pdf, ps, other

    math.CO

    On the boundedness of degenerate hypergraphs

    Authors: Jianfeng Hou, Caiyun Hu, Heng Li, Xizhi Liu, Caihong Yang, Yixiao Zhang

    Abstract: We investigate the impact of a high-degree vertex in Turán problems for degenerate hypergraphs (including graphs). We say an $r$-graph $F$ is bounded if there exist constants $α, β>0$ such that for large $n$, every $n$-vertex $F$-free $r$-graph with a vertex of degree at least $α\binom{n-1}{r-1}$ has fewer than $(1-β) \cdot \mathrm{ex}(n,F)$ edges. The boundedness property is crucial for recent wo… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: comments are welcome

  24. arXiv:2407.00365  [pdf, other

    cs.CL

    Financial Knowledge Large Language Model

    Authors: Cehao Yang, Chengjin Xu, Yiyan Qi

    Abstract: Artificial intelligence is making significant strides in the finance industry, revolutionizing how data is processed and interpreted. Among these technologies, large language models (LLMs) have demonstrated substantial potential to transform financial services by automating complex tasks, enhancing customer service, and providing detailed financial analysis. Firstly, we introduce IDEA-FinBench, an… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: 66 pages

  25. arXiv:2407.00072  [pdf, other

    cs.IR cs.CL

    Pistis-RAG: A Scalable Cascading Framework Towards Trustworthy Retrieval-Augmented Generation

    Authors: Yu Bai, Yukai Miao, Li Chen, Dan Li, Yanyu Ren, Hongtao Xie, Ce Yang, Xuhui Cai

    Abstract: In Greek mythology, Pistis symbolized good faith, trust, and reliability. Drawing inspiration from these principles, Pistis-RAG is a scalable multi-stage framework designed to address the challenges of large-scale retrieval-augmented generation (RAG) systems. This framework consists of distinct stages: matching, pre-ranking, ranking, reasoning, and aggregating. Each stage contributes to narrowing… ▽ More

    Submitted 11 July, 2024; v1 submitted 21 June, 2024; originally announced July 2024.

  26. arXiv:2406.20015  [pdf, other

    cs.CL cs.AI

    ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models

    Authors: Yuxiang Zhang, Jing Chen, Junjie Wang, Yaxin Liu, Cheng Yang, Chufan Shi, Xinyu Zhu, Zihao Lin, Hanwen Wan, Yujiu Yang, Tetsuya Sakai, Tian Feng, Hayato Yamana

    Abstract: Tool-augmented large language models (LLMs) are rapidly being integrated into real-world applications. Due to the lack of benchmarks, the community still needs to fully understand the hallucination issues within these models. To address this challenge, we introduce a comprehensive diagnostic benchmark, ToolBH. Specifically, we assess the LLM's hallucinations through two perspectives: depth and bre… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  27. Quantum voting machine encoded with microwave photons

    Authors: Yu Zhang, Chuiping Yang, Qiping Su, Yihao Kang, Wen Zheng, Shaoxiong Li, Yang Yu

    Abstract: We propose a simple quantum voting machine using microwave photon qubit encoding, based on a setup comprising multiple microwave cavities and a coupled superconducting flux qutrit. This approach primarily relies on a multi-control single-target quantum phase gate. The scheme offers operational simplicity, requiring only a single step, while ensuring verifiability through the measurement of a singl… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 14pages,4 figures. arXiv admin note: text overlap with arXiv:2306.02227

    MSC Class: 81V99

    Journal ref: Chin. Phys. Lett. 41 070302 (2024)

  28. arXiv:2406.18181  [pdf, ps, other

    cs.SE

    An Empirical Study of Unit Test Generation with Large Language Models

    Authors: Lin Yang, Chen Yang, Shutao Gao, Weijing Wang, Bo Wang, Qihao Zhu, Xiao Chu, Jianyi Zhou, Guangtai Liang, Qianxiang Wang, Junjie Chen

    Abstract: Unit testing is an essential activity in software development for verifying the correctness of software components. However, manually writing unit tests is challenging and time-consuming. The emergence of Large Language Models (LLMs) offers a new direction for automating unit test generation. Existing research primarily focuses on closed-source LLMs (e.g., ChatGPT and CodeX) with fixed prompting s… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  29. arXiv:2406.17934  [pdf, other

    astro-ph.EP

    Rapid protoplanet formation in vortices: three-dimensional local simulations with selfgravity

    Authors: Wladimir Lyra, Chao-Chin Yang, Jacob B. Simon, Orkan M. Umurhan, Andrew N. Youdin

    Abstract: Disk vortices, seen in numerical simulations of protoplanetary disks and found observationally in ALMA and VLA images of these objects, are promising sites for planet formation given their pebble trapping abilities. Previous works have shown strong concentration of pebbles in vortices, but gravitational collapse has only been shown in low-resolution, two-dimensional, global models. In this letter,… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 9 pages, 5 figures, accepted for publication in ApJ letters

  30. arXiv:2406.17720  [pdf, other

    cs.CV

    Arboretum: A Large Multimodal Dataset Enabling AI for Biodiversity

    Authors: Chih-Hsuan Yang, Benjamin Feuer, Zaki Jubery, Zi K. Deng, Andre Nakkab, Md Zahid Hasan, Shivani Chiranjeevi, Kelly Marshall, Nirmal Baishnab, Asheesh K Singh, Arti Singh, Soumik Sarkar, Nirav Merchant, Chinmay Hegde, Baskar Ganapathysubramanian

    Abstract: We introduce Arboretum, the largest publicly accessible dataset designed to advance AI for biodiversity applications. This dataset, curated from the iNaturalist community science platform and vetted by domain experts to ensure accuracy, includes 134.6 million images, surpassing existing datasets in scale by an order of magnitude. The dataset encompasses image-language paired data for a diverse set… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Preprint under review

  31. arXiv:2406.17488  [pdf, other

    eess.SP

    Environmental Variation or Instrumental Drift? A Probabilistic Approach to Gas Sensor Drift Modeling and Evaluation

    Authors: Cheng Yang, Gustav Bohlin, Tobias Oechtering

    Abstract: Drift is a significant issue that undermines the reliability of gas sensors. This paper introduces a probabilistic model to distinguish between environmental variation and instrumental drift, using low-cost non-dispersive infrared (NDIR) CO2 sensors as a case study. Data from a long-term field experiment is analyzed to evaluate both sensor performance and environmental changes over time. Our appro… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: This conference paper has been submitted to IEEE SENSORS 2024

  32. arXiv:2406.16744  [pdf

    cond-mat.mtrl-sci

    Lone Pair Induced 1D Character and Weak Cation-anion Interactions: Two Ingredients for Low Thermal Conductivity in Mixed-anion Metal Chalcohalides

    Authors: Xingchen Shen, Koushik Pal, Paribesh Acharyya, Bernard Raveau, Philippe Boullay, Carmelo Prestipino, Susumu Fujii, Chun-Chuen Yang, I-Yu Tsao, Adele Renaud, Pierric Lemoine, Christophe Candolfi, Emmanuel Guilmeau

    Abstract: Mixed-anion compounds, which incorporate multiple types of anions into materials, displays tailored crystal structures and physical/chemical properties, garnering immense interests in various applications such as batteries, catalysis, photovoltaics, and thermoelectrics. However, detailed studies regarding correlations between crystal structure, chemical bonding, and thermal/vibrational properties… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  33. arXiv:2406.16715  [pdf, other

    cs.LG

    GC-Bench: A Benchmark Framework for Graph Condensation with New Insights

    Authors: Shengbo Gong, Juntong Ni, Noveen Sachdeva, Carl Yang, Wei Jin

    Abstract: Graph condensation (GC) is an emerging technique designed to learn a significantly smaller graph that retains the essential information of the original graph. This condensed graph has shown promise in accelerating graph neural networks while preserving performance comparable to those achieved with the original, larger graphs. Additionally, this technique facilitates downstream applications such as… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 9 pages

  34. arXiv:2406.16529  [pdf, other

    cs.CL

    Towards Better Graph-based Cross-document Relation Extraction via Non-bridge Entity Enhancement and Prediction Debiasing

    Authors: Hao Yue, Shaopeng Lai, Chengyi Yang, Liang Zhang, Junfeng Yao, Jinsong Su

    Abstract: Cross-document Relation Extraction aims to predict the relation between target entities located in different documents. In this regard, the dominant models commonly retain useful information for relation prediction via bridge entities, which allows the model to elaborately capture the intrinsic interdependence between target entities. However, these studies ignore the non-bridge entities, each of… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024 Findings

  35. arXiv:2406.16303  [pdf, other

    eess.SP

    Hybrid Precoding With Low-Resolution PSs for Wideband Terahertz Communication Systems in The Face of Beam Squint

    Authors: Yang Wang, Chuang Yang, Mugen Peng

    Abstract: Terahertz (THz) communication is considered one of the most critical technologies for 6G because of its abundant bandwidth. To compensate the high propagation of THz, analog/digital hybrid precoding for THz massive multiple input multiple output (MIMO) is proposed to focus signals and extend communication range. Notably, considering hardware cost and power consumption, infinite and high-resolution… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  36. arXiv:2406.15784  [pdf

    cs.AI

    Data Issues in Industrial AI System: A Meta-Review and Research Strategy

    Authors: Xuejiao Li, Cheng Yang, Charles Møller, Jay Lee

    Abstract: In the era of Industry 4.0, artificial intelligence (AI) is assuming an increasingly pivotal role within industrial systems. Despite the recent trend within various industries to adopt AI, the actual adoption of AI is not as developed as perceived. A significant factor contributing to this lag is the data issues in AI implementation. How to address these data issues stands as a significant concern… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  37. arXiv:2406.15486  [pdf, other

    cs.CL cs.AI cs.LG

    SampleAttention: Near-Lossless Acceleration of Long Context LLM Inference with Adaptive Structured Sparse Attention

    Authors: Qianchao Zhu, Jiangfei Duan, Chang Chen, Siran Liu, Xiuhong Li, Guanyu Feng, Xin Lv, Huanqi Cao, Xiao Chuanfu, Xingcheng Zhang, Dahua Lin, Chao Yang

    Abstract: Large language models (LLMs) now support extremely long context windows, but the quadratic complexity of vanilla attention results in significantly long Time-to-First-Token (TTFT) latency. Existing approaches to address this complexity require additional pretraining or finetuning, and often sacrifice model accuracy. In this paper, we first provide both theoretical and empirical foundations for nea… ▽ More

    Submitted 28 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  38. arXiv:2406.14928  [pdf, other

    cs.AI cs.CL cs.HC cs.MA cs.SI

    Autonomous Agents for Collaborative Task under Information Asymmetry

    Authors: Wei Liu, Chenxi Wang, Yifei Wang, Zihao Xie, Rennai Qiu, Yufan Dang, Zhuoyun Du, Weize Chen, Cheng Yang, Chen Qian

    Abstract: Large Language Model Multi-Agent Systems (LLM-MAS) have achieved great progress in solving complex tasks. It performs communication among agents within the system to collaboratively solve tasks, under the premise of shared information. However, when agents' communication is leveraged to enhance human cooperation, a new challenge arises due to information asymmetry, since each agent can only access… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 16 pages, 8 figures, 5 tables, Work in progress

  39. arXiv:2406.14083  [pdf, ps, other

    math.CO

    Tight bounds for rainbow partial $F$-tiling in edge-colored complete hypergraphs

    Authors: Jinghua Deng, Jianfeng Hou, Xizhi Liu, Caihong Yang

    Abstract: For an $r$-graph $F$ and integers $n,t$ satisfying $t \le n/v(F)$, let $\mathrm{ar}(n,tF)$ denote the minimum integer $N$ such that every edge-coloring of $K_{n}^{r}$ using $N$ colors contains a rainbow copy of $tF$, where $tF$ is the $r$-graphs consisting of $t$ vertex-disjoint copies of $F$. The case $t=1$ is the classical anti-Ramsey problem proposed by Erdős--Simonovits--Sós~\cite{ESS75}. When… ▽ More

    Submitted 21 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: 19 pages, 1 figues, comments are welcome

  40. arXiv:2406.14036  [pdf, other

    cs.LG cs.AI cs.CL

    Toward Infinite-Long Prefix in Transformer

    Authors: Jiuxiang Gu, Yingyu Liang, Zhenmei Shi, Zhao Song, Chiwun Yang

    Abstract: Prompting and contextual-based fine-tuning methods, which we call Prefix Learning, have been proposed to enhance the performance of language models on various downstream tasks that can match full parameter fine-tuning. There remains a limited theoretical understanding of how these methods work. In this paper, we aim to relieve this limitation by studying the learning ability of Prefix Learning fro… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  41. arXiv:2406.13988  [pdf, other

    cs.CV

    LGmap: Local-to-Global Mapping Network for Online Long-Range Vectorized HD Map Construction

    Authors: Kuang Wu, Sulei Nian, Can Shen, Chuan Yang, Zhanbin Li

    Abstract: This report introduces the first-place winning solution for the Autonomous Grand Challenge 2024 - Mapless Driving. In this report, we introduce a novel online mapping pipeline LGmap, which adept at long-range temporal model. Firstly, we propose symmetric view transformation(SVT), a hybrid view transformation module. Our approach overcomes the limitations of forward sparse feature representation an… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  42. arXiv:2406.13912  [pdf, other

    cs.CV

    From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment

    Authors: Yusuke Hirota, Ryo Hachiuma, Chao-Han Huck Yang, Yuta Nakashima

    Abstract: Large language models (LLMs) have enhanced the capacity of vision-language models to caption visual text. This generative approach to image caption enrichment further makes textual captions more descriptive, improving alignment with the visual context. However, while many studies focus on benefits of generative caption enrichment (GCE), are there any negative side effects? We compare standard-form… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  43. arXiv:2406.13873  [pdf, other

    cs.AI

    A Pure Transformer Pretraining Framework on Text-attributed Graphs

    Authors: Yu Song, Haitao Mao, Jiachen Xiao, Jingzhe Liu, Zhikai Chen, Wei Jin, Carl Yang, Jiliang Tang, Hui Liu

    Abstract: Pretraining plays a pivotal role in acquiring generalized knowledge from large-scale data, achieving remarkable successes as evidenced by large models in CV and NLP. However, progress in the graph domain remains limited due to fundamental challenges such as feature heterogeneity and structural heterogeneity. Recently, increasing efforts have been made to enhance node feature quality with Large Lan… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  44. arXiv:2406.13173  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Biomedical Visual Instruction Tuning with Clinician Preference Alignment

    Authors: Hejie Cui, Lingjun Mao, Xin Liang, Jieyu Zhang, Hui Ren, Quanzheng Li, Xiang Li, Carl Yang

    Abstract: Recent advancements in multimodal foundation models have showcased impressive capabilities in understanding and reasoning with visual and textual information. Adapting these foundation models trained for general usage to specialized domains like biomedicine requires large-scale domain-specific instruction datasets. While existing works have explored curating such datasets automatically, the result… ▽ More

    Submitted 29 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

    MSC Class: 68T50; 68T45; 68T37; 68T05; 68T07; 68T09; ACM Class: I.2.7; I.2.6; I.2.10

  45. arXiv:2406.13055  [pdf, other

    nucl-th astro-ph.CO physics.plasm-ph

    Self-consistent strong screening applied to thermonuclear reactions

    Authors: Christopher Grayson, Cheng Tao Yang, Martin Formanek, Johann Rafelski

    Abstract: Self-consistent strong plasma screening around light nuclei is implemented in the Big Bang nucleosynthesis (BBN) epoch to determine the short-range screening potential, $eφ(r)/T \geq 1$, relevant for thermonuclear reactions. We numerically solve the non-linear Poisson-Boltzmann equation incorporating Fermi-Dirac statistics adopting a generalized screening mass to find the electric potential in the… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 16 pages, 5 figures, typeset using LATEX default style in AASTeX631

  46. arXiv:2406.11817  [pdf, other

    cs.CL cs.AI cs.LG

    Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level

    Authors: Jie Liu, Zhanhui Zhou, Jiaheng Liu, Xingyuan Bu, Chao Yang, Han-Sen Zhong, Wanli Ouyang

    Abstract: Direct Preference Optimization (DPO), a standard method for aligning language models with human preferences, is traditionally applied to offline preferences. Recent studies show that DPO benefits from iterative training with online preferences labeled by a trained reward model. In this work, we identify a pitfall of vanilla iterative DPO - improved response quality can lead to increased verbosity.… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  47. arXiv:2406.11683  [pdf, other

    cs.CL

    HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing

    Authors: Jing Chen, Xinyu Zhu, Cheng Yang, Chufan Shi, Yadong Xi, Yuxiang Zhang, Junjie Wang, Jiashu Pu, Rongsheng Zhang, Yujiu Yang, Tian Feng

    Abstract: Generative AI has demonstrated unprecedented creativity in the field of computer vision, yet such phenomena have not been observed in natural language processing. In particular, large language models (LLMs) can hardly produce written works at the level of human experts due to the extremely high complexity of literature writing. In this paper, we present HoLLMwood, an automated framework for unleas… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  48. arXiv:2406.11160  [pdf, other

    cs.AI

    Context Graph

    Authors: Chengjin Xu, Muzhi Li, Cehao Yang, Xuhui Jiang, Lumingyuan Tang, Yiyan Qi, Jian Guo

    Abstract: Knowledge Graphs (KGs) are foundational structures in many AI applications, representing entities and their interrelations through triples. However, triple-based KGs lack the contextual information of relational knowledge, like temporal dynamics and provenance details, which are crucial for comprehensive knowledge representation and effective reasoning. Instead, \textbf{Context Graphs} (CGs) expan… ▽ More

    Submitted 27 June, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

  49. arXiv:2406.11085  [pdf, other

    cs.CL

    Multiple Sources are Better Than One: Incorporating External Knowledge in Low-Resource Glossing

    Authors: Changbing Yang, Garrett Nicolai, Miikka Silfverberg

    Abstract: In this paper, we address the data scarcity problem in automatic data-driven glossing for low-resource languages by coordinating multiple sources of linguistic expertise. We supplement models with translations at both the token and sentence level as well as leverage the extensive linguistic capability of modern LLMs. Our enhancements lead to an average absolute improvement of 5%-points in word-lev… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2403.08189

  50. arXiv:2406.10869  [pdf, other

    eess.IV cs.CV

    Geometric Distortion Guided Transformer for Omnidirectional Image Super-Resolution

    Authors: Cuixin Yang, Rongkang Dong, Jun Xiao, Cong Zhang, Kin-Man Lam, Fei Zhou, Guoping Qiu

    Abstract: As virtual and augmented reality applications gain popularity, omnidirectional image (ODI) super-resolution has become increasingly important. Unlike 2D plain images that are formed on a plane, ODIs are projected onto spherical surfaces. Applying established image super-resolution methods to ODIs, therefore, requires performing equirectangular projection (ERP) to map the ODIs onto a plane. ODI sup… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 13 pages, 12 figures, journal