Skip to main content

Showing 1–50 of 165 results for author: Lv, X

  1. arXiv:2407.08202  [pdf

    physics.geo-ph

    A rotational ellipsoid model for solid Earth tide with high precision

    Authors: Yongfeng Yang, Yunfei Zhang, Qiang Liu, Xianqing Lv, Pu Huang

    Abstract: Solid Earth tide represents the elastic response of solid Earth to the lunar (solar) gravitational force. The yielding solid Earth due to the force has been thought to be a prolate ellipsoid since the time of Lord Kelvin, yet the ellipsoid's geometry such as semi-major axis's length, semi-minor axis's length, and oblateness remains unresolved. Additionally, the tidal displacement of solid Earth is… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 20 pages, 4 figures, 1 table

  2. arXiv:2407.04051  [pdf, other

    cs.SD cs.AI eess.AS

    FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs

    Authors: Keyu An, Qian Chen, Chong Deng, Zhihao Du, Changfeng Gao, Zhifu Gao, Yue Gu, Ting He, Hangrui Hu, Kai Hu, Shengpeng Ji, Yabin Li, Zerui Li, Heng Lu, Haoneng Luo, Xiang Lv, Bin Ma, Ziyang Ma, Chongjia Ni, Changhe Song, Jiaqi Shi, Xian Shi, Hao Wang, Wen Wang, Yuxuan Wang , et al. (8 additional authors not shown)

    Abstract: This report introduces FunAudioLLM, a model family designed to enhance natural voice interactions between humans and large language models (LLMs). At its core are two innovative models: SenseVoice, which handles multilingual speech recognition, emotion recognition, and audio event detection; and CosyVoice, which facilitates natural speech generation with control over multiple languages, timbre, sp… ▽ More

    Submitted 10 July, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

    Comments: Work in progress. Authors are listed in alphabetical order by family name

  3. arXiv:2406.18556  [pdf

    eess.IV cs.CV cs.LG

    Renal digital pathology visual knowledge search platform based on language large model and book knowledge

    Authors: Xiaomin Lv, Chong Lai, Liya Ding, Maode Lai, Qingrong Sun

    Abstract: Large models have become mainstream, yet their applications in digital pathology still require exploration. Meanwhile renal pathology images play an important role in the diagnosis of renal diseases. We conducted image segmentation and paired corresponding text descriptions based on 60 books for renal pathology, clustering analysis for all image and text description features based on large models,… ▽ More

    Submitted 26 May, 2024; originally announced June 2024.

    Comments: 9 pages, 6 figures

  4. arXiv:2406.15486  [pdf, other

    cs.CL cs.AI cs.LG

    SampleAttention: Near-Lossless Acceleration of Long Context LLM Inference with Adaptive Structured Sparse Attention

    Authors: Qianchao Zhu, Jiangfei Duan, Chang Chen, Siran Liu, Xiuhong Li, Guanyu Feng, Xin Lv, Huanqi Cao, Xiao Chuanfu, Xingcheng Zhang, Dahua Lin, Chao Yang

    Abstract: Large language models (LLMs) now support extremely long context windows, but the quadratic complexity of vanilla attention results in significantly long Time-to-First-Token (TTFT) latency. Existing approaches to address this complexity require additional pretraining or finetuning, and often sacrifice model accuracy. In this paper, we first provide both theoretical and empirical foundations for nea… ▽ More

    Submitted 28 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  5. arXiv:2406.12793  [pdf, other

    cs.CL

    ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

    Authors: Team GLM, :, Aohan Zeng, Bin Xu, Bowen Wang, Chenhui Zhang, Da Yin, Diego Rojas, Guanyu Feng, Hanlin Zhao, Hanyu Lai, Hao Yu, Hongning Wang, Jiadai Sun, Jiajie Zhang, Jiale Cheng, Jiayi Gui, Jie Tang, Jing Zhang, Juanzi Li, Lei Zhao, Lindong Wu, Lucen Zhong, Mingdao Liu, Minlie Huang , et al. (32 additional authors not shown)

    Abstract: We introduce ChatGLM, an evolving family of large language models that we have been developing over time. This report primarily focuses on the GLM-4 language series, which includes GLM-4, GLM-4-Air, and GLM-4-9B. They represent our most capable models that are trained with all the insights and lessons gained from the preceding three generations of ChatGLM. To date, the GLM-4 models are pre-trained… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  6. arXiv:2406.12295  [pdf, other

    cs.CL

    Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding

    Authors: Kaiyan Zhang, Jianyu Wang, Ning Ding, Biqing Qi, Ermo Hua, Xingtai Lv, Bowen Zhou

    Abstract: Large Language Models (LLMs) demonstrate impressive performance in diverse applications, yet they face significant drawbacks, including high inference latency, expensive training cost, and generation of hallucination. Collaborative decoding between large and small language models (SLMs) offers a novel approach to address these challenges. Inspired by dual-process cognitive theory, we integrate the… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  7. arXiv:2406.10744  [pdf, other

    cs.CV

    Technique Report of CVPR 2024 PBDL Challenges

    Authors: Ying Fu, Yu Li, Shaodi You, Boxin Shi, Linwei Chen, Yunhao Zou, Zichun Wang, Yichen Li, Yuze Han, Yingkai Zhang, Jianan Wang, Qinglin Liu, Wei Yu, Xiaoqian Lv, Jianing Li, Shengping Zhang, Xiangyang Ji, Yuanpei Chen, Yuhan Zhang, Weihang Peng, Liwen Zhang, Zhe Xu, Dingyong Gou, Cong Li, Senyan Xu , et al. (75 additional authors not shown)

    Abstract: The intersection of physics-based vision and deep learning presents an exciting frontier for advancing computer vision technologies. By leveraging the principles of physics to inform and enhance deep learning models, we can develop more robust and accurate vision systems. Physics-based vision aims to invert the processes to recover scene properties such as shape, reflectance, light distribution, a… ▽ More

    Submitted 12 July, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

    Comments: CVPR 2024 PBDL Challenges: https://pbdl-ws.github.io/pbdl2024/challenge/index.html

  8. arXiv:2406.03949  [pdf, other

    cs.CL

    UltraMedical: Building Specialized Generalists in Biomedicine

    Authors: Kaiyan Zhang, Sihang Zeng, Ermo Hua, Ning Ding, Zhang-Ren Chen, Zhiyuan Ma, Haoxin Li, Ganqu Cui, Biqing Qi, Xuekai Zhu, Xingtai Lv, Hu Jinfang, Zhiyuan Liu, Bowen Zhou

    Abstract: Large Language Models (LLMs) have demonstrated remarkable capabilities across various domains and are moving towards more specialized areas. Recent advanced proprietary models such as GPT-4 and Gemini have achieved significant advancements in biomedicine, which have also raised privacy and security challenges. The construction of specialized generalists hinges largely on high-quality datasets, enh… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Datasets and models are available at https://github.com/TsinghuaC3I/UltraMedical

  9. arXiv:2406.00434  [pdf, other

    cs.CV

    MoDGS: Dynamic Gaussian Splatting from Causually-captured Monocular Videos

    Authors: Qingming Liu, Yuan Liu, Jiepeng Wang, Xianqiang Lv, Peng Wang, Wenping Wang, Junhui Hou

    Abstract: In this paper, we propose MoDGS, a new pipeline to render novel-view images in dynamic scenes using only casually captured monocular videos. Previous monocular dynamic NeRF or Gaussian Splatting methods strongly rely on the rapid movement of input cameras to construct multiview consistency but fail to reconstruct dynamic scenes on casually captured input videos whose cameras are static or move slo… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  10. arXiv:2405.11870  [pdf, other

    cs.CL cs.AI

    Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process

    Authors: Ermo Hua, Biqing Qi, Kaiyan Zhang, Yue Yu, Ning Ding, Xingtai Lv, Kai Tian, Bowen Zhou

    Abstract: Supervised Fine-Tuning (SFT) and Preference Optimization (PO) are two fundamental processes for enhancing the capabilities of Language Models (LMs) post pre-training, aligning them better with human preferences. Although SFT advances in training efficiency, PO delivers better alignment, thus they are often combined. However, common practices simply apply them sequentially without integrating their… ▽ More

    Submitted 28 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

  11. arXiv:2405.09552  [pdf, other

    eess.IV cs.AI cs.CV

    ODFormer: Semantic Fundus Image Segmentation Using Transformer for Optic Nerve Head Detection

    Authors: Jiayi Wang, Yi-An Mao, Xiaoyu Ma, Sicen Guo, Yuting Shao, Xiao Lv, Wenting Han, Mark Christopher, Linda M. Zangwill, Yanlong Bi, Rui Fan

    Abstract: Optic nerve head (ONH) detection has been a crucial area of study in ophthalmology for years. However, the significant discrepancy between fundus image datasets, each generated using a single type of fundus camera, poses challenges to the generalizability of ONH detection approaches developed based on semantic segmentation networks. Despite the numerous recent advancements in general-purpose seman… ▽ More

    Submitted 2 June, 2024; v1 submitted 15 April, 2024; originally announced May 2024.

  12. arXiv:2404.19584  [pdf, other

    physics.optics

    Broadband microwave-rate dark pulse microcombs in dissipation-engineered LiNbO$_3$ microresonators

    Authors: Xiaomin Lv, Binbin Nie, Chen Yang, Rui Ma, Ze Wang, Yanwu Liu, Xing Jin, Kaixuan Zhu, Zhenyu Chen, Du Qian, Guanyu Zhang, Guowei Lv, Qihuang Gong, Fang Bo, Qi-Fan Yang

    Abstract: Kerr microcombs generated in optical microresonators provide broadband light sources bridging optical and microwave signals. Their translation to thin-film lithium niobate unlocks second-order nonlinear optical interfaces such as electro-optic modulation and frequency doubling for completing comb functionalities. However, the strong Raman response of LiNbO$_3$ has complicated the formation of Kerr… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  13. arXiv:2404.16687  [pdf, other

    cs.CV

    NTIRE 2024 Quality Assessment of AI-Generated Content Challenge

    Authors: Xiaohong Liu, Xiongkuo Min, Guangtao Zhai, Chunyi Li, Tengchuan Kou, Wei Sun, Haoning Wu, Yixuan Gao, Yuqin Cao, Zicheng Zhang, Xiele Wu, Radu Timofte, Fei Peng, Huiyuan Fu, Anlong Ming, Chuanming Wang, Huadong Ma, Shuai He, Zifei Dou, Shu Chen, Huacong Zhang, Haiyi Xie, Chengwei Wang, Baoying Chen, Jishen Zeng , et al. (89 additional authors not shown)

    Abstract: This paper reports on the NTIRE 2024 Quality Assessment of AI-Generated Content Challenge, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2024. This challenge is to address a major challenge in the field of image and video processing, namely, Image Quality Assessment (IQA) and Video Quality Assessment (VQA) for AI-Generated Conte… ▽ More

    Submitted 7 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

  14. arXiv:2404.13299  [pdf, other

    cs.CV

    PCQA: A Strong Baseline for AIGC Quality Assessment Based on Prompt Condition

    Authors: Xi Fang, Weigang Wang, Xiaoxin Lv, Jun Yan

    Abstract: The development of Large Language Models (LLM) and Diffusion Models brings the boom of Artificial Intelligence Generated Content (AIGC). It is essential to build an effective quality assessment framework to provide a quantifiable evaluation of different images or videos based on the AIGC technologies. The content generated by AIGC methods is driven by the crafted prompts. Therefore, it is intuitiv… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: Published in CVPR-2024's NTIRE: New Trends in Image Restoration and Enhancement workshop and challenges

  15. arXiv:2404.10253  [pdf, other

    cs.DC

    Kilometer-Level Coupled Modeling Using 40 Million Cores: An Eight-Year Journey of Model Development

    Authors: Xiaohui Duan, Yuxuan Li, Zhao Liu, Bin Yang, Juepeng Zheng, Haohuan Fu, Shaoqing Zhang, Shiming Xu, Yang Gao, Wei Xue, Di Wei, Xiaojing Lv, Lifeng Yan, Haopeng Huang, Haitian Lu, Lingfeng Wan, Haoran Lin, Qixin Chang, Chenlin Li, Quanjie He, Zeyu Song, Xuantong Wang, Yangyang Yu, Xilong Fan, Zhaopeng Qu , et al. (16 additional authors not shown)

    Abstract: With current and future leading systems adopting heterogeneous architectures, adapting existing models for heterogeneous supercomputers is of urgent need for improving model resolution and reducing modeling uncertainty. This paper presents our three-week effort on porting a complex earth system model, CESM 2.2, to a 40-million-core Sunway supercomputer. Taking a non-intrusive approach that tries t… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 18 pages, 13 figures

  16. arXiv:2404.03577  [pdf, other

    cs.CL

    Untangle the KNOT: Interweaving Conflicting Knowledge and Reasoning Skills in Large Language Models

    Authors: Yantao Liu, Zijun Yao, Xin Lv, Yuchen Fan, Shulin Cao, Jifan Yu, Lei Hou, Juanzi Li

    Abstract: Providing knowledge documents for large language models (LLMs) has emerged as a promising solution to update the static knowledge inherent in their parameters. However, knowledge in the document may conflict with the memory of LLMs due to outdated or incorrect knowledge in the LLMs' parameters. This leads to the necessity of examining the capability of LLMs to assimilate supplemental external know… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: Accepted by LREC-COLING 2024 as long paper

  17. arXiv:2403.15872  [pdf, other

    cs.CL

    RAAMove: A Corpus for Analyzing Moves in Research Article Abstracts

    Authors: Hongzheng Li, Ruojin Wang, Ge Shi, Xing Lv, Lei Lei, Chong Feng, Fang Liu, Jinkun Lin, Yangguang Mei, Lingnan Xu

    Abstract: Move structures have been studied in English for Specific Purposes (ESP) and English for Academic Purposes (EAP) for decades. However, there are few move annotation corpora for Research Article (RA) abstracts. In this paper, we introduce RAAMove, a comprehensive multi-domain corpus dedicated to the annotation of move structures in RA abstracts. The primary objective of RAAMove is to facilitate mov… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: Accepted by LREC-COLING 2024

  18. arXiv:2403.12021  [pdf, other

    quant-ph cond-mat.quant-gas physics.atom-ph

    A tweezer array with 6100 highly coherent atomic qubits

    Authors: Hannah J. Manetsch, Gyohei Nomura, Elie Bataille, Kon H. Leung, Xudong Lv, Manuel Endres

    Abstract: Optical tweezer arrays have had a transformative impact on atomic and molecular physics over the past years, and they now form the backbone for a wide range of leading experiments in quantum computing, simulation, and metrology. Underlying this development is the simplicity of single particle control and detection inherent to the technique. Typical experiments trap tens to hundreds of atomic qubit… ▽ More

    Submitted 19 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: H.J.M., G.N., and E.B. contributed equally to this work

  19. arXiv:2403.11832  [pdf, other

    astro-ph.HE hep-ph

    Precise measurement of the cosmic-ray spectrum and $\left \langle \ln A \right \rangle$ by LHAASO -- connecting the Galactic to the extragalactic components

    Authors: Xing-Jian Lv, Xiao-Jun Bi, Kun Fang, Yi-Qing Guo, Hui-Hai He, Ling-Ling Ma, Peng-Fei Yin, Qiang Yuan, Meng-Jie Zhao

    Abstract: Recently LHAASO Collaboration gives precise measurements of cosmic rays (CR) all particle energy spectrum and mean logarithmic mass $\left \langle \ln A \right \rangle$ from 0.3 PeV to 30 PeV. Combining the CR measurements by AMS-02 and DAMPE in space and that by LHAASO and Auger on the ground we construct a model to recover all these measurements from tens of GeV to tens of EeV. We find the LHAAS… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 11 pages, 2 figures, 4 tables

  20. arXiv:2403.09384  [pdf

    physics.comp-ph cond-mat.mtrl-sci

    Anomalous thermal transport and high thermoelectric performance of Cu-based vanadate CuVO3

    Authors: Xin Jin, Qiling Ou, Haoran Wei, Xianyong Ding, Fangyang Zhan, Rui Wang, Xiaolong Yang, Xuewei Lv, Peng Yu

    Abstract: Thermoelectric (TE) conversion technology, capable of transforming heat into electricity, is critical for sustainable energy solutions. Many promising TE materials contain rare or toxic elements, so the development of cost-effective and eco-friendly high-performance TE materials is highly urgent. Herein, we explore the thermal transport and TE properties of transition metal vanadate CuVO3 by using… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  21. arXiv:2403.08281  [pdf, other

    cs.CL cs.AI

    Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models

    Authors: Ning Ding, Yulin Chen, Ganqu Cui, Xingtai Lv, Weilin Zhao, Ruobing Xie, Bowen Zhou, Zhiyuan Liu, Maosong Sun

    Abstract: Underlying data distributions of natural language, programming code, and mathematical symbols vary vastly, presenting a complex challenge for large language models (LLMs) that strive to achieve high performance across all three domains simultaneously. Achieving a very high level of proficiency for an LLM within a specific domain often requires extensive training with relevant corpora, which is typ… ▽ More

    Submitted 26 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

  22. Dual-Context Aggregation for Universal Image Matting

    Authors: Qinglin Liu, Xiaoqian Lv, Wei Yu, Changyong Guo, Shengping Zhang

    Abstract: Natural image matting aims to estimate the alpha matte of the foreground from a given image. Various approaches have been explored to address this problem, such as interactive matting methods that use guidance such as click or trimap, and automatic matting methods tailored to specific objects. However, existing matting methods are designed for specific objects or guidance, neglecting the common re… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Journal ref: Multimed Tools Appl (2023)

  23. arXiv:2402.15149  [pdf, other

    astro-ph.HE hep-ph

    Possible spectral irregularities in the AMS-02 positron spectrum

    Authors: Xing-Jian Lv, Xiao-Jun Bi, Kun Fang, Peng-Fei Yin, Meng-Jie Zhao

    Abstract: The excesses in the electron and positron spectra observed by many experiments, such as PAMELA and AMS-02, have sparked significant theoretical investigation. It is not easy to distinguish the two primary hypotheses dark matter annihilation/decay and pulsars from the spectral features. Should pulsars be the source of this excess, the expected variability in their distribution may introduce distinc… ▽ More

    Submitted 29 May, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: 6 pages, 6 figures

  24. arXiv:2402.14840  [pdf, other

    cs.CL cs.AI stat.AP

    RJUA-MedDQA: A Multimodal Benchmark for Medical Document Question Answering and Clinical Reasoning

    Authors: Congyun Jin, Ming Zhang, Xiaowei Ma, Li Yujiao, Yingbo Wang, Yabo Jia, Yuliang Du, Tao Sun, Haowen Wang, Cong Fan, Jinjie Gu, Chenfei Chi, Xiangguo Lv, Fangzhou Li, Wei Xue, Yiran Huang

    Abstract: Recent advancements in Large Language Models (LLMs) and Large Multi-modal Models (LMMs) have shown potential in various medical applications, such as Intelligent Medical Diagnosis. Although impressive results have been achieved, we find that existing benchmarks do not reflect the complexity of real medical reports and specialized in-depth reasoning capabilities. In this work, we introduced RJUA-Me… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 15 pages, 13 figures

  25. arXiv:2401.18058  [pdf, other

    cs.CL cs.LG

    LongAlign: A Recipe for Long Context Alignment of Large Language Models

    Authors: Yushi Bai, Xin Lv, Jiajie Zhang, Yuze He, Ji Qi, Lei Hou, Jie Tang, Yuxiao Dong, Juanzi Li

    Abstract: Extending large language models to effectively handle long contexts requires instruction fine-tuning on input sequences of similar length. To address this, we present LongAlign -- a recipe of the instruction data, training, and evaluation for long context alignment. First, we construct a long instruction-following dataset using Self-Instruct. To ensure the data diversity, it covers a broad range o… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  26. arXiv:2401.11204  [pdf, other

    cs.CV

    Towards Category Unification of 3D Single Object Tracking on Point Clouds

    Authors: Jiahao Nie, Zhiwei He, Xudong Lv, Xueyi Zhou, Dong-Kyu Chae, Fei Xie

    Abstract: Category-specific models are provenly valuable methods in 3D single object tracking (SOT) regardless of Siamese or motion-centric paradigms. However, such over-specialized model designs incur redundant parameters, thus limiting the broader applicability of 3D SOT task. This paper first introduces unified models that can simultaneously track objects across all categories using a single network with… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: Accepted by ICLR2024 (poster)

  27. arXiv:2312.16051  [pdf, other

    cs.CV

    Inter-X: Towards Versatile Human-Human Interaction Analysis

    Authors: Liang Xu, Xintao Lv, Yichao Yan, Xin Jin, Shuwen Wu, Congsheng Xu, Yifan Liu, Yizhou Zhou, Fengyun Rao, Xingdong Sheng, Yunhui Liu, Wenjun Zeng, Xiaokang Yang

    Abstract: The analysis of the ubiquitous human-human interactions is pivotal for understanding humans as social beings. Existing human-human interaction datasets typically suffer from inaccurate body motions, lack of hand gestures and fine-grained textual descriptions. To better perceive and generate human-human interactions, we propose Inter-X, a currently largest human-human interaction dataset with accur… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: Project page: https://liangxuy.github.io/inter-x/

  28. arXiv:2312.11196  [pdf, other

    quant-ph physics.atom-ph

    Coherence time of 20 s with a single cesium atom in an optical dipole trap

    Authors: Zhuangzhuang Tian, Haobo Chang, Xin Lv, Mengna Yang, Zhihui Wang, Pengfei Yang, Pengfei Zhang, Gang Li, Tiancai Zhang

    Abstract: We analyze the decoherence between two ground electronic states of an optically trapped atom by adopting a full description of the atomic wavefunction. The motional state, i.e., the phonon state, is taken into account. In addition to the decoherence due to the variance of differential light shift (DLS), a new decoherence mechanism, phonon-jumping-induced decoherence (PJID), is discovered and verif… ▽ More

    Submitted 31 December, 2023; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 5 pages, 4 figures in the main text; 6 pages, 8 figures in the supplementary material

  29. Giant domain wall anomalous Hall effect in an antiferromagnet

    Authors: Wei Xia, Bo Bai, Xuejiao Chen, Yichen Yang, Yang Zhang, Jian Yuan, Qiang Li, Kunya Yang, Xiangqi Liu, Yang Shi, Haiyang Ma, Huali Yang, Mingquan He, Lei Li, Chuanying Xi, Li Pi, Xiaodong Lv, Xia Wang, Xuerong Liu, Shiyan Li, Xiaodong Zhou, Jianpeng Liu, Yulin Chen, Jian Shen, Dawei Shen , et al. (3 additional authors not shown)

    Abstract: The Hall effect plays a crucial role in establishment of band theory of solids and discovery of emergent new phases of interacting electrons such as the topological phases of matter. Generally, the dissipationless Hall effect requires time-reversal symmetry breaking (TRSB), where TRSB induced by external magnetic field results in ordinary Hall effect, while TRSB caused by spontaneous magnetization… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 19 pages Main Text, 5 main figures

    Journal ref: Chinese Physics Letters 2022, 39: 067101

  30. arXiv:2312.06718  [pdf, other

    cs.AI

    Large Scale Foundation Models for Intelligent Manufacturing Applications: A Survey

    Authors: Haotian Zhang, Semujju Stuart Dereck, Zhicheng Wang, Xianwei Lv, Kang Xu, Liang Wu, Ye Jia, Jing Wu, Zhuo Long, Wensheng Liang, X. G. Ma, Ruiyan Zhuang

    Abstract: Although the applications of artificial intelligence especially deep learning had greatly improved various aspects of intelligent manufacturing, they still face challenges for wide employment due to the poor generalization ability, difficulties to establish high-quality training datasets, and unsatisfactory performance of deep learning methods. The emergence of large scale foundational models(LSFM… ▽ More

    Submitted 22 December, 2023; v1 submitted 10 December, 2023; originally announced December 2023.

  31. arXiv:2311.17494  [pdf, other

    quant-ph physics.atom-ph

    Resolved Raman sideband cooling of a single optically trapped cesium atom

    Authors: Zhuangzhuang Tian, Haobo Chang, Xin Lv, Mengna Yang, Zhihui Wang, Pengfei Yang, Pengfei Zhang, Gang Li, Tiancai Zhang

    Abstract: We developed a resolved Raman sideband cooling scheme that can efficiently prepare a single optically trapped cesium (Cs) atom in its motional ground states. A two-photon Raman process between two outermost Zeeman sublevels in a single hyperfine state is applied to reduce the phonon number. Our scheme is less sensitive to the variation in the magnetic field than the commonly used scheme where the… ▽ More

    Submitted 31 December, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: 4 pages, 3 figures, 1 table

  32. arXiv:2311.13982  [pdf, other

    cs.CL cs.AI

    Probabilistic Tree-of-thought Reasoning for Answering Knowledge-intensive Complex Questions

    Authors: Shulin Cao, Jiajie Zhang, Jiaxin Shi, Xin Lv, Zijun Yao, Qi Tian, Juanzi Li, Lei Hou

    Abstract: Large language models (LLMs) are capable of answering knowledge-intensive complex questions with chain-of-thought (CoT) reasoning. However, they tend to generate factually incorrect reasoning steps when the required knowledge is not available or up-to-date in models' parameters. Recent works turn to retrieving external knowledge to augment CoT reasoning. Despite being promising, these chain-based… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: Accepted by EMNLP 2023

  33. arXiv:2311.11696  [pdf, other

    cs.CL cs.AI cs.LG

    Sparse Low-rank Adaptation of Pre-trained Language Models

    Authors: Ning Ding, Xingtai Lv, Qiaosen Wang, Yulin Chen, Bowen Zhou, Zhiyuan Liu, Maosong Sun

    Abstract: Fine-tuning pre-trained large language models in a parameter-efficient manner is widely studied for its effectiveness and efficiency. The popular method of low-rank adaptation (LoRA) offers a notable approach, hypothesizing that the adaptation process is intrinsically low-dimensional. Although LoRA has demonstrated commendable performance, it is implemented with a fixed and unalterable intrinsic r… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: Accepted to EMNLP 2023 (Main Conference)

  34. arXiv:2311.07153  [pdf

    cond-mat.supr-con

    Double Dome and Reemergence of Superconductivity in Pristine 6R-TaS2 under Pressure

    Authors: Xindeng Lv, Hao Song, Kun Chen, Sirui Liu, Yanping Huang, Yuqiang Fang, Tian Cui

    Abstract: Investigating the implications of interlayer coupling on superconductivity is essential for comprehending the intrinsic mechanisms of high temperature superconductors. Van der Waals heterojunctions have attracted extensive research due to their exotic interlayer coupling. Here, we present a natural heterojunction superconductor of 6R-TaS2 that demonstrates a double-dome of superconductivity, in ad… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  35. arXiv:2310.05635  [pdf, other

    quant-ph cond-mat.mes-hall cond-mat.stat-mech

    Nanoscale engineering and dynamical stabilization of mesoscopic spin textures

    Authors: Kieren Harkins, Christoph Fleckenstein, Noella D'Souza, Paul M. Schindler, David Marchiori, Claudia Artiaco, Quentin Reynard-Feytis, Ushoshi Basumallick, William Beatrez, Arjun Pillai, Matthias Hagn, Aniruddha Nayak, Samantha Breuer, Xudong Lv, Maxwell McAllister, Paul Reshetikhin, Emanuel Druga, Marin Bukov, Ashok Ajoy

    Abstract: Thermalization phenomena, while ubiquitous in quantum systems, have traditionally been viewed as obstacles to be mitigated. In this study, we demonstrate the ability, instead, to harness thermalization to dynamically engineer and stabilize structured quantum states in a mesoscopically large ensemble of spins. Specifically, we showcase the capacity to generate, control, stabilize, and read out 'she… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 8 + 32 pages

  36. arXiv:2309.06912  [pdf, other

    cs.IR

    Multi-behavior Recommendation with SVD Graph Neural Networks

    Authors: Shengxi Fu, Qianqian Ren, Xingfeng Lv, Jinbao Li

    Abstract: Graph Neural Networks (GNNs) have been extensively employed in the field of recommendation systems, offering users personalized recommendations and yielding remarkable outcomes. Recently, GNNs incorporating contrastive learning have demonstrated promising performance in handling the sparse data problem of recommendation systems. However, existing contrastive learning methods still have limitations… ▽ More

    Submitted 9 May, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

  37. arXiv:2309.01961  [pdf, other

    cs.CV

    NICE: CVPR 2023 Challenge on Zero-shot Image Captioning

    Authors: Taehoon Kim, Pyunghwan Ahn, Sangyun Kim, Sihaeng Lee, Mark Marsden, Alessandra Sala, Seung Hwan Kim, Bohyung Han, Kyoung Mu Lee, Honglak Lee, Kyounghoon Bae, Xiangyu Wu, Yi Gao, Hailiang Zhang, Yang Yang, Weili Guo, Jianfeng Lu, Youngtaek Oh, Jae Won Cho, Dong-jin Kim, In So Kweon, Junmo Kim, Wooyoung Kang, Won Young Jhoo, Byungseok Roh , et al. (17 additional authors not shown)

    Abstract: In this report, we introduce NICE (New frontiers for zero-shot Image Captioning Evaluation) project and share the results and outcomes of 2023 challenge. This project is designed to challenge the computer vision community to develop robust image captioning models that advance the state-of-the-art both in terms of accuracy and fairness. Through the challenge, the image captioning models were tested… ▽ More

    Submitted 10 September, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: Tech report, project page https://nice.lgresearch.ai/

  38. arXiv:2309.01341  [pdf, ps, other

    math.OC

    Decentralized Control for Discrete-time Mean-Field Systems with Multiple Controllers of Delayed Information

    Authors: Qingyuan Qi, Zhiqiang Liu, Qianqian Zhang, Xinbei Lv

    Abstract: In this paper, the finite horizon asymmetric information linear quadratic (LQ) control problem is investigated for a discrete-time mean field system. Different from previous works, multiple controllers with different information sets are involved in the mean field system dynamics. The coupling of different controllers makes it quite difficult in finding the optimal control strategy. Fortunately, b… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

  39. arXiv:2308.14508  [pdf, other

    cs.CL

    LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

    Authors: Yushi Bai, Xin Lv, Jiajie Zhang, Hongchang Lyu, Jiankai Tang, Zhidian Huang, Zhengxiao Du, Xiao Liu, Aohan Zeng, Lei Hou, Yuxiao Dong, Jie Tang, Juanzi Li

    Abstract: Although large language models (LLMs) demonstrate impressive performance for many language tasks, most of them can only handle texts a few thousand tokens long, limiting their applications on longer sequence inputs, such as books, reports, and codebases. Recent works have proposed methods to improve LLMs' long context capabilities by extending context windows and more sophisticated memory mechanis… ▽ More

    Submitted 19 June, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: ACL 2024

  40. arXiv:2308.06605  [pdf, other

    cs.DC

    Towards Exascale Computation for Turbomachinery Flows

    Authors: Yuhang Fu, Weiqi Shen, Jiahuan Cui, Yao Zheng, Guangwen Yang, Zhao Liu, Jifa Zhang, Tingwei Ji, Fangfang Xie, Xiaojing Lv, Hanyue Liu, Xu Liu, Xiyang Liu, Xiaoyu Song, Guocheng Tao, Yan Yan, Paul Tucker, Steven A. E. Miller, Shirui Luo, Seid Koric, Weimin Zheng

    Abstract: A state-of-the-art large eddy simulation code has been developed to solve compressible flows in turbomachinery. The code has been engineered with a high degree of scalability, enabling it to effectively leverage the many-core architecture of the new Sunway system. A consistent performance of 115.8 DP-PFLOPs has been achieved on a high-pressure turbine cascade consisting of over 1.69 billion mesh e… ▽ More

    Submitted 29 December, 2023; v1 submitted 12 August, 2023; originally announced August 2023.

    Comments: SC23, November, 2023, Denver, CO., USA

  41. arXiv:2307.07114  [pdf, other

    astro-ph.HE hep-ph

    Reexamine the dark matter scenario accounting for the positron excess in a new cosmic ray propagation model

    Authors: Xing-Jian Lv, Xiao-Jun Bi, Kun Fang, Peng-Fei Yin, Meng-Jie Zhao

    Abstract: The positron excess in cosmic rays has stimulated a lot of interests in the last decade. The dark matter origin of the extra positrons has attracted great attention. However, the $γ$-ray search set very stringent constraints on the dark matter annihilation/decay rate, which leads to great disfavor of the dark matter scenario. In the work, we incorporate the recent progress in cosmic rays propagati… ▽ More

    Submitted 11 February, 2024; v1 submitted 13 July, 2023; originally announced July 2023.

    Comments: 11 pages, 4 figures

  42. arXiv:2307.03130  [pdf, other

    cs.CL cs.HC

    VisKoP: Visual Knowledge oriented Programming for Interactive Knowledge Base Question Answering

    Authors: Zijun Yao, Yuanyong Chen, Xin Lv, Shulin Cao, Amy Xin, Jifan Yu, Hailong Jin, Jianjun Xu, Peng Zhang, Lei Hou, Juanzi Li

    Abstract: We present Visual Knowledge oriented Programming platform (VisKoP), a knowledge base question answering (KBQA) system that integrates human into the loop to edit and debug the knowledge base (KB) queries. VisKoP not only provides a neural program induction module, which converts natural language questions into knowledge oriented program language (KoPL), but also maps KoPL programs into graphical e… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

  43. arXiv:2307.03115  [pdf, other

    cs.CL

    KoRC: Knowledge oriented Reading Comprehension Benchmark for Deep Text Understanding

    Authors: Zijun Yao, Yantao Liu, Xin Lv, Shulin Cao, Jifan Yu, Lei Hou, Juanzi Li

    Abstract: Deep text understanding, which requires the connections between a given document and prior knowledge beyond its text, has been highlighted by many benchmarks in recent years. However, these benchmarks have encountered two major limitations. On the one hand, most of them require human annotation of knowledge, which leads to limited knowledge coverage. On the other hand, they usually use choices or… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

  44. arXiv:2307.03084  [pdf, other

    cs.LG cs.AI cs.CL

    OpenDelta: A Plug-and-play Library for Parameter-efficient Adaptation of Pre-trained Models

    Authors: Shengding Hu, Ning Ding, Weilin Zhao, Xingtai Lv, Zhen Zhang, Zhiyuan Liu, Maosong Sun

    Abstract: The scale of large pre-trained models (PTMs) poses significant challenges in adapting to downstream tasks due to the high optimization overhead and storage costs associated with full-parameter fine-tuning. To address this, many studies explore parameter-efficient tuning methods, also framed as "delta tuning", which updates only a small subset of parameters, known as "delta modules", while keeping… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: Accepted to ACL 2023 Demo track

  45. Momentum matching and band-alignment type in van der Waals heterostructures: Interfacial effects and materials screening

    Authors: Yue-Jiao Zhang, Yin-Ti Ren, Xiao-Huan Lv, Xiao-Lin Zhao, Rui Yang, Nie-Wei Wang, Chen-Dong Jin, Hu Zhang, Ru-Qian Lian, Peng-Lai Gong, Rui-Ning Wang, Jiang-Long Wang, Xing-Qiang Shi

    Abstract: Momentum-matched type II van der Waals heterostructures (vdWHs) have been designed by assembling layered two-dimensional semiconductors (2DSs) with special band-structure combinations - that is, the valence band edge at the Gamma point (the Brillouin-zone center) for one 2DS and the conduction band edge at the Gamma point for the other [Ubrig et al., Nat. Mater. 19, 299 (2020)]. However, the band… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

    Journal ref: Phys. Rev. B 2023

  46. arXiv:2306.09296  [pdf, other

    cs.CL

    KoLA: Carefully Benchmarking World Knowledge of Large Language Models

    Authors: Jifan Yu, Xiaozhi Wang, Shangqing Tu, Shulin Cao, Daniel Zhang-Li, Xin Lv, Hao Peng, Zijun Yao, Xiaohan Zhang, Hanming Li, Chunyang Li, Zheyuan Zhang, Yushi Bai, Yantao Liu, Amy Xin, Nianyi Lin, Kaifeng Yun, Linlu Gong, Jianhui Chen, Zhili Wu, Yunjia Qi, Weikai Li, Yong Guan, Kaisheng Zeng, Ji Qi , et al. (10 additional authors not shown)

    Abstract: The unprecedented performance of large language models (LLMs) necessitates improvements in evaluations. Rather than merely exploring the breadth of LLM abilities, we believe meticulous and thoughtful designs are essential to thorough, unbiased, and applicable evaluations. Given the importance of world knowledge to LLMs, we construct a Knowledge-oriented LLM Assessment benchmark (KoLA), in which we… ▽ More

    Submitted 30 June, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Accepted by ICLR 2024

  47. arXiv:2306.07652  [pdf

    stat.AP q-bio.TO

    Inactivated COVID-19 Vaccination did not affect In vitro fertilization (IVF) / Intra-Cytoplasmic Sperm Injection (ICSI) cycle outcomes

    Authors: Qi Wan, Ying Ling Yao, XingYu Lv, Li Hong Geng, Yue Wang, Enoch Appiah Adu-Gyamfi, Xue Jiao Wang, Yue Qian, Juan Yang, Ming Xing Chend, Zhao Hui Zhong, Yuan Li, Yu Bin Ding

    Abstract: Background: The objective of this study is to evaluate the impact of COVID-19 inactivated vaccine administration on the outcomes of in vitro fertilization (IVF) and intracytoplasmic sperm injection (ICSI) cycles in infertile couples in China. Methods: We collected data from the CYART prospective cohort, which included couples undergoing IVF treatment from January 2021 to September 2022 at Sichuan… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: 26 pages, 4 figures and 5 tables

  48. arXiv:2306.04181  [pdf, other

    cs.CL cs.LG

    Benchmarking Foundation Models with Language-Model-as-an-Examiner

    Authors: Yushi Bai, Jiahao Ying, Yixin Cao, Xin Lv, Yuze He, Xiaozhi Wang, Jifan Yu, Kaisheng Zeng, Yijia Xiao, Haozhe Lyu, Jiayin Zhang, Juanzi Li, Lei Hou

    Abstract: Numerous benchmarks have been established to assess the performance of foundation models on open-ended question answering, which serves as a comprehensive test of a model's ability to understand and generate language in a manner similar to humans. Most of these works focus on proposing new datasets, however, we see two main issues within previous benchmarking pipelines, namely testing leakage and… ▽ More

    Submitted 4 November, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 Datasets and Benchmarks

  49. arXiv:2305.19787  [pdf, other

    cs.CV cs.AI

    DeepMerge: Deep-Learning-Based Region-Merging for Image Segmentation

    Authors: Xianwei Lv, Claudio Persello, Wangbin Li, Xiao Huang, Dongping Ming, Alfred Stein

    Abstract: Image segmentation aims to partition an image according to the objects in the scene and is a fundamental step in analysing very high spatial-resolution (VHR) remote sensing imagery. Current methods struggle to effectively consider land objects with diverse shapes and sizes. Additionally, the determination of segmentation scale parameters frequently adheres to a static and empirical doctrine, posin… ▽ More

    Submitted 5 January, 2024; v1 submitted 31 May, 2023; originally announced May 2023.

  50. arXiv:2305.15056  [pdf, other

    cs.CL

    Reasoning over Hierarchical Question Decomposition Tree for Explainable Question Answering

    Authors: Jiajie Zhang, Shulin Cao, Tingjia Zhang, Xin Lv, Jiaxin Shi, Qi Tian, Juanzi Li, Lei Hou

    Abstract: Explainable question answering (XQA) aims to answer a given question and provide an explanation why the answer is selected. Existing XQA methods focus on reasoning on a single knowledge source, e.g., structured knowledge bases, unstructured corpora, etc. However, integrating information from heterogeneous knowledge sources is essential to answer complex questions. In this paper, we propose to leve… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: has been accepted by ACL2023