Skip to main content

Showing 51–100 of 897 results for author: Tan, X

  1. arXiv:2404.14827  [pdf, other

    cs.CL

    Sentence-Level or Token-Level? A Comprehensive Study on Knowledge Distillation

    Authors: Jingxuan Wei, Linzhuang Sun, Yichong Leng, Xu Tan, Bihui Yu, Ruifeng Guo

    Abstract: Knowledge distillation, transferring knowledge from a teacher model to a student model, has emerged as a powerful technique in neural machine translation for compressing models or simplifying training targets. Knowledge distillation encompasses two primary methods: sentence-level distillation and token-level distillation. In sentence-level distillation, the student model is trained to align with t… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  2. arXiv:2404.14710  [pdf, other

    cs.SE

    Challenges of Using Pre-trained Models: the Practitioners' Perspective

    Authors: Xin Tan, Taichuan Li, Ruohe Chen, Fang Liu, Li Zhang

    Abstract: The challenges associated with using pre-trained models (PTMs) have not been specifically investigated, which hampers their effective utilization. To address this knowledge gap, we collected and analyzed a dataset of 5,896 PTM-related questions on Stack Overflow. We first analyze the popularity and difficulty trends of PTM-related questions. We find that PTM-related questions are becoming more and… ▽ More

    Submitted 1 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: SANER 2024

  3. arXiv:2404.14700  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    FlashSpeech: Efficient Zero-Shot Speech Synthesis

    Authors: Zhen Ye, Zeqian Ju, Haohe Liu, Xu Tan, Jianyi Chen, Yiwen Lu, Peiwen Sun, Jiahao Pan, Weizhen Bian, Shulin He, Qifeng Liu, Yike Guo, Wei Xue

    Abstract: Recent progress in large-scale zero-shot speech synthesis has been significantly advanced by language models and diffusion models. However, the generation process of both methods is slow and computationally intensive. Efficient speech synthesis using a lower computing budget to achieve quality on par with previous work remains a significant challenge. In this paper, we present FlashSpeech, a large… ▽ More

    Submitted 24 April, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: Efficient zero-shot speech synthesis

  4. arXiv:2404.13659  [pdf, other

    cs.CV

    LMFNet: An Efficient Multimodal Fusion Approach for Semantic Segmentation in High-Resolution Remote Sensing

    Authors: Tong Wang, Guanzhou Chen, Xiaodong Zhang, Chenxi Liu, Xiaoliang Tan, Jiaqi Wang, Chanjuan He, Wenlin Zhou

    Abstract: Despite the rapid evolution of semantic segmentation for land cover classification in high-resolution remote sensing imagery, integrating multiple data modalities such as Digital Surface Model (DSM), RGB, and Near-infrared (NIR) remains a challenge. Current methods often process only two types of data, missing out on the rich information that additional modalities can provide. Addressing this gap,… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  5. arXiv:2404.13276  [pdf

    cond-mat.mtrl-sci

    Giant Rashba-Splitting of One-Dimensional Metallic States in Bi Dimer Lines on InAs(100)

    Authors: Polina M. Sheverdyaeva, Gustav Bihlmayer, Silvio Modesti, Vitaliy Feyer, Matteo Jugovac, Giovanni Zamborlini, Christian Tusche, Ying-Jiun Chen, Xin Liang Tan, Kenta Hagiwara, Luca Petaccia, Sangeeta Thakur, Asish K. Kundu, Carlo Carbone, Paolo Moras

    Abstract: Bismuth produces different types of ordered superstructures on the InAs(100) surface, depending on the growth procedure and coverage. The (2x1) phase forms at completion of a Bi monolayer and consists of a uniformly oriented array of parallel lines of Bi dimers. Scanning tunneling and core level spectroscopies demonstrate its metallic character, in contrast with the semiconducting properties expec… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: 4 figures, includes supplemental material file

  6. arXiv:2404.13258  [pdf, ps, other

    eess.SY

    Human Motor Learning Dynamics in High-dimensional Tasks

    Authors: Ankur Kamboj, Rajiv Ranganathan, Xiaobo Tan, Vaibhav Srivastava

    Abstract: Conventional approaches to enhancing movement coordination, such as providing instructions and visual feedback, are often inadequate in complex motor tasks with multiple degrees of freedom (DoFs). To effectively address coordination deficits in such complex motor systems, it becomes imperative to develop interventions grounded in a model of human motor learning; however, modeling such learning pro… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: 22 pages (single column), 9 figures

  7. arXiv:2404.12964  [pdf, ps, other

    math.PR

    On the McKean-Vlasov SDE with branching

    Authors: Julien Claisse, Jiazhi Kang, Xiaolu Tan

    Abstract: We study a nonlinear branching diffusion process in the sense of McKean, i.e., where particles are subjected to a mean-field interaction. We consider first a strong formulation of the problem and we provide an existence and uniqueness result by using contraction arguments. Then we consider the notion of weak solution and its equivalent martingale problem formulation. In this setting, we provide a… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  8. arXiv:2404.12000  [pdf, other

    cs.SE

    How far are AI-powered programming assistants from meeting developers' needs?

    Authors: Xin Tan, Xiao Long, Xianjun Ni, Yinghao Zhu, Jing Jiang, Li Zhang

    Abstract: Recent In-IDE AI coding assistant tools (ACATs) like GitHub Copilot have significantly impacted developers' coding habits. While some studies have examined their effectiveness, there lacks in-depth investigation into the actual assistance process. To bridge this gap, we simulate real development scenarios encompassing three typical types of software development tasks and recruit 27 computer scienc… ▽ More

    Submitted 24 April, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  9. arXiv:2404.08087  [pdf, other

    astro-ph.EP astro-ph.SR

    Time-resolved Hubble Space Telescope Wide Field Camera 3 Spectrophotometry Reveals Inefficient Day-to-Night Heat Redistribution in the Highly Irradiated Brown Dwarf SDSS 1557B

    Authors: Rachael C. Amaro, Daniel Apai, Ben W. P. Lew, Yifan Zhou, Joshua D. Lothringer, Sarah L. Casewell, Xianyu Tan, Travis Barman, Mark S. Marley, L. C. Mayorga, Vivien Parmentier

    Abstract: Brown dwarfs in ultra-short period orbits around white dwarfs offer a unique opportunity to study the properties of tidally-locked, fast rotating (1-3 hr), and highly-irradiated atmospheres. Here, we present phase-resolved spectrophotometry of the white dwarf-brown dwarf (WD-BD) binary SDSS 1557, which is the fifth WD-BD binary in our six-object sample. Using the Hubble Space Telescope Wide Field… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 19 pages and 11 figures. Accepted to Astrophysical Journal

  10. arXiv:2404.07787  [pdf, other

    astro-ph.IM

    Research on fine co-focus adjustment method for segmented solar telescope

    Authors: Kunyan Wang, Yichun Dai, Bin Wang, Xu Tan, Dehua Yang, Zhenyu Jin

    Abstract: For segmented telescopes, achieving fine co-focus adjustment is essential for realizing co-phase adjustment and maintenance, which involves adjusting the millimeter-scale piston between segments to fall within the capture range of the co-phase detection system. CGST proposes using a SHWFS for piston detection during the co-focus adjustment stage. However, the residual piston after adjustment excee… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  11. arXiv:2404.07609  [pdf, other

    math.OC eess.SY

    Achieving violation-free distributed optimization under coupling constraints

    Authors: Changxin Liu, Xiao Tan, Xuyang Wu, Dimos V. Dimarogonas, Karl H. Johansson

    Abstract: Constraint satisfaction is a critical component in a wide range of engineering applications, including but not limited to safe multi-agent control and economic dispatch in power systems. This study explores violation-free distributed optimization techniques for problems characterized by separable objective functions and coupling constraints. First, we incorporate auxiliary decision variables toget… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 13 pages, 6 figures

  12. arXiv:2404.07571  [pdf, other

    math.OC eess.SY

    A continuous-time violation-free multi-agent optimization algorithm and its applications to safe distributed control

    Authors: Xiao Tan, Changxin Liu, Karl H. Johansson, Dimos V. Dimarogonas

    Abstract: In this work, we propose a continuous-time distributed optimization algorithm with guaranteed zero coupling constraint violation and apply it to safe distributed control in the presence of multiple control barrier functions (CBF). The optimization problem is defined over a network that collectively minimizes a separable cost function with coupled linear constraints. An equivalent optimization prob… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  13. arXiv:2404.07436  [pdf, other

    hep-ex

    Measurement of $e^{+}e^{-}\to ωη^{\prime}$ cross sections at $\sqrt{s}=$ 2.000 to 3.080 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (599 additional authors not shown)

    Abstract: The Born cross sections for the process $e^{+}e^{-}\to ωη^{\prime}$ are measured at 22 center-of-mass energies from 2.000 to 3.080 GeV using data collected with the BESIII detector at the BEPCII collider. A resonant structure is observed with a statistical significance of 9.6$σ$. A Breit-Wigner fit determines its mass to be $M_R=(2153\pm30\pm31)~{\rm{MeV}}/c^{2}$ and its width to be… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  14. arXiv:2404.07149  [pdf, other

    astro-ph.IM astro-ph.EP astro-ph.SR

    Tianyu: search for the second solar system and explore the dynamic universe

    Authors: Fabo Feng, Yicheng Rui, Zhimao Du, Qing Lin, Congcong Zhang, Dan Zhou, Kaiming Cui, Masahiro Ogihara, Ming Yang, Jie Lin, Yongzhi Cai, Taozhi Yang, Xiaoying Pang, Mingjie Jian, Wenxiong Li, Hengxiao Guo, Xian Shi, Jianchun Shi, Jianyang Li, Kangrou Guo, Song Yao, Aming Chen, Peng Jia, Xianyu Tan, James S. Jenkins , et al. (10 additional authors not shown)

    Abstract: Giant planets like Jupiter and Saturn, play important roles in the formation and habitability of Earth-like planets. The detection of solar system analogs that have multiple cold giant planets is essential for our understanding of planet habitability and planet formation. Although transit surveys such as Kepler and TESS have discovered thousands of exoplanets, these missions are not sensitive to l… ▽ More

    Submitted 10 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

    Comments: 48 pages, 16 figures, accepted by Acta Astronomica Sinica

  15. arXiv:2404.06393  [pdf, other

    cs.SD cs.AI eess.AS

    MuPT: A Generative Symbolic Music Pretrained Transformer

    Authors: Xingwei Qu, Yuelin Bai, Yinghao Ma, Ziya Zhou, Ka Man Lo, Jiaheng Liu, Ruibin Yuan, Lejun Min, Xueling Liu, Tianyu Zhang, Xinrun Du, Shuyue Guo, Yiming Liang, Yizhi Li, Shangda Wu, Junting Zhou, Tianyu Zheng, Ziyang Ma, Fengze Han, Wei Xue, Gus Xia, Emmanouil Benetos, Xiang Yue, Chenghua Lin, Xu Tan , et al. (4 additional authors not shown)

    Abstract: In this paper, we explore the application of Large Language Models (LLMs) to the pre-training of music. While the prevalent use of MIDI in music modeling is well-established, our findings suggest that LLMs are inherently more compatible with ABC Notation, which aligns more closely with their design and strengths, thereby enhancing the model's performance in musical composition. To address the chal… ▽ More

    Submitted 10 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  16. arXiv:2404.05231  [pdf, other

    cs.CV

    PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection

    Authors: Xiaofan Li, Zhizhong Zhang, Xin Tan, Chengwei Chen, Yanyun Qu, Yuan Xie, Lizhuang Ma

    Abstract: The vision-language model has brought great improvement to few-shot industrial anomaly detection, which usually needs to design of hundreds of prompts through prompt engineering. For automated scenarios, we first use conventional prompt learning with many-class paradigm as the baseline to automatically learn prompts but found that it can not work well in one-class anomaly detection. To address the… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR2024

  17. arXiv:2404.03204  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis

    Authors: Detai Xin, Xu Tan, Kai Shen, Zeqian Ju, Dongchao Yang, Yuancheng Wang, Shinnosuke Takamichi, Hiroshi Saruwatari, Shujie Liu, Jinyu Li, Sheng Zhao

    Abstract: We present RALL-E, a robust language modeling method for text-to-speech (TTS) synthesis. While previous work based on large language models (LLMs) shows impressive performance on zero-shot TTS, such methods often suffer from poor robustness, such as unstable prosody (weird pitch and rhythm/duration) and a high word error rate (WER), due to the autoregressive prediction style of language models. Th… ▽ More

    Submitted 19 May, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

  18. arXiv:2404.02952  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall

    Chirality-Driven Orbital Angular Momentum and Circular Dichroism in CoSi

    Authors: Stefanie Suzanne Brinkman, Xin Liang Tan, Bjørnulf Brekke, Anders Christian Mathisen, Øyvind Finnseth, Richard Justin Schenk, Kenta Hagiwara, Meng-Jie Huang, Jens Buck, Matthias Kalläne, Moritz Hoesch, Kai Rossnagel, Kui-Hon Ou Yang, Minn-Tsong Lin, Guo-Jiun Shu, Ying-Jiun Chen, Christian Tusche, Hendrik Bentmann

    Abstract: Chiral crystals and molecules were recently predicted to form an intriguing platform for unconventional orbital physics. Here, we report the observation of chirality-driven orbital textures in the bulk electronic structure of CoSi, a prototype member of the cubic B20 family of chiral crystals. Using circular dichroism in soft X-ray angle-resolved photoemission, we demonstrate the formation of a bu… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: To be published in Physical Review Letters

    Report number: QuSpin 2024

  19. arXiv:2404.01532  [pdf, other

    cs.CL cs.IR

    Set-Aligning Framework for Auto-Regressive Event Temporal Graph Generation

    Authors: Xingwei Tan, Yuxiang Zhou, Gabriele Pergola, Yulan He

    Abstract: Event temporal graphs have been shown as convenient and effective representations of complex temporal relations between events in text. Recent studies, which employ pre-trained language models to auto-regressively generate linearised graphs for constructing event temporal graphs, have shown promising results. However, these methods have often led to suboptimal graph generation as the linearised gr… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Accepted to NAACL 2024. 9 + 10 pages

  20. arXiv:2403.19095  [pdf

    cs.CY

    Purposeful remixing with generative AI: Constructing designer voice in multimodal composing

    Authors: Xiao Tan, Wei Xu, Chaoran Wang

    Abstract: Voice, the discursive construction of the writer's identity, has been extensively studied and theorized in composition studies. In multimodal writing, students are able to mobilize both linguistic and non linguistic resources to express their real or imagined identities. But at the same time, when students are limited to choose from available online resources, their voices might be compromised due… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  21. arXiv:2403.19091  [pdf, other

    hep-ex

    Observation of the semileptonic decays $D^0\rightarrow K_S^0π^-π^0 e^+ ν_e$ and $D^+\rightarrow K_S^0π^+π^- e^+ ν_e$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (600 additional authors not shown)

    Abstract: By analyzing $e^+e^-$ annihilation data corresponding to an integrated luminosity of 2.93 $\rm fb^{-1}$ collected at a center-of-mass energy of 3.773 GeV with the \text{BESIII} detector, the first observation of the semileptonic decays $D^0\rightarrow K_S^0π^-π^0 e^+ ν_e$ and $D^+\rightarrow K_S^0π^+π^- e^+ ν_e$ is reported. With a dominant hadronic contribution from $K_1(1270)$, the branching fra… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 19pages

  22. arXiv:2403.17387  [pdf, other

    cs.CV

    Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection

    Authors: Jiacheng Zhang, Jiaming Li, Xiangru Lin, Wei Zhang, Xiao Tan, Junyu Han, Errui Ding, Jingdong Wang, Guanbin Li

    Abstract: We delve into pseudo-labeling for semi-supervised monocular 3D object detection (SSM3OD) and discover two primary issues: a misalignment between the prediction quality of 3D and 2D attributes and the tendency of depth supervision derived from pseudo-labels to be noisy, leading to significant optimization conflicts with other reliable forms of supervision. We introduce a novel decoupled pseudo-labe… ▽ More

    Submitted 23 April, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: To appear in CVPR2024

  23. arXiv:2403.15127  [pdf, other

    cs.CV

    Gradient-based Sampling for Class Imbalanced Semi-supervised Object Detection

    Authors: Jiaming Li, Xiangru Lin, Wei Zhang, Xiao Tan, Yingying Li, Junyu Han, Errui Ding, Jingdong Wang, Guanbin Li

    Abstract: Current semi-supervised object detection (SSOD) algorithms typically assume class balanced datasets (PASCAL VOC etc.) or slightly class imbalanced datasets (MS-COCO, etc). This assumption can be easily violated since real world datasets can be extremely class imbalanced in nature, thus making the performance of semi-supervised object detectors far from satisfactory. Besides, the research for this… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: Accepted by ICCV2023

  24. arXiv:2403.15100  [pdf, other

    cs.RO cs.AI

    Subequivariant Reinforcement Learning Framework for Coordinated Motion Control

    Authors: Haoyu Wang, Xiaoyu Tan, Xihe Qiu, Chao Qu

    Abstract: Effective coordination is crucial for motion control with reinforcement learning, especially as the complexity of agents and their motions increases. However, many existing methods struggle to account for the intricate dependencies between joints. We introduce CoordiGraph, a novel architecture that leverages subequivariant principles from physics to enhance coordination of motion control with rein… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 7 pages, 7 figures, 2024 IEEE International Conference on Robotics and Automation

  25. arXiv:2403.10065  [pdf, other

    cs.CL

    Triple GNNs: Introducing Syntactic and Semantic Information for Conversational Aspect-Based Quadruple Sentiment Analysis

    Authors: Binbin Li, Yuqing Li, Siyu Jia, Bingnan Ma, Yu Ding, Zisen Qi, Xingbang Tan, Menghan Guo, Shenghui Liu

    Abstract: Conversational Aspect-Based Sentiment Analysis (DiaASQ) aims to detect quadruples \{target, aspect, opinion, sentiment polarity\} from given dialogues. In DiaASQ, elements constituting these quadruples are not necessarily confined to individual sentences but may span across multiple utterances within a dialogue. This necessitates a dual focus on both the syntactic information of individual utteran… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted by CSCWD2024

  26. arXiv:2403.07865  [pdf, other

    cs.CL cs.AI cs.CR cs.LG cs.SE

    CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion

    Authors: Qibing Ren, Chang Gao, Jing Shao, Junchi Yan, Xin Tan, Wai Lam, Lizhuang Ma

    Abstract: The rapid advancement of Large Language Models (LLMs) has brought about remarkable generative capabilities but also raised concerns about their potential misuse. While strategies like supervised fine-tuning and reinforcement learning from human feedback have enhanced their safety, these methods primarily focus on natural languages, which may not generalize to other domains. This paper introduces C… ▽ More

    Submitted 9 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: ACL Findings 2024, Code is available at https://github.com/renqibing/CodeAttack

  27. A Magnetic Millirobot Walks on Slippery Biological Surfaces for Targeted Cargo Delivery

    Authors: Moonkwang Jeong, Xiangzhou Tan, Felix Fischer, Tian Qiu

    Abstract: Small-scale robots hold great potential for targeted cargo delivery in minimally-inv asive medicine. However, current robots often face challenges to locomote efficiently on slip pery biological tissue surfaces, especially when loaded with heavy cargos. Here, we report a magnetic millirobot that can walk on rough and slippery biological tissues by anchoring itself on the soft tissue surface altern… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 15 pages

    ACM Class: J.3

  28. arXiv:2403.03100  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models

    Authors: Zeqian Ju, Yuancheng Wang, Kai Shen, Xu Tan, Detai Xin, Dongchao Yang, Yanqing Liu, Yichong Leng, Kaitao Song, Siliang Tang, Zhizheng Wu, Tao Qin, Xiang-Yang Li, Wei Ye, Shikun Zhang, Jiang Bian, Lei He, Jinyu Li, Sheng Zhao

    Abstract: While recent large-scale text-to-speech (TTS) models have achieved significant progress, they still fall short in speech quality, similarity, and prosody. Considering speech intricately encompasses various attributes (e.g., content, prosody, timbre, and acoustic details) that pose significant challenges for generation, a natural idea is to factorize speech into individual subspaces representing di… ▽ More

    Submitted 23 April, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: Achieving human-level quality and naturalness on multi-speaker datasets (e.g., LibriSpeech) in a zero-shot way

  29. arXiv:2403.02905  [pdf, other

    cs.MM

    MMoFusion: Multi-modal Co-Speech Motion Generation with Diffusion Model

    Authors: Sen Wang, Jiangning Zhang, Weijian Cao, Xiaobin Hu, Moran Li, Xiaozhong Ji, Xin Tan, Mengtian Li, Zhifeng Xie, Chengjie Wang, Lizhuang Ma

    Abstract: The body movements accompanying speech aid speakers in expressing their ideas. Co-speech motion generation is one of the important approaches for synthesizing realistic avatars. Due to the intricate correspondence between speech and motion, generating realistic and diverse motion is a challenging task. In this paper, we propose MMoFusion, a Multi-modal co-speech Motion generation framework based o… ▽ More

    Submitted 17 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  30. arXiv:2403.02260  [pdf, other

    astro-ph.EP astro-ph.SR

    Latitude-dependent Atmospheric Waves and Long-period Modulations in Luhman 16 B from the Longest Lightcurve of an Extrasolar World

    Authors: Nguyen Fuda, Dániel Apai, Domenico Nardiello, Xianyu Tan, Theodora Karalidi, Luigi Rolly Bedin

    Abstract: In this work, we present the longest photometric monitoring of up to 1200 hours of the strongly variable brown-dwarf binaries Luhman 16 AB and provide evidence of $\pm$5% variability on a timescale of several-to-hundreds of hours for this object. We show that short-period rotational modulation around 5 hours (k = 1 wavenumber) and 2.5 hours (k = 2 wavenumber) dominate the variability under 10 hour… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 27 pages, 20 figures. Accepted for publication in ApJ (February 21, 2024)

  31. arXiv:2403.00758  [pdf, other

    cs.CL cs.AI cs.LG

    Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training

    Authors: Qingyan Guo, Rui Wang, Junliang Guo, Xu Tan, Jiang Bian, Yujiu Yang

    Abstract: While large language models (LLMs) have achieved impressive performance across diverse tasks, recent studies showcase that causal LLMs suffer from the "reversal curse". It is a typical example that the model knows "A's father is B", but is unable to reason "B's child is A". This limitation poses a challenge to the advancement of artificial general intelligence (AGI), as it suggests a gap in the mo… ▽ More

    Submitted 20 March, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

  32. arXiv:2402.19155  [pdf, other

    cs.LG

    Beyond Language Models: Byte Models are Digital World Simulators

    Authors: Shangda Wu, Xu Tan, Zili Wang, Rui Wang, Xiaobing Li, Maosong Sun

    Abstract: Traditional deep learning often overlooks bytes, the basic units of the digital world, where all forms of information and operations are encoded and manipulated in binary format. Inspired by the success of next token prediction in natural language processing, we introduce bGPT, a model with next byte prediction to simulate the digital world. bGPT matches specialized models in performance across va… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 19 pages, 5 figures, 5 tables

  33. arXiv:2402.14312  [pdf, other

    astro-ph.IM astro-ph.CO astro-ph.EP astro-ph.GA

    The Jiao Tong University Spectroscopic Telescope Project

    Authors: JUST Team, Chengze Liu, Ying Zu, Fabo Feng, Zhaoyu Li, Yu Yu, Hua Bai, Xiangqun Cui, Bozhong Gu, Yizhou Gu, Jiaxin Han, Yonghui Hou, Zhongwen Hu, Hangxin Ji, Yipeng Jing, Wei Li, Zhaoxiang Qi, Xianyu Tan, Cairang Tian, Dehua Yang, Xiangyan Yuan, Chao Zhai, Congcong Zhang, Jun Zhang, Haotong Zhang , et al. (6 additional authors not shown)

    Abstract: The Jiao Tong University Spectroscopic Telescope (JUST) is a 4.4-meter f/6.0 segmentedmirror telescope dedicated to spectroscopic observations. The JUST primary mirror is composed of 18 hexagonal segments, each with a diameter of 1.1 m. JUST provides two Nasmyth platforms for placing science instruments. One Nasmyth focus fits a field of view of 10 arcmin and the other has an extended field of vie… ▽ More

    Submitted 29 February, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: 28 pages, 6 figures

  34. arXiv:2402.10739  [pdf, other

    cs.CV

    PointMamba: A Simple State Space Model for Point Cloud Analysis

    Authors: Dingkang Liang, Xin Zhou, Wei Xu, Xingkui Zhu, Zhikang Zou, Xiaoqing Ye, Xiao Tan, Xiang Bai

    Abstract: Transformers have become one of the foundational architectures in point cloud analysis tasks due to their excellent global modeling ability. However, the attention mechanism has quadratic complexity, making the design of a linear complexity method with global modeling appealing. In this paper, we propose PointMamba, transferring the success of Mamba, a recent representative state space model (SSM)… ▽ More

    Submitted 29 May, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: Update the architecture and performance. The code is available at https://github.com/LMD0311/PointMamba

  35. arXiv:2402.06632  [pdf, other

    physics.ins-det quant-ph

    Microgram $\mathrm{BaCl}_2$ Ablation Targets for Trapped Ion Experiments

    Authors: Noah Greenberg, Akbar Jahangiri Jozani, Collin J. C. Epstein, Xinghe Tan, Rajibul Islam, Crystal Senko

    Abstract: Trapped ions for quantum information processing has been an area of intense study due to the extraordinarily high fidelity operations that have been reported experimentally. Specifically, barium trapped ions have been shown to have exceptional state-preparation and measurement (SPAM) fidelities. The $^{133}\mathrm{Ba}^+$ ($I = 1/2$) isotope in particular is a promising candidate for large-scale qu… ▽ More

    Submitted 16 January, 2024; originally announced February 2024.

  36. arXiv:2402.05239  [pdf, other

    quant-ph cs.CC

    Efficient approximate unitary designs from random Pauli rotations

    Authors: Jeongwan Haah, Yunchao Liu, Xinyu Tan

    Abstract: We construct random walks on simple Lie groups that quickly converge to the Haar measure for all moments up to order $t$. Specifically, a step of the walk on the unitary or orthognoal group of dimension $2^{\mathsf n}$ is a random Pauli rotation $e^{\mathrm i θP /2}$. The spectral gap of this random walk is shown to be $Ω(1/t)$, which coincides with the best previously known bound for a random wal… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 21 pages, 1 figure

  37. arXiv:2402.03829  [pdf, ps, other

    hep-ex

    Precise Measurement of Born Cross Sections for $e^+e^-\to D\bar{D}$ and Observation of One Structure between $\sqrt{s} = 3.80-4.95$ GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (604 additional authors not shown)

    Abstract: Using data samples collected with the BESIII detector at the BEPCII collider at center-of-mass energies ranging from 3.80 to 4.95 GeV, corresponding to an integrated luminosity of 20 fb$^{-1}$, a measurement of Born cross sections for the $e^+e^-\to D^{0}\bar{D}^{0}$ and $D^{+}D^{-}$ processes is presented with unprecedented precision. By performing a simultaneous fit to the dressed cross sections… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 9 pages, 4 figures, 1 tables, 1 Supplemental_Material

  38. arXiv:2402.03636  [pdf, other

    cs.RO

    Online Informative Sampling using Semantic Features in Underwater Environments

    Authors: Shrutika Vishal Thengane, Yu Xiang Tan, Marcel Bartholomeus Prasetyo, Malika Meghjani

    Abstract: The underwater world remains largely unexplored, with Autonomous Underwater Vehicles (AUVs) playing a crucial role in sub-sea explorations. However, continuous monitoring of underwater environments using AUVs can generate a significant amount of data. In addition, sending live data feed from an underwater environment requires dedicated on-board data storage options for AUVs which can hinder requir… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: In proceeding of IEEE/MTS OCEANS, 2024

  39. arXiv:2402.01767  [pdf, other

    cs.CL cs.AI cs.LG

    HiQA: A Hierarchical Contextual Augmentation RAG for Massive Documents QA

    Authors: Xinyue Chen, Pengyu Gao, Jiangjiang Song, Xiaoyang Tan

    Abstract: As language model agents leveraging external tools rapidly evolve, significant progress has been made in question-answering(QA) methodologies utilizing supplementary documents and the Retrieval-Augmented Generation (RAG) approach. This advancement has improved the response quality of language models and alleviates the appearance of hallucination. However, these methods exhibit limited retrieval ac… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

  40. arXiv:2401.17464  [pdf, other

    cs.CL

    Efficient Tool Use with Chain-of-Abstraction Reasoning

    Authors: Silin Gao, Jane Dwivedi-Yu, Ping Yu, Xiaoqing Ellen Tan, Ramakanth Pasunuru, Olga Golovneva, Koustuv Sinha, Asli Celikyilmaz, Antoine Bosselut, Tianlu Wang

    Abstract: To achieve faithful reasoning that aligns with human expectations, large language models (LLMs) need to ground their reasoning to real-world knowledge (e.g., web facts, math and physical rules). Tools help LLMs access this external knowledge, but there remains challenges for fine-tuning LLM agents (e.g., Toolformer) to invoke tools in multi-step reasoning problems, where inter-connected tool calls… ▽ More

    Submitted 26 February, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

  41. arXiv:2401.15289  [pdf, other

    cs.CR cs.AR

    SoK: Where's the "up"?! A Comprehensive (bottom-up) Study on the Security of Arm Cortex-M Systems

    Authors: Xi Tan, Zheyuan Ma, Sandro Pinto, Le Guan, Ning Zhang, Jun Xu, Zhiqiang Lin, Hongxin Hu, Ziming Zhao

    Abstract: Arm Cortex-M processors are the most widely used 32-bit microcontrollers among embedded and Internet-of-Things devices. Despite the widespread usage, there has been little effort in summarizing their hardware security features, characterizing the limitations and vulnerabilities of their hardware and software stack, and systematizing the research on securing these systems. The goals and contributio… ▽ More

    Submitted 13 May, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: To Appear in the 18th USENIX WOOT Conference on Offensive Technologies, August 12-13, 2024

    ACM Class: C.0; K.6.5

  42. arXiv:2401.14720  [pdf, ps, other

    hep-ex

    Observation of structures in the processes $e^+e^-\rightarrowωχ_{c1}$ and $ωχ_{c2}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (608 additional authors not shown)

    Abstract: We present measurements of the Born cross sections for the processes $e^+e^-\rightarrowωχ_{c1}$ and $ωχ_{c2}$ at center-of-mass energies $\sqrt{s}$ from 4.308 to 4.951 GeV. The measurements are performed with data samples corresponding to an integrated luminosity of 11.0 $\rm{fb}^{-1}$ collected with the BESIII detector operating at the BEPCII storage ring. Assuming the $e^+e^-\rightarrowωχ_{c2}$… ▽ More

    Submitted 24 March, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: 11 pages, 8 figures, with Supplemental Material

  43. arXiv:2401.14711  [pdf, other

    hep-ex

    Study of $e^{+}e^{-}\rightarrowπ^{+}π^{-}π^{0}$ at $\sqrt{s}$ from 2.00 to 3.08 GeV at BESIII

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (608 additional authors not shown)

    Abstract: With the data samples taken at center-of-mass energies from 2.00 to 3.08 GeV with the BESIII detector at the BEPCII collider, a partial wave analysis on the $e^{+}e^{-}\rightarrowπ^{+}π^{-}π^{0}$ process is performed. The Born cross sections for $e^{+}e^{-}\rightarrowπ^{+}π^{-}π^{0}$ and its intermediate processes $e^{+}e^{-}\rightarrowρπ$ and $ρ(1450)π$ are measured as functions of $\sqrt{s}$. Th… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  44. arXiv:2401.13027  [pdf, ps, other

    astro-ph.EP astro-ph.IM

    Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b

    Authors: Taylor J. Bell, Nicolas Crouzet, Patricio E. Cubillos, Laura Kreidberg, Anjali A. A. Piette, Michael T. Roman, Joanna K. Barstow, Jasmina Blecic, Ludmila Carone, Louis-Philippe Coulombe, Elsa Ducrot, Mark Hammond, João M. Mendonça, Julianne I. Moses, Vivien Parmentier, Kevin B. Stevenson, Lucas Teinturier, Michael Zhang, Natalie M. Batalha, Jacob L. Bean, Björn Benneke, Benjamin Charnay, Katy L. Chubb, Brice-Olivier Demory, Peter Gao , et al. (58 additional authors not shown)

    Abstract: Hot Jupiters are among the best-studied exoplanets, but it is still poorly understood how their chemical composition and cloud properties vary with longitude. Theoretical models predict that clouds may condense on the nightside and that molecular abundances can be driven out of equilibrium by zonal winds. Here we report a phase-resolved emission spectrum of the hot Jupiter WASP-43b measured from 5… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 61 pages, 13 figures, 4 tables. This preprint has been submitted to and accepted in principle for publication in Nature Astronomy without significant changes

  45. arXiv:2401.11372  [pdf, other

    cs.RO cs.LG

    Back-stepping Experience Replay with Application to Model-free Reinforcement Learning for a Soft Snake Robot

    Authors: Xinda Qi, Dong Chen, Zhaojian Li, Xiaobo Tan

    Abstract: In this paper, we propose a novel technique, Back-stepping Experience Replay (BER), that is compatible with arbitrary off-policy reinforcement learning (RL) algorithms. BER aims to enhance learning efficiency in systems with approximate reversibility, reducing the need for complex reward shaping. The method constructs reversed trajectories using back-stepping transitions to reach random or fixed t… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: Submitted to the IEEE for possible publication

  46. arXiv:2401.09146  [pdf, other

    cs.CV

    Continuous Piecewise-Affine Based Motion Model for Image Animation

    Authors: Hexiang Wang, Fengqi Liu, Qianyu Zhou, Ran Yi, Xin Tan, Lizhuang Ma

    Abstract: Image animation aims to bring static images to life according to driving videos and create engaging visual content that can be used for various purposes such as animation, entertainment, and education. Recent unsupervised methods utilize affine and thin-plate spline transformations based on keypoints to transfer the motion in driving frames to the source image. However, limited by the expressive p… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  47. arXiv:2401.06377  [pdf, other

    cs.RO

    Design and Nonlinear Modeling of a Modular Cable Driven Soft Robotic Arm

    Authors: Xinda Qi, Yu Mei, Dong Chen, Zhaojian Li, Xiaobo Tan

    Abstract: We propose a novel multi-section cable-driven soft robotic arm inspired by octopus tentacles along with a new modeling approach. Each section of the modular manipulator is made of a soft tubing backbone, a soft silicon arm body, and two rigid endcaps, which connect adjacent sections and decouple the actuation cables of different sections. The soft robotic arm is made with casting after the rigid e… ▽ More

    Submitted 15 May, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: The paper has been accepted by IEEE Transactions on Mechatronics

  48. arXiv:2401.06201  [pdf, other

    cs.CL

    EASYTOOL: Enhancing LLM-based Agents with Concise Tool Instruction

    Authors: Siyu Yuan, Kaitao Song, Jiangjie Chen, Xu Tan, Yongliang Shen, Ren Kan, Dongsheng Li, Deqing Yang

    Abstract: To address intricate real-world tasks, there has been a rising interest in tool utilization in applications of large language models (LLMs). To develop LLM-based agents, it usually requires LLMs to understand many tool functions from different tool documentation. But these documentations could be diverse, redundant or incomplete, which immensely affects the capability of LLMs in using tools. To so… ▽ More

    Submitted 27 March, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

  49. arXiv:2401.06199  [pdf, other

    q-bio.QM cs.AI cs.LG

    xTrimoPGLM: Unified 100B-Scale Pre-trained Transformer for Deciphering the Language of Protein

    Authors: Bo Chen, Xingyi Cheng, Pan Li, Yangli-ao Geng, Jing Gong, Shen Li, Zhilei Bei, Xu Tan, Boyan Wang, Xin Zeng, Chiming Liu, Aohan Zeng, Yuxiao Dong, Jie Tang, Le Song

    Abstract: Protein language models have shown remarkable success in learning biological information from protein sequences. However, most existing models are limited by either autoencoding or autoregressive pre-training objectives, which makes them struggle to handle protein understanding and generation tasks concurrently. We propose a unified protein language model, xTrimoPGLM, to address these two types of… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  50. arXiv:2401.03859  [pdf, other

    astro-ph.EP

    Modeling the day-night temperature variations of ultra-hot Jupiters: confronting non-grey general circulation models and observations

    Authors: Xianyu Tan, Thaddeus D. Komacek, Natasha E. Batalha, Drake Deming, Roxana Lupu, Vivien Parmentier, Raymond T. Pierrehumbert

    Abstract: Ultra-hot Jupiters (UHJs) are natural laboratories to study extreme physics in planetary atmospheres and their rich observational data sets are yet to be confronted with models with varying complexities at a population level. In this work, we update the general circulation model of Tan & Komacek (2019) to include a non-grey radiative transfer scheme and apply it to simulate the realistic thermal s… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: Accepted to MNRAS; data underlying this article is available in Zenodo https://doi.org/10.5281/zenodo.10121933