Skip to main content

Showing 1–50 of 172 results for author: Cao, K

  1. arXiv:2406.11200  [pdf, other

    cs.LG cs.CL

    AvaTaR: Optimizing LLM Agents for Tool-Assisted Knowledge Retrieval

    Authors: Shirley Wu, Shiyu Zhao, Qian Huang, Kexin Huang, Michihiro Yasunaga, Kaidi Cao, Vassilis N. Ioannidis, Karthik Subbian, Jure Leskovec, James Zou

    Abstract: Large language model (LLM) agents have demonstrated impressive capability in utilizing external tools and knowledge to boost accuracy and reduce hallucinations. However, developing the prompting techniques that make LLM agents able to effectively use external tools and knowledge is a heuristic and laborious task. Here, we introduce AvaTaR, a novel and automatic framework that optimizes an LLM agen… ▽ More

    Submitted 17 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: 19 pages, 8 figures, 6 tables

  2. arXiv:2405.12133  [pdf

    quant-ph cond-mat.mtrl-sci physics.acc-ph physics.app-ph physics.optics

    Auger photoemission as a laser-like coherent cathode

    Authors: Yushan Zeng, Bin Zhang, Kecheng Cao, Xiao-jing Liu, Yiming Pan

    Abstract: In pursuit of quantum advancements across disciplines, a bright and coherent electron source is expected to be a cornerstone of diverse applications including electron microscopy, laser accelerators, and free electron lasers. Current cathodes, such as cold field and photoemission, can generate high-quality electron beams with different cathode materials, geometric configurations, and laser excitat… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 12 pages, 2 figures

  3. arXiv:2405.08449  [pdf, other

    cond-mat.stat-mech

    Exploring Dynamical Phase Transitions in the XY Chain through Linear Quench: Early and Long-term Perspectives

    Authors: Kaiyuan Cao, Peiqing Tong

    Abstract: We investigate the nonequilibrium dynamics induced by a finite-time linear quench in the XY chain. Initially, we examine the dynamical quantum phase transition, characterized by the nonanalytic behavior of the Loschmidt amplitude. We find distinct behaviors of DQPTs during and following the ramp. Following the ramp, the ramp crossing the critical point $h_{c}$ is the sufficient condition for the o… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 12 pages, 7 figures

  4. arXiv:2405.05188  [pdf, other

    astro-ph.IM

    ContEvol formalism: possibly a new twist on computational physics

    Authors: Kaili Cao

    Abstract: We present the ContEvol (continuous evolution) formalism, a family of implicit numerical methods which only need to solve linear equations and are almost symplectic. Combining values and derivatives of functions, ContEvol outputs allow users to recover full history and render full distributions. Using classic harmonic oscillator as a prototype case, we show that ContEvol methods lead to lower-orde… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 74 pages, 22 figures. No journal submission plan. Comments are welcome

  5. arXiv:2404.18434  [pdf, ps, other

    cs.IT

    The augmented codes of a family of linear codes with locality 2

    Authors: Ziling Heng, Keqing Cao

    Abstract: In this paper, we first generalize the class of linear codes by Ding and Ding (IEEE TIT, 61(11), pp. 5835-5842, 2015). Then we mainly study the augmented codes of this generalized class of linear codes. For one thing, we use Gaussian sums to determine the parameters and weight distributions of the augmented codes in some cases. It is shown that the augmented codes are self-orthogonal and have only… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 25 pages

  6. arXiv:2404.15275  [pdf, other

    cs.CV

    ID-Animator: Zero-Shot Identity-Preserving Human Video Generation

    Authors: Xuanhua He, Quande Liu, Shengju Qian, Xin Wang, Tao Hu, Ke Cao, Keyu Yan, Jie Zhang

    Abstract: Generating high-fidelity human video with specified identities has attracted significant attention in the content generation community. However, existing techniques struggle to strike a balance between training efficiency and identity preservation, either requiring tedious case-by-case fine-tuning or usually missing identity details in the video generation process. In this study, we present \textb… ▽ More

    Submitted 25 June, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: Project Page: https://id-animator.github.io/

  7. arXiv:2404.13207  [pdf, other

    cs.IR cs.LG

    STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases

    Authors: Shirley Wu, Shiyu Zhao, Michihiro Yasunaga, Kexin Huang, Kaidi Cao, Qian Huang, Vassilis N. Ioannidis, Karthik Subbian, James Zou, Jure Leskovec

    Abstract: Answering real-world complex queries, such as complex product search, often requires accurate retrieval from semi-structured knowledge bases that involve blend of unstructured (e.g., textual descriptions of products) and structured (e.g., entity relations of products) information. However, previous works have mostly studied textual and relational retrieval tasks as separate topics. To address the… ▽ More

    Submitted 20 May, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

    Comments: 26 pages, 6 figures

  8. arXiv:2404.05528  [pdf, other

    physics.app-ph

    NAND-like SOT-MRAM-based Approximate Storage for Error-Tolerant Applications

    Authors: Min Wang, Zhengyi Hou, Chenyi Wang, Zhengjie Yan, Shixing Li, Ao Du, Wenlong Cai, Jinhao Li, Hongchao Zhang, Kaihua Cao, Kewen Shi, Bi Wang, Yuanfu Zhao, Qingyi Xiang, Zhaohao Wang, Weisheng Zhao

    Abstract: We demonstrate approximate storage based on NAND-like spin-orbit torque (SOT) MRAM, through "device-modeling-architecture" explorations. We experimentally achieve down to 1E-5 level selectivity. Selectivity and low-power solutions are established by numerical calculation workflow. System-level power consumption is evaluated in the 512 KB last-level cache according to 5 quality levels. Error-tolera… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  9. arXiv:2404.00776  [pdf, other

    cs.LG cs.DB stat.ML

    PyTorch Frame: A Modular Framework for Multi-Modal Tabular Learning

    Authors: Weihua Hu, Yiwen Yuan, Zecheng Zhang, Akihiro Nitta, Kaidi Cao, Vid Kocijan, Jure Leskovec, Matthias Fey

    Abstract: We present PyTorch Frame, a PyTorch-based framework for deep learning over multi-modal tabular data. PyTorch Frame makes tabular deep learning easy by providing a PyTorch-based data structure to handle complex tabular data, introducing a model abstraction to enable modular implementation of tabular models, and allowing external foundation models to be incorporated to handle complex columns (e.g.,… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: https://github.com/pyg-team/pytorch-frame

  10. arXiv:2403.14173  [pdf, other

    cs.RO

    HCTO: Optimality-Aware LiDAR Inertial Odometry with Hybrid Continuous Time Optimization for Compact Wearable Mapping System

    Authors: Jianping Li, Shenghai Yuan, Muqing Cao, Thien-Minh Nguyen, Kun Cao, Lihua Xie

    Abstract: Compact wearable mapping system (WMS) has gained significant attention due to their convenience in various applications. Specifically, it provides an efficient way to collect prior maps for 3D structure inspection and robot-based "last-mile delivery" in complex environments. However, vibrations in human motion and the uneven distribution of point cloud features in complex environments often lead t… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  11. arXiv:2403.06389  [pdf, other

    cond-mat.mtrl-sci physics.app-ph

    Suppression of flux jumps in high-$J_c$ Nb$_3$Sn conductors by ferromagnetic layer

    Authors: Cun Xue, Kai-Wei Cao, Tian He, Chong Wei, Wei Liu, Jun-Yi Ge

    Abstract: Flux jumps observed in high-$J_c$ Nb$_3$Sn conductors are urgent problems to construct high field superconducting magnets. The low-field instabilities usually reduce the current-carrying capability and thus cause the premature quench of Nb$_3$Sn coils at low magnetic field. In this paper, we explore suppressing the flux jumps by ferromagnetic (FM) layer. Firstly, we experimentally and theoreticall… ▽ More

    Submitted 4 June, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

  12. arXiv:2403.06265  [pdf, other

    cs.CL cs.AI cs.LG

    Unpacking Tokenization: Evaluating Text Compression and its Correlation with Model Performance

    Authors: Omer Goldman, Avi Caciularu, Matan Eyal, Kris Cao, Idan Szpektor, Reut Tsarfaty

    Abstract: Despite it being the cornerstone of BPE, the most common tokenization algorithm, the importance of compression in the tokenization process is still unclear. In this paper, we argue for the theoretical importance of compression, that can be viewed as 0-gram language modeling where equal probability is assigned to all tokens. We also demonstrate the empirical importance of compression for downstream… ▽ More

    Submitted 22 June, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

    Comments: EMNLP 2024, Findings

  13. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  14. arXiv:2403.03249  [pdf, other

    astro-ph.SR astro-ph.GA

    Nature vs. Nurture: Distinguishing Effects from Stellar Processing and Chemical Evolution on Carbon and Nitrogen in Red Giant Stars

    Authors: John D. Roberts, Marc H. Pinsonneault, Jennifer A. Johnson, Joel C. Zinn, David H. Weinberg, Mathieu Vrard, Jamie Tayar, Dennis Stello, Benoît Mosser, James W. Johnson, Kaili Cao, Keivan G. Stassun, Guy S. Stringfellow, Aldo Serenelli, Savita Mathur, Saskia Hekker, Rafael A. García, Yvonne P. Elsworth, Enrico Corsaro

    Abstract: The surface [C/N] ratios of evolved giants are strongly affected by the first dredge-up (FDU) of nuclear-processed material from stellar cores. C and N also have distinct nucleosynthetic origins and serve as diagnostics of mixing and mass loss. We use subgiants to find strong trends in the birth [C/N] with [Fe/H], which differ between the low-$α$ and high-$α$ populations. We demonstrate that these… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 19 pages, 19 figures

  15. arXiv:2402.16264  [pdf

    cond-mat.mes-hall

    Intrinsic supercurrent diode effect in NbSe2 nanobridge

    Authors: Yiwen Zhang, Jiliang Cai, Peng Dong, Jiadian He, Yifan Ding, Jinghui Wang, Xiang Zhou, Kecheng Cao, Yueshen Wu, Jun Li

    Abstract: The significance of the superconducting diode effect lies in its potential application as a fundamental component in the development of next-generation superconducting circuit technology. The stringent operating conditions at low temperatures have posed challenges for the conventional semiconductor diode, primarily due to its exceptionally high resistivity. In response to this limitation, various… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  16. arXiv:2402.16009  [pdf, other

    cs.DL cs.CL

    PST-Bench: Tracing and Benchmarking the Source of Publications

    Authors: Fanjin Zhang, Kun Cao, Yukuo Cen, Jifan Yu, Da Yin, Jie Tang

    Abstract: Tracing the source of research papers is a fundamental yet challenging task for researchers. The billion-scale citation relations between papers hinder researchers from understanding the evolution of science efficiently. To date, there is still a lack of an accurate and scalable dataset constructed by professional researchers to identify the direct source of their studied papers, based on which au… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: 8 pages, 3 appendix pages

  17. arXiv:2402.15810  [pdf, other

    cs.DL cs.CL cs.LG

    OAG-Bench: A Human-Curated Benchmark for Academic Graph Mining

    Authors: Fanjin Zhang, Shijie Shi, Yifan Zhu, Bo Chen, Yukuo Cen, Jifan Yu, Yelin Chen, Lulu Wang, Qingfei Zhao, Yuqing Cheng, Tianyi Han, Yuwei An, Dan Zhang, Weng Lam Tam, Kun Cao, Yunhe Pang, Xinyu Guan, Huihui Yuan, Jian Song, Xiaoyan Li, Yuxiao Dong, Jie Tang

    Abstract: With the rapid proliferation of scientific literature, versatile academic knowledge services increasingly rely on comprehensive academic graph mining. Despite the availability of public academic graphs, benchmarks, and datasets, these resources often fall short in multi-aspect and fine-grained annotations, are constrained to specific task types and domains, or lack underlying real academic graphs.… ▽ More

    Submitted 20 June, 2024; v1 submitted 24 February, 2024; originally announced February 2024.

    Comments: KDD'24, 9 pages, 5 appendix pages

    Journal ref: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '24), August 25--29, 2024, Barcelona, Spain

  18. arXiv:2402.14204  [pdf, ps, other

    math.DG

    A sufficient condition for the height function to be constant in $ I_g\times_ρ\mathbb{P}^n $

    Authors: Kaijian Cao

    Abstract: This paper makes some modifications to the warped product space. Based on Alias,Impera and Rigoli, a warping function is added to the warped product space. This new function affects the Riemannian metric of the warped product space. In this new warped product space, we continue to discuss the sufficient condition for calculating the height of the immersed surface.

    Submitted 21 February, 2024; originally announced February 2024.

  19. arXiv:2402.12192  [pdf, other

    cs.CV

    Pan-Mamba: Effective pan-sharpening with State Space Model

    Authors: Xuanhua He, Ke Cao, Keyu Yan, Rui Li, Chengjun Xie, Jie Zhang, Man Zhou

    Abstract: Pan-sharpening involves integrating information from low-resolution multi-spectral and high-resolution panchromatic images to generate high-resolution multi-spectral counterparts. While recent advancements in the state space model, particularly the efficient long-range dependency modeling achieved by Mamba, have revolutionized computer vision community, its untapped potential in pan-sharpening mot… ▽ More

    Submitted 8 March, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  20. arXiv:2401.16923  [pdf, other

    cs.CV cs.RO eess.IV

    Fourier Prompt Tuning for Modality-Incomplete Scene Segmentation

    Authors: Ruiping Liu, Jiaming Zhang, Kunyu Peng, Yufan Chen, Ke Cao, Junwei Zheng, M. Saquib Sarfraz, Kailun Yang, Rainer Stiefelhagen

    Abstract: Integrating information from multiple modalities enhances the robustness of scene perception systems in autonomous vehicles, providing a more comprehensive and reliable sensory framework. However, the modality incompleteness in multi-modal segmentation remains under-explored. In this work, we establish a task called Modality-Incomplete Scene Segmentation (MISS), which encompasses both system-level… ▽ More

    Submitted 10 April, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted to IEEE IV 2024. The source code is publicly available at https://github.com/RuipingL/MISS

  21. arXiv:2401.12635  [pdf, other

    cond-mat.supr-con cond-mat.str-el

    Neutron Scattering Studies on the High-$T_c$ Superconductor La$_3$Ni$_2$O$_{7-δ}$ at Ambient Pressure

    Authors: Tao Xie, Mengwu Huo, Xiaosheng Ni, Feiran Shen, Xing Huang, Hualei Sun, Helen C. Walker, Devashibhai Adroja, Dehong Yu, Bing Shen, Lunhua He, Kun Cao, Meng Wang

    Abstract: After several decades of studies of high-temperature superconductivity, there is no compelling theory for the mechanism yet; however, the spin fluctuations have been widely believed to play a crucial role in forming the superconducting Cooper pairs. The recent discovery of high-temperature superconductivity near 80 K in the bilayer nickelate La$_3$Ni$_2$O$_7$ under pressure provides a new platform… ▽ More

    Submitted 4 April, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: 10 pages, 9 figures with supplementary information

  22. arXiv:2401.10685  [pdf, other

    cs.LG cs.AI eess.SP

    Towards End-to-End GPS Localization with Neural Pseudorange Correction

    Authors: Xu Weng, KV Ling, Haochen Liu, Kun Cao

    Abstract: Pseudorange errors are the root cause of localization inaccuracy in GPS. Previous data-driven methods regress and eliminate pseudorange errors using handcrafted intermediate labels. Unlike them, we propose an end-to-end GPS localization framework, E2E-PrNet, to train a neural network for pseudorange correction (PrNet) directly using the final task loss calculated with the ground truth of GPS recei… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  23. arXiv:2401.01031  [pdf, other

    cond-mat.stat-mech

    Quantum phase transitions in the alternating XY chain with three-site interactions

    Authors: Kaiyuan Cao, Hao Fu, Xue Liu, Ming Zhong, Peiqing Tong

    Abstract: We investigate the quantum phase transition in the alternating XY chain with the XZX+YZY type of three-spin interactions. We present the exact solution derived by means of the Jordan-Wigner transformation and study the average magnetization, spin correlations, and von Neumann entropy to establish the phase diagram. The phase diagram consists of the ferromagnetic phases, the paramagnetic phases, an… ▽ More

    Submitted 4 January, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

    Comments: 10 pages,12 figures

  24. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  25. arXiv:2312.04693  [pdf, other

    cs.LG

    GraphMETRO: Mitigating Complex Graph Distribution Shifts via Mixture of Aligned Experts

    Authors: Shirley Wu, Kaidi Cao, Bruno Ribeiro, James Zou, Jure Leskovec

    Abstract: Graph data are inherently complex and heterogeneous, leading to a high natural diversity of distributional shifts. However, it remains unclear how to build machine learning architectures that generalize to complex non-synthetic distributional shifts naturally occurring in the real world. Here we develop GraphMETRO, a Graph Neural Network architecture, that reliably models natural diversity and cap… ▽ More

    Submitted 5 February, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: Graph Neural Networks, Mixture-of-experts, Distribution Shifts, Generalization

  26. arXiv:2311.09906  [pdf, ps, other

    math.DG

    Fino-Vezzoni conjecture on Lie algebras with abelian ideals of codimension two

    Authors: Kexiang Cao, Fangyang Zheng

    Abstract: In this paper, we confirm the Fino-Vezzoni Conjecture for unimodular Lie algebras which contain abelian ideals of codimension two, a natural generalization to the class of almost abelian Lie algebras. This provides new evidence towards the validity of the conjecture on a very special type of $3$-step solvmanifolds.

    Submitted 18 March, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: 20 pages, updated some reference info and added a couple of appendices

    MSC Class: 53C55

  27. arXiv:2311.08025  [pdf, other

    cond-mat.stat-mech

    Relaxation dynamics in the alternating XY chain following a quantum quench

    Authors: Kaiyuan Cao, Yayun Hu, Peiqing Tong, Guangwen Yang, Peng Liu

    Abstract: We investigate the relaxation dynamics of the fermion two-point correlation function $C_{mn}(t)=\langleψ(t)|c_{m}^†c_{n}|ψ(t)\rangle$ in the XY chain with staggered nearest-neighbor hopping interaction after a quench. We find that the deviation $δC_{mn}(t)=C_{mn}(t)-C_{mn}(\infty)$ decays with time following the power law behavior $t^{-μ}$, where the exponent $μ$ depends on whether the quench is t… ▽ More

    Submitted 4 January, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: 12 pages, 8 figures

  28. arXiv:2311.03043  [pdf, ps, other

    quant-ph cond-mat.other

    Topological phases of many-body non-Hermitian systems

    Authors: Kui Cao, Su-Peng Kou

    Abstract: We show that many-body fermionic non-Hermitian systems require two distinct sets of topological invariants to describe the topology of energy bands and quantum states respectively, with the latter yet to be explored. We identify 10 symmetry classes -- determined by particle-hole, linearized time-reversal, and linearized chiral symmetries. Each class has topological invariant associated with each d… ▽ More

    Submitted 23 April, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

  29. arXiv:2311.00266  [pdf, other

    cond-mat.supr-con

    Constructing the Fulde-Ferrell-Larkin-Ovchinnikov state in antiferromagnetic insulator CrOCl

    Authors: Yifan Ding, Jiadian He, Shihao Zhang, Huakun Zuo, Pingfan Gu, Jiliang Cai, Xiaohui Zeng, Pu Yan, Kecheng Cao, Kenji Watanabe, Takashi Taniguchi, Peng Dong, Yiwen Zhang, Yueshen Wu, Xiang Zhou, Jinghui Wang, Yulin Chen, Yu Ye, Jianpeng Liu, Jun Li

    Abstract: Time reversal symmetry breaking in superconductors, resulting from external magnetic fields or spontaneous magnetization, often leads to unconventional superconducting properties. In this way, a conventional Fulde-Ferrell-Larkin-Ovchinnikov (FFLO) state, characterized by the Cooper pairs with nonzero total momentum, may be realized by the Zeeman effect caused from external magnetic fields. Here, w… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

  30. arXiv:2309.14097  [pdf, other

    cs.OH

    How do users design scientific workflows? The Case of Snakemake

    Authors: Sebastian Pohl, Nourhan Elfaramawy, Kedi Cao, Birte Kehr, Matthias Weidlich

    Abstract: Scientific workflows automate the analysis of large-scale scientific data, fostering the reuse of data processing operators as well as the reproducibility and traceability of analysis results. In exploratory research, however, workflows are continuously adapted, utilizing a wide range of tools and software libraries, to test scientific hypotheses. Script-based workflow engines cater to the require… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

  31. arXiv:2309.13035  [pdf, other

    cs.RO

    PyPose v0.6: The Imperative Programming Interface for Robotics

    Authors: Zitong Zhan, Xiangfu Li, Qihang Li, Haonan He, Abhinav Pandey, Haitao Xiao, Yangmengfei Xu, Xiangyu Chen, Kuan Xu, Kun Cao, Zhipeng Zhao, Zihan Wang, Huan Xu, Zihang Fang, Yutian Chen, Wentao Wang, Xu Fang, Yi Du, Tianhao Wu, Xiao Lin, Yuheng Qiu, Fan Yang, Jingnan Shi, Shaoshu Su, Yiren Lu , et al. (11 additional authors not shown)

    Abstract: PyPose is an open-source library for robot learning. It combines a learning-based approach with physics-based optimization, which enables seamless end-to-end robot learning. It has been used in many tasks due to its meticulously designed application programming interface (API) and efficient implementation. From its initial launch in early 2022, PyPose has experienced significant enhancements, inco… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  32. arXiv:2309.02686  [pdf, other

    cond-mat.stat-mech

    Dynamical relaxation behavior of extended XY chain with gapless phase following a quantum quench

    Authors: Kaiyuan Cao, Yayun Hu, Peiqing Tong, Guangwen Yang

    Abstract: We investigate the dynamical relaxation behavior of the two-point correlation in extended XY models with a gapless phase after quenches from various initial states. Specifically, we study the XY chain with gapless phase induced by the additional interactions: Dzyaloshinskii-Moriya interaction and XZY-YZX type of three-site interaction. When quenching from the gapped phase, we observe that the addi… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: 12 pages, 10 figures

  33. arXiv:2308.13490  [pdf, other

    cs.LG cs.AR cs.SI

    TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs

    Authors: Phitchaya Mangpo Phothilimthana, Sami Abu-El-Haija, Kaidi Cao, Bahare Fatemi, Mike Burrows, Charith Mendis, Bryan Perozzi

    Abstract: Precise hardware performance models play a crucial role in code optimizations. They can assist compilers in making heuristic decisions or aid autotuners in identifying the optimal configuration for a given program. For example, the autotuner for XLA, a machine learning compiler, discovered 10-20% speedup on state-of-the-art models serving substantial production traffic at Google. Although there ex… ▽ More

    Submitted 5 December, 2023; v1 submitted 25 August, 2023; originally announced August 2023.

  34. arXiv:2308.03209  [pdf, other

    cs.LG

    Communication-Free Distributed GNN Training with Vertex Cut

    Authors: Kaidi Cao, Rui Deng, Shirley Wu, Edward W Huang, Karthik Subbian, Jure Leskovec

    Abstract: Training Graph Neural Networks (GNNs) on real-world graphs consisting of billions of nodes and edges is quite challenging, primarily due to the substantial memory needed to store the graph and its intermediate node and edge features, and there is a pressing need to speed up the training process. A common approach to achieve speed up is to divide the graph into many smaller subgraphs, which are the… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

  35. arXiv:2307.12857  [pdf, ps, other

    cond-mat.supr-con cond-mat.str-el

    Coincidence detection probability of $(γ, 2e)$ photoemission measurement

    Authors: Yuehua Su, Kun Cao, Chao Zhang

    Abstract: In the study of the strongly correlated electrons, one of the challenging core tasks is to develop the potential techniques for direct detection of the many-body correlations of the strongly correlated electrons. $(γ, 2e)$ photoemission technique has been developed to investigate the two-body correlations of the target correlated electrons. In this article, we will focus on this technique for the… ▽ More

    Submitted 24 July, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: 10 pages, 3 figures

  36. arXiv:2307.07763  [pdf, other

    cs.RO cs.CV eess.IV

    Tightly-Coupled LiDAR-Visual SLAM Based on Geometric Features for Mobile Agents

    Authors: Ke Cao, Ruiping Liu, Ze Wang, Kunyu Peng, Jiaming Zhang, Junwei Zheng, Zhifeng Teng, Kailun Yang, Rainer Stiefelhagen

    Abstract: The mobile robot relies on SLAM (Simultaneous Localization and Mapping) to provide autonomous navigation and task execution in complex and unknown environments. However, it is hard to develop a dedicated algorithm for mobile robots due to dynamic and challenging situations, such as poor lighting conditions and motion blur. To tackle this issue, we propose a tightly-coupled LiDAR-visual SLAM based… ▽ More

    Submitted 25 December, 2023; v1 submitted 15 July, 2023; originally announced July 2023.

    Comments: Accepted to ROBIO 2023

  37. arXiv:2307.07757  [pdf, other

    cs.CV cs.HC cs.RO eess.IV

    Open Scene Understanding: Grounded Situation Recognition Meets Segment Anything for Helping People with Visual Impairments

    Authors: Ruiping Liu, Jiaming Zhang, Kunyu Peng, Junwei Zheng, Ke Cao, Yufan Chen, Kailun Yang, Rainer Stiefelhagen

    Abstract: Grounded Situation Recognition (GSR) is capable of recognizing and interpreting visual scenes in a contextually intuitive way, yielding salient activities (verbs) and the involved entities (roles) depicted in images. In this work, we focus on the application of GSR in assisting people with visual impairments (PVI). However, precise localization information of detected objects is often required to… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

    Comments: Code will be available at https://github.com/RuipingL/OpenSU

  38. arXiv:2305.12322  [pdf, other

    cs.LG cs.SI

    Learning Large Graph Property Prediction via Graph Segment Training

    Authors: Kaidi Cao, Phitchaya Mangpo Phothilimthana, Sami Abu-El-Haija, Dustin Zelle, Yanqi Zhou, Charith Mendis, Jure Leskovec, Bryan Perozzi

    Abstract: Learning to predict properties of large graphs is challenging because each prediction requires the knowledge of an entire graph, while the amount of memory available during training is bounded. Here we propose Graph Segment Training (GST), a general framework that utilizes a divide-and-conquer approach to allow learning large graph property prediction with a constant memory footprint. GST first di… ▽ More

    Submitted 5 November, 2023; v1 submitted 20 May, 2023; originally announced May 2023.

  39. arXiv:2305.05461  [pdf, other

    cs.CL

    What is the best recipe for character-level encoder-only modelling?

    Authors: Kris Cao

    Abstract: This paper aims to benchmark recent progress in language understanding models that output contextualised representations at the character level. Many such modelling architectures and methods to train those architectures have been proposed, but it is currently unclear what the relative contributions of the architecture vs. the pretraining objective are to final model performance. We explore the des… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: accepted at ACL 2023

  40. arXiv:2305.00271  [pdf, other

    cs.RO

    Path Planning for Multiple Tethered Robots Using Topological Braids

    Authors: Muqing Cao, Kun Cao, Shenghai Yuan, Kangcheng Liu, Yan Loi Wong, Lihua Xie

    Abstract: Path planning for multiple tethered robots is a challenging problem due to the complex interactions among the cables and the possibility of severe entanglements. Previous works on this problem either consider idealistic cable models or provide no guarantee for entanglement-free paths. In this work, we present a new approach to address this problem using the theory of braids. By establishing a topo… ▽ More

    Submitted 15 June, 2023; v1 submitted 29 April, 2023; originally announced May 2023.

    Comments: Accepted for presentation in Robotics: Science and Systems 2023

  41. arXiv:2304.07511  [pdf, other

    cs.HC

    Pilgrimage to Pureland: Art, Perception and the Wutai Mural VR Reconstruction

    Authors: Rongxuan Mu, Yuhe Nie, Kent Cao, Ruoxin You, Yinzong Wei, Xin Tong

    Abstract: Virtual reality (VR) supports audiences to engage with cultural heritage proactively. We designed an easy-to-access and guided Pilgrimage To Pureland VR reconstruction of Dunhuang Mogao Grottoes to offer the general public an accessible and engaging way to explore the Dunhuang murals. We put forward an immersive VR reconstruction paradigm that can efficiently convert complex 2D artwork into a VR e… ▽ More

    Submitted 15 April, 2023; originally announced April 2023.

  42. Statistical mechanics for non-Hermitian quantum systems

    Authors: Kui Cao, Su-Peng Kou

    Abstract: We present a systematic study of statistical mechanics for non-Hermitian quantum systems. Our work reveals that the stability of a non-Hermitian system necessitates the existence of a single path-dependent conserved quantity, which, in conjunction with the system's Hamiltonian, dictates the equilibrium state. By elucidating the relationship between the Hamiltonian and the supported conserved quant… ▽ More

    Submitted 1 December, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

  43. arXiv:2304.03854  [pdf, other

    cs.LG

    Revisiting Deep Learning for Variable Type Recovery

    Authors: Kevin Cao, Kevin Leach

    Abstract: Compiled binary executables are often the only available artifact in reverse engineering, malware analysis, and software systems maintenance. Unfortunately, the lack of semantic information like variable types makes comprehending binaries difficult. In efforts to improve the comprehensibility of binaries, researchers have recently used machine learning techniques to predict semantic information co… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: In The 31st International Conference on Program Comprehension(ICPC 2023 RENE)

  44. arXiv:2304.01422  [pdf, ps, other

    quant-ph cond-mat.other

    Non-Hermitian Chiral Skin Effect

    Authors: Xinran Ma, Kui Cao, Xiaoran Wang, Zheng Wei, Supeng Kou

    Abstract: The interplay between non-Hermitian effects and topological insulators has become a frontier of research in non-Hermitian physics. However, the existence of a non-Hermitian skin effect for topological-protected edge states remains controversial. In this paper, we discover an alternative form of the non-Hermitian skin effect called the non-Hermitian chiral skin effect (NHCSE). NHCSE is a non-Hermit… ▽ More

    Submitted 24 July, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

  45. arXiv:2303.15966  [pdf, other

    cond-mat.stat-mech

    Aperiodic dynamical quantum phase transitions in multi-band Bloch Hamiltonian and its origin

    Authors: Kaiyuan Cao, Hao Guo, Guangwen Yang

    Abstract: We investigate the dynamical quantum phase transition (DQPT) in the multi-band Bloch Hamiltonian of the one-dimensional periodic Kitaev model, focusing on quenches from a Bloch band. By analyzing the dynamical free energy and Pancharatnam geometric phase, we show that the critical times of DQPTs deviate from periodic spacing due to the multi-band effect, contrasting with results from two-band mode… ▽ More

    Submitted 27 July, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

    Comments: 11 pages, 11 figures

  46. arXiv:2303.11910  [pdf, other

    cs.CV

    360BEV: Panoramic Semantic Mapping for Indoor Bird's-Eye View

    Authors: Zhifeng Teng, Jiaming Zhang, Kailun Yang, Kunyu Peng, Hao Shi, Simon Reiß, Ke Cao, Rainer Stiefelhagen

    Abstract: Seeing only a tiny part of the whole is not knowing the full circumstance. Bird's-eye-view (BEV) perception, a process of obtaining allocentric maps from egocentric views, is restricted when using a narrow Field of View (FoV) alone. In this work, mapping from 360° panoramas to BEV semantics, the 360BEV task, is established for the first time to achieve holistic representations of indoor scenes in… ▽ More

    Submitted 4 September, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: Code and datasets are available at the project page: https://jamycheung.github.io/360BEV.html. Accepted to WACV 2024

  47. arXiv:2303.08750  [pdf, other

    astro-ph.IM astro-ph.CO

    Simulating image coaddition with the Nancy Grace Roman Space Telescope: II. Analysis of the simulated images and implications for weak lensing

    Authors: Masaya Yamamoto, Katherine Laliotis, Emily Macbeth, Tianqing Zhang, Christopher M. Hirata, M. A. Troxel, Kaili Cao, Ami Choi, Jahmour Givans, Katrin Heitmann, Mustapha Ishak, Mike Jarvis, Eve Kovacs, Heyang Long, Rachel Mandelbaum, Andy Park, Anna Porredon, Christopher W. Walter, W. Michael Wood-Vasey

    Abstract: One challenge for applying current weak lensing analysis tools to the Nancy Grace Roman Space Telescope is that individual images will be undersampled. Our companion paper presented an initial application of Imcom - an algorithm that builds an optimal mapping from input to output pixels to reconstruct a fully sampled combined image - on the Roman image simulations. In this paper, we measure the ou… ▽ More

    Submitted 12 January, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 25 pages, 20 figures. Submitted to MNRAS

  48. arXiv:2303.08749  [pdf, other

    astro-ph.IM astro-ph.CO

    Simulating image coaddition with the Nancy Grace Roman Space Telescope: I. Simulation methodology and general results

    Authors: Christopher M. Hirata, Masaya Yamamoto, Katherine Laliotis, Emily Macbeth, M. A. Troxel, Tianqing Zhang, Kaili Cao, Ami Choi, Jahmour Givans, Katrin Heitmann, Mustapha Ishak, Mike Jarvis, Eve Kovacs, Heyang Long, Rachel Mandelbaum, Andy Park, Anna Porredon, Christopher W. Walter, W. Michael Wood-Vasey

    Abstract: The upcoming Nancy Grace Roman Space Telescope will carry out a wide-area survey in the near infrared. A key science objective is the measurement of cosmic structure via weak gravitational lensing. Roman data will be undersampled, which introduces new challenges in the measurement of source galaxy shapes; a potential solution is to use linear algebra-based coaddition techniques such as Imcom that… ▽ More

    Submitted 12 January, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 28 pages, 19 figures, matches version accepted by MNRAS

  49. arXiv:2303.07669  [pdf, other

    cs.LG

    AutoTransfer: AutoML with Knowledge Transfer -- An Application to Graph Neural Networks

    Authors: Kaidi Cao, Jiaxuan You, Jiaju Liu, Jure Leskovec

    Abstract: AutoML has demonstrated remarkable success in finding an effective neural architecture for a given machine learning task defined by a specific dataset and an evaluation metric. However, most present AutoML techniques consider each task independently from scratch, which requires exploring many architectures, leading to high computational cost. Here we propose AutoTransfer, an AutoML solution that i… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: ICLR 2023

  50. arXiv:2303.07666  [pdf, other

    cs.LG

    Relational Multi-Task Learning: Modeling Relations between Data and Tasks

    Authors: Kaidi Cao, Jiaxuan You, Jure Leskovec

    Abstract: A key assumption in multi-task learning is that at the inference time the multi-task model only has access to a given data point but not to the data point's labels from other tasks. This presents an opportunity to extend multi-task learning to utilize data point's labels from other auxiliary tasks, and this way improves performance on the new task. Here we introduce a novel relational multi-task l… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: ICLR 2022 Spotlight