Skip to main content

Showing 1–50 of 1,183 results for author: He, S

  1. arXiv:2407.08440  [pdf, other

    cs.CL cs.AI

    Beyond Instruction Following: Evaluating Rule Following of Large Language Models

    Authors: Wangtao Sun, Chenxiang Zhang, Xueyou Zhang, Ziyang Huang, Haotian Xu, Pei Chen, Shizhu He, Jun Zhao, Kang Liu

    Abstract: Although Large Language Models (LLMs) have demonstrated strong instruction-following ability to be helpful, they are further supposed to be controlled and guided by rules in real-world scenarios to be safe, and accurate in responses. This demands the possession of rule-following capability of LLMs. However, few works have made a clear evaluation of the rule-following capability of LLMs. Previous s… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2407.07554  [pdf, other

    cs.GR cs.SD eess.AS

    Beat-It: Beat-Synchronized Multi-Condition 3D Dance Generation

    Authors: Zikai Huang, Xuemiao Xu, Cheng Xu, Huaidong Zhang, Chenxi Zheng, Jing Qin, Shengfeng He

    Abstract: Dance, as an art form, fundamentally hinges on the precise synchronization with musical beats. However, achieving aesthetically pleasing dance sequences from music is challenging, with existing methods often falling short in controllability and beat alignment. To address these shortcomings, this paper introduces Beat-It, a novel framework for beat-specific, key pose-guided dance generation. Unlike… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  3. arXiv:2407.05092  [pdf, other

    cs.CL

    Exploring Sound Change Over Time: A Review of Computational and Human Perception

    Authors: Siqi He, Wei Zhao

    Abstract: Computational and human perception are often considered separate approaches for studying sound changes over time; few works have touched on the intersection of both. To fill this research gap, we provide a pioneering review contrasting computational with human perception from the perspectives of methods and tasks. Overall, computational approaches rely on computer-driven models to perceive histori… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: LChange24 Camera Ready

  4. arXiv:2407.04997  [pdf, other

    cs.SE cs.AI cs.HC

    Achieving Tool Calling Functionality in LLMs Using Only Prompt Engineering Without Fine-Tuning

    Authors: Shengtao He

    Abstract: Currently, the vast majority of locally deployed open-source large language models (LLMs) and some commercial model interfaces do not support stable tool calling functionality. The existing solution involves fine-tuning LLMs, which results in significant time and computational resource consumption. This paper proposes a method that enables LLMs to achieve stable tool calling capabilities using onl… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: 5 pages, 2 figures,review comments welcome

    ACM Class: I.2.7

  5. arXiv:2407.04948  [pdf, other

    cs.CV

    Zero-shot Object Counting with Good Exemplars

    Authors: Huilin Zhu, Jingling Yuan, Zhengwei Yang, Yu Guo, Zheng Wang, Xian Zhong, Shengfeng He

    Abstract: Zero-shot object counting (ZOC) aims to enumerate objects in images using only the names of object classes during testing, without the need for manual annotations. However, a critical challenge in current ZOC methods lies in their inability to identify high-quality exemplars effectively. This deficiency hampers scalability across diverse classes and undermines the development of strong visual asso… ▽ More

    Submitted 9 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

  6. arXiv:2407.04621  [pdf, other

    cs.CV

    OneRestore: A Universal Restoration Framework for Composite Degradation

    Authors: Yu Guo, Yuan Gao, Yuxu Lu, Huilin Zhu, Ryan Wen Liu, Shengfeng He

    Abstract: In real-world scenarios, image impairments often manifest as composite degradations, presenting a complex interplay of elements such as low light, haze, rain, and snow. Despite this reality, existing restoration methods typically target isolated degradation types, thereby falling short in environments where multiple degrading factors coexist. To bridge this gap, our study proposes a versatile imag… ▽ More

    Submitted 10 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

  7. arXiv:2407.04152  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    VoxAct-B: Voxel-Based Acting and Stabilizing Policy for Bimanual Manipulation

    Authors: I-Chun Arthur Liu, Sicheng He, Daniel Seita, Gaurav Sukhatme

    Abstract: Bimanual manipulation is critical to many robotics applications. In contrast to single-arm manipulation, bimanual manipulation tasks are challenging due to higher-dimensional action spaces. Prior works leverage large amounts of data and primitive actions to address this problem, but may suffer from sample inefficiency and limited generalization across various tasks. To this end, we propose VoxAct-… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  8. arXiv:2407.03698  [pdf, ps, other

    hep-th

    Generalized $T\overline{T}$-like Deformations in Duality-Invariant Nonlinear Electrodynamic Theories

    Authors: H. Babaei-Aghbolagh, Song He, Hao Ouyang

    Abstract: This study introduces a high-order perturbation methodology to categorize two primary solution types within duality-invariant nonlinear electrodynamic theories, adhering to the differential self-duality criterion. The first solution type aligns with irrelevant stress tensor flows, resembling $T\bar{T}$ dynamics, and the second involves a blend of irrelevant $T\bar{T}$-like and marginal root-… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 25 pages, 1 figure

  9. arXiv:2407.01688  [pdf, other

    cs.SE

    How We Built Cedar: A Verification-Guided Approach

    Authors: Craig Disselkoen, Aaron Eline, Shaobo He, Kyle Headley, Michael Hicks, Kesha Hietala, John Kastner, Anwar Mamat, Matt McCutchen, Neha Rungta, Bhakti Shah, Emina Torlak, Andrew Wells

    Abstract: This paper presents verification-guided development (VGD), a software engineering process we used to build Cedar, a new policy language for expressive, fast, safe, and analyzable authorization. Developing a system with VGD involves writing an executable model of the system and mechanically proving properties about the model; writing production code for the system and using differential random test… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  10. arXiv:2407.00987  [pdf, other

    cs.NI eess.SY

    Exploiting Dependency-Aware Priority Adjustment for Mixed-Criticality TSN Flow Scheduling

    Authors: Miao Guo, Yifei Sun, Chaojie Gu, Shibo He, Zhiguo Shi

    Abstract: Time-Sensitive Networking (TSN) serves as a one-size-fits-all solution for mixed-criticality communication, in which flow scheduling is vital to guarantee real-time transmissions. Traditional approaches statically assign priorities to flows based on their associated applications, resulting in significant queuing delays. In this paper, we observe that assigning different priorities to a flow leads… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: This paper has been accepted by IWQoS'24

  11. arXiv:2406.19708  [pdf, other

    cs.NE cs.AI cs.CE q-bio.NC

    A Differentiable Approach to Multi-scale Brain Modeling

    Authors: Chaoming Wang, Muyang Lyu, Tianqiu Zhang, Sichao He, Si Wu

    Abstract: We present a multi-scale differentiable brain modeling workflow utilizing BrainPy, a unique differentiable brain simulator that combines accurate brain simulation with powerful gradient-based optimization. We leverage this capability of BrainPy across different brain scales. At the single-neuron level, we implement differentiable neuron models and employ gradient methods to optimize their fit to e… ▽ More

    Submitted 1 July, 2024; v1 submitted 28 June, 2024; originally announced June 2024.

    Comments: 2nd Differentiable Almost Everything Workshop at ICML 2024

  12. arXiv:2406.18548  [pdf

    eess.IV cs.CV

    Exploration of Multi-Scale Image Fusion Systems in Intelligent Medical Image Analysis

    Authors: Yuxiang Hu, Haowei Yang, Ting Xu, Shuyao He, Jiajie Yuan, Haozhang Deng

    Abstract: The diagnosis of brain cancer relies heavily on medical imaging techniques, with MRI being the most commonly used. It is necessary to perform automatic segmentation of brain tumors on MRI images. This project intends to build an MRI algorithm based on U-Net. The residual network and the module used to enhance the context information are combined, and the void space convolution pooling pyramid is a… ▽ More

    Submitted 23 May, 2024; originally announced June 2024.

  13. arXiv:2406.18085  [pdf, other

    cs.CL

    Multilingual Knowledge Graph Completion from Pretrained Language Models with Knowledge Constraints

    Authors: Ran Song, Shizhu He, Shengxiang Gao, Li Cai, Kang Liu, Zhengtao Yu, Jun Zhao

    Abstract: Multilingual Knowledge Graph Completion (mKGC) aim at solving queries like (h, r, ?) in different languages by reasoning a tail entity t thus improving multilingual knowledge graphs. Previous studies leverage multilingual pretrained language models (PLMs) and the generative paradigm to achieve mKGC. Although multilingual pretrained language models contain extensive knowledge of different languages… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 11 pages, ACL 2023

  14. arXiv:2406.17739  [pdf, other

    cs.CL cs.AI

    Find Parent then Label Children: A Two-stage Taxonomy Completion Method with Pre-trained Language Model

    Authors: Fei Xia, Yixuan Weng, Shizhu He, Kang Liu, Jun Zhao

    Abstract: Taxonomies, which organize domain concepts into hierarchical structures, are crucial for building knowledge systems and downstream applications. As domain knowledge evolves, taxonomies need to be continuously updated to include new concepts. Previous approaches have mainly focused on adding concepts to the leaf nodes of the existing hierarchical tree, which does not fully utilize the taxonomy's kn… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  15. arXiv:2406.17005  [pdf, other

    cs.CV

    PVUW 2024 Challenge on Complex Video Understanding: Methods and Results

    Authors: Henghui Ding, Chang Liu, Yunchao Wei, Nikhila Ravi, Shuting He, Song Bai, Philip Torr, Deshui Miao, Xin Li, Zhenyu He, Yaowei Wang, Ming-Hsuan Yang, Zhensong Xu, Jiangtao Yao, Chengjing Wu, Ting Liu, Luoqi Liu, Xinyu Liu, Jing Zhang, Kexin Zhang, Yuting Yang, Licheng Jiao, Shuyuan Yang, Mingqi Gao, Jingnan Luo , et al. (12 additional authors not shown)

    Abstract: Pixel-level Video Understanding in the Wild Challenge (PVUW) focus on complex video understanding. In this CVPR 2024 workshop, we add two new tracks, Complex Video Object Segmentation Track based on MOSE dataset and Motion Expression guided Video Segmentation track based on MeViS dataset. In the two new tracks, we provide additional videos and annotations that feature challenging elements, such as… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: MOSE Challenge: https://henghuiding.github.io/MOSE/ChallengeCVPR2024, MeViS Challenge: https://henghuiding.github.io/MeViS/ChallengeCVPR2024

  16. arXiv:2406.16242  [pdf, other

    math.DG

    Foliation of area minimizing hypersurfaces in asymptotically flat manifolds and Schoen's conjecture

    Authors: Shihang He, Yuguang Shi, Haobin Yu

    Abstract: In this paper, we demonstrate that any asymptotically flat manifold $(M^n, g)$ with $4\leq n\leq 7$ can be foliated by a family of area-minimizing hypersurfaces, each of which is asymptotic to Cartesian coordinate hyperplanes defined at an end of $(M^n, g)$. As an application of this foliation, we show that for any asymptotically flat manifold $(M^n, g)$ with $4\leq n\leq 7$, nonnegative scalar cu… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 39pages, 8 figures. Comments are welcome!

  17. arXiv:2406.15786  [pdf, other

    cs.LG cs.AI cs.CL

    What Matters in Transformers? Not All Attention is Needed

    Authors: Shwai He, Guoheng Sun, Zheyu Shen, Ang Li

    Abstract: Scaling Transformer-based large language models (LLMs) has demonstrated promising performance across various tasks. However, this scaling also introduces redundant structures, posing challenges for real-world deployment. Despite some recognition of redundancy in LLMs, the variability of redundancy across different structures, such as MLP and Attention layers, is under-explored. In this work, we in… ▽ More

    Submitted 7 July, 2024; v1 submitted 22 June, 2024; originally announced June 2024.

    Comments: 15 pages, 13 figures, 6 tables

  18. arXiv:2406.12772  [pdf, other

    hep-th gr-qc hep-ph nucl-th

    QCD Phase Diagram at finite Magnetic Field and Chemical Potential: A Holographic Approach Using Machine Learning

    Authors: Rong-Gen Cai, Song He, Li Li, Hong-An Zeng

    Abstract: By leveraging neural networks, we address the inverse problem of constructing a quantitative 2+1-flavor holographic QCD model based on state-of-the-art lattice QCD data. Our model demonstrates quantitative agreement with the latest lattice QCD results. We construct the full phase diagram at finite magnetic field $B$, baryon chemical potential $μ_B$ and temperature $T$. We uncover rich phase struct… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 10 pages, 13 figures

  19. arXiv:2406.12382  [pdf, other

    cs.CL

    From Instance Training to Instruction Learning: Task Adapters Generation from Instructions

    Authors: Huanxuan Liao, Yao Xu, Shizhu He, Yuanzhe Zhang, Yanchao Hao, Shengping Liu, Kang Liu, Jun Zhao

    Abstract: Large language models (LLMs) have acquired the ability to solve general tasks by utilizing instruction finetuning (IFT). However, IFT still relies heavily on instance training of extensive task data, which greatly limits the adaptability of LLMs to real-world scenarios where labeled task instances are scarce and broader task generalization becomes paramount. Contrary to LLMs, humans acquire skills… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  20. arXiv:2406.12100  [pdf, other

    cs.LG cs.RO

    Adaptive Uncertainty Quantification for Trajectory Prediction Under Distributional Shift

    Authors: Huiqun Huang, Sihong He, Fei Miao

    Abstract: Trajectory prediction models that can infer both finite future trajectories and their associated uncertainties of the target vehicles in an online setting (e.g., real-world application scenarios) is crucial for ensuring the safe and robust navigation and path planning of autonomous vehicle motion. However, the majority of existing trajectory prediction models have neither considered reducing the u… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 9 pages, 2 figures

  21. arXiv:2406.08756  [pdf, other

    cs.DC cs.LG

    Optimizing Large Model Training through Overlapped Activation Recomputation

    Authors: Ping Chen, Wenjie Zhang, Shuibing He, Yingjie Gu, Zhuwei Peng, Kexin Huang, Xuan Zhan, Weijian Chen, Yi Zheng, Zhefeng Wang, Yanlong Yin, Gang Chen

    Abstract: Large model training has been using recomputation to alleviate the memory pressure and pipelining to exploit the parallelism of data, tensor, and devices. The existing recomputation approaches may incur up to 40% overhead when training real-world models, e.g., the GPT model with 22B parameters. This is because they are executed on demand in the critical training path. In this paper, we design a ne… ▽ More

    Submitted 27 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: 13 pages

  22. arXiv:2406.08538  [pdf, other

    hep-th

    Supergluon scattering in AdS: constructibility, spinning amplitudes, and new structures

    Authors: Qu Cao, Song He, Xiang Li, Yichao Tang

    Abstract: We elaborate on a new recursive method proposed in arXiv:2312.15484 for computing tree-level $n$-point supergluon amplitudes as well as those with one gluon, i.e., spinning amplitudes, in ${\rm AdS}_5 \times S^3$. We present an improved proof for the so-called "constructibility" of supergluon and spinning amplitudes based on their factorizations and flat-space limit, which allows us to determine t… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 46 pages, multiple figures

  23. arXiv:2406.08114  [pdf

    cond-mat.mes-hall cond-mat.str-el cond-mat.supr-con

    Massive 1D Dirac Line, Solitons and Reversible Manipulation on the Surface of a Prototype Obstructed Atomic Insulator, Silicon

    Authors: Zhongkai Liu, Peng Deng, Yuanfeng Xu, Haifeng Yang, Ding Pei, Cheng Chen, Shanmei He, Defa Liu, Sung-Kwan Mo, Timur Kim, Cephise Cacho, Hong Yao, Zhi-Da Song, Xi Chen, Zhong Wang, Binghai Yan, Lexian Yang, Bogdan A. Bernevig, Yulin Chen

    Abstract: Topologically trivial insulators can be classified into atomic insulators (AIs) and obstructed atomic insulators (OAIs) depending on whether the Wannier charge centers are localized or not at spatial positions occupied by atoms. An OAI can possess unusual properties such as surface states along certain crystalline surfaces, which advantageously appear in materials with much larger bulk energy gap… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  24. arXiv:2406.07147  [pdf

    cs.HC cs.AI cs.CY

    Wearable Device-Based Real-Time Monitoring of Physiological Signals: Evaluating Cognitive Load Across Different Tasks

    Authors: Ling He, Yanxin Chen, Wenqi Wang, Shuting He, Xiaoqiang Hu

    Abstract: This study employs cutting-edge wearable monitoring technology to conduct high-precision, high-temporal-resolution (1-second interval) cognitive load assessment on electroencephalogram (EEG) data from the FP1 channel and heart rate variability (HRV) data of secondary vocational students. By jointly analyzing these two critical physiological indicators, the research delves into their application va… ▽ More

    Submitted 3 July, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

  25. arXiv:2406.05977  [pdf, other

    cs.IR

    Weighted KL-Divergence for Document Ranking Model Refinement

    Authors: Yingrui Yang, Yifan Qiao, Shanxiu He, Tao Yang

    Abstract: Transformer-based retrieval and reranking models for text document search are often refined through knowledge distillation together with contrastive learning. A tight distribution matching between the teacher and student models can be hard as over-calibration may degrade training effectiveness when a teacher does not perform well. This paper contrastively reweights KL divergence terms to prioritiz… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  26. arXiv:2406.05392  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Deconstructing The Ethics of Large Language Models from Long-standing Issues to New-emerging Dilemmas

    Authors: Chengyuan Deng, Yiqun Duan, Xin Jin, Heng Chang, Yijun Tian, Han Liu, Henry Peng Zou, Yiqiao Jin, Yijia Xiao, Yichen Wang, Shenghao Wu, Zongxing Xie, Kuofeng Gao, Sihong He, Jun Zhuang, Lu Cheng, Haohan Wang

    Abstract: Large Language Models (LLMs) have achieved unparalleled success across diverse language modeling tasks in recent years. However, this progress has also intensified ethical concerns, impacting the deployment of LLMs in everyday contexts. This paper provides a comprehensive survey of ethical challenges associated with LLMs, from longstanding issues such as copyright infringement, systematic bias, an… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  27. arXiv:2406.04042  [pdf, ps, other

    hep-th gr-qc math-ph

    Holographic stress tensor correlators on higher genus Riemann surfaces

    Authors: Song He, Yun-ze Li, Yunfei Xie

    Abstract: In this work, we present a comprehensive study of holographic stress tensor correlators on general Riemann surfaces, extending beyond the previously well-studied torus cases to explore higher genus conformal field theories (CFTs) within the framework of the Anti-de Sitter/conformal field theory (AdS/CFT) correspondence. We develop a methodological approach to compute holographic stress tensor corr… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 37 pages, 1 figure

  28. arXiv:2406.03838  [pdf, other

    hep-th

    On universal splittings of tree-level particle and string scattering amplitudes

    Authors: Qu Cao, Jin Dong, Song He, Canxin Shi, Fanky Zhu

    Abstract: In this paper, we study the newly discovered universal splitting behavior for tree-level scattering amplitudes of particles and strings~\cite{Cao:2024gln}: when a set of Mandelstam variables (and Lorentz products involving polarizations for gluons/gravitons) vanish, the $n$-point amplitude factorizes as the product of two lower-point {\it currents} with $n{+}3$ external legs in total. We refer to… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 37 pages, 3 figures

  29. arXiv:2406.03097  [pdf, other

    cs.LG cs.AI

    Enhancing the Resilience of Graph Neural Networks to Topological Perturbations in Sparse Graphs

    Authors: Shuqi He, Jun Zhuang, Ding Wang, Luyao Peng, Jun Song

    Abstract: Graph neural networks (GNNs) have been extensively employed in node classification. Nevertheless, recent studies indicate that GNNs are vulnerable to topological perturbations, such as adversarial attacks and edge disruptions. Considerable efforts have been devoted to mitigating these challenges. For example, pioneering Bayesian methodologies, including GraphSS and LlnDT, incorporate Bayesian labe… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  30. arXiv:2406.02542  [pdf, other

    cs.LG

    Loki: Low-Rank Keys for Efficient Sparse Attention

    Authors: Prajwal Singhania, Siddharth Singh, Shwai He, Soheil Feizi, Abhinav Bhatele

    Abstract: Inference on large language models can be expensive in terms of the compute and memory costs involved, especially when long sequence lengths are used. In particular, the self-attention mechanism used in such models contributes significantly to these costs, which has resulted in several recent works that propose sparse attention approximations for inference. In this work, we propose to approximate… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  31. arXiv:2406.02500  [pdf, other

    cs.LG cs.AI

    Demystifying the Compression of Mixture-of-Experts Through a Unified Framework

    Authors: Shwai He, Daize Dong, Liang Ding, Ang Li

    Abstract: Scaling large language models has revolutionized the performance across diverse domains, yet the continual growth in model size poses significant challenges for real-world deployment. The Mixture of Experts (MoE) approach addresses this by dynamically selecting and activating only a subset of experts, significantly reducing computational costs while maintaining high performance. However, MoE intro… ▽ More

    Submitted 24 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: 20 pages, 15 figures, 5 tables

  32. arXiv:2406.00492  [pdf, other

    eess.IV cs.CV cs.LG

    SAM-VMNet: Deep Neural Networks For Coronary Angiography Vessel Segmentation

    Authors: Xueying Zeng, Baixiang Huang, Yu Luo, Guangyu Wei, Songyan He, Yushuang Shao

    Abstract: Coronary artery disease (CAD) is one of the most prevalent diseases in the cardiovascular field and one of the major contributors to death worldwide. Computed Tomography Angiography (CTA) images are regarded as the authoritative standard for the diagnosis of coronary artery disease, and by performing vessel segmentation and stenosis detection on CTA images, physicians are able to diagnose coronary… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  33. arXiv:2405.20292  [pdf, other

    hep-th

    All-Loop Geometry for Four-Point Correlation Function

    Authors: Song He, Yu-tin Huang, Chia-Kai Kuo

    Abstract: In this letter, we consider a positive geometry conjectured to encode the loop integrand of four-point stress-energy correlators in planar $\mathcal{N}=4$ super Yang-Mills. Beginning with four lines in twistor space, we characterize a positive subspace to which an $\ell$-loop geometry is attached. The loop geometry then consists of $\ell$ lines in twistor space satisfying positivity conditions amo… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 7 pages + 2 figures

  34. arXiv:2405.19766  [pdf, other

    astro-ph.GA

    Hierarchical Structure and Self-gravity in the Maddalena Giant Molecular Cloud

    Authors: Renjie Shen, Yuehui Ma, Hongchi Wang, Suziye He, Miaomiao Zhang

    Abstract: In this work, we present the data from the Milky Way Imaging Scroll Painting (MWISP) project for the Maddalena giant molecular cloud (GMC). We decompose the 13CO emission datacube of the observed region into hierarchical substructures using a modified Dendrogram algorithm. We investigate the statistical properties of these substructures and examine the role that self-gravity plays on various spati… ▽ More

    Submitted 31 May, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  35. arXiv:2405.19499  [pdf, other

    cs.LG cs.MA math.OC

    Momentum for the Win: Collaborative Federated Reinforcement Learning across Heterogeneous Environments

    Authors: Han Wang, Sihong He, Zhili Zhang, Fei Miao, James Anderson

    Abstract: We explore a Federated Reinforcement Learning (FRL) problem where $N$ agents collaboratively learn a common policy without sharing their trajectory data. To date, existing FRL work has primarily focused on agents operating in the same or ``similar" environments. In contrast, our problem setup allows for arbitrarily large levels of environment heterogeneity. To obtain the optimal policy which maxim… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Journal ref: Proceedings of the 41st International Conference on Machine Learning, 2024 Learning

  36. arXiv:2405.19323  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Are Large Language Models Chameleons?

    Authors: Mingmeng Geng, Sihong He, Roberto Trotta

    Abstract: Do large language models (LLMs) have their own worldviews and personality tendencies? Simulations in which an LLM was asked to answer subjective questions were conducted more than 1 million times. Comparison of the responses from different LLMs with real data from the European Social Survey (ESS) suggests that the effect of prompts on bias and variability is fundamental, highlighting major cultura… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 16 pages,8 figures

  37. arXiv:2405.19249  [pdf, ps, other

    math.AP

    Uniform Inviscid Damping and Inviscid Limit of the 2D Navier-Stokes equation with Navier Boundary Conditions

    Authors: Jacob Bedrossian, Siming He, Sameer Iyer, Fei Wang

    Abstract: We consider the 2D, incompressible Navier-Stokes equations near the Couette flow, $ω^{(NS)} = 1 + εω$, set on the channel $\mathbb{T} \times [-1, 1]$, supplemented with Navier boundary conditions on the perturbation, $ω|_{y = \pm 1} = 0$. We are simultaneously interested in two asymptotic regimes that are classical in hydrodynamic stability: the long time, $t \rightarrow \infty$, stability of back… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 157 pages

  38. arXiv:2405.19233  [pdf, ps, other

    math.AP

    Pseudo-Gevrey Smoothing for the Passive Scalar Equations near Couette

    Authors: Jacob Bedrossian, Siming He, Sameer Iyer, Fei Wang

    Abstract: In this article, we study the regularity theory for two linear equations that are important in fluid dynamics: the passive scalar equation for (time-varying) shear flows close to Couette in $\mathbb T \times [-1,1]$ with vanishing diffusivity $ν\to 0$ and the Poisson equation with right-hand side behaving in similar function spaces to such a passive scalar. The primary motivation for this work is… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 130 pages

  39. arXiv:2405.17460  [pdf

    cs.LG cs.AI cs.CV

    Investigation of Customized Medical Decision Algorithms Utilizing Graph Neural Networks

    Authors: Yafeng Yan, Shuyao He, Zhou Yu, Jiajie Yuan, Ziang Liu, Yan Chen

    Abstract: Aiming at the limitations of traditional medical decision system in processing large-scale heterogeneous medical data and realizing highly personalized recommendation, this paper introduces a personalized medical decision algorithm utilizing graph neural network (GNN). This research innovatively integrates graph neural network technology into the medical and health field, aiming to build a high-pr… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  40. arXiv:2405.15472  [pdf, other

    math.DS

    Stability Analysis of Biochemical Reaction Networks Linearly Conjugated to complex balanced Systems with Time Delays Added

    Authors: Xiaoyu Zhang, Shibo He, Chuanhou Gao, Denis Dochain

    Abstract: Linear conjugacy offers a new perspective to broaden the scope of stable biochemical reaction networks to the systems linearly conjugated to the well-established complex balanced mass action systems ($\ell$cCBMASs). This paper addresses the challenge posed by time delay, which can disrupt the linear conjugacy relationship and complicate stability analysis for delayed versions of $\ell$cCBMASs (D… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  41. arXiv:2405.15269  [pdf, other

    cs.CV cs.LG

    BDetCLIP: Multimodal Prompting Contrastive Test-Time Backdoor Detection

    Authors: Yuwei Niu, Shuo He, Qi Wei, Feng Liu, Lei Feng

    Abstract: Multimodal contrastive learning methods (e.g., CLIP) have shown impressive zero-shot classification performance due to their strong ability to joint representation learning for visual and textual modalities. However, recent research revealed that multimodal contrastive learning on poisoned pre-training data with a small proportion of maliciously backdoored data can induce backdoored CLIP that coul… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  42. arXiv:2405.09717  [pdf, other

    cs.CV

    From NeRFs to Gaussian Splats, and Back

    Authors: Siming He, Zach Osman, Pratik Chaudhari

    Abstract: For robotics applications where there is a limited number of (typically ego-centric) views, parametric representations such as neural radiance fields (NeRFs) generalize better than non-parametric ones such as Gaussian splatting (GS) to views that are very different from those in the training data; GS however can render much faster than NeRFs. We develop a procedure to convert back and forth betwee… ▽ More

    Submitted 10 June, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

  43. arXiv:2405.07148  [pdf, other

    physics.flu-dyn cs.CE

    Investigate the efficiency of incompressible flow simulations on CPUs and GPUs with BSAMR

    Authors: Dewen Liu, Shuai He, Haoran Cheng, Yadong Zeng

    Abstract: Adaptive mesh refinement (AMR) is a classical technique about local refinement in space where needed, thus effectively reducing computational costs for HPC-based physics simulations. Although AMR has been used for many years, little reproducible research discusses the impact of software-based parameters on block-structured AMR (BSAMR) efficiency and how to choose them. This article primarily does… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: 22 pages include reference, 9 figures

  44. arXiv:2405.06929  [pdf, other

    cs.CV

    PRENet: A Plane-Fit Redundancy Encoding Point Cloud Sequence Network for Real-Time 3D Action Recognition

    Authors: Shenglin He, Xiaoyang Qu, Jiguang Wan, Guokuan Li, Changsheng Xie, Jianzong Wang

    Abstract: Recognizing human actions from point cloud sequence has attracted tremendous attention from both academia and industry due to its wide applications. However, most previous studies on point cloud action recognition typically require complex networks to extract intra-frame spatial features and inter-frame temporal features, resulting in an excessive number of redundant computations. This leads to hi… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: Accepted by the 2024 International Joint Conference on Neural Networks (IJCNN 2024)

  45. arXiv:2405.06048  [pdf, ps, other

    math.AP

    Time-dependent Flows and Their Applications in Parabolic-parabolic Patlak-Keller-Segel Systems Part II: Shear Flows

    Authors: Siming He

    Abstract: In this study, we investigate the behavior of three-dimensional parabolic-parabolic Patlak-Keller-Segel (PKS) systems in the presence of ambient shear flows. Our findings demonstrate that when the total mass of the cell density is below a specific threshold, the solution remains globally regular as long as the flow is sufficiently strong. The primary difficulty in our analysis stems from the fast… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  46. arXiv:2405.03465  [pdf, ps, other

    hep-th

    Geometric formulation of generalized root-$T\bar{T}$ deformations

    Authors: H. Babaei-Aghbolagh, Song He, Tommaso Morone, Hao Ouyang, Roberto Tateo

    Abstract: We develop a generic geometric formalism that incorporates both $T\bar{T}$-like and root-$T\bar{T}$-like deformations in arbitrary dimensions. This framework applies to a wide family of stress-energy tensor perturbations and encompasses various well-known field theories. Building upon the recently proposed correspondence between Ricci-based gravity and $T\bar{T}$-like deformations, we further exte… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 11 pages, no figures

  47. arXiv:2405.02562  [pdf, other

    math.AP

    Time-dependent Flows and Their Applications in Parabolic-parabolic Patlak-Keller-Segel Systems Part I: Alternating Flows

    Authors: Siming He

    Abstract: We consider the three-dimensional parabolic-parabolic Patlak-Keller-Segel equations (PKS) subject to ambient flows. Without the ambient fluid flow, the equation is super-critical in three-dimension and has finite-time blow-up solutions with arbitrarily small $L^1$-mass. In this study, we show that a family of time-dependent alternating shear flows, inspired by the clever ideas of Tarek Elgindi, ca… ▽ More

    Submitted 9 May, 2024; v1 submitted 4 May, 2024; originally announced May 2024.

    Comments: Corrected typos and added a citation

  48. arXiv:2405.01570  [pdf

    cond-mat.supr-con

    Superconductivity of Bulk Abnormal Magic-stoichiometric Na3Cl Salt Crystals at Normal Pressure

    Authors: Shuqiang He, Yi-Feng Zheng, Guosheng Shi, Yi-Jie Xiang, Meihui Xiao, Qituan Zhang, Yue-Yu Zhang, Haiping Fang

    Abstract: The identification of new materials with superconducting properties is the pursuit in the realm of superconductivity research. Here, excitedly, we show that the simplest salt daily used can be made a superconductor at normal pressure only by adjusting its stoichiometry of Na and Cl as Na3Cl at normal pressure based on first-principles calculations. This bulk stable abnormal Na-Cl stoichiometric cr… ▽ More

    Submitted 17 April, 2024; originally announced May 2024.

  49. arXiv:2405.01327  [pdf, other

    cs.LG

    Constrained Reinforcement Learning Under Model Mismatch

    Authors: Zhongchang Sun, Sihong He, Fei Miao, Shaofeng Zou

    Abstract: Existing studies on constrained reinforcement learning (RL) may obtain a well-performing policy in the training environment. However, when deployed in a real environment, it may easily violate constraints that were originally satisfied during training because there might be model mismatch between the training and real environments. To address the above challenge, we formulate the problem as constr… ▽ More

    Submitted 3 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  50. arXiv:2405.01255  [pdf, ps, other

    hep-th gr-qc

    Note on holographic torus stress tensor correlators in $AdS_3$ gravity

    Authors: Song He, Yi Li, Yun-Ze Li, Yunda Zhang

    Abstract: In the AdS$_3$/CFT$_2$ framework, the Euclidean BTZ black hole corresponds to the dominant high-temperature phase of its dual field theory. We initially employ perturbative methods to solve the Einstein equations as boundary value problems, providing correlators for the energy-momentum tensor operator at low points. Utilizing operator equations established in our previous work, we further compute… ▽ More

    Submitted 25 June, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: 46 pages