Skip to main content

Showing 1–50 of 421 results for author: Gao, T

  1. arXiv:2407.03114  [pdf, other

    quant-ph

    Strong quantum nonlocality without entanglement in every $(n-1)$-partition

    Authors: Huaqi Zhou, Ting Gao, Fengli Yan

    Abstract: Orthogonal product sets that are locally irreducible in every bipartition have the strongest nonlocality while also need a large number of quantum states. In this paper, we construct the orthogonal product sets with strong quantum nonlocality in any possible $n$-partite systems, where $n$ is greater than three. Rigorous proofs show that these sets are locally irreducible in every $(n-1)$-partition… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 13 pages, 6 figures

  2. arXiv:2407.01548  [pdf, ps, other

    q-bio.OT cs.AI cs.LG

    From Cognition to Computation: A Comparative Review of Human Attention and Transformer Architectures

    Authors: Minglu Zhao, Dehong Xu, Tao Gao

    Abstract: Attention is a cornerstone of human cognition that facilitates the efficient extraction of information in everyday life. Recent developments in artificial intelligence like the Transformer architecture also incorporate the idea of attention in model designs. However, despite the shared fundamental principle of selectively attending to information, human attention and the Transformer model display… ▽ More

    Submitted 25 April, 2024; originally announced July 2024.

  3. arXiv:2406.19247  [pdf, other

    cs.CV

    Local Manifold Learning for No-Reference Image Quality Assessment

    Authors: Timin Gao, Wensheng Pan, Yan Zhang, Sicheng Zhao, Shengchuan Zhang, Xiawu Zheng, Ke Li, Liujuan Cao, Rongrong Ji

    Abstract: Contrastive learning has considerably advanced the field of Image Quality Assessment (IQA), emerging as a widely adopted technique. The core mechanism of contrastive learning involves minimizing the distance between quality-similar (positive) examples while maximizing the distance between quality-dissimilar (negative) examples. Despite its successes, current contrastive learning methods often negl… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  4. arXiv:2406.19080  [pdf, other

    quant-ph

    G_q-concurrence and entanglement constraints in multiqubit systems

    Authors: Hui Li, Ting Gao, Fengli Yan

    Abstract: In this paper, we introduce a category of one-parameter bipartite entanglement quantifiers, termed $G_q$-concurrence ($q>1$), and show rigorously that they satisfy all the axiomatic conditions of an entanglement measure and can be considered as a generalization of concurrence. In addition, we establish an analytic formula relating $G_q$-concurrence to concurrence for $1<q\leq2$ in two-qubit system… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 11 pages, 7 figures

  5. arXiv:2406.15686  [pdf, other

    cs.CR cs.NI

    The Case for Transport-Level Encryption in Datacenter Networks

    Authors: Tianyi Gao, Xinshu Ma, Suhas Narreddy, Eugenio Luo, Steven W. D. Chien, Michio Honda

    Abstract: Cloud applications need network data encryption to isolate from other tenants and protect their data from potential eavesdroppers in the network infrastructure. This paper presents SDP, a protocol design for emerging datacenter transport protocols, such as pHost, NDP, and Homa, to integrate data encryption with the use of existing NIC offloading of cryptographic operations designed for TLS over TC… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  6. arXiv:2406.10462  [pdf, other

    cs.CV

    CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation

    Authors: Wei Chen, Lin Li, Yongqi Yang, Bin Wen, Fan Yang, Tingting Gao, Yu Wu, Long Chen

    Abstract: Interleaved image-text generation has emerged as a crucial multimodal task, aiming at creating sequences of interleaved visual and textual content given a query. Despite notable advancements in recent multimodal large language models (MLLMs), generating integrated image-text sequences that exhibit narrative coherence and entity and style consistency remains challenging due to poor training data qu… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 22 pages

  7. arXiv:2405.14705  [pdf, other

    cs.CV

    Learning Multi-dimensional Human Preference for Text-to-Image Generation

    Authors: Sixian Zhang, Bohan Wang, Junqiang Wu, Yan Li, Tingting Gao, Di Zhang, Zhongyuan Wang

    Abstract: Current metrics for text-to-image models typically rely on statistical metrics which inadequately represent the real preference of humans. Although recent work attempts to learn these preferences via human annotated images, they reduce the rich tapestry of human preference to a single overall score. However, the preference results vary when humans evaluate images with different aspects. Therefore,… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  8. arXiv:2405.09394  [pdf, other

    cs.LG cs.DC

    SA-FedLora: Adaptive Parameter Allocation for Efficient Federated Learning with LoRA Tuning

    Authors: Yuning Yang, Xiaohong Liu, Tianrun Gao, Xiaodong Xu, Guangyu Wang

    Abstract: Fine-tuning large-scale pre-trained models via transfer learning is an emerging important paradigm for a wide range of downstream tasks, with performance heavily reliant on extensive data. Federated learning (FL), as a distributed framework, provides a secure solution to train models on local datasets while safeguarding raw sensitive data. However, FL networks encounter high communication costs du… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  9. arXiv:2405.08856  [pdf, ps, other

    hep-ph hep-ex nucl-th

    Chiral properties of the nucleon interpolating current and $θ$-dependent observables

    Authors: Yohei Ema, Ting Gao, Maxim Pospelov, Adam Ritz

    Abstract: We revisit the chiral properties of nucleon interpolating currents, and show that of the two leading order currents $j_1$ and $j_2$, only two linear combinations $j_1\pm j_2$ transform covariantly under the anomalous $U(1)_A$ symmetry. As a result, calculations of quantities which vanish by symmetry in the chiral limit may produce unphysical results if carried out with different linear combination… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 10 pages

    Report number: UMN-TH-4319/24, FTPI-MINN-24-10

  10. arXiv:2405.07518  [pdf, other

    cs.AR cs.AI

    SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts

    Authors: Raghu Prabhakar, Ram Sivaramakrishnan, Darshan Gandhi, Yun Du, Mingran Wang, Xiangyu Song, Kejie Zhang, Tianren Gao, Angela Wang, Karen Li, Yongning Sheng, Joshua Brot, Denis Sokolov, Apurv Vivek, Calvin Leung, Arjun Sabnis, Jiayu Bai, Tuowen Zhao, Mark Gottscho, David Jackson, Mark Luttrell, Manish K. Shah, Edison Chen, Kaizhao Liang, Swayambhoo Jain , et al. (5 additional authors not shown)

    Abstract: Monolithic large language models (LLMs) like GPT-4 have paved the way for modern generative AI applications. Training, serving, and maintaining monolithic LLMs at scale, however, remains prohibitively expensive and challenging. The disproportionate increase in compute-to-memory ratio of modern AI accelerators have created a memory wall, necessitating new methods to deploy AI. Composition of Expert… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  11. arXiv:2405.03163  [pdf

    cond-mat.mtrl-sci cond-mat.other

    Magnetic Ordering of Ammonium Cations in NH$_4$I, NH$_4$Br and NH$_4$Cl

    Authors: Fei Yen, Lei Meng, Tian Gao, Sixia Hu

    Abstract: The different types of magnetism arise mainly from how electrons move and interact with each other. In this work, we show how protons (H$^+$) also exhibit magnetic behavior. We measured the magnetic susceptibility of the ammonium halides and identified pronounced increases at 232 K, 233 K and 243 K for NH$_4$I, NH$_4$Br and NH$_4$Cl, respectively, which all coincide to the geometric ordering of it… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: Manuscript + Supporting Information file (19 + 4 pages, 5 + 3 figures). Sorry for not uploading this back in 2020!

    Journal ref: J. Phys. Chem. C 123, 23655-23660 (2019)

  12. arXiv:2404.19525  [pdf, other

    cs.CV

    MicroDreamer: Zero-shot 3D Generation in $\sim$20 Seconds by Score-based Iterative Reconstruction

    Authors: Luxi Chen, Zhengyi Wang, Zihan Zhou, Tingting Gao, Hang Su, Jun Zhu, Chongxuan Li

    Abstract: Optimization-based approaches, such as score distillation sampling (SDS), show promise in zero-shot 3D generation but suffer from low efficiency, primarily due to the high number of function evaluations (NFEs) required for each sample. In this paper, we introduce score-based iterative reconstruction (SIR), an efficient and general algorithm mimicking a differentiable 3D reconstruction process to r… ▽ More

    Submitted 27 May, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

  13. arXiv:2404.19412  [pdf

    cs.RO eess.SY

    Enhancing Robotic Adaptability: Integrating Unsupervised Trajectory Segmentation and Conditional ProMPs for Dynamic Learning Environments

    Authors: Tianci Gao

    Abstract: We propose a novel framework for enhancing robotic adaptability and learning efficiency, which integrates unsupervised trajectory segmentation with adaptive probabilistic movement primitives (ProMPs). By employing a cutting-edge deep learning architecture that combines autoencoders and Recurrent Neural Networks (RNNs), our approach autonomously pinpoints critical transitional points in continuous,… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  14. arXiv:2404.19377  [pdf

    cond-mat.mes-hall

    Toroidic phase transitions in a direct-kagome artificial spin ice

    Authors: Wen-Cheng Yue, Zixiong Yuan, Peiyuan Huang, Yizhe Sun, Tan Gao, Yang-Yang Lyu, Xuecou Tu, Sining Dong, Liang He, Ying Dong, Xun Cao, Lin Kang, Huabing Wang, Peiheng Wu, Cristiano Nisoli, Yong-Lei Wang

    Abstract: Ferrotoroidicity, the fourth form of primary ferroic order, breaks both space and time inversion symmetry. So far, direct observation of ferrotoroidicity in natural materials remains elusive, which impedes the exploration of ferrotoroidic phase transitions. Here, we overcome the limitations of natural materials using an artificial nanomagnet system that can be characterized at the constituent leve… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Journal ref: Nature Nanotechnology (2024)

  15. arXiv:2404.16033  [pdf, other

    cs.CV cs.CL

    Cantor: Inspiring Multimodal Chain-of-Thought of MLLM

    Authors: Timin Gao, Peixian Chen, Mengdan Zhang, Chaoyou Fu, Yunhang Shen, Yan Zhang, Shengchuan Zhang, Xiawu Zheng, Xing Sun, Liujuan Cao, Rongrong Ji

    Abstract: With the advent of large language models(LLMs) enhanced by the chain-of-thought(CoT) methodology, visual reasoning problem is usually decomposed into manageable sub-tasks and tackled sequentially with various external tools. However, such a paradigm faces the challenge of the potential "determining hallucinations" in decision-making due to insufficient visual information and the limitation of low-… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: The project page is available at https://ggg0919.github.io/cantor/

  16. arXiv:2404.15013  [pdf, other

    quant-ph

    Quantifying multipartite quantum states by ($k+1$)-partite entanglement measures

    Authors: Hui Li, Ting Gao, Fengli Yan

    Abstract: In this paper, we investigate how to quantify the quantum states of $n$-particles from the point of $(k+1)$-partite entanglement $(1\leq k\leq n-1)$, which plays an instrumental role in quantum nonlocality and quantum metrology. We put forward two families of entanglement measures termed $q$-$(k+1)$-PE concurrence $(q>1)$ and $α$-$(k+1)$-PE concurrence $(0\leqα<1)$, respectively. As far as the pur… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 10 pages,2 figures

  17. arXiv:2404.14949  [pdf, other

    cs.CV

    Multi-Modal Prompt Learning on Blind Image Quality Assessment

    Authors: Wensheng Pan, Timin Gao, Yan Zhang, Runze Hu, Xiawu Zheng, Enwei Zhang, Yuting Gao, Yutao Liu, Yunhang Shen, Ke Li, Shengchuan Zhang, Liujuan Cao, Rongrong Ji

    Abstract: Image Quality Assessment (IQA) models benefit significantly from semantic information, which allows them to treat different types of objects distinctly. Currently, leveraging semantic information to enhance IQA is a crucial research direction. Traditional methods, hindered by a lack of sufficiently annotated data, have employed the CLIP image-text pretraining model as their backbone to gain semant… ▽ More

    Submitted 18 May, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

  18. arXiv:2404.07458  [pdf, other

    physics.plasm-ph

    I-mode Plasma Confinement Improvement by Real-time Lithium Injection and its Classification on EAST Tokamak

    Authors: X. M. Zhong, X. L. Zou, A. D. Liu, Y. T. Song, G. Zhuang, H. Q. Liu, L. Q. Xu, E. Z. Li, B. Zhang, G. Z. Zuo, Z. Wang, C. Zhou, J. Zhang, W. X. Shi, L. T. Gao, S. F. Wang, W. Gao, T. Q. Jia, Q. Zang, H. L. Zhao, M. Wang, H. D. Xu, X. J. Wang, X. Gao, X. D. Lin , et al. (3 additional authors not shown)

    Abstract: I-mode is a promising regime for future fusion reactors due to the high energy confinement and the moderate particle confinement. However, the effect of lithium, which has been widely applied for particle recycling and impurity control, on I-mode plasma is still unclear. Recently, experiments of real-time lithium powder injection on I-mode plasma have been carried out in EAST Tokamak. It was found… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  19. arXiv:2404.01960  [pdf, ps, other

    quant-ph

    $(n,m,p)$-type quantum network configuration and its nonlocality

    Authors: Zan-Jia Li, Ying-Qiu He, Dong Ding, Ming-Xing Yu, Ting Gao, Feng-Li Yan

    Abstract: A quantum network shared entangled sources among distant nodes enables us to distribute entanglement along the network by suitable measurements. Network nonlocality means that it does not admit a network model involving local variables emitted from independent sources. In this work, we construct an $(n,m,p)$-type quantum network configuration and then derive the corresponding $n$-local correlation… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 7 pages, 2 figures

  20. arXiv:2403.15538  [pdf, other

    hep-ph hep-th

    Momentum shift and on-shell constructible massive amplitudes

    Authors: Yohei Ema, Ting Gao, Wenqi Ke, Zhen Liu, Kun-Feng Lyu, Ishmam Mahbub

    Abstract: We construct tree-level amplitude for massive particles using on-shell recursion relations based on two classes of momentum shifts: an all-line transverse shift that deforms momentum by its transverse polarization vector, and a massive BCFW-type shift. We illustrate that these shifts allow us to correctly calculate four-point and five-point amplitudes in massive QED, without an ambiguity associate… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 26 pages, 1 figure and comments are welcome

    Report number: UMN-TH-4316/24, FTPI-MINN-24-07

  21. arXiv:2403.15133  [pdf, other

    physics.optics

    Observation of sub-Poissonian correlation in spin-orbit coupled polariton vortex pairs at room temperature

    Authors: Xiaokun Zhai, Ying Gao, Xuekai Ma, Chunzi Xing, Xiao Wang, Anlian Pan, Marc Assmann, Stefan Schumacher, Tingge Gao

    Abstract: Coupling of orbital and spin degrees of freedom gives rise to intriguing physical phenomena in bosonic condensates, such as formation of stripe phases and domains with vortex arrays. However, the robust locking of spin and orbital degrees of freedom of the nonlinear topological objects such as vortex pairs with sub-Poissonian fluctuation in bosonic condensates remains challenging. In the present w… ▽ More

    Submitted 5 June, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

  22. arXiv:2403.11091  [pdf, other

    cs.SD cs.CV eess.AS

    Multitask frame-level learning for few-shot sound event detection

    Authors: Liang Zou, Genwei Yan, Ruoyu Wang, Jun Du, Meng Lei, Tian Gao, Xin Fang

    Abstract: This paper focuses on few-shot Sound Event Detection (SED), which aims to automatically recognize and classify sound events with limited samples. However, prevailing methods methods in few-shot SED predominantly rely on segment-level predictions, which often providing detailed, fine-grained predictions, particularly for events of brief duration. Although frame-level prediction strategies have been… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 6 pages, 4 figures, conference

  23. arXiv:2403.10405  [pdf, other

    math.DS

    Action Functional as an Early Warning Indicator in the Space of Probability Measures via Schrödinger Bridge

    Authors: Peng Zhang, Ting Gao, Jin Guo, Jinqiao Duan

    Abstract: Critical transition and tipping phenomena between two meta-stable states in stochastic dynamical systems represents an important problem. In this work, we expand the methodology from the traditional Onsager-Machlup action functional, which typically identifies the most probable transition pathway between two meta-stable states, to investigate the evolutionary transition dynamics between two meta-s… ▽ More

    Submitted 8 May, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: 18pages

  24. arXiv:2403.07420  [pdf, other

    cs.CV

    DragAnything: Motion Control for Anything using Entity Representation

    Authors: Weijia Wu, Zhuang Li, Yuchao Gu, Rui Zhao, Yefei He, David Junhao Zhang, Mike Zheng Shou, Yan Li, Tingting Gao, Di Zhang

    Abstract: We introduce DragAnything, which utilizes a entity representation to achieve motion control for any object in controllable video generation. Comparison to existing motion control methods, DragAnything offers several advantages. Firstly, trajectory-based is more userfriendly for interaction, when acquiring other guidance signals (e.g., masks, depth maps) is labor-intensive. Users only need to draw… ▽ More

    Submitted 15 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: The project website is at: https://weijiawu.github.io/draganything_page/ . The code is at: https://github.com/showlab/DragAnything

  25. arXiv:2403.01391  [pdf, other

    quant-ph

    Planar two-region multi-partite maximally entangled states

    Authors: Yanwen Liang, Fengli Yan, Ting Gao

    Abstract: In entanglement theory, there are different methods to consider one state being more entangled than another. The "maximally" entangled states in a multipartite system can be defined from an axiomatic perspective. According to different criteria for selection, there are many specific types of quantum maximally entangled states, such as absolutely maximally entangled state, planar maximally entangle… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: 12 pages, 8 figures

  26. arXiv:2403.00929  [pdf, other

    cs.RO cs.AI cs.LG

    PRIME: Scaffolding Manipulation Tasks with Behavior Primitives for Data-Efficient Imitation Learning

    Authors: Tian Gao, Soroush Nasiriany, Huihan Liu, Quantao Yang, Yuke Zhu

    Abstract: Imitation learning has shown great potential for enabling robots to acquire complex manipulation behaviors. However, these algorithms suffer from high sample complexity in long-horizon tasks, where compounding errors accumulate over the task horizons. We present PRIME (PRimitive-based IMitation with data Efficiency), a behavior primitive-based framework designed for improving the data efficiency o… ▽ More

    Submitted 10 March, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

  27. arXiv:2402.16617  [pdf, other

    cs.CL

    Long-Context Language Modeling with Parallel Context Encoding

    Authors: Howard Yen, Tianyu Gao, Danqi Chen

    Abstract: Extending large language models (LLMs) to process longer inputs is crucial for a wide range of applications. However, the substantial computational cost of transformers and limited generalization of positional encoding restrict the size of their context window. We introduce Context Expansion with Parallel Encoding (CEPE), a framework that can be applied to any existing decoder-only LLMs to extend… ▽ More

    Submitted 11 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: ACL 2024. Code, models, and data are available at https://github.com/princeton-nlp/CEPE. arXiv admin note: text overlap with arXiv:1912.01214 by other authors

  28. arXiv:2402.14073  [pdf, other

    cs.CL cs.CV cs.LG

    Improving Language Understanding from Screenshots

    Authors: Tianyu Gao, Zirui Wang, Adithya Bhaskar, Danqi Chen

    Abstract: An emerging family of language models (LMs), capable of processing both text and images within a single visual view, has the promise to unlock complex tasks such as chart understanding and UI navigation. We refer to these models as screenshot language models. Despite their appeal, existing screenshot LMs substantially lag behind text-only models on language understanding tasks. To close this gap,… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Our model and code are available at https://github.com/princeton-nlp/PTP

  29. arXiv:2402.04111  [pdf, ps, other

    cs.IT

    Vector Approximate Message Passing With Arbitrary I.I.D. Noise Priors

    Authors: Mohamed Akrout, Tiancheng Gao, Faouzi Bellili, Amine Mezghani

    Abstract: Approximate message passing (AMP) algorithms are devised under the Gaussianity assumption of the measurement noise vector. In this work, we relax this assumption within the vector AMP (VAMP) framework to arbitrary independent and identically distributed (i.i.d.) noise priors. We do so by rederiving the linear minimum mean square error (LMMSE) to accommodate both the noise and signal estimations wi… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: Accepted to the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  30. arXiv:2402.02462  [pdf, ps, other

    quant-ph

    Quantum teleportation based on the elegant joint measurement

    Authors: Dong Ding, Ming-Xing Yu, Ying-Qiu He, Hao-Sen Ji, Ting Gao, Feng-Li Yan

    Abstract: As a generalization of the well-known Bell state measurement (BSM), the elegant joint measurement (EJM) is a kind of novel two-qubit joint measurement, parameterized by a subtle phase factor $θ\in [0,π/2]$. We explore quantum teleportation based on the EJM, inspired by Gisin's idea that quantum entanglement not only provides quantum channel and also quantum joint measurement for quantum teleportat… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: 8 pages, 3 figures

  31. arXiv:2402.01168  [pdf, other

    hep-ex

    Measurement of transverse polarization of $Λ$/$\barΛ$ within jet in $pp$ collisions at STAR

    Authors: Taoya Gao

    Abstract: Spontaneous polarization of $Λ/\barΛ$ in unpolarized hadron interactions has been observed experimentally for nearly half a century and still eludes a definitive explanation. One possible origin is the effect arising from polarizing fragmentation functions (pFFs), which describe the production of polarized hadrons from the fragmentation of an unpolarized parton. Recently, significant transverse po… ▽ More

    Submitted 5 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: SPIN2023 proceeding

  32. arXiv:2402.00987  [pdf, other

    cs.LG

    Self-Supervised Contrastive Pre-Training for Multivariate Point Processes

    Authors: Xiao Shou, Dharmashankar Subramanian, Debarun Bhattacharjya, Tian Gao, Kristin P. Bennet

    Abstract: Self-supervision is one of the hallmarks of representation learning in the increasingly popular suite of foundation models including large language models such as BERT and GPT-3, but it has not been pursued in the context of multivariate event streams, to the best of our knowledge. We introduce a new paradigm for self-supervised learning for multivariate point processes using a transformer encoder… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  33. arXiv:2402.00330  [pdf, other

    cs.RO

    Night-Rider: Nocturnal Vision-aided Localization in Streetlight Maps Using Invariant Extended Kalman Filtering

    Authors: Tianxiao Gao, Mingle Zhao, Chengzhong Xu, Hui Kong

    Abstract: Vision-aided localization for low-cost mobile robots in diverse environments has attracted widespread attention recently. Although many current systems are applicable in daytime environments, nocturnal visual localization is still an open problem owing to the lack of stable visual information. An insight from most nocturnal scenes is that the static and bright streetlights are reliable visual info… ▽ More

    Submitted 3 March, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

  34. arXiv:2401.16254  [pdf, ps, other

    physics.ao-ph

    YingLong: Skillful High Resolution Regional Short Term Forecasting with Boundary Smoothing

    Authors: Pengbo Xu, Tianyan Gao, Yu Wang, Junping Yin, Juan Zhang, Xiaogu Zheng, Zhimin Zhang, Xiaoguang Hu, Xiaoxu Chen

    Abstract: In the realm of numerical weather forecasting, achieving higher resolution demands increased computational resources and time investment, and leveraging deep learning networks trained solely on data significantly reduces the time expenditure during forecasting. Recently, several global forecasting artificial-intelligence-based models are developed, which are mainly trained on reanalysis dataset wi… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  35. arXiv:2401.06428  [pdf, ps, other

    nucl-ex

    First Exploration of Monopole-Driven Shell Evolution above the N = 126 shell closure: new Millisecond Isomers in 213Tl and 215Tl

    Authors: T. T. Yeung, A. I. Morales, J. Wu, M. Liu, C. Yuan, S. Nishimura, V. H. Phong, N. Fukuda, J. L. Tain, T. Davinson, K. P. Rykaczewski, R. Yokoyama, T. Isobe, M. Niikura, Zs. Podolyak, G. Alcala, A. Algora, J. Agramunt, C. Appleton, H. Baba, R. Caballero-Folch, P. Calvino, M. P. Carpenter, I. Dillmann, A. Estrade , et al. (30 additional authors not shown)

    Abstract: Isomer spectroscopy of heavy neutron-rich nuclei beyond the N=126 closed shell has been performed for the first time at the Radioactive Isotope Beam Factory of the RIKEN Nishina Center. New millisecond isomers have been identified at low excitation energies, 985.3(19) keV in 213Tl and 874(5) keV in 215Tl. The measured half-lives of 1.34(5) ms in 213Tl and 3.0(3) ms in 215Tl suggest spins and parit… ▽ More

    Submitted 25 April, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

    Comments: 9 pages, 3 figures, 1 table

  36. arXiv:2401.03625  [pdf, other

    physics.optics cond-mat.quant-gas

    Optically controllable localization of exciton polariton condensates in a potential lattice

    Authors: Qiang Ai, Jan Wingenbach, Xinmiao Yang, Jing Wei, Zaharias Hatzopoulos, Pavlos G. Savvidis, Stefan Schumacher, Xuekai Ma, Tingge Gao

    Abstract: Exciton polaritons are inherently non-Hermitian systems with adjustable gain and loss coefficients. In this work we show that exciton polariton condensates can be selectively localized in an optically-induced lattice with equal potential depth by judiciously controlling a second focused pump with a very small size. Specifically, the localized polariton condensate can be tuned among different poten… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

  37. arXiv:2401.02311  [pdf, other

    math.DS

    Fourier neural operator based fluid-structure interaction for predicting the vesicle dynamics

    Authors: Wang Xiao, Ting Gao, Kai Liu, Jinqiao Duan, Meng Zhao

    Abstract: Solving complex fluid-structure interaction (FSI) problems, characterized by nonlinear partial differential equations, is crucial in various scientific and engineering applications. Traditional computational fluid dynamics (CFD) solvers are insufficient to meet the growing requirements for large-scale and long-period simulations. Fortunately, the rapid advancement in neural networks, especially ne… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  38. arXiv:2401.01184  [pdf

    astro-ph.SR astro-ph.EP physics.atom-ph physics.chem-ph

    A high resolution rovibronic molecular cross-section of MgH+ molecular cation

    Authors: Huagang Xiao, Tao Gao

    Abstract: The high resolution rovibronic line list of MgH+ molecular cation are presented in our work. The potential energy curves are calculated by the method of multireference configuration interaction plus Davidson correction (MRCI+Q) and spin-orbit coupling (SOC) effect. Spectroscopy constants are fitted and the results are in good agreement with the experiment, ensuring the accuracy of the electronic s… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

  39. arXiv:2401.01065  [pdf, other

    cs.CV cs.AI

    BEV-TSR: Text-Scene Retrieval in BEV Space for Autonomous Driving

    Authors: Tao Tang, Dafeng Wei, Zhengyu Jia, Tian Gao, Changwei Cai, Chengkai Hou, Peng Jia, Kun Zhan, Haiyang Sun, Jingchen Fan, Yixing Zhao, Fu Liu, Xiaodan Liang, Xianpeng Lang, Yang Wang

    Abstract: The rapid development of the autonomous driving industry has led to a significant accumulation of autonomous driving data. Consequently, there comes a growing demand for retrieving data to provide specialized optimization. However, directly applying previous image retrieval methods faces several challenges, such as the lack of global feature representation and inadequate text retrieval ability for… ▽ More

    Submitted 18 June, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

  40. arXiv:2401.01014  [pdf, other

    quant-ph

    Entanglement hierarchies in multipartite scenarios

    Authors: Hui Li, Ting Gao, Fengli Yan

    Abstract: In this paper, we investigate the hierarchical structure of the $n$-partite quantum states. We present a whole set of hierarchical quantifications as a method of characterizing quantum states, which go beyond genuine multipartite entanglement measures and allow for fine identification among distinct entanglement contributions. This kind of quantifications, termed $k$-GM concurrence, can unambiguou… ▽ More

    Submitted 23 May, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

    Comments: 10 pages, 3 figures

  41. arXiv:2401.00744  [pdf, other

    physics.comp-ph cond-mat.mtrl-sci cs.LG

    Towards Harmonization of SO(3)-Equivariance and Expressiveness: a Hybrid Deep Learning Framework for Electronic-Structure Hamiltonian Prediction

    Authors: Shi Yin, Xinyang Pan, Xudong Zhu, Tianyu Gao, Haochong Zhang, Feng Wu, Lixin He

    Abstract: Deep learning for predicting the electronic-structure Hamiltonian of quantum systems necessitates satisfying the covariance laws, among which achieving SO(3)-equivariance without sacrificing the non-linear expressive capability of networks remains unsolved. To navigate the harmonization between equivariance and expressiveness, we propose a deep learning method synergizing two distinct categories o… ▽ More

    Submitted 21 June, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

  42. arXiv:2312.12844  [pdf, other

    cs.LG cs.AI stat.ME

    Effective Causal Discovery under Identifiable Heteroscedastic Noise Model

    Authors: Naiyu Yin, Tian Gao, Yue Yu, Qiang Ji

    Abstract: Capturing the underlying structural causal relations represented by Directed Acyclic Graphs (DAGs) has been a fundamental task in various AI disciplines. Causal DAG learning via the continuous optimization framework has recently achieved promising performance in terms of both accuracy and efficiency. However, most methods make strong assumptions of homoscedastic noise, i.e., exogenous noises have… ▽ More

    Submitted 9 June, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

  43. arXiv:2312.10593  [pdf, other

    cs.CR eess.SP

    A Novel RFID Authentication Protocol Based on A Block-Order-Modulus Variable Matrix Encryption Algorithm

    Authors: Yan Wang, Ruiqi Liu, Tong Gao, Feng Shu, Xuemei Lei, Guan Gui, Jiangzhou Wang

    Abstract: In this paper, authentication for mobile radio frequency identification (RFID) systems with low-cost tags is studied. Firstly, an adaptive modulus (AM) encryption algorithm is proposed. Subsequently, in order to enhance the security without additional storage of new key matrices, a self-updating encryption order (SUEO) algorithm is designed. Furthermore, a diagonal block local transpose key matrix… ▽ More

    Submitted 9 May, 2024; v1 submitted 16 December, 2023; originally announced December 2023.

  44. arXiv:2312.07849  [pdf, other

    cs.CV

    Encoder-minimal and Decoder-minimal Framework for Remote Sensing Image Dehazing

    Authors: Yuanbo Wen, Tao Gao, Ziqi Li, Jing Zhang, Ting Chen

    Abstract: Haze obscures remote sensing images, hindering valuable information extraction. To this end, we propose RSHazeNet, an encoder-minimal and decoder-minimal framework for efficient remote sensing image dehazing. Specifically, regarding the process of merging features within the same level, we develop an innovative module called intra-level transposed fusion module (ITFM). This module employs adaptive… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  45. arXiv:2312.06158  [pdf, other

    cs.CV

    Adaptive Feature Selection for No-Reference Image Quality Assessment by Mitigating Semantic Noise Sensitivity

    Authors: Xudong Li, Timin Gao, Runze Hu, Yan Zhang, Shengchuan Zhang, Xiawu Zheng, Jingyuan Zheng, Yunhang Shen, Ke Li, Yutao Liu, Pingyang Dai, Rongrong Ji

    Abstract: The current state-of-the-art No-Reference Image Quality Assessment (NR-IQA) methods typically rely on feature extraction from upstream semantic backbone networks, assuming that all extracted features are relevant. However, we make a key observation that not all features are beneficial, and some may even be harmful, necessitating careful selection. Empirically, we find that many image pairs with sm… ▽ More

    Submitted 26 May, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

  46. arXiv:2312.00962  [pdf, other

    cs.RO

    MBot: A Modular Ecosystem for Scalable Robotics Education

    Authors: Peter Gaskell, Jana Pavlasek, Tom Gao, Abhishek Narula, Stanley Lewis, Odest Chadwicke Jenkins

    Abstract: The Michigan Robotics MBot is a low-cost mobile robot platform that has been used to train over 1,400 students in autonomous navigation since 2014 at the University of Michigan and our collaborating colleges. The MBot platform was designed to meet the needs of teaching robotics at scale to match the growth of robotics as a field and an academic discipline. Transformative advancements in robot navi… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  47. arXiv:2311.14294  [pdf, other

    cs.CV

    Decouple Content and Motion for Conditional Image-to-Video Generation

    Authors: Cuifeng Shen, Yulu Gan, Chen Chen, Xiongwei Zhu, Lele Cheng, Tingting Gao, Jinzhi Wang

    Abstract: The goal of conditional image-to-video (cI2V) generation is to create a believable new video by beginning with the condition, i.e., one image and text.The previous cI2V generation methods conventionally perform in RGB pixel space, with limitations in modeling motion consistency and visual continuity. Additionally, the efficiency of generating videos in pixel space is quite low.In this paper, we pr… ▽ More

    Submitted 14 December, 2023; v1 submitted 24 November, 2023; originally announced November 2023.

  48. arXiv:2311.14284  [pdf, other

    cs.CV

    Paragraph-to-Image Generation with Information-Enriched Diffusion Model

    Authors: Weijia Wu, Zhuang Li, Yefei He, Mike Zheng Shou, Chunhua Shen, Lele Cheng, Yan Li, Tingting Gao, Di Zhang, Zhongyuan Wang

    Abstract: Text-to-image (T2I) models have recently experienced rapid development, achieving astonishing performance in terms of fidelity and textual alignment capabilities. However, given a long paragraph (up to 512 words), these generation models still struggle to achieve strong alignment and are unable to generate images depicting complex scenes. In this paper, we introduce an information-enriched diffusi… ▽ More

    Submitted 29 November, 2023; v1 submitted 24 November, 2023; originally announced November 2023.

    Comments: The project website is at: https://weijiawu.github.io/ParaDiffusionPage/. Code: https://github.com/weijiawu/ParaDiffusion

  49. arXiv:2311.12320  [pdf, other

    cs.AI

    A Survey on Multimodal Large Language Models for Autonomous Driving

    Authors: Can Cui, Yunsheng Ma, Xu Cao, Wenqian Ye, Yang Zhou, Kaizhao Liang, Jintai Chen, Juanwu Lu, Zichong Yang, Kuei-Da Liao, Tianren Gao, Erlong Li, Kun Tang, Zhipeng Cao, Tong Zhou, Ao Liu, Xinrui Yan, Shuqi Mei, Jianguo Cao, Ziran Wang, Chao Zheng

    Abstract: With the emergence of Large Language Models (LLMs) and Vision Foundation Models (VFMs), multimodal AI systems benefiting from large models have the potential to equally perceive the real world, make decisions, and control tools as humans. In recent months, LLMs have shown widespread attention in autonomous driving and map systems. Despite its immense potential, there is still a lack of a comprehen… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  50. arXiv:2311.09539  [pdf, ps, other

    quant-ph

    Entanglement constraint on wave-particle duality for tripartite systems

    Authors: Zanjia Li, Yingqiu He, Dong Ding, Ting Gao, Fengli Yan

    Abstract: A global multi-partite entanglement may place a constraint on the wave-particle duality. We investigate this constraint relation of the global entanglement and the quantitative wave-particle duality in tripartite systems. We perform quantum state tomography to reconstruct the reduced density matrix by using the OriginQ quantum computing cloud platform. As a result, we show that, theoretically and… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 10 pages, 3 figures