Skip to main content

Showing 1–50 of 363 results for author: Nie, Y

  1. arXiv:2407.05840  [pdf, other

    cs.ET physics.optics

    A 103-TOPS/mm$^2$ Integrated Photonic Computing Engine Enabling Next-Generation Reservoir Computing

    Authors: Dongliang Wang, Yikun Nie, Gaolei Hu, Hon Ki Tsang, Chaoran Huang

    Abstract: Reservoir computing (RC) is a leading machine learning algorithm for information processing due to its rich expressiveness. A new RC paradigm has recently emerged, showcasing superior performance and delivering more interpretable results with shorter training data sets and training times, representing the next generation of RC computing. This work presents the first realization of a high-speed nex… ▽ More

    Submitted 31 May, 2024; originally announced July 2024.

  2. arXiv:2407.04277  [pdf, other

    cs.CV

    Research, Applications and Prospects of Event-Based Pedestrian Detection: A Survey

    Authors: Han Wang, Yuman Nie, Yun Li, Hongjie Liu, Min Liu, Wen Cheng, Yaoxiong Wang

    Abstract: Event-based cameras, inspired by the biological retina, have evolved into cutting-edge sensors distinguished by their minimal power requirements, negligible latency, superior temporal resolution, and expansive dynamic range. At present, cameras used for pedestrian detection are mainly frame-based imaging sensors, which have suffered from lethargic response times and hefty data redundancy. In contr… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  3. arXiv:2407.02301  [pdf, other

    cs.CL

    CFinBench: A Comprehensive Chinese Financial Benchmark for Large Language Models

    Authors: Ying Nie, Binwei Yan, Tianyu Guo, Hao Liu, Haoyu Wang, Wei He, Binfan Zheng, Weihao Wang, Qiang Li, Weijian Sun, Yunhe Wang, Dacheng Tao

    Abstract: Large language models (LLMs) have achieved remarkable performance on various NLP tasks, yet their potential in more challenging and domain-specific task, such as finance, has not been fully explored. In this paper, we present CFinBench: a meticulously crafted, the most comprehensive evaluation benchmark to date, for assessing the financial knowledge of LLMs under Chinese context. In practice, to b… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  4. arXiv:2406.11903  [pdf, other

    q-fin.GN cs.AI q-fin.CP

    A Survey of Large Language Models for Financial Applications: Progress, Prospects and Challenges

    Authors: Yuqi Nie, Yaxuan Kong, Xiaowen Dong, John M. Mulvey, H. Vincent Poor, Qingsong Wen, Stefan Zohren

    Abstract: Recent advances in large language models (LLMs) have unlocked novel opportunities for machine learning applications in the financial domain. These models have demonstrated remarkable capabilities in understanding context, processing vast amounts of data, and generating human-preferred contents. In this survey, we explore the application of LLMs on various financial tasks, focusing on their potenti… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  5. arXiv:2406.10589  [pdf, other

    physics.soc-ph

    Resilience patterns in higher-order meta-population networks

    Authors: Yanyi Nie, Yanbing Liu, Qixuan Cao, Tao Lin, Wei Wang

    Abstract: Meta-population networks are effective tools for capturing population movement across distinct regions, but the assumption of well-mixed regions fails to capture the reality of population higher-order interactions. As a multidimensional system capturing mobility characteristics, meta-population networks are inherently complex and difficult to interpret when subjected to resilience analysis based o… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  6. arXiv:2406.08725  [pdf, other

    cs.CR

    RL-JACK: Reinforcement Learning-powered Black-box Jailbreaking Attack against LLMs

    Authors: Xuan Chen, Yuzhou Nie, Lu Yan, Yunshu Mao, Wenbo Guo, Xiangyu Zhang

    Abstract: Modern large language model (LLM) developers typically conduct a safety alignment to prevent an LLM from generating unethical or harmful content. Recent studies have discovered that the safety alignment of LLMs can be bypassed by jailbreaking prompts. These prompts are designed to create specific conversation scenarios with a harmful question embedded. Querying an LLM with such prompts can mislead… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  7. arXiv:2406.08705  [pdf, other

    cs.CR

    When LLM Meets DRL: Advancing Jailbreaking Efficiency via DRL-guided Search

    Authors: Xuan Chen, Yuzhou Nie, Wenbo Guo, Xiangyu Zhang

    Abstract: Recent studies developed jailbreaking attacks, which construct jailbreaking prompts to ``fool'' LLMs into responding to harmful questions. Early-stage jailbreaking attacks require access to model internals or significant human efforts. More advanced attacks utilize genetic algorithms for automatic and black-box attacks. However, the random nature of genetic algorithms significantly limits the effe… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  8. On Onsager's type conjecture for the inviscid Boussinesq equations

    Authors: Changxing Miao, Yao Nie, Weikui Ye

    Abstract: In this paper, we investigate the Cauchy problem for the three dimensional inviscid Boussinesq system in the periodic setting. For $1\le p\le \infty$, we show that the threshold regularity exponent for $L^p$-norm conservation of temperature of this system is $1/3$, consistent with Onsager exponent. More precisely, for $1\le p\le\infty$, every weak solution $(v,θ)\in C_tC^β_x$ to the inviscid Bouss… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Journal ref: Journal of Functional Analysis 287 (2024) 110527

  9. arXiv:2406.00256  [pdf, other

    cs.IT cs.CR

    Over-the-Air Collaborative Inference with Feature Differential Privacy

    Authors: Mohamed Seif, Yuqi Nie, Andrea Goldsmith, Vincent Poor

    Abstract: Collaborative inference in next-generation networks can enhance Artificial Intelligence (AI) applications, including autonomous driving, personal identification, and activity classification. This method involves a three-stage process: a) data acquisition through sensing, b) feature extraction, and c) feature encoding for transmission. Transmission of the extracted features entails the potential ri… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  10. Correctable Landmark Discovery via Large Models for Vision-Language Navigation

    Authors: Bingqian Lin, Yunshuang Nie, Ziming Wei, Yi Zhu, Hang Xu, Shikui Ma, Jianzhuang Liu, Xiaodan Liang

    Abstract: Vision-Language Navigation (VLN) requires the agent to follow language instructions to reach a target position. A key factor for successful navigation is to align the landmarks implied in the instruction with diverse visual observations. However, previous VLN agents fail to perform accurate modality alignment especially in unexplored scenes, since they learn from limited navigation data and lack s… ▽ More

    Submitted 5 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted by TPAMI 2024

  11. arXiv:2405.16783  [pdf, other

    cs.CR cs.AI cs.LG

    TrojFM: Resource-efficient Backdoor Attacks against Very Large Foundation Models

    Authors: Yuzhou. Nie, Yanting. Wang, Jinyuan. Jia, Michael J. De Lucia, Nathaniel D. Bastian, Wenbo. Guo, Dawn. Song

    Abstract: One key challenge in backdoor attacks against large foundation models is the resource limits. Backdoor attacks usually require retraining the target model, which is impractical for very large foundation models. Existing backdoor attacks are mainly designed for supervised classifiers or small foundation models (e.g., BERT). None of these attacks has successfully compromised a very large foundation… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  12. arXiv:2405.13581  [pdf, other

    cs.CV cs.AI

    Safety Alignment for Vision Language Models

    Authors: Zhendong Liu, Yuanbi Nie, Yingshui Tan, Xiangyu Yue, Qiushi Cui, Chongjun Wang, Xiaoyong Zhu, Bo Zheng

    Abstract: Benefiting from the powerful capabilities of Large Language Models (LLMs), pre-trained visual encoder models connected to an LLMs can realize Vision Language Models (VLMs). However, existing research shows that the visual modality of VLMs is vulnerable, with attackers easily bypassing LLMs' safety alignment through visual modality features to launch attacks. To address this issue, we enhance the e… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 23 pages, 15 figures

  13. arXiv:2405.08633  [pdf, other

    cond-mat.supr-con cond-mat.mtrl-sci cond-mat.str-el

    On the superconducting gap structure of the miassite Rh17S15: Nodal or nodeless?

    Authors: J. Y. Nie, C. C. Zhao, C. Q. Xu, B. Li, C. P. Tu, X. Zhang, D. Z. Dai, H. R. Wang, S. Xu, Wenhe Jiao, B. M. Wang, Zhu'an Xu, Xiaofeng Xu, S. Y. Li

    Abstract: Recent penetration depth measurement claimed the observation of unconventional superconductivity in the miassite Rh$_{17}$S$_{15}$ single crystals, evidenced by the linear-in-temperature penetration depth at low temperatures, thereby arguing for the presence of the lines of node in its superconducting gap structure. Here we measure the thermal conductivity of Rh$_{17}$S$_{15}$ single crystals down… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 7 pages, 6 figures

  14. arXiv:2405.07303  [pdf, other

    hep-ex hep-ph physics.ins-det

    Search for solar axions by Primakoff effect with the full dataset of the CDEX-1B Experiment

    Authors: L. T. Yang, S. K. Liu, Q. Yue, K. J. Kang, Y. J. Li, H. P. An, Greeshma C., J. P. Chang, Y. H. Chen, J. P. Cheng, W. H. Dai, Z. Deng, C. H. Fang, X. P. Geng, H. Gong, Q. J. Guo, T. Guo, X. Y. Guo, L. He, J. R. He, J. W. Hu, H. X. Huang, T. C. Huang, L. Jiang, S. Karmakar , et al. (61 additional authors not shown)

    Abstract: We present the first limit on $g_{Aγ}$ coupling constant using the Bragg-Primakoff conversion based on an exposure of 1107.5 kg days of data from the CDEX-1B experiment at the China Jinping Underground Laboratory. The data are consistent with the null signal hypothesis, and no excess signals are observed. Limits of the coupling $g_{Aγ}<2.08\times10^{-9}$ GeV$^{-1}$ (95\% C.L.) are derived for axio… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: 7 pages, 5 figures

  15. arXiv:2405.04390  [pdf, other

    cs.CV

    DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving

    Authors: Chen Min, Dawei Zhao, Liang Xiao, Jian Zhao, Xinli Xu, Zheng Zhu, Lei Jin, Jianshu Li, Yulan Guo, Junliang Xing, Liping Jing, Yiming Nie, Bin Dai

    Abstract: Vision-centric autonomous driving has recently raised wide attention due to its lower cost. Pre-training is essential for extracting a universal representation. However, current vision-centric pre-training typically relies on either 2D or 3D pre-text tasks, overlooking the temporal characteristics of autonomous driving as a 4D scene understanding task. In this paper, we address this challenge by i… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Accepted by CVPR2024

  16. arXiv:2405.02357  [pdf, other

    cs.LG

    Large Language Models for Mobility in Transportation Systems: A Survey on Forecasting Tasks

    Authors: Zijian Zhang, Yujie Sun, Zepu Wang, Yuqi Nie, Xiaobo Ma, Peng Sun, Ruolin Li

    Abstract: Mobility analysis is a crucial element in the research area of transportation systems. Forecasting traffic information offers a viable solution to address the conflict between increasing transportation demands and the limitations of transportation infrastructure. Predicting human travel is significant in aiding various transportation and urban management tasks, such as taxi dispatch and urban plan… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 9 pages

  17. arXiv:2404.15538  [pdf, other

    cs.GR cs.AI cs.CL cs.LG

    DreamCraft: Text-Guided Generation of Functional 3D Environments in Minecraft

    Authors: Sam Earle, Filippos Kokkinos, Yuhe Nie, Julian Togelius, Roberta Raileanu

    Abstract: Procedural Content Generation (PCG) algorithms enable the automatic generation of complex and diverse artifacts. However, they don't provide high-level control over the generated content and typically require domain expertise. In contrast, text-to-3D methods allow users to specify desired characteristics in natural language, offering a high amount of flexibility and expressivity. But unlike PCG, s… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 16 pages, 9 figures, accepted to Foundation of Digital Games 2024

  18. arXiv:2404.15127  [pdf, other

    cs.CV cs.CL

    MedDr: Diagnosis-Guided Bootstrapping for Large-Scale Medical Vision-Language Learning

    Authors: Sunan He, Yuxiang Nie, Zhixuan Chen, Zhiyuan Cai, Hongmei Wang, Shu Yang, Hao Chen

    Abstract: The rapid advancement of large-scale vision-language models has showcased remarkable capabilities across various tasks. However, the lack of extensive and high-quality image-text data in medicine has greatly hindered the development of large-scale medical vision-language models. In this work, we present a diagnosis-guided bootstrapping strategy that exploits both image and label information to con… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  19. Sharp ill-posedness for the non-resistive MHD equations in Sobolev spaces

    Authors: Qionglei Chen, Yao Nie, Weikui Ye

    Abstract: In this paper, we prove a sharp ill-posedness result for the incompressible non-resistive MHD equations. In any dimension $d\ge 2$, we show the ill-posedness of the non-resistive MHD equations in $H^{\frac{d}{2}-1}(\mathbb{R}^d)\times H^{\frac{d}{2}}(\mathbb{R}^d)$, which is sharp in view of the results of the local well-posedness in… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 20 pages

    Journal ref: Journal of Functional Analysis, 286(2024)110302

  20. arXiv:2404.09793  [pdf, other

    hep-ex hep-ph physics.ins-det

    First Search for Light Fermionic Dark Matter Absorption on Electrons Using Germanium Detector in CDEX-10 Experiment

    Authors: J. X. Liu, L. T. Yang, Q. Yue, K. J. Kang, Y. J. Li, H. P. An, Greeshma C., J. P. Chang, Y. H. Chen, J. P. Cheng, W. H. Dai, Z. Deng, C. H. Fang, X. P. Geng, H. Gong, Q. J. Guo, T. Guo, X. Y. Guo, L. He, J. R. He, J. W. Hu, H. X. Huang, T. C. Huang, L. Jiang, S. Karmakar , et al. (61 additional authors not shown)

    Abstract: We present the first results of the search for sub-MeV fermionic dark matter absorbed by electron targets of Germanium using the 205.4~kg$\cdot$day data collected by the CDEX-10 experiment, with the analysis threshold of 160~eVee. No significant dark matter (DM) signals over the background are observed. Results are presented as limits on the cross section of DM--electron interaction. We present ne… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 6 pages, 4 figures

  21. arXiv:2404.03264  [pdf, other

    cs.CY cs.AI

    Foundation Model for Advancing Healthcare: Challenges, Opportunities, and Future Directions

    Authors: Yuting He, Fuxiang Huang, Xinrui Jiang, Yuxiang Nie, Minghao Wang, Jiguang Wang, Hao Chen

    Abstract: Foundation model, which is pre-trained on broad data and is able to adapt to a wide range of tasks, is advancing healthcare. It promotes the development of healthcare artificial intelligence (AI) models, breaking the contradiction between limited AI models and diverse healthcare practices. Much more widespread healthcare scenarios will benefit from the development of a healthcare foundation model… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  22. arXiv:2403.20276  [pdf, other

    hep-ex hep-ph physics.ins-det

    Constraints on the Blazar-Boosted Dark Matter from the CDEX-10 Experiment

    Authors: R. Xu, L. T. Yang, Q. Yue, K. J. Kang, Y. J. Li, H. P. An, Greeshma C., J. P. Chang, Y. H. Chen, J. P. Cheng, W. H. Dai, Z. Deng, C. H. Fang, X. P. Geng, H. Gong, Q. J. Guo, T. Guo, X. Y. Guo, L. He, S. M. He, J. W. Hu, H. X. Huang, T. C. Huang, L. Jiang, S. Karmakar , et al. (59 additional authors not shown)

    Abstract: We report new constraints on light dark matter (DM) boosted by blazars using the 205.4 kg day data from the CDEX-10 experiment located at the China Jinping Underground Laboratory. Two representative blazars, TXS 0506+56 and BL Lacertae are studied. The results derived from TXS 0506+56 exclude DM-nucleon elastic scattering cross sections from $4.6\times 10^{-33}\ \rm cm^2$ to… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: 7 pages, 4 figures

  23. arXiv:2403.20263  [pdf, other

    hep-ex hep-ph physics.ins-det

    Probing Dark Matter Particles from Evaporating Primordial Black Holes via Electron Scattering in the CDEX-10 Experiment

    Authors: Z. H. Zhang, L. T. Yang, Q. Yue, K. J. Kang, Y. J. Li, H. P. An, Greeshma C., J. P. Chang, Y. H. Chen, J. P. Cheng, W. H. Dai, Z. Deng, C. H. Fang, X. P. Geng, H. Gong, Q. J. Guo, T. Guo, X. Y. Guo, L. He, S. M. He, J. W. Hu, H. X. Huang, T. C. Huang, L. Jiang, S. Karmakar , et al. (59 additional authors not shown)

    Abstract: Dark matter (DM) is a major constituent of the Universe. However, no definite evidence of DM particles (denoted as ``$χ$") has been found in DM direct detection (DD) experiments to date. There is a novel concept that detecting $χ$ from evaporating primordial black holes (PBHs). We search for $χ$ emitted from PBHs by investigating their interaction with target electrons. The examined PBH masses ran… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: 8 pages, 6 figures

  24. arXiv:2403.19632  [pdf, other

    cs.CV

    GauStudio: A Modular Framework for 3D Gaussian Splatting and Beyond

    Authors: Chongjie Ye, Yinyu Nie, Jiahao Chang, Yuantao Chen, Yihao Zhi, Xiaoguang Han

    Abstract: We present GauStudio, a novel modular framework for modeling 3D Gaussian Splatting (3DGS) to provide standardized, plug-and-play components for users to easily customize and implement a 3DGS pipeline. Supported by our framework, we propose a hybrid Gaussian representation with foreground and skyball background models. Experiments demonstrate this representation reduces artifacts in unbounded outdo… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: Code: https://github.com/GAP-LAB-CUHK-SZ/gaustudio

  25. arXiv:2403.19319  [pdf, other

    cs.CV

    Mesh2NeRF: Direct Mesh Supervision for Neural Radiance Field Representation and Generation

    Authors: Yujin Chen, Yinyu Nie, Benjamin Ummenhofer, Reiner Birkl, Michael Paulitsch, Matthias Müller, Matthias Nießner

    Abstract: We present Mesh2NeRF, an approach to derive ground-truth radiance fields from textured meshes for 3D generation tasks. Many 3D generative approaches represent 3D scenes as radiance fields for training. Their ground-truth radiance fields are usually fitted from multi-view renderings from a large-scale synthetic 3D dataset, which often results in artifacts due to occlusions or under-fitting issues.… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: Project page: https://terencecyj.github.io/projects/Mesh2NeRF/ Video: https://youtu.be/oufv1N3f7iY

  26. arXiv:2403.17636  [pdf, other

    cs.CL

    Mix-Initiative Response Generation with Dynamic Prefix Tuning

    Authors: Yuxiang Nie, Heyan Huang, Xian-Ling Mao, Lizi Liao

    Abstract: Mixed initiative serves as one of the key factors in controlling conversation directions. For a speaker, responding passively or leading proactively would result in rather different responses. However, most dialogue systems focus on training a holistic response generation model without any distinction among different initiatives. It leads to the cross-contamination problem, where the model confuse… ▽ More

    Submitted 27 March, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted to the main conference of NAACL 2024

  27. arXiv:2403.16902  [pdf

    physics.optics cond-mat.mtrl-sci physics.bio-ph

    Multi-Convergence-Angle Ptychography with Simultaneous Strong Contrast and High Resolution

    Authors: Wei Mao, Weiyang Zhang, Chen Huang, Liqi Zhou, Judy. S. Kim, Si Gao, Yu Lei, Xiaopeng Wu, Yiming Hu, Xudong Pei, Weina Fang, Xiaoguo Liu, Jingdong Song, Chunhai Fan, Yuefeng Nie, Angus. I. Kirkland, Peng Wang

    Abstract: Advances in bioimaging methods and hardware facilities have revolutionised the determination of numerous biological structures at atomic or near-atomic resolution. Among these developments, electron ptychography has recently attracted considerable attention because of its superior resolution, remarkable sensitivity to light elements, and high electron dose efficiency. Here, we introduce an innovat… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  28. arXiv:2403.16558  [pdf, other

    cs.CV

    Elysium: Exploring Object-level Perception in Videos via MLLM

    Authors: Han Wang, Yanjie Wang, Yongjie Ye, Yuxiang Nie, Can Huang

    Abstract: Multi-modal Large Language Models (MLLMs) have demonstrated their ability to perceive objects in still images, but their application in video-related tasks, such as object tracking, remains understudied. This lack of exploration is primarily due to two key challenges. Firstly, extensive pretraining on large-scale video datasets is required to equip MLLMs with the capability to perceive objects acr… ▽ More

    Submitted 29 March, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  29. Foundation Models for Time Series Analysis: A Tutorial and Survey

    Authors: Yuxuan Liang, Haomin Wen, Yuqi Nie, Yushan Jiang, Ming Jin, Dongjin Song, Shirui Pan, Qingsong Wen

    Abstract: Time series analysis stands as a focal point within the data mining community, serving as a cornerstone for extracting valuable insights crucial to a myriad of real-world applications. Recent advances in Foundation Models (FMs) have fundamentally reshaped the paradigm of model design for time series analysis, boosting various downstream tasks in practice. These innovative approaches often leverage… ▽ More

    Submitted 18 June, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: In Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'24)

  30. arXiv:2403.11401  [pdf, other

    cs.CV cs.AI

    Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning

    Authors: Rao Fu, Jingyu Liu, Xilun Chen, Yixin Nie, Wenhan Xiong

    Abstract: This paper introduces Scene-LLM, a 3D-visual-language model that enhances embodied agents' abilities in interactive 3D indoor environments by integrating the reasoning strengths of Large Language Models (LLMs). Scene-LLM adopts a hybrid 3D visual feature representation, that incorporates dense spatial information and supports scene state updates. The model employs a projection layer to efficiently… ▽ More

    Submitted 22 March, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

  31. arXiv:2403.10023  [pdf, other

    quant-ph

    Measurement-device-independent quantum random number generation over 23 Mbps with imperfect single-photon sources

    Authors: You-Qi Nie, Hongyi Zhou, Bing Bai, Qi Xu, Xiongfeng Ma, Jun Zhang, Jian-Wei Pan

    Abstract: Quantum randomness relies heavily on the accurate characterization of the generator implementation, where the device imperfection or inaccurate characterization can lead to incorrect entropy estimation and practical bias, significantly affecting the reliability of the generated randomness. Measurement-device-independent (MDI) quantum random number generation (QRNG) endeavors to produce certified r… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: 20 pages, 9 figures, including appendixes. Accepted for publication in Quantum Science and Technology

  32. arXiv:2403.07376  [pdf, other

    cs.CV cs.AI cs.CL cs.RO

    NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning

    Authors: Bingqian Lin, Yunshuang Nie, Ziming Wei, Jiaqi Chen, Shikui Ma, Jianhua Han, Hang Xu, Xiaojun Chang, Xiaodan Liang

    Abstract: Vision-and-Language Navigation (VLN), as a crucial research problem of Embodied AI, requires an embodied agent to navigate through complex 3D environments following natural language instructions. Recent research has highlighted the promising capacity of large language models (LLMs) in VLN by improving navigational reasoning accuracy and interpretability. However, their predominant use in an offlin… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  33. arXiv:2403.07344  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci cond-mat.str-el

    Electronic Structure of Superconducting Infinite-Layer Lanthanum Nickelates

    Authors: Wenjie Sun, Zhicheng Jiang, Chengliang Xia, Bo Hao, Yueying Li, Shengjun Yan, Maosen Wang, Hongquan Liu, Jianyang Ding, Jiayu Liu, Zhengtai Liu, Jishan Liu, Hanghui Chen, Dawei Shen, Yuefeng Nie

    Abstract: Revealing the momentum-resolved electronic structure of infinite-layer nickelates is essential for understanding this new class of unconventional superconductors, but has been hindered by the formidable challenges in improving the sample quality. In this work, we report for the first time the angle-resolved photoemission spectroscopy of superconducting La$_{0.8}$Sr$_{0.2}$NiO$_{2}$ films prepared… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 29 pages,13 figures

  34. arXiv:2403.03066  [pdf, ps, other

    eess.SY

    Tracking-in-range Formulations for Numerical Optimal Control

    Authors: Nikilesh Ramesh, Eric C. Kerrigan, Yuanbo Nie

    Abstract: In contrast to set-point tracking which aims to reduce the tracking error between the tracker and the reference, tracking-in-range problems only focus on whether the tracker is within a given range around the reference, making it more suitable for the mission specifications of many practical applications. In this work, we present novel optimal control formulations to solve tracking-in-range proble… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  35. arXiv:2402.08148  [pdf, other

    math.OC

    Model Predictive Bang-Bang Controller Synthesis via Approximate Value Functions

    Authors: Morgan Jones, Yuanbo Nie, Matthew M. Peet

    Abstract: In this paper, we propose a novel method for addressing Optimal Control Problems (OCPs) with input-affine dynamics and cost functions. This approach adopts a Model Predictive Control (MPC) strategy, wherein a controller is synthesized to handle an approximated OCP within a finite time horizon. Upon reaching this horizon, the controller is re-calibrated to tackle another approximation of the OCP, w… ▽ More

    Submitted 16 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  36. arXiv:2402.02074  [pdf, other

    cs.CV

    Multiple-Crop Human Mesh Recovery with Contrastive Learning and Camera Consistency in A Single Image

    Authors: Yongwei Nie, Changzhen Liu, Chengjiang Long, Qing Zhang, Guiqing Li, Hongmin Cai

    Abstract: We tackle the problem of single-image Human Mesh Recovery (HMR). Previous approaches are mostly based on a single crop. In this paper, we shift the single-crop HMR to a novel multiple-crop HMR paradigm. Cropping a human from image multiple times by shifting and scaling the original bounding box is feasible in practice, easy to implement, and incurs neglectable cost, but immediately enriches availa… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  37. arXiv:2402.01253  [pdf, other

    cs.IR

    RimiRec: Modeling Refined Multi-interest in Hierarchical Structure for Recommendation

    Authors: Haolei Pei, Yuanyuan Xu, Yangping Zhu, Yuan Nie

    Abstract: Industrial recommender systems usually consist of the retrieval stage and the ranking stage, to handle the billion-scale of users and items. The retrieval stage retrieves candidate items relevant to user interests for recommendations and has attracted much attention. Frequently, a user shows refined multi-interests in a hierarchical structure. For example, a user likes Conan and Kuroba Kaito, whic… ▽ More

    Submitted 5 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 4 pages, 4 figures

  38. arXiv:2401.15980  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci cond-mat.str-el

    Superconductivity in freestanding infinite-layer nickelate membranes

    Authors: Shengjun Yan, Wei Mao, Wenjie Sun, Yueying Li, Haoying Sun, Jiangfeng Yang, Bo Hao, Wei Guo, Leyan Nian, Zhengbin Gu, Peng Wang, Yuefeng Nie

    Abstract: The observation of superconductivity in infinite-layer nickelates has attracted significant attention due to its potential as a new platform for exploring high $ \mathrm{\textit{T}}_{c} $ superconductivity. However, thus far, superconductivity has only been observed in epitaxial thin films, which limits the manipulation capabilities and modulation methods compared to two-dimensional exfoliated mat… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 20 pages, 9 figures

  39. arXiv:2401.15979  [pdf, other

    cond-mat.supr-con cond-mat.mtrl-sci cond-mat.str-el

    ${\mathrm{\textit{In situ}}}$ preparation of superconducting infinite-layer nickelate thin films with atomically flat surface

    Authors: Wenjie Sun, Zhichao Wang, Bo Hao, Shengjun Yan, Haoying Sun, Zhengbin Gu, Yu Deng, Yuefeng Nie

    Abstract: Since their discovery, the infinite-layer nickelates have been regarded as an appealing system for gaining deeper insights into high temperature superconductivity (HTSC). However, the synthesis of superconducting samples has been proved to be challenging. Here, we develop an ultrahigh vacuum (UHV) ${\mathrm{\textit{in situ}}}$ reduction method using atomic hydrogen as reducing agent and apply it i… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 5 pages, 4 figures

    Journal ref: Adv. Mater. 2024, 2401342

  40. arXiv:2401.14121  [pdf, other

    cs.CV

    Incorporating Exemplar Optimization into Training with Dual Networks for Human Mesh Recovery

    Authors: Yongwei Nie, Mingxian Fan, Chengjiang Long, Qing Zhang, Jian Zhu, Xuemiao Xu

    Abstract: We propose a novel optimization-based human mesh recovery method from a single image. Given a test exemplar, previous approaches optimize the pre-trained regression network to minimize the 2D re-projection loss, which however suffer from over-/under-fitting problems. This is because the ``exemplar optimization'' at testing time has too weak relation to the pre-training process, and the exemplar op… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  41. arXiv:2401.13551  [pdf, other

    cs.CV

    Interleaving One-Class and Weakly-Supervised Models with Adaptive Thresholding for Unsupervised Video Anomaly Detection

    Authors: Yongwei Nie, Hao Huang, Chengjiang Long, Qing Zhang, Pradipta Maji, Hongmin Cai

    Abstract: Without human annotations, a typical Unsupervised Video Anomaly Detection (UVAD) method needs to train two models that generate pseudo labels for each other. In previous work, the two models are closely entangled with each other, and it is not known how to upgrade their method without modifying their training framework significantly. Second, previous work usually adopts fixed thresholding to obtai… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  42. arXiv:2401.12681  [pdf, other

    cs.LG cs.AI

    Non-Neighbors Also Matter to Kriging: A New Contrastive-Prototypical Learning

    Authors: Zhishuai Li, Yunhao Nie, Ziyue Li, Lei Bai, Yisheng Lv, Rui Zhao

    Abstract: Kriging aims at estimating the attributes of unsampled geo-locations from observations in the spatial vicinity or physical connections, which helps mitigate skewed monitoring caused by under-deployed sensors. Existing works assume that neighbors' information offers the basis for estimating the attributes of the unobserved target while ignoring non-neighbors. However, non-neighbors could also offer… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: Accepted in AISTATS 2024

  43. arXiv:2401.08013  [pdf, other

    cs.GT cs.MA econ.GN

    A Day-to-Day Dynamical Approach to the Most Likely User Equilibrium Problem

    Authors: Jiayang Li, Qianni Wang, Liyang Feng, Jun Xie, Yu Marco Nie

    Abstract: The lack of a unique user equilibrium (UE) route flow in traffic assignment has posed a significant challenge to many transportation applications. The maximum-entropy principle, which advocates for the consistent selection of the most likely solution as a representative, is often used to address the challenge. Built on a recently proposed day-to-day (DTD) discrete-time dynamical model called cumul… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  44. arXiv:2401.04719  [pdf

    cond-mat.mtrl-sci

    Inelastic electron scattering at large angles: the phonon polariton contribution

    Authors: Hongbin Yang, Paul Zeiger, Andrea Konečná, Lu Han, Guangyao Miao, Yinong Zhou, Yifeng Huang, Xingxu Yan, Weihua Wang, Jiandong Guo, Yuefeng Nie, Ruqian Wu, Jan Rusz, Xiaoqing Pan

    Abstract: We explore the inelastic electron scattering in SrTiO3, PbTiO3, and SiC in their phonon energy range, challenging the assumption that phonon polaritons are excluded at large angles in high-resolution transmission electron energy-loss spectroscopy. We demonstrate that through multiple scattering, the electron beam can excite both phonons and phonon polaritons, and the relative proportion of each va… ▽ More

    Submitted 9 January, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

  45. arXiv:2312.17276  [pdf, other

    cs.CL cs.LG

    PanGu-$π$: Enhancing Language Model Architectures via Nonlinearity Compensation

    Authors: Yunhe Wang, Hanting Chen, Yehui Tang, Tianyu Guo, Kai Han, Ying Nie, Xutao Wang, Hailin Hu, Zheyuan Bai, Yun Wang, Fangcheng Liu, Zhicheng Liu, Jianyuan Guo, Sinan Zeng, Yinchen Zhang, Qinghua Xu, Qun Liu, Jun Yao, Chao Xu, Dacheng Tao

    Abstract: The recent trend of large language models (LLMs) is to increase the scale of both model size (\aka the number of parameters) and dataset to achieve better generative ability, which is definitely proved by a lot of work such as the famous GPT and Llama. However, large models often involve massive computational costs, and practical applications cannot afford such high prices. However, the method of… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

  46. arXiv:2312.14456  [pdf, other

    cond-mat.mtrl-sci cond-mat.str-el

    Spontaneous gap opening and potential excitonic states in an ideal Dirac semimetal Ta$_2$Pd$_3$Te$_5$

    Authors: Peng Zhang, Yuyang Dong, Dayu Yan, Bei Jiang, Tao Yang, Jun Li, Zhaopeng Guo, Yong Huang, Bo Hao, Qing Li, Yupeng Li, Kifu Kurokawa, Rui Wang, Yuefeng Nie, Makoto Hashimoto, Donghui Lu, Wen-He Jiao, Jie Shen, Tian Qian, Zhijun Wang, Youguo Shi, Takeshi Kondo

    Abstract: The opening of an energy gap in the electronic structure generally indicates the presence of interactions. In materials with low carrier density and short screening length, long-range Coulomb interaction favors the spontaneous formation of electron-hole pairs, so-called excitons, opening an excitonic gap at the Fermi level. Excitonic materials host unique phenomenons associated with pair excitatio… ▽ More

    Submitted 15 March, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: 9 pages, 5 figures

    Journal ref: Phys. Rev. X 14, 011047 (2024)

  47. arXiv:2312.12418  [pdf, other

    cs.CV

    LASA: Instance Reconstruction from Real Scans using A Large-scale Aligned Shape Annotation Dataset

    Authors: Haolin Liu, Chongjie Ye, Yinyu Nie, Yingfan He, Xiaoguang Han

    Abstract: Instance shape reconstruction from a 3D scene involves recovering the full geometries of multiple objects at the semantic instance level. Many methods leverage data-driven learning due to the intricacies of scene complexity and significant indoor occlusions. Training these methods often requires a large-scale, high-quality dataset with aligned and paired shape annotations with real-world scans. Ex… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: homepage: https://gap-lab-cuhk-sz.github.io/LASA/

  48. arXiv:2312.06428  [pdf, other

    cs.CV cs.AI cs.IR cs.LG

    VisionTraj: A Noise-Robust Trajectory Recovery Framework based on Large-scale Camera Network

    Authors: Zhishuai Li, Ziyue Li, Xiaoru Hu, Guoqing Du, Yunhao Nie, Feng Zhu, Lei Bai, Rui Zhao

    Abstract: Trajectory recovery based on the snapshots from the city-wide multi-camera network facilitates urban mobility sensing and driveway optimization. The state-of-the-art solutions devoted to such a vision-based scheme typically incorporate predefined rules or unsupervised iterative feedback, struggling with multi-fold challenges such as lack of open-source datasets for training the whole pipeline, and… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  49. arXiv:2312.05798  [pdf, other

    cs.CV

    Disentangled Representation Learning for Controllable Person Image Generation

    Authors: Wenju Xu, Chengjiang Long, Yongwei Nie, Guanghui Wang

    Abstract: In this paper, we propose a novel framework named DRL-CPG to learn disentangled latent representation for controllable person image generation, which can produce realistic person images with desired poses and human attributes (e.g., pose, head, upper clothes, and pants) provided by various source persons. Unlike the existing works leveraging the semantic masks to obtain the representation of each… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  50. arXiv:2312.03441  [pdf, other

    cs.CV

    UFineBench: Towards Text-based Person Retrieval with Ultra-fine Granularity

    Authors: Jialong Zuo, Hanyu Zhou, Ying Nie, Feng Zhang, Tianyu Guo, Nong Sang, Yunhe Wang, Changxin Gao

    Abstract: Existing text-based person retrieval datasets often have relatively coarse-grained text annotations. This hinders the model to comprehend the fine-grained semantics of query texts in real scenarios. To address this problem, we contribute a new benchmark named \textbf{UFineBench} for text-based person retrieval with ultra-fine granularity. Firstly, we construct a new \textbf{dataset} named UFine6… ▽ More

    Submitted 6 June, 2024; v1 submitted 6 December, 2023; originally announced December 2023.