Skip to main content

Showing 1–50 of 394 results for author: Meng, L

  1. arXiv:2407.08551  [pdf, other

    cs.CL cs.SD eess.AS

    Autoregressive Speech Synthesis without Vector Quantization

    Authors: Lingwei Meng, Long Zhou, Shujie Liu, Sanyuan Chen, Bing Han, Shujie Hu, Yanqing Liu, Jinyu Li, Sheng Zhao, Xixin Wu, Helen Meng, Furu Wei

    Abstract: We present MELLE, a novel continuous-valued tokens based language modeling approach for text to speech synthesis (TTS). MELLE autoregressively generates continuous mel-spectrogram frames directly from text condition, bypassing the need for vector quantization, which are originally designed for audio compression and sacrifice fidelity compared to mel-spectrograms. Specifically, (i) instead of cross… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2407.06861  [pdf, other

    cs.CV

    Window-to-Window BEV Representation Learning for Limited FoV Cross-View Geo-localization

    Authors: Lei Cheng, Teng Wang, Lingquan Meng, Changyin Sun

    Abstract: Cross-view geo-localization confronts significant challenges due to large perspective changes, especially when the ground-view query image has a limited field of view with unknown orientation. To bridge the cross-view domain gap, we for the first time explore to learn a BEV representation directly from the ground query image. However, the unknown orientation between ground and aerial images combin… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  3. arXiv:2407.06730  [pdf, other

    cs.CV

    LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition

    Authors: Teng Wang, Lingquan Meng, Lei Cheng, Changyin Sun

    Abstract: Visual place recognition (VPR) remains challenging due to significant viewpoint changes and appearance variations. Mainstream works tackle these challenges by developing various feature aggregation methods to transform deep features into robust and compact global representations. Unfortunately, satisfactory results cannot be achieved under challenging conditions. We start from a new perspective an… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  4. arXiv:2407.04649  [pdf, other

    hep-ph hep-ex hep-lat nucl-th

    Internal structure of the $T_{cc}(3875)^+$ from its light-quark mass dependence

    Authors: Michael Abolnikov, Vadim Baru, Evgeny Epelbaum, Arseniy A. Filin, Christoph Hanhart, Lu Meng

    Abstract: We employ a chiral effective field theory-based approach to connect $DD^*$ scattering observables at the physical and variable pion masses accessible in lattice QCD simulations. We incorporate all relevant scales associated with three-body $DDπ$ dynamics and the left-hand cut induced by the one-pion exchange for pion masses higher than the physical one, as required by analyticity and unitarity. By… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 11 pages, 7 figures

  5. arXiv:2406.17824  [pdf, other

    hep-ph hep-ex hep-lat

    Fully heavy tetraquark resonant states with different flavors

    Authors: Wei-Lin Wu, Yao Ma, Yan-Ke Chen, Lu Meng, Shi-Lin Zhu

    Abstract: We use the quark potential model to calculate the mass spectrum of the S-wave fully heavy tetraquark systems with different flavors, including the $ bc\bar b\bar c, bb\bar c\bar c, cc\bar c\bar b $ and $ bb\bar b\bar c $ systems. We employ the Gaussian expansion method to solve the four-body Schrödinger equation, and the complex scaling method to identify resonant states. The… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 10 pages,7 figures,8 tables

  6. arXiv:2406.14123  [pdf

    cs.CY

    Mapping AI Ethics Narratives: Evidence from Twitter Discourse Between 2015 and 2022

    Authors: Mengyi Wei, Puzhen Zhang, Chuan Chen, Dongsheng Chen, Chenyu Zuo, Liqiu Meng

    Abstract: Public participation is indispensable for an insightful understanding of the ethics issues raised by AI technologies. Twitter is selected in this paper to serve as an online public sphere for exploring discourse on AI ethics, facilitating broad and equitable public engagement in the development of AI technology. A research framework is proposed to demonstrate how to transform AI ethics-related dis… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 22 pages, 6 figures

  7. arXiv:2406.11739  [pdf, other

    cs.CV

    V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results

    Authors: Jiaqi Wang, Yuhang Zang, Pan Zhang, Tao Chu, Yuhang Cao, Zeyi Sun, Ziyu Liu, Xiaoyi Dong, Tong Wu, Dahua Lin, Zeming Chen, Zhi Wang, Lingchen Meng, Wenhao Yao, Jianwei Yang, Sihong Wu, Zhineng Chen, Zuxuan Wu, Yu-Gang Jiang, Peixi Wu, Bosong Chai, Xuan Nie, Longquan Yan, Zeyu Wang, Qifan Zhou , et al. (9 additional authors not shown)

    Abstract: Detecting objects in real-world scenes is a complex task due to various challenges, including the vast range of object categories, and potential encounters with previously unknown or unseen objects. The challenges necessitate the development of public benchmarks and challenges to advance the field of object detection. Inspired by the success of previous COCO and LVIS Challenges, we organize the V3… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  8. arXiv:2406.07855  [pdf, other

    cs.CL cs.SD eess.AS

    VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via Monotonic Alignment

    Authors: Bing Han, Long Zhou, Shujie Liu, Sanyuan Chen, Lingwei Meng, Yanming Qian, Yanqing Liu, Sheng Zhao, Jinyu Li, Furu Wei

    Abstract: With the help of discrete neural audio codecs, large language models (LLM) have increasingly been recognized as a promising methodology for zero-shot Text-to-Speech (TTS) synthesis. However, sampling based decoding strategies bring astonishing diversity to generation, but also pose robustness issues such as typos, omissions and repetition. In addition, the high sampling rate of audio also brings h… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 15 pages, 5 figures

  9. arXiv:2406.06993  [pdf, ps, other

    hep-ph hep-ex

    Spectrum of the molecular hexaquarks

    Authors: Bo Wang, Kan Chen, Lu Meng, Shi-Lin Zhu

    Abstract: We investigate the mass spectra of molecular-type hexaquark states in the dibaryon systems. These systems are composed of the charmed baryons $[Σ_c^{(\ast)}$, $Ξ_c^{(\prime,\ast)}]$, doubly charmed baryons $[Ξ_{cc}^{(\ast)}]$, and hyperons $[Σ^{(\ast)}$, $Ξ^{(\ast)}]$. We consider all possible combinations of particle-particle and particle-antiparticle pairs, including the S-wave spin multiplets i… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 17 pages, 13 tables

  10. arXiv:2406.06909  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    Training Dynamics of Nonlinear Contrastive Learning Model in the High Dimensional Limit

    Authors: Lineghuan Meng, Chuang Wang

    Abstract: This letter presents a high-dimensional analysis of the training dynamics for a single-layer nonlinear contrastive learning model. The empirical distribution of the model weights converges to a deterministic measure governed by a McKean-Vlasov nonlinear partial differential equation (PDE). Under L2 regularization, this PDE reduces to a closed set of low-dimensional ordinary differential equations… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 21 pages, 11 figures

  11. arXiv:2406.06592  [pdf, other

    cs.CL cs.LG

    Improve Mathematical Reasoning in Language Models by Automated Process Supervision

    Authors: Liangchen Luo, Yinxiao Liu, Rosanne Liu, Samrat Phatale, Harsh Lara, Yunxuan Li, Lei Shu, Yun Zhu, Lei Meng, Jiao Sun, Abhinav Rastogi

    Abstract: Complex multi-step reasoning tasks, such as solving mathematical problems or generating code, remain a significant hurdle for even the most advanced large language models (LLMs). Verifying LLM outputs with an Outcome Reward Model (ORM) is a standard inference-time technique aimed at enhancing the reasoning performance of LLMs. However, this still proves insufficient for reasoning tasks with a leng… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 18 pages, 5 figures, 1 table

  12. arXiv:2406.04523  [pdf, other

    cs.CL cs.LG

    Proofread: Fixes All Errors with One Tap

    Authors: Renjie Liu, Yanxiang Zhang, Yun Zhu, Haicheng Sun, Yuanbo Zhang, Michael Xuelin Huang, Shanqing Cai, Lei Meng, Shumin Zhai

    Abstract: The impressive capabilities in Large Language Models (LLMs) provide a powerful approach to reimagine users' typing experience. This paper demonstrates Proofread, a novel Gboard feature powered by a server-side LLM in Gboard, enabling seamless sentence-level and paragraph-level corrections with a single tap. We describe the complete system in this paper, from data generation, metrics design to mode… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 8 pages, 3 figures, 2 tables

  13. arXiv:2406.04334  [pdf, other

    cs.CV

    DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs

    Authors: Lingchen Meng, Jianwei Yang, Rui Tian, Xiyang Dai, Zuxuan Wu, Jianfeng Gao, Yu-Gang Jiang

    Abstract: Most large multimodal models (LMMs) are implemented by feeding visual tokens as a sequence into the first layer of a large language model (LLM). The resulting architecture is simple but significantly increases computation and memory costs, as it has to handle a large number of additional tokens in its input layer. This paper presents a new architecture DeepStack for LMMs. Considering $N$ layers in… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Project Page: https://deepstack-vl.github.io/

  14. arXiv:2406.01151  [pdf, other

    cs.AR

    A 0.96pJ/SOP, 30.23K-neuron/mm^2 Heterogeneous Neuromorphic Chip With Fullerene-like Interconnection Topology for Edge-AI Computing

    Authors: P. J. Zhou, Q. Yu, M. Chen, Y. C. Wang, L. W. Meng, Y. Zuo, N. Ning, Y. Liu, S. G. Hu, G. C. Qiao

    Abstract: Edge-AI computing requires high energy efficiency, low power consumption, and relatively high flexibility and compact area, challenging the AI-chip design. This work presents a 0.96 pJ/SOP heterogeneous neuromorphic system-on-chip (SoC) with fullerene-like interconnection topology for edge-AI computing. The neuromorphic core integrates different technologies to augment computing energy efficiency,… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 5 pages, 8 figures

  15. arXiv:2405.20046  [pdf, other

    cs.AI

    Cross-Training with Multi-View Knowledge Fusion for Heterogenous Federated Learning

    Authors: Zhuang Qi, Lei Meng, Weihao He, Ruohan Zhang, Yu Wang, Xin Qi, Xiangxu Meng

    Abstract: Federated learning benefits from cross-training strategies, which enables models to train on data from distinct sources to improve the generalization capability. However, the data heterogeneity between sources may lead models to gradually forget previously acquired knowledge when undergoing cross-training to adapt to new tasks or data sources. We argue that integrating personalized and global know… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  16. arXiv:2405.19618  [pdf, other

    physics.optics

    Spectral multiplexing based on multi-distance lensless imaging

    Authors: Qijun You, Lingshuo Meng, Yun Gao, Qing Liao, Wei Cao, Peixiang Lu

    Abstract: We have demonstrated the capability of spectral multiplexing in multi-distance diffractive imaging, enabling the reconstruction of samples with diverse spectral responses. While previous methods like ptychography utilize redundancy in radial diffraction data to achieve information multiplexing, they typically require capturing a substantial amount of diffraction data. In contrast, our approach eff… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 4 pages, 5 figures

  17. arXiv:2405.16995  [pdf, other

    hep-ph nucl-th

    Electron form factors in Basis Light-front Quantization

    Authors: Lingdi Meng, Shuo Tang, Zhi Hu, Guo-Li Wang, Yang Li, Xingbo Zhao, James P. Vary

    Abstract: In this paper, we evaluate the electromagnetic and gravitational form factors as well as the corresponding generalized parton distributions of the electron using the Basis Light-front Quantization approach to QED. We compare our results with those from light-front perturbation theory. We adopt a novel basis with its scale depending on the constituents' longitudinal momentum fraction. We show that… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  18. arXiv:2405.16178  [pdf, other

    cs.CL

    Accelerating Inference of Retrieval-Augmented Generation via Sparse Context Selection

    Authors: Yun Zhu, Jia-Chen Gu, Caitlin Sikora, Ho Ko, Yinxiao Liu, Chu-Cheng Lin, Lei Shu, Liangchen Luo, Lei Meng, Bang Liu, Jindong Chen

    Abstract: Large language models (LLMs) augmented with retrieval exhibit robust performance and extensive versatility by incorporating external contexts. However, the input length grows linearly in the number of retrieved documents, causing a dramatic increase in latency. In this paper, we propose a novel paradigm named Sparse RAG, which seeks to cut computation costs through sparsity. Specifically, Sparse R… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  19. arXiv:2405.15277  [pdf

    cond-mat.mtrl-sci

    Inducing ferroelectricity in NH$_4$I and NH$_4$Br via partial replacement of protons by deuterons

    Authors: Miao Miao Zhao, Lei Meng, Yi Yang Xu, Na Du, Fei Yen

    Abstract: While all of the polymorphs of NH$_4$I and NH$_4$Br are non-polar, a reversible electric polarization is established in the ordered $γ$ phases of (NH$_4$)$_{0.73}$(ND$_4$)$_{0.27}$I and (NH$_4$)$_{0.84}$(ND$_4$)$_{0.16}$Br (where D is $^2$H) via $dc$ electric fields. The presence of two groups of orbital magnetic moments appears to be responsible for the asymmetric lattice distortions. Our finding… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 14 pages, 3 figures

    Journal ref: J. Phys. Chem. C 127, 20951-20955 (2023)

  20. Electric Polarization and Magnetic Properties of (NH$_4$)$_{1-x}$K$_x$I (x = 0.05-0.17)

    Authors: Yi Yang Xu, Lei Meng, Miao Miao Zhao, Chu Xin Peng, Fei Yen

    Abstract: While all of the polymorphs of pure NH$_4$I and KI are non-polar, we identify that (NH$_4$)$_{0.95}$K$_{0.05}$I is ferroelectric and (NH$_4$)$_{0.87}$K$_{0.13}$I and (NH$_4$)$_{0.83}$K$_{0.17}$I are pyroelectric through measurements of their pyroelectric current and complex dielectric constant. The order to disorder phase transitions occur near 245 K. Magnetic susceptibility measurements indicate… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 13 pages, 5 figures

    Journal ref: Journal of Alloys and Compounds 960, 170685 (2023)

  21. arXiv:2405.13848  [pdf, other

    cs.LG

    Maximum Manifold Capacity Representations in State Representation Learning

    Authors: Li Meng, Morten Goodwin, Anis Yazidi, Paal Engelstad

    Abstract: The expanding research on manifold-based self-supervised learning (SSL) builds on the manifold hypothesis, which suggests that the inherent complexity of high-dimensional data can be unraveled through lower-dimensional manifold embeddings. Capitalizing on this, DeepInfomax with an unbalanced atlas (DIM-UA) has emerged as a powerful tool and yielded impressive results for state representations in r… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  22. arXiv:2405.09759  [pdf

    cond-mat.mtrl-sci physics.chem-ph

    Ferroelectricity Driven by Orbital Resonance of Protons in CH$_3$NH$_3$Cl and CH$_3$NH$_3$Br

    Authors: Chu Xin Peng, Lei Meng, Yi Yang Xu, Tian Tian Xing, Miao Miao Zhao, Peng Ren, Fei Yen

    Abstract: The $β$ and $γ$ phases of methylammonium chloride CH$_3$NH$_3$Cl and methylammonium bromide CH$_3$NH$_3$Br are identified to be ferroelectric $via$ pyroelectric current and dielectric constant measurements. The magnetic susceptibility also exhibits pronounced discontinuities at the Curie temperatures. We attribute the origin of spontaneous polarization to the emergence of two groups of proton orbi… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 5 pages, 5 figures

    Journal ref: J. Mater. Chem. C, 10, 1334-1338 (2022)

  23. arXiv:2405.09375  [pdf, other

    cs.RO

    VascularPilot3D: Toward a 3D fully autonomous navigation for endovascular robotics

    Authors: Song Jingwei, Yang Keke, Chen Han, Liu Jiayi, Gu Yinan, Hui Qianxin, Huang Yanqi, Li Meng, Zhang Zheng, Cao Tuoyu, Ghaffari Maani

    Abstract: This research reports VascularPilot3D, the first 3D fully autonomous endovascular robot navigation system. As an exploration toward autonomous guidewire navigation, VascularPilot3D is developed as a complete navigation system based on intra-operative imaging systems (fluoroscopic X-ray in this study) and typical endovascular robots. VascularPilot3D adopts previously researched fast 3D-2D vessel re… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: Submitted to MICCAI2024

  24. Magnetic interactions based on proton orbital motion in CH$_3$NH$_3$PbI$_3$ and CH$_3$NH$_3$PbBr$_3$

    Authors: Lei Meng, Miao Miao Zhao, Yi Yang Xu, Chu Xin Peng, Yang Yang, Tian Tian Xing, Peng Ren, Fei Yen

    Abstract: The microscopic origin of the remarkable optoelectronic properties of one of the most studied contemporary materials remains unclear. Here, we identify the existence of magnetic interactions between intermolecular proton orbitals in CH$_3$NH$_3$PbI$_3$ and CH$_3$NH$_3$PbBr$_3$. In particular, a unique sharp drop and a pronounced step-up discontinuity in the magnetic susceptibility at the tetragona… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: Manuscript + Supplementary Material file (17 + 6 pages, 4 + 2 figures)

    Journal ref: Scripta Mater. 226, 115229 (2023)

  25. Magnetic Properties of NH$_4$H$_2$PO$_4$ and KH$_2$PO$_4$: Emergence of Multiferroic Salts

    Authors: Lei Meng, Chen He, Wei Ji, Fei Yen

    Abstract: We observe sharp step-down discontinuities in the magnetic susceptibility of NH$_4$H$_2$PO$_4$ and NH$_4$H$_2$PO$_4$-$d$$_{60}$ (60% deuterated) along the $a$ and $c$-axes occurring exactly at their antiferroelectric transition temperatures. For the case of KH$_2$PO$_4$, less pronounced discontinuities occur at the ferroelectric transition temperature. To explain this, we treat the acid protons as… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 14 pages with 5 figures

    Journal ref: J. Phys. Chem. Lett. 11, 8297-8301 (2020)

  26. arXiv:2405.07687  [pdf, other

    cs.RO

    Highly Efficient Observation Process based on FFT Filtering for Robot Swarm Collaborative Navigation in Unknown Environments

    Authors: Chenxi Li, Weining Lu, Zhihao Ma, Litong Meng, Bin Liang

    Abstract: Collaborative path planning for robot swarms in complex, unknown environments without external positioning is a challenging problem. This requires robots to find safe directions based on real-time environmental observations, and to efficiently transfer and fuse these observations within the swarm. This study presents a filtering method based on Fast Fourier Transform (FFT) to address these two iss… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 8 pages, 8 figures, 1 table

  27. arXiv:2405.07036  [pdf, other

    hep-ph

    Dark Matter Physics in General NMSSM

    Authors: Lei Meng, Junjie Cao, Shenshen Yang

    Abstract: In the General Next-to-Minimal Supersymmetric Standard Model (GNMSSM), singlet particles may form a secluded sector of dark matter (DM), in which Singlino-like DM could achieve the observed relic abundance through various channels such as $\tildeχ_1^0 \tildeχ_1^0 \to h_s h_s, A_s A_s, h_s A_s$, where $h_s$ and $A_s$ represent singlet-dominated CP-even and CP-odd Higgs bosons. We provide analytical… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  28. arXiv:2405.03171  [pdf

    cond-mat.mtrl-sci

    Magnetoelectric Coupling Based on Protons in Ammonium Sulfate

    Authors: Lei Meng, Chen He, Fei Yen

    Abstract: Most ferroelectric crystals have their own set of unique characteristics and ammonium sulfate (NH$_4$)$_2$SO$_4$ is no exception. We report on two previously unidentified features in ammonium sulfate: 1) that there are at least two successive transitions instead of one occurring at the Curie temperature $T$$_C$ = 223 K according to dielectric constant measurements; and 2) pronounced step-like anom… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: Manuscript + Supporting Information (19 + 2 pages, 6 + 2 figures)

    Journal ref: J. Phys. Chem. C 124, 17255-17261 (2020)

  29. arXiv:2405.03163  [pdf

    cond-mat.mtrl-sci cond-mat.other

    Magnetic Ordering of Ammonium Cations in NH$_4$I, NH$_4$Br and NH$_4$Cl

    Authors: Fei Yen, Lei Meng, Tian Gao, Sixia Hu

    Abstract: The different types of magnetism arise mainly from how electrons move and interact with each other. In this work, we show how protons (H$^+$) also exhibit magnetic behavior. We measured the magnetic susceptibility of the ammonium halides and identified pronounced increases at 232 K, 233 K and 243 K for NH$_4$I, NH$_4$Br and NH$_4$Cl, respectively, which all coincide to the geometric ordering of it… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: Manuscript + Supporting Information file (19 + 4 pages, 5 + 3 figures). Sorry for not uploading this back in 2020!

    Journal ref: J. Phys. Chem. C 123, 23655-23660 (2019)

  30. arXiv:2404.16575  [pdf, other

    hep-ph hep-ex hep-lat

    Probing the pole origin of $X(3872)$ with the coupled-channel dynamics

    Authors: Jun-Zhang Wang, Zi-Yang Lin, Yan-Ke Chen, Lu Meng, Shi-Lin Zhu

    Abstract: The $X(3872)$, as the first and the most crucial member in the exotic charmoniumlike $XYZ$ family, has been studied for a long time. However, its dynamical origin, whether stemming from a $D\bar{D}^*$ hadronic molecule or the first excited $P$-wave charmonium $χ_{c1}(2P)$, remains controversial. In this Letter, we demonstrate that the $X(3872)$ definitely does not result from the mass shift of the… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 11 pages, 5 figures

  31. arXiv:2404.13899  [pdf, other

    cs.CL cs.AI cs.MM

    Towards Better Text-to-Image Generation Alignment via Attention Modulation

    Authors: Yihang Wu, Xiao Cao, Kaixin Li, Zitan Chen, Haonan Wang, Lei Meng, Zhiyong Huang

    Abstract: In text-to-image generation tasks, the advancements of diffusion models have facilitated the fidelity of generated results. However, these models encounter challenges when processing text prompts containing multiple entities and attributes. The uneven distribution of attention results in the issues of entity leakage and attribute misalignment. Training from scratch to address this issue requires n… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  32. arXiv:2404.06037  [pdf, other

    cs.DC

    A Survey of Distributed Graph Algorithms on Massive Graphs

    Authors: Lingkai Meng, Yu Shao, Long Yuan, Longbin Lai, Peng Cheng, Xue Li, Wenyuan Yu, Wenjie Zhang, Xuemin Lin, Jingren Zhou

    Abstract: Distributed processing of large-scale graph data has many practical applications and has been widely studied. In recent years, a lot of distributed graph processing frameworks and algorithms have been proposed. While many efforts have been devoted to analyzing these, with most analyzing them based on programming models, less research focuses on understanding their challenges in distributed environ… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  33. arXiv:2403.17828  [pdf, other

    astro-ph.HE

    The Relativistic Spin Precession in the Compact Double Neutron Star System PSR~J1946+2052

    Authors: Lingqi Meng, Weiwei Zhu, Michael Kramer, Xueli Miao, Gregory Desvignes, Lijing Shao, Huanchen Hu, Paulo C. C. Freire, Yongkun Zhang, Mengyao Xue, Ziyao Fang, David J. Champion, Mao Yuan, Chenchen Miao, Jiarui Niu, Qiuyang Fu, Jumei Yao, Yanjun Guo, Chengmin Zhang

    Abstract: We observe systematic profile changes in the visible pulsar of the compact double neutron star system PSR~J1946+2052 using observations with the Five-hundred-meter Aperture Spherical radio Telescope (FAST). The interpulse of PSR~J1946+2052 changed from single-peak to double-peak shape from 2018 to 2021. We attribute this evolution as the result of the relativistic spin precession of the pulsar. Wi… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 12 pages, 9 figures, accepted for publication in ApJ

  34. arXiv:2403.14941  [pdf, other

    cs.LG cs.AI

    Unifying Lane-Level Traffic Prediction from a Graph Structural Perspective: Benchmark and Baseline

    Authors: Shuhao Li, Yue Cui, Jingyi Xu, Libin Li, Lingkai Meng, Weidong Yang, Fan Zhang, Xiaofang Zhou

    Abstract: Traffic prediction has long been a focal and pivotal area in research, witnessing both significant strides from city-level to road-level predictions in recent years. With the advancement of Vehicle-to-Everything (V2X) technologies, autonomous driving, and large-scale models in the traffic domain, lane-level traffic prediction has emerged as an indispensable direction. However, further progress in… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  35. arXiv:2403.10056  [pdf, other

    cs.CL cs.AI

    Don't Half-listen: Capturing Key-part Information in Continual Instruction Tuning

    Authors: Yongquan He, Xuancheng Huang, Minghao Tang, Lingxun Meng, Xiang Li, Wei Lin, Wenyuan Zhang, Yifu Gao

    Abstract: Instruction tuning for large language models (LLMs) can drive them to produce results consistent with human goals in specific downstream tasks. However, the process of continual instruction tuning (CIT) for LLMs may bring about the catastrophic forgetting (CF) problem, where previously learned abilities are degraded. Recent methods try to alleviate the CF problem by modifying models or replaying d… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: 18 pages, 4 figures

  36. arXiv:2403.08216  [pdf, other

    cs.LG cs.CV

    PaddingFlow: Improving Normalizing Flows with Padding-Dimensional Noise

    Authors: Qinglong Meng, Chongkun Xia, Xueqian Wang

    Abstract: Normalizing flow is a generative modeling approach with efficient sampling. However, Flow-based models suffer two issues: 1) If the target distribution is manifold, due to the unmatch between the dimensions of the latent target distribution and the data distribution, flow-based models might perform badly. 2) Discrete data might make flow-based models collapse into a degenerate mixture of point mas… ▽ More

    Submitted 23 April, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  37. arXiv:2403.06798  [pdf, other

    eess.IV cs.CV cs.LG

    Dynamic Perturbation-Adaptive Adversarial Training on Medical Image Classification

    Authors: Shuai Li, Xiaoguang Ma, Shancheng Jiang, Lu Meng

    Abstract: Remarkable successes were made in Medical Image Classification (MIC) recently, mainly due to wide applications of convolutional neural networks (CNNs). However, adversarial examples (AEs) exhibited imperceptible similarity with raw data, raising serious concerns on network robustness. Although adversarial training (AT), in responding to malevolent AEs, was recognized as an effective approach to im… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 9 pages, 4 figures, 2 tables

  38. arXiv:2403.03739  [pdf, other

    cs.LG cs.AI

    A&B BNN: Add&Bit-Operation-Only Hardware-Friendly Binary Neural Network

    Authors: Ruichen Ma, Guanchao Qiao, Yian Liu, Liwei Meng, Ning Ning, Yang Liu, Shaogang Hu

    Abstract: Binary neural networks utilize 1-bit quantized weights and activations to reduce both the model's storage demands and computational burden. However, advanced binary architectures still incorporate millions of inefficient and nonhardware-friendly full-precision multiplication operations. A&B BNN is proposed to directly remove part of the multiplication operations in a traditional BNN and replace th… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: CVPR 2024 Accepted

  39. arXiv:2403.01727  [pdf, other

    hep-ph hep-ex hep-lat nucl-th

    Identify the new state $Y(3872)$ as the P-wave $D\bar{D}^*/\bar{D}D^*$ resonance

    Authors: Zi-Yang Lin, Jun-Zhang Wang, Jian-Bo Cheng, Lu Meng, Shi-Lin Zhu

    Abstract: The BESIII Collaboration recently observed a new charmonium-like vector state $Y(3872)$ in $e^+e^-\rightarrow D\bar{D}$, which should be the first P-wave $D\bar{D}^*/\bar{D}D^*$ molecular resonance. The experimental and theoretical identification of the P-wave dimeson state holds paramount importance in enhancing our comprehension of the non-perturbative QCD and few-body physics. Its existence is… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: 9 pages, 7 figures

  40. arXiv:2402.07595  [pdf, other

    eess.IV cs.LG

    Comparative Analysis of ImageNet Pre-Trained Deep Learning Models and DINOv2 in Medical Imaging Classification

    Authors: Yuning Huang, Jingchen Zou, Lanxi Meng, Xin Yue, Qing Zhao, Jianqiang Li, Changwei Song, Gabriel Jimenez, Shaowu Li, Guanghui Fu

    Abstract: Medical image analysis frequently encounters data scarcity challenges. Transfer learning has been effective in addressing this issue while conserving computational resources. The recent advent of foundational models like the DINOv2, which uses the vision transformer architecture, has opened new opportunities in the field and gathered significant interest. However, DINOv2's performance on clinical… ▽ More

    Submitted 13 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  41. arXiv:2402.00534  [pdf, other

    cs.CV cs.LG

    A Manifold Representation of the Key in Vision Transformers

    Authors: Li Meng, Morten Goodwin, Anis Yazidi, Paal Engelstad

    Abstract: Vision Transformers implement multi-head self-attention via stacking multiple attention blocks. The query, key, and value are often intertwined and generated within those blocks via a single, shared linear transformation. This paper explores the concept of disentangling the key from the query and value, and adopting a manifold representation for the key. Our experiments reveal that decoupling and… ▽ More

    Submitted 7 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  42. arXiv:2402.00455  [pdf, ps, other

    cs.IT eess.SP

    New Lower Bounds on Aperiodic Ambiguity Function of Unimodular Sequences

    Authors: Lingsheng Meng, Yong Liang Guan, Yao Ge, Zilong Liu, Pingzhi Fan

    Abstract: This paper presents new aperiodic ambiguity function (AF) lower bounds of unimodular sequences under certain low ambiguity zone. Our key idea, motivated by the Levenshtein correlation bound, is to introduce two weight vectors associated to the delay and Doppler shifts, respectively, and then exploit the upper and lower bounds on the Frobenius norm of the weighted auto- and cross-AF matrices to der… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 5 pages, 1 figure

  43. Smart Fitting Room: A One-stop Framework for Matching-aware Virtual Try-on

    Authors: Mingzhe Yu, Yunshan Ma, Lei Wu, Kai Cheng, Xue Li, Lei Meng, Tat-Seng Chua

    Abstract: The development of virtual try-on has revolutionized online shopping by allowing customers to visualize themselves in various fashion items, thus extending the in-store try-on experience to the cyber space. Although virtual try-on has attracted considerable research initiatives, existing systems only focus on the quality of image generation, overlooking whether the fashion item is a good match to… ▽ More

    Submitted 20 April, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

  44. arXiv:2401.14899  [pdf, other

    hep-ph hep-ex hep-lat nucl-th

    Benchmark calculations of fully heavy compact and molecular tetraquark states

    Authors: Wei-Lin Wu, Yan-Ke Chen, Lu Meng, Shi-Lin Zhu

    Abstract: We calculate the mass spectrum of the S-wave fully heavy tetraquark systems $ QQ\bar Q\bar Q~(Q=c,b) $ with both normal $ (J^{PC}=0^{++},1^{+-},2^{++}) $ and exotic $ (J^{PC}=0^{+-},1^{++},2^{+-}) $ C-parities using three different quark potential models (AL1, AP1, BGS). The exotic C-parity systems refer to the ones that cannot be composed of two S-wave ground heavy quarkonia. We incorporate the m… ▽ More

    Submitted 26 March, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: 16 pages, 6 figures, 10 tables. Version accepted by PRD

    Journal ref: Phys. Rev. D 109, 054034 (2024)

  45. arXiv:2401.14664  [pdf, other

    cs.SD cs.CL eess.AS

    UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization

    Authors: Yuejiao Wang, Xixin Wu, Disong Wang, Lingwei Meng, Helen Meng

    Abstract: Dysarthric speech reconstruction (DSR) systems aim to automatically convert dysarthric speech into normal-sounding speech. The technology eases communication with speakers affected by the neuromotor disorder and enhances their social inclusion. NED-based (Neural Encoder-Decoder) systems have significantly improved the intelligibility of the reconstructed speech as compared with GAN-based (Generati… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: Accepted to ICASSP 2024

  46. arXiv:2401.11482  [pdf, ps, other

    gr-qc

    Destroying the Event Horizon of Cold Dark Matter-Black Hole System

    Authors: Liping Meng, Zhaoyi Xu, Meirong Tang

    Abstract: Since the Weak Cosmic Censorship Conjecture was proposed, research on this conjecture has been ongoing. This paper explores the conjecture in black holes that are closer to those existing in the real universe (i.e., rotating black holes enveloped by dark matter). We conduct our study by introducing a test particle and a scalar field into the black hole. Our conclusions show that, in extremal case,… ▽ More

    Submitted 25 June, 2024; v1 submitted 21 January, 2024; originally announced January 2024.

  47. arXiv:2401.09819  [pdf, other

    cs.RO cs.AI cs.LG

    PPNet: A Two-Stage Neural Network for End-to-end Path Planning

    Authors: Qinglong Meng, Chongkun Xia, Xueqian Wang, Songping Mai, Bin Liang

    Abstract: The classical path planners, such as sampling-based path planners, can provide probabilistic completeness guarantees in the sense that the probability that the planner fails to return a solution if one exists, decays to zero as the number of samples approaches infinity. However, finding a near-optimal feasible solution in a given period is challenging in many applications such as the autonomous ve… ▽ More

    Submitted 23 April, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

  48. arXiv:2401.08433  [pdf, other

    cs.RO

    Autonomous Multiple-Trolley Collection System with Nonholonomic Robots: Design, Control, and Implementation

    Authors: Peijia Xie, Bingyi Xia, Anjun Hu, Ziqi Zhao, Lingxiao Meng, Zhirui Sun, Xuheng Gao, Jiankun Wang, Max Q. -H. Meng

    Abstract: The intricate and multi-stage task in dynamic public spaces like luggage trolley collection in airports presents both a promising opportunity and an ongoing challenge for automated service robots. Previous research has primarily focused on handling a single trolley or individual functional components, creating a gap in providing cost-effective and efficient solutions for practical scenarios. In th… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  49. arXiv:2401.07382  [pdf, other

    cs.CL cs.AI

    Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation

    Authors: Meng Cao, Lei Shu, Lei Yu, Yun Zhu, Nevan Wichers, Yinxiao Liu, Lei Meng

    Abstract: Reinforcement learning (RL) can align language models with non-differentiable reward signals, such as human preferences. However, a major challenge arises from the sparsity of these reward signals - typically, there is only a single reward for an entire output. This sparsity of rewards can lead to inefficient and unstable learning. To address this challenge, our paper introduces an novel framework… ▽ More

    Submitted 19 February, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

  50. arXiv:2401.04152  [pdf, other

    cs.SD cs.AI cs.CL eess.AS

    Cross-Speaker Encoding Network for Multi-Talker Speech Recognition

    Authors: Jiawen Kang, Lingwei Meng, Mingyu Cui, Haohan Guo, Xixin Wu, Xunying Liu, Helen Meng

    Abstract: End-to-end multi-talker speech recognition has garnered great interest as an effective approach to directly transcribe overlapped speech from multiple speakers. Current methods typically adopt either 1) single-input multiple-output (SIMO) models with a branched encoder, or 2) single-input single-output (SISO) models based on attention-based encoder-decoder architecture with serialized output train… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: Accepted by ICASSP2024