Skip to main content

Showing 1–50 of 11,970 results for author: Li, X

  1. arXiv:2407.09278  [pdf, ps, other

    math-ph math.DS math.SP

    Exact local distribution of the absolutely continuous spectral measure

    Authors: Xianzhe Li, Jiangong You, Qi Zhou

    Abstract: It is well-established that the spectral measure for one-frequency Schrödinger operators with Diophantine frequencies exhibits optimal $1/2$-Hölder continuity within the absolutely continuous spectrum. This study extends these findings by precisely characterizing the local distribution of the spectral measure for dense small potentials, including a notable result for any subcritical almost Mathieu… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 49 pages

  2. arXiv:2407.09227  [pdf, other

    cond-mat.quant-gas quant-ph

    Stability and decay of subradiant patterns in a quantum gas with photon-mediated interactions

    Authors: Alexander Baumgärtner, Simon Hertlein, Tom Schmit, Davide Dreon, Carlos Máximo, Xiangliang Li, Giovanna Morigi, Tobias Donner

    Abstract: The phenomenon of subradiance, marked by its surprising suppression of spontaneous emission, challenges conventional expectations of the collective behavior of scatterers. We study subradiance in the experimental setting of a Bose-Einstein condensate positioned at the mode crossing of two optical cavities. In this setup, subradiance manifests in the form of metastable density structures that suppr… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  3. arXiv:2407.09139  [pdf, other

    hep-ex

    Measurement of $CP$ asymmetries in $B^0 \to K^0_S π^0 γ$ decays at Belle II

    Authors: Belle II Collaboration, I. Adachi, L. Aggarwal, H. Ahmed, H. Aihara, N. Akopov, A. Aloisio, N. Anh Ky, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, A. Baur, A. Beaubien, F. Becherer , et al. (414 additional authors not shown)

    Abstract: We report measurements of time-dependent $CP$ asymmetries in $B^0 \to K^0_S π^0 γ$ decays based on a data sample of $(388\pm6)\times10^6$ $B\bar{B}$ events collected at the $Υ(4S)$ resonance with the Belle II detector. The Belle II experiment operates at the SuperKEKB asymmetric-energy $e^+e^-$ collider. We measure decay-time distributions to determine $CP$-violating parameters $S$ and $C$. We det… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 10 pages, 4 figures

    Report number: Belle II Preprint 2024-009, KEK Preprint 2024-1

  4. arXiv:2407.09088  [pdf, other

    eess.IV cs.AI cs.CV

    FD-SOS: Vision-Language Open-Set Detectors for Bone Fenestration and Dehiscence Detection from Intraoral Images

    Authors: Marawan Elbatel, Keyuan Liu, Yanqi Yang, Xiaomeng Li

    Abstract: Accurate detection of bone fenestration and dehiscence (FD) is crucial for effective treatment planning in dentistry. While cone-beam computed tomography (CBCT) is the gold standard for evaluating FD, it comes with limitations such as radiation exposure, limited accessibility, and higher cost compared to intraoral images. In intraoral images, dentists face challenges in the differential diagnosis… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: MICCAI 2024

  5. arXiv:2407.08944  [pdf, other

    cs.CV eess.IV

    Bora: Biomedical Generalist Video Generation Model

    Authors: Weixiang Sun, Xiaocao You, Ruizhe Zheng, Zhengqing Yuan, Xiang Li, Lifang He, Quanzheng Li, Lichao Sun

    Abstract: Generative models hold promise for revolutionizing medical education, robot-assisted surgery, and data augmentation for medical AI development. Diffusion models can now generate realistic images from text prompts, while recent advancements have demonstrated their ability to create diverse, high-quality videos. However, these models often struggle with generating accurate representations of medical… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  6. arXiv:2407.08928  [pdf, ps, other

    math.AP

    Dynamics for a diffusive epidemic model with a free boundary: spreading speed

    Authors: Xueping Li, Lei Li, Mingxin Wang

    Abstract: We study the spreading speed of a diffusive epidemic model proposed by Li et al. \cite{LL}, where the Stefan boundary condition is imposed at the right boundary, and the left boundary is subject to the homogeneous Dirichlet and Neumann condition, respectively. A spreading-vanishing dichotomy and some sharp criteria were obtained in \cite{LL}. In this paper, when spreading happens, we not only obta… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  7. arXiv:2407.08903  [pdf, other

    cs.CR cs.AI cs.AR

    TensorTEE: Unifying Heterogeneous TEE Granularity for Efficient Secure Collaborative Tensor Computing

    Authors: Husheng Han, Xinyao Zheng, Yuanbo Wen, Yifan Hao, Erhu Feng, Ling Liang, Jianan Mu, Xiaqing Li, Tianyun Ma, Pengwei Jin, Xinkai Song, Zidong Du, Qi Guo, Xing Hu

    Abstract: Heterogeneous collaborative computing with NPU and CPU has received widespread attention due to its substantial performance benefits. To ensure data confidentiality and integrity during computing, Trusted Execution Environments (TEE) is considered a promising solution because of its comparatively lower overhead. However, existing heterogeneous TEE designs are inefficient for collaborative computin… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Accepted by ASPLOS 2024

  8. arXiv:2407.08661  [pdf, other

    cond-mat.str-el cond-mat.mes-hall

    Self-consistent theory for the fractional quantum anomalous Hall effect in rhombohedral pentalayer graphene

    Authors: Ke Huang, Xiao Li, Sankar Das Sarma, Fan Zhang

    Abstract: The fractional quantum anomalous Hall (FQAH) effect in rhombohedral pentalayer graphene (PLG) has attracted significant attention due to its potential for observing exotic quantum states. In this work, we present a self-consistent Hartree-Fock theory for the FQAH effect in rhombohedral PLG. In particular, we focus on the convergence of the Hartree-Fock calculation with various reference fields and… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 18 pages, 12 figures. Comments are welcome

  9. arXiv:2407.08516  [pdf, other

    cs.AI

    Converging Paradigms: The Synergy of Symbolic and Connectionist AI in LLM-Empowered Autonomous Agents

    Authors: Haoyi Xiong, Zhiyuan Wang, Xuhong Li, Jiang Bian, Zeke Xie, Shahid Mumtaz, Laura E. Barnes

    Abstract: This article explores the convergence of connectionist and symbolic artificial intelligence (AI), from historical debates to contemporary advancements. Traditionally considered distinct paradigms, connectionist AI focuses on neural networks, while symbolic AI emphasizes symbolic representation and logic. Recent advancements in large language models (LLMs), exemplified by ChatGPT and GPT-4, highlig… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  10. arXiv:2407.08468  [pdf, other

    math.ST

    Matching-Based Policy Learning

    Authors: Xuqiao Li, Ying Yan

    Abstract: Treatment heterogeneity is ubiquitous in many areas, motivating practitioners to search for the optimal policy that maximizes the expected outcome based on individualized characteristics. However, most existing policy learning methods rely on weighting-based approaches, which may suffer from high instability in observational studies. To enhance the robustness of the estimated policy, we propose a… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  11. arXiv:2407.08351  [pdf, other

    cs.CL cs.LG

    AutoBencher: Creating Salient, Novel, Difficult Datasets for Language Models

    Authors: Xiang Lisa Li, Evan Zheran Liu, Percy Liang, Tatsunori Hashimoto

    Abstract: Evaluation is critical for assessing capabilities, tracking scientific progress, and informing model selection. In this paper, we present three desiderata for a good benchmark for language models: (i) salience (e.g., knowledge about World War II is more salient than a random day in history), (ii) novelty (i.e., the benchmark reveals new trends in model rankings not shown by previous benchmarks), a… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: preprint

  12. arXiv:2407.08303  [pdf, other

    cs.CV cs.AI

    DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception

    Authors: Xiaotong Li, Fan Zhang, Haiwen Diao, Yueze Wang, Xinlong Wang, Ling-Yu Duan

    Abstract: Existing Multimodal Large Language Models (MLLMs) increasingly emphasize complex understanding of various visual elements, including multiple objects, text information, and spatial relations. Their development for comprehensive visual perception hinges on the availability of high-quality image-text datasets that offer diverse visual elements and throughout image descriptions. However, the scarcity… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  13. arXiv:2407.08273   

    cs.CL

    RB-SQL: A Retrieval-based LLM Framework for Text-to-SQL

    Authors: Zhenhe Wu, Zhongqiu Li, Jie Zhang, Mengxiang Li, Yu Zhao, Ruiyu Fang, Zhongjiang He, Xuelong Li, Zhoujun Li, Shuangyong Song

    Abstract: Large language models (LLMs) with in-context learning have significantly improved the performance of text-to-SQL task. Previous works generally focus on using exclusive SQL generation prompt to improve the LLMs' reasoning ability. However, they are mostly hard to handle large databases with numerous tables and columns, and usually ignore the significance of pre-processing database and extracting v… ▽ More

    Submitted 12 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: Further improvement and modification are needed.

  14. arXiv:2407.08154  [pdf, other

    cs.CE

    Bayesian uncertainty analysis for underwater 3D reconstruction with neural radiance fields

    Authors: Haojie Lian, Xinhao Li, Yilin Qu, Jing Du, Zhuxuan Meng, Jie Liu, Leilei Chen

    Abstract: Neural radiance fields (NeRFs) are a deep learning technique that can generate novel views of 3D scenes using sparse 2D images from different viewing directions and camera poses. As an extension of conventional NeRFs in underwater environment, where light can get absorbed and scattered by water, SeaThru-NeRF was proposed to separate the clean appearance and geometric structure of underwater scene… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  15. arXiv:2407.08101  [pdf, other

    cs.CV

    Live Fitness Coaching as a Testbed for Situated Interaction

    Authors: Sunny Panchal, Apratim Bhattacharyya, Guillaume Berger, Antoine Mercier, Cornelius Bohm, Florian Dietrichkeit, Reza Pourreza, Xuanlin Li, Pulkit Madan, Mingu Lee, Mark Todorovich, Ingo Bax, Roland Memisevic

    Abstract: Tasks at the intersection of vision and language have had a profound impact in advancing the capabilities of vision-language models such as dialog-based assistants. However, models trained on existing tasks are largely limited to turn-based interactions, where each turn must be stepped (i.e., prompted) by the user. Open-ended, asynchronous interactions where an AI model may proactively deliver tim… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: The benchmark and dataset is available here: https://developer.qualcomm.com/software/ai-datasets/qevd

  16. arXiv:2407.08031  [pdf, ps, other

    math.DG

    Coarse extrinsic curvature of Riemannian submanifolds

    Authors: Marc Arnaudon, Xue-Mei Li, Benedikt Petko

    Abstract: Inspired by Y. Ollivier's coarse Ricci curvature, we introduce a novel concept of coarse extrinsic curvature on Riemannian submanifolds. This is defined through Wasserstein distances between test probability measures supported in the tubular neighbourhood of the submanifold. This framework provides an understanding of the geometric properties of embeddings, offering valuable insights into their cu… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 52 pages, 6 figures

    MSC Class: 53B25 (Primary) 49Q22 (Secondary)

  17. arXiv:2407.07863  [pdf, other

    astro-ph.IM physics.space-ph

    Intensity-sensitive quality assessment of extended sources in astronomical images

    Authors: X. Li, K. Adamek, W. Armour

    Abstract: Radio astronomy studies the Universe by observing the radio emissions of celestial bodies. Different methods can be used to recover the sky brightness distribution (SBD), which describes the distribution of celestial sources from recorded data, with the output dependent on the method used. Image quality assessment (IQA) indexes can be used to compare the differences between restored SBDs produced… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  18. arXiv:2407.07807  [pdf, other

    astro-ph.HE

    Revisiting the dead time effects of Insight-HXMT/ME on timing analysis

    Authors: Youli Tuo, Xiaobo Li, Ying Tan, Baiyang Wu, Weichun Jiang, Liming Song, Jinlu Qu, Sudeep Gogate, Shuang-Nan Zhang, Andrea Santangelo

    Abstract: Dead time is a common instrumental effect of X-ray detectors which would alter the behavior of timing properties of astronomical signals, such as distorting the shape of power density spectra (PDS), affecting the root-mean-square of potential quasi-periodic oscillation signals, etc. We revisit the effects of the dead time of Medium Energy X-ray telescope (ME) onboard Insight-HXMT, based on the sim… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 9 pages, 8 figures, accepted for publication in MNRAS main journal

  19. arXiv:2407.07763  [pdf, other

    cs.CV

    S&D Messenger: Exchanging Semantic and Domain Knowledge for Generic Semi-Supervised Medical Image Segmentation

    Authors: Qixiang Zhang, Haonan Wang, Xiaomeng Li

    Abstract: Semi-supervised medical image segmentation (SSMIS) has emerged as a promising solution to tackle the challenges of time-consuming manual labeling in the medical field. However, in practical scenarios, there are often domain variations within the datasets, leading to derivative scenarios like semi-supervised medical domain generalization (Semi-MDG) and unsupervised medical domain adaptation (UMDA).… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 10 pages, under review of IEEE Transcations on Medical Imaging

  20. arXiv:2407.07760  [pdf, other

    cs.CV cs.AI

    Learning Spatial-Semantic Features for Robust Video Object Segmentation

    Authors: Xin Li, Deshui Miao, Zhenyu He, Yaowei Wang, Huchuan Lu, Ming-Hsuan Yang

    Abstract: Tracking and segmenting multiple similar objects with complex or separate parts in long-term videos is inherently challenging due to the ambiguity of target parts and identity confusion caused by occlusion, background clutter, and long-term variations. In this paper, we propose a robust video object segmentation framework equipped with spatial-semantic features and discriminative object queries to… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Winner solution of the VOTS2024 Challenge

  21. arXiv:2407.07651  [pdf, other

    hep-ex physics.data-an

    Study of the decay and production properties of $D_{s1}(2536)$ and $D_{s2}^*(2573)$

    Authors: M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (645 additional authors not shown)

    Abstract: The $e^+e^-\rightarrow D_s^+D_{s1}(2536)^-$ and $e^+e^-\rightarrow D_s^+D^*_{s2}(2573)^-$ processes are studied using data samples collected with the BESIII detector at center-of-mass energies from 4.530 to 4.946~GeV. The absolute branching fractions of $D_{s1}(2536)^- \rightarrow \bar{D}^{*0}K^-$ and $D_{s2}^*(2573)^- \rightarrow \bar{D}^0K^-$ are measured for the first time to be… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  22. arXiv:2407.07478  [pdf, other

    cs.CV

    EA-VTR: Event-Aware Video-Text Retrieval

    Authors: Zongyang Ma, Ziqi Zhang, Yuxin Chen, Zhongang Qi, Chunfeng Yuan, Bing Li, Yingmin Luo, Xu Li, Xiaojuan Qi, Ying Shan, Weiming Hu

    Abstract: Understanding the content of events occurring in the video and their inherent temporal logic is crucial for video-text retrieval. However, web-crawled pre-training datasets often lack sufficient event information, and the widely adopted video-level cross-modal contrastive learning also struggles to capture detailed and complex video-text event alignment. To address these challenges, we make improv… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Accepted by ECCV 2024

  23. arXiv:2407.07436  [pdf, other

    cs.IT math.OC

    Alternating Subspace Approximate Message Passing

    Authors: Xu Zhu, Yufei Ma, Xiaoguang Li, Tiejun Li

    Abstract: Numerous renowned algorithms for tackling the compressed sensing problem employ an alternating strategy, which typically involves data matching in one module and denoising in another. Based on an in-depth analysis of the connection between the message passing and operator splitting, we present a novel approach, the Alternating Subspace Method (ASM), which intuitively combines the principles of the… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 19 pages, 6 figures

    MSC Class: 94A12; 65F10; 90C06

  24. arXiv:2407.07397  [pdf, other

    cs.SD eess.AS

    SimuSOE: A Simulated Snoring Dataset for Obstructive Sleep Apnea-Hypopnea Syndrome Evaluation during Wakefulness

    Authors: Jie Lin, Xiuping Yang, Li Xiao, Xinhong Li, Weiyan Yi, Yuhong Yang, Weiping Tu, Xiong Chen

    Abstract: Obstructive Sleep Apnea-Hypopnea Syndrome (OSAHS) is a prevalent chronic breathing disorder caused by upper airway obstruction. Previous studies advanced OSAHS evaluation through machine learning-based systems trained on sleep snoring or speech signal datasets. However, constructing datasets for training a precise and rapid OSAHS evaluation system poses a challenge, since 1) it is time-consuming t… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  25. arXiv:2407.06985  [pdf, other

    cs.AI

    PEER: Expertizing Domain-Specific Tasks with a Multi-Agent Framework and Tuning Methods

    Authors: Yiying Wang, Xiaojing Li, Binzhu Wang, Yueyang Zhou, Han Ji, Hong Chen, Jinshi Zhang, Fei Yu, Zewei Zhao, Song Jin, Renji Gong, Wanqing Xu

    Abstract: In domain-specific applications, GPT-4, augmented with precise prompts or Retrieval-Augmented Generation (RAG), shows notable potential but faces the critical tri-lemma of performance, cost, and data privacy. High performance requires sophisticated processing techniques, yet managing multiple agents within a complex workflow often proves costly and challenging. To address this, we introduce the PE… ▽ More

    Submitted 9 July, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

  26. arXiv:2407.06880  [pdf, other

    math.AP

    Sharp non-uniqueness for the 2D hyper-dissipative Navier-Stokes equations

    Authors: Lili Du, Xinliang Li

    Abstract: In this article, we study the non-uniqueness of weak solutions for the two-dimensional hyper-dissipative Navier-Stokes equations in the super-critical spaces $L_{t}^γW_{x}^{s,p}$ when $α\in[1,\frac{3}{2})$, and obtain the conclusion that the non-uniqueness of the weak solutions at the two endpoints is sharp in view of the generalized Ladyženskaja-Prodi-Serrin condition with the triplet… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 61 pages, 2 figures. arXiv admin note: text overlap with arXiv:2405.20754

    MSC Class: 35A02; 35D30; 76D05

  27. arXiv:2407.06833  [pdf, other

    q-bio.QM cs.CV eess.IV

    Training-free CryoET Tomogram Segmentation

    Authors: Yizhou Zhao, Hengwei Bian, Michael Mu, Mostofa R. Uddin, Zhenyang Li, Xiang Li, Tianyang Wang, Min Xu

    Abstract: Cryogenic Electron Tomography (CryoET) is a useful imaging technology in structural biology that is hindered by its need for manual annotations, especially in particle picking. Recent works have endeavored to remedy this issue with few-shot learning or contrastive learning techniques. However, supervised training is still inevitable for them. We instead choose to leverage the power of existing 2D… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: This preprint has not undergone peer review or any post-submission improvements or corrections. The Version of Record of this contribution will be published in MICCAI 2024

  28. arXiv:2407.06735  [pdf, ps, other

    math.AP

    Existence of positive solutions for Kirchhoff type problems with critical exponent in exterior domains

    Authors: Liqian Jia, Xinfu Li, Shiwang Ma

    Abstract: In this paper, by using variational methods we study the existence of positive solutions for the following Kirchhoff type problem: $$ \left\{ \begin{array}{ll} -\left(a+b\mathlarger{\int}_Ω|\nabla u|^{2}dx\right)Δu+V(x)u=u^{5}, \ & x\inΩ,\\ \\ u=0,\ & x\in\partial Ω, \end{array}\right. $$ where $a>0$, $b\geq0$, $Ω\subset\mathbb R^3$ is an unbounded exterior domain, $\partialΩ\neq\emptyset$,… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 36 Pages

  29. arXiv:2407.06491  [pdf, other

    cs.CV

    VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model

    Authors: Xinhao Li, Zhenpeng Huang, Jing Wang, Kunchang Li, Limin Wang

    Abstract: With the growth of high-quality data and advancement in visual pre-training paradigms, Video Foundation Models (VFMs) have made significant progress recently, demonstrating their remarkable performance on traditional video understanding benchmarks. However, the existing benchmarks (e.g. Kinetics) and their evaluation protocols are often limited by relatively poor diversity, high evaluation costs,… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  30. arXiv:2407.06309  [pdf, other

    cs.CY cs.AI

    Multimodal Chain-of-Thought Reasoning via ChatGPT to Protect Children from Age-Inappropriate Apps

    Authors: Chuanbo Hu, Bin Liu, Minglei Yin, Yilu Zhou, Xin Li

    Abstract: Mobile applications (Apps) could expose children to inappropriate themes such as sexual content, violence, and drug use. Maturity rating offers a quick and effective method for potential users, particularly guardians, to assess the maturity levels of apps. Determining accurate maturity ratings for mobile apps is essential to protect children's health in today's saturated digital marketplace. Exist… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  31. arXiv:2407.06253  [pdf, other

    cond-mat.dis-nn quant-ph

    Unsupervised machine learning for detecting mutual independence among eigenstate regimes in interacting quasiperiodic chains

    Authors: Colin Beveridge, Cassio Rodrigo Cristani, Xiao Li, Enrico Barbierato, Yi-Ting Hsu

    Abstract: Many-body eigenstates that are neither thermal nor many-body-localized (MBL) were numerically found in certain interacting chains with moderate quasiperiodic potentials. The energy regime consisting of these non-ergodic but extended (NEE) eigenstates has been extensively studied for being a possible many-body mobility edge between the energy-resolved MBL and thermal phases. Recently, the NEE regim… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 6 pages, 5 figures

  32. arXiv:2407.06199  [pdf

    cs.DB

    Data Governance and Data Management in Operations and Supply Chain: A Literature Review

    Authors: Xuejiao Li, Yang Cheng, Xiaoning Xia, Charles Møller

    Abstract: In the dynamic landscape of contemporary business, the wave in data and technological advancements has directed companies toward embracing data-driven decision-making processes. Despite the vast potential that data holds for strategic insights and operational efficiencies, substantial challenges arise in the form of data issues. Recognizing these obstacles, the imperative for effective data govern… ▽ More

    Submitted 21 June, 2024; originally announced July 2024.

  33. arXiv:2407.06159  [pdf, other

    cs.CV

    A Semantic-Aware and Multi-Guided Network for Infrared-Visible Image Fusion

    Authors: Xiaoli Zhang, Liying Wang, Libo Zhao, Xiongfei Li, Siwei Ma

    Abstract: Multi-modality image fusion aims at fusing specific-modality and shared-modality information from two source images. To tackle the problem of insufficient feature extraction and lack of semantic awareness for complex scenes, this paper focuses on how to model correlation-driven decomposing features and reason high-level graph representation by efficiently extracting complementary features and mult… ▽ More

    Submitted 11 June, 2024; originally announced July 2024.

  34. arXiv:2407.06136  [pdf, other

    cs.CV

    Mamba-FSCIL: Dynamic Adaptation with Selective State Space Model for Few-Shot Class-Incremental Learning

    Authors: Xiaojie Li, Yibo Yang, Jianlong Wu, Bernard Ghanem, Liqiang Nie, Min Zhang

    Abstract: Few-shot class-incremental learning (FSCIL) confronts the challenge of integrating new classes into a model with minimal training samples while preserving the knowledge of previously learned classes. Traditional methods widely adopt static adaptation relying on a fixed parameter space to learn from data that arrive sequentially, prone to overfitting to the current session. Existing dynamic strateg… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Code: https://github.com/xiaojieli0903/Mamba-FSCIL

  35. arXiv:2407.05963  [pdf, ps, other

    cs.SE cs.AI cs.NI cs.SI

    6GSoft: Software for Edge-to-Cloud Continuum

    Authors: Muhammad Azeem Akbar, Matteo Esposito, Sami Hyrynsalmi, Karthikeyan Dinesh Kumar, Valentina Lenarduzzi, Xiaozhou Li, Ali Mehraj, Tommi Mikkonen, Sergio Moreschini, Niko Mäkitalo, Markku Oivo, Anna-Sofia Paavonen, Risha Parveen, Kari Smolander, Ruoyu Su, Kari Systä, Davide Taibi, Nan Yang, Zheying Zhang, Muhammad Zohaib

    Abstract: In the era of 6G, developing and managing software requires cutting-edge software engineering (SE) theories and practices tailored for such complexity across a vast number of connected edge devices. Our project aims to lead the development of sustainable methods and energy-efficient orchestration models specifically for edge environments, enhancing architectural support driven by AI for contempora… ▽ More

    Submitted 9 July, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

  36. arXiv:2407.05905  [pdf, other

    eess.SP

    Deep Learning-based CSI Feedback in Wi-Fi Systems

    Authors: Fan Qi, Jiajia Guo, Yiming Cui, Xiangyi Li, Chao-Kai Wen, Shi Jin

    Abstract: In Wi-Fi systems, channel state information (CSI) plays a crucial role in enabling access points to execute beamforming operations. However, the feedback overhead associated with CSI significantly hampers the throughput improvements. Recent advancements in deep learning (DL) have transformed the approach to CSI feedback in cellular systems. Drawing inspiration from the successes witnessed in the r… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  37. arXiv:2407.05609  [pdf, other

    cs.CL

    Open-world Multi-label Text Classification with Extremely Weak Supervision

    Authors: Xintong Li, Jinya Jiang, Ria Dharmani, Jayanth Srinivasa, Gaowen Liu, Jingbo Shang

    Abstract: We study open-world multi-label text classification under extremely weak supervision (XWS), where the user only provides a brief description for classification objectives without any labels or ground-truth label space. Similar single-label XWS settings have been explored recently, however, these methods cannot be easily adapted for multi-label. We observe that (1) most documents have a dominant cl… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Preprint

  38. arXiv:2407.05455  [pdf, other

    cond-mat.str-el cond-mat.quant-gas cond-mat.stat-mech quant-ph

    Quantum Supercriticality in the Ising Model and Rydberg Atom Array

    Authors: Junsen Wang, Enze Lv, Xinyang Li, Yuliang Jin, Wei Li

    Abstract: Supercriticality, featured with universal scaling behaviors, emerges as an intriguing phenomenon proximate to the classical liquid-gas critical point. In this study, we extend this significant concept to quantum many-body systems near the quantum critical point (QCP), employing tensor network calculations and scaling analyses of the Ising model and Rydberg atom array. The supercritical, fluid-like… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: 8 pages, 5 figures (SM 5 pages, 6 figures)

  39. arXiv:2407.05361  [pdf, other

    eess.AS cs.CL

    Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation

    Authors: Haorui He, Zengqiang Shang, Chaoren Wang, Xuyuan Li, Yicheng Gu, Hua Hua, Liwei Liu, Chen Yang, Jiaqi Li, Peiyang Shi, Yuancheng Wang, Kai Chen, Pengyuan Zhang, Zhizheng Wu

    Abstract: Recently, speech generation models have made significant progress by using large-scale training data. However, the research community struggle to produce highly spontaneous and human-like speech due to the lack of large-scale, diverse, and spontaneous speech data. This paper presents \textit{Emilia}, the first multilingual speech generation dataset from in-the-wild speech data, and Emilia-Pipe, th… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  40. arXiv:2407.05342  [pdf, other

    cs.CV

    Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models

    Authors: Longxiang Tang, Zhuotao Tian, Kai Li, Chunming He, Hantao Zhou, Hengshuang Zhao, Xiu Li, Jiaya Jia

    Abstract: This study addresses the Domain-Class Incremental Learning problem, a realistic but challenging continual learning scenario where both the domain distribution and target classes vary across tasks. To handle these diverse tasks, pre-trained Vision-Language Models (VLMs) are introduced for their strong generalizability. However, this incurs a new problem: the knowledge encoded in the pre-trained VLM… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  41. arXiv:2407.05320  [pdf, other

    cs.AI

    KAE: A Property-based Method for Knowledge Graph Alignment and Extension

    Authors: Daqian Shi, Xiaoyue Li, Fausto Giunchiglia

    Abstract: A common solution to the semantic heterogeneity problem is to perform knowledge graph (KG) extension exploiting the information encoded in one or more candidate KGs, where the alignment between the reference KG and candidate KGs is considered the critical procedure. However, existing KG alignment methods mainly rely on entity type (etype) label matching as a prerequisite, which is poorly performin… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2405.02463

  42. arXiv:2407.05286  [pdf, other

    cs.LG math.OC

    Stability and Generalization for Stochastic Recursive Momentum-based Algorithms for (Strongly-)Convex One to $K$-Level Stochastic Optimizations

    Authors: Xiaokang Pan, Xingyu Li, Jin Liu, Tao Sun, Kai Sun, Lixing Chen, Zhe Qu

    Abstract: STOchastic Recursive Momentum (STORM)-based algorithms have been widely developed to solve one to $K$-level ($K \geq 3$) stochastic optimization problems. Specifically, they use estimators to mitigate the biased gradient issue and achieve near-optimal convergence results. However, there is relatively little work on understanding their generalization performance, particularly evident during the tra… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  43. arXiv:2407.05276  [pdf, other

    cs.DC

    BFLN: A Blockchain-based Federated Learning Model for Non-IID Data

    Authors: Yang Li, Chunhe Xia, Dongchi Huang, Xiaojian Li, Tianbo Wang

    Abstract: As the application of federated learning becomes increasingly widespread, the issue of imbalanced training data distribution has emerged as a significant challenge. Federated learning utilizes local data stored on different training clients for model training, rather than centralizing data on a server, thereby greatly enhancing the privacy and security of training data. However, the distribution o… ▽ More

    Submitted 10 July, 2024; v1 submitted 7 July, 2024; originally announced July 2024.

  44. arXiv:2407.05013  [pdf, other

    cs.CL

    Progress or Regress? Self-Improvement Reversal in Post-training

    Authors: Ting Wu, Xuefeng Li, Pengfei Liu

    Abstract: Self-improvement through post-training methods such as iterative preference learning has been acclaimed for enhancing the problem-solving capabilities (e.g., mathematical reasoning) of Large Language Models (LLMs) without human intervention. However, as exploration deepens, it becomes crucial to assess whether these improvements genuinely signify progress in solving more challenging problems or if… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  45. arXiv:2407.04697  [pdf, other

    cs.CV cs.MM

    VCoME: Verbal Video Composition with Multimodal Editing Effects

    Authors: Weibo Gong, Xiaojie Jin, Xin Li, Dongliang He, Xinglong Wu

    Abstract: Verbal videos, featuring voice-overs or text overlays, provide valuable content but present significant challenges in composition, especially when incorporating editing effects to enhance clarity and visual appeal. In this paper, we introduce the novel task of verbal video composition with editing effects. This task aims to generate coherent and visually appealing verbal videos by integrating mult… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  46. arXiv:2407.04675  [pdf, other

    eess.AS cs.SD

    Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition

    Authors: Ye Bai, Jingping Chen, Jitong Chen, Wei Chen, Zhuo Chen, Chuang Ding, Linhao Dong, Qianqian Dong, Yujiao Du, Kepan Gao, Lu Gao, Yi Guo, Minglun Han, Ting Han, Wenchao Hu, Xinying Hu, Yuxiang Hu, Deyu Hua, Lu Huang, Mingkun Huang, Youjia Huang, Jishuo Jin, Fanliu Kong, Zongwei Lan, Tianyu Li , et al. (30 additional authors not shown)

    Abstract: Modern automatic speech recognition (ASR) model is required to accurately transcribe diverse speech signals (from different domains, languages, accents, etc) given the specific contextual information in various application scenarios. Classic end-to-end models fused with extra language models perform well, but mainly in data matching scenarios and are gradually approaching a bottleneck. In this wor… ▽ More

    Submitted 10 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

  47. arXiv:2407.04663  [pdf, other

    cs.CV cs.LG

    Unsupervised 4D Cardiac Motion Tracking with Spatiotemporal Optical Flow Networks

    Authors: Long Teng, Wei Feng, Menglong Zhu, Xinchao Li

    Abstract: Cardiac motion tracking from echocardiography can be used to estimate and quantify myocardial motion within a cardiac cycle. It is a cost-efficient and effective approach for assessing myocardial function. However, ultrasound imaging has the inherent characteristics of spatially low resolution and temporally random noise, which leads to difficulties in obtaining reliable annotation. Thus it is dif… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  48. arXiv:2407.04620  [pdf, other

    cs.LG cs.AI cs.CL

    Learning to (Learn at Test Time): RNNs with Expressive Hidden States

    Authors: Yu Sun, Xinhao Li, Karan Dalal, Jiarui Xu, Arjun Vikram, Genghan Zhang, Yann Dubois, Xinlei Chen, Xiaolong Wang, Sanmi Koyejo, Tatsunori Hashimoto, Carlos Guestrin

    Abstract: Self-attention performs well in long context but has quadratic complexity. Existing RNN layers have linear complexity, but their performance in long context is limited by the expressive power of their hidden state. We propose a new class of sequence modeling layers with linear complexity and an expressive hidden state. The key idea is to make the hidden state a machine learning model itself, and t… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  49. Exploration of Class Center for Fine-Grained Visual Classification

    Authors: Hang Yao, Qiguang Miao, Peipei Zhao, Chaoneng Li, Xin Li, Guanwen Feng, Ruyi Liu

    Abstract: Different from large-scale classification tasks, fine-grained visual classification is a challenging task due to two critical problems: 1) evident intra-class variances and subtle inter-class differences, and 2) overfitting owing to fewer training samples in datasets. Most existing methods extract key features to reduce intra-class variances, but pay no attention to subtle inter-class differences… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: Accpeted by TCSVT. Code and trained models are here:https://github.com/hyao1/ECC

  50. arXiv:2407.04179  [pdf, other

    cs.CL

    Defense Against Syntactic Textual Backdoor Attacks with Token Substitution

    Authors: Xinglin Li, Xianwen He, Yao Li, Minhao Cheng

    Abstract: Textual backdoor attacks present a substantial security risk to Large Language Models (LLM). It embeds carefully chosen triggers into a victim model at the training stage, and makes the model erroneously predict inputs containing the same triggers as a certain class. Prior backdoor defense methods primarily target special token-based triggers, leaving syntax-based triggers insufficiently addressed… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.