Skip to main content

Showing 1–50 of 10,406 results for author: Zhang, Z

  1. arXiv:2407.09359  [pdf, other

    cs.CV

    A Unified Anomaly Synthesis Strategy with Gradient Ascent for Industrial Anomaly Detection and Localization

    Authors: Qiyu Chen, Huiyuan Luo, Chengkan Lv, Zhengtao Zhang

    Abstract: Anomaly synthesis strategies can effectively enhance unsupervised anomaly detection. However, existing strategies have limitations in the coverage and controllability of anomaly synthesis, particularly for weak defects that are very similar to normal regions. In this paper, we propose Global and Local Anomaly co-Synthesis Strategy (GLASS), a novel unified framework designed to synthesize a broader… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  2. arXiv:2407.09250  [pdf

    cs.NI cs.LG

    FedsLLM: Federated Split Learning for Large Language Models over Communication Networks

    Authors: Kai Zhao, Zhaohui Yang, Chongwen Huang, Xiaoming Chen, Zhaoyang Zhang

    Abstract: Addressing the challenges of deploying large language models in wireless communication networks, this paper combines low-rank adaptation technology (LoRA) with the splitfed learning framework to propose the federated split learning for large language models (FedsLLM) framework. The method introduced in this paper utilizes LoRA technology to reduce processing loads by dividing the network into clie… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  3. arXiv:2407.08596  [pdf, other

    astro-ph.HE astro-ph.GA

    Modeling X-Ray Multi-Reflection in Super-Eddington Winds

    Authors: Zijian Zhang, Lars Lund Thomsen, Lixin Dai, Christopher S. Reynolds, Javier A. García, Erin Kara, Riley Connors, Megan Masterson, Yuhan Yao, Thomas Dauser

    Abstract: It has been recently discovered that a few super-Eddington sources undergoing black hole super-Eddington accretion exhibit X-ray reflection signatures. In such new systems, one expects that the coronal X-ray emissions are mainly reflected by optically thick super-Eddington winds instead of thin disks. In this paper, we conduct a series of general relativistic ray-tracing and Monte Carlo radiative… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 23 pages, 21 figures, 2 tables. Comments are welcome

  4. arXiv:2407.08561  [pdf, other

    cs.CV

    MapLocNet: Coarse-to-Fine Feature Registration for Visual Re-Localization in Navigation Maps

    Authors: Hang Wu, Zhenghao Zhang, Siyuan Lin, Xiangru Mu, Qiang Zhao, Ming Yang, Tong Qin

    Abstract: Robust localization is the cornerstone of autonomous driving, especially in challenging urban environments where GPS signals suffer from multipath errors. Traditional localization approaches rely on high-definition (HD) maps, which consist of precisely annotated landmarks. However, building HD map is expensive and challenging to scale up. Given these limitations, leveraging navigation maps has eme… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: IROS 2024 (Oral)

  5. arXiv:2407.08554  [pdf, other

    cs.AI cs.HC

    Establishing Rigorous and Cost-effective Clinical Trials for Artificial Intelligence Models

    Authors: Wanling Gao, Yunyou Huang, Dandan Cui, Zhuoming Yu, Wenjing Liu, Xiaoshuang Liang, Jiahui Zhao, Jiyue Xie, Hao Li, Li Ma, Ning Ye, Yumiao Kang, Dingfeng Luo, Peng Pan, Wei Huang, Zhongmou Liu, Jizhong Hu, Gangyuan Zhao, Chongrong Jiang, Fan Huang, Tianyi Wei, Suqin Tang, Bingjie Xia, Zhifei Zhang, Jianfeng Zhan

    Abstract: A profound gap persists between artificial intelligence (AI) and clinical practice in medicine, primarily due to the lack of rigorous and cost-effective evaluation methodologies. State-of-the-art and state-of-the-practice AI model evaluations are limited to laboratory studies on medical datasets or direct clinical trials with no or solely patient-centered controls. Moreover, the crucial role of cl… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 23 pages

  6. arXiv:2407.08526  [pdf, other

    cs.CV

    BLOS-BEV: Navigation Map Enhanced Lane Segmentation Network, Beyond Line of Sight

    Authors: Hang Wu, Zhenghao Zhang, Siyuan Lin, Tong Qin, Jin Pan, Qiang Zhao, Chunjing Xu, Ming Yang

    Abstract: Bird's-eye-view (BEV) representation is crucial for the perception function in autonomous driving tasks. It is difficult to balance the accuracy, efficiency and range of BEV representation. The existing works are restricted to a limited perception range within 50 meters. Extending the BEV representation range can greatly benefit downstream tasks such as topology reasoning, scene understanding, and… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: IEEE IV 2024

  7. arXiv:2407.08458  [pdf, other

    cs.LG cs.NI eess.SP

    Joint Optimization of Age of Information and Energy Consumption in NR-V2X System based on Deep Reinforcement Learning

    Authors: Shulin Song, Zheng Zhang, Qiong Wu, Qiang Fan, Pingyi Fan

    Abstract: Autonomous driving may be the most important application scenario of next generation, the development of wireless access technologies enabling reliable and low-latency vehicle communication becomes crucial. To address this, 3GPP has developed Vehicle-to-Everything (V2X) specifications based on 5G New Radio (NR) technology, where Mode 2 Side-Link (SL) communication resembles Mode 4 in LTE-V2X, allo… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: This paper has been accepted by sensors. The source code has been released at: https://github.com/qiongwu86/Joint-Optimization-of-AoI-and-Energy-Consumption-in-NR-V2X-System-based-on-DRL

  8. arXiv:2407.08394  [pdf, other

    cs.CV

    Diff-Tracker: Text-to-Image Diffusion Models are Unsupervised Trackers

    Authors: Zhengbo Zhang, Li Xu, Duo Peng, Hossein Rahmani, Jun Liu

    Abstract: We introduce Diff-Tracker, a novel approach for the challenging unsupervised visual tracking task leveraging the pre-trained text-to-image diffusion model. Our main idea is to leverage the rich knowledge encapsulated within the pre-trained diffusion model, such as the understanding of image semantics and structural information, to address unsupervised visual tracking. To this end, we design an ini… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  9. arXiv:2407.08296  [pdf, other

    cs.LG

    Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients

    Authors: Zhenyu Zhang, Ajay Jaiswal, Lu Yin, Shiwei Liu, Jiawei Zhao, Yuandong Tian, Zhangyang Wang

    Abstract: Training Large Language Models (LLMs) is memory-intensive due to the large number of parameters and associated optimization states. GaLore, a recent method, reduces memory usage by projecting weight gradients into a low-rank subspace without compromising performance. However, GaLore relies on time-consuming Singular Value Decomposition (SVD) operations to identify the subspace, and the frequent su… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  10. arXiv:2407.08201  [pdf, other

    astro-ph.CO astro-ph.GA

    Masses of Sunyaev-Zel'dovich Galaxy Clusters Detected by The Atacama Cosmology Telescope: Stacked Lensing Measurements with Subaru HSC Year 3 data

    Authors: Masato Shirasaki, Cristóbal Sifón, Hironao Miyatake, Erwin Lau, Zhuowen Zhang, Neta Bahcall, Mark Devlin, Jo Dunkley, Arya Farahi, Matt Hilton, Yen-Ting Lin, Daisuke Nagai, Suzanne T. Staggs, Tomomi Sunayama, David Spergel, Edward J. Wollack

    Abstract: We present a stacked lensing analysis of 96 galaxy clusters selected by the thermal Sunyaev-Zel'dovich (SZ) effect in maps of the cosmic microwave background (CMB). We select foreground galaxy clusters with a $5σ$-level SZ threshold in CMB observations from the Atacama Cosmology Telescope, while we define background source galaxies for the lensing analysis with secure photometric redshift cuts in… ▽ More

    Submitted 12 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: 34 pages, 17 figures

  11. arXiv:2407.08196  [pdf, other

    cs.AI

    SoupLM: Model Integration in Large Language and Multi-Modal Models

    Authors: Yue Bai, Zichen Zhang, Jiasen Lu, Yun Fu

    Abstract: Training large language models (LLMs) and multimodal LLMs necessitates significant computing resources, and existing publicly available LLMs are typically pre-trained on diverse, privately curated datasets spanning various tasks. For instance, LLaMA, Vicuna, and LLaVA are three LLM variants trained with LLaMA base models using very different training recipes, tasks, and data modalities. The traini… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  12. arXiv:2407.08021  [pdf, other

    cs.MA

    Field Deployment of Multi-Agent Reinforcement Learning Based Variable Speed Limit Controllers

    Authors: Yuhang Zhang, Zhiyao Zhang, Marcos Quiñones-Grueiro, William Barbour, Clay Weston, Gautam Biswas, Daniel Work

    Abstract: This article presents the first field deployment of a multi-agent reinforcement-learning (MARL) based variable speed limit (VSL) control system on the I-24 freeway near Nashville, Tennessee. We describe how we train MARL agents in a traffic simulator and directly deploy the simulation-based policy on a 17-mile stretch of Interstate 24 with 67 VSL controllers. We use invalid action masking and seve… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  13. arXiv:2407.07859  [pdf, other

    cond-mat.mtrl-sci

    Domains in ferroelectric nitrides superlattices

    Authors: Zhijun Jiang, Zhenlong Zhang, Charles Paillard, Hongjun Xiang, Laurent Bellaiche

    Abstract: Ferroelectric nitrides have emerged as promising semiconductor materials for modern electronics. However, their domain structures and associated properties are basically unknown, despite their potential to result in optimized or new phenomena. Density functional theory calculations are performed to investigate the effect of epitaxial strain on multidomains of (Al,Sc)N nitride systems and to compar… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 9 pages, 4 figures

  14. arXiv:2407.07791  [pdf, other

    cs.CL

    Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities

    Authors: Tianjie Ju, Yiting Wang, Xinbei Ma, Pengzhou Cheng, Haodong Zhao, Yulong Wang, Lifeng Liu, Jian Xie, Zhuosheng Zhang, Gongshen Liu

    Abstract: The rapid adoption of large language models (LLMs) in multi-agent systems has highlighted their impressive capabilities in various applications, such as collaborative problem-solving and autonomous negotiation. However, the security implications of these LLM-based multi-agent systems have not been thoroughly investigated, particularly concerning the spread of manipulated knowledge. In this paper,… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 18 Pages, working in progress

  15. arXiv:2407.07651  [pdf, other

    hep-ex physics.data-an

    Study of the decay and production properties of $D_{s1}(2536)$ and $D_{s2}^*(2573)$

    Authors: M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (645 additional authors not shown)

    Abstract: The $e^+e^-\rightarrow D_s^+D_{s1}(2536)^-$ and $e^+e^-\rightarrow D_s^+D^*_{s2}(2573)^-$ processes are studied using data samples collected with the BESIII detector at center-of-mass energies from 4.530 to 4.946~GeV. The absolute branching fractions of $D_{s1}(2536)^- \rightarrow \bar{D}^{*0}K^-$ and $D_{s2}^*(2573)^- \rightarrow \bar{D}^0K^-$ are measured for the first time to be… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  16. arXiv:2407.07638  [pdf, other

    cs.CV cs.AI

    Tuning Vision-Language Models with Candidate Labels by Prompt Alignment

    Authors: Zhifang Zhang, Beibei Li

    Abstract: Vision-language models (VLMs) can learn high-quality representations from a large-scale training dataset of image-text pairs. Prompt learning is a popular approach to fine-tuning VLM to adapt them to downstream tasks. Despite the satisfying performance, a major limitation of prompt learning is the demand for labelled data. In real-world scenarios, we may only obtain candidate labels (where the tru… ▽ More

    Submitted 11 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

  17. arXiv:2407.07600  [pdf, ps, other

    math.AP

    Pointwise and Oscillation Estimates via Riesz Potentials for Mixed Local and Nonlocal Parabolic Equations

    Authors: Lingwei Ma, Qi Xiong, Zhenqiu Zhang

    Abstract: We establish a class of pointwise estimates for weak solutions to mixed local and nonlocal parabolic equations involving measure data and merely measurable coefficients via caloric Riesz potentials. Such estimates effectively bound the sizes and oscillations of weak solutions, respectively. The proof relies on demonstrating a new local Hölder estimate with an optimal $L^q$-Tail for weak solutions… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  18. arXiv:2407.07513  [pdf, other

    quant-ph

    High-rate quantum digital signatures network with integrated silicon photonics

    Authors: Yongqiang Du, Bing-Hong Li, Xin Hua, Xiao-Yu Cao, Zhengeng Zhao, Feng Xie, Zhenrong Zhang, Hua-Lei Yin, Xi Xiao, Kejin Wei

    Abstract: The development of quantum networks is paramount towards practical and secure communications. Quantum digital signatures (QDS) offer an information-theoretically secure solution for ensuring data integrity, authenticity, and non-repudiation, rapidly growing from proof-of-concept to robust demonstrations. However, previous QDS systems relied on expensive and bulky optical equipment, limiting large-… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 11 pages, 6 figures

  19. arXiv:2407.07503  [pdf, other

    cs.CV cs.IR

    Metasurface-based Snapshot Shortwave-Infrared Hyperspectral Image Reconstruction with Inter and Intra Prior Learning Network

    Authors: Linqiang Li, Jinglei Hao, Yongqiang Zhao, Pan Liu, Haofang Yan, Ziqin Zhang, Seong G. Kong

    Abstract: Shortwave-infrared(SWIR) spectral information,ranging from 1 μm to 2.5μm, breaks the limitations of traditional color cameras in acquiring scene information and has been used in many fields. However, conventional SWIR hyperspectral imaging systems face challenges due to their bulky setups and low acquisition speed. In this work, we introduce a snapshot SWIR hyperspectral imaging system based on a… ▽ More

    Submitted 10 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: 10 pages,5 figures

  20. arXiv:2407.07479  [pdf, other

    cs.CV

    How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?

    Authors: Yuxin Chen, Zongyang Ma, Ziqi Zhang, Zhongang Qi, Chunfeng Yuan, Bing Li, Junfu Pu, Ying Shan, Xiaojuan Qi, Weiming Hu

    Abstract: Dominant dual-encoder models enable efficient image-text retrieval but suffer from limited accuracy while the cross-encoder models offer higher accuracy at the expense of efficiency. Distilling cross-modality matching knowledge from cross-encoder to dual-encoder provides a natural approach to harness their strengths. Thus we investigate the following valuable question: how to make cross-encoder a… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Accepted by CVPR 2024

  21. arXiv:2407.07478  [pdf, other

    cs.CV

    EA-VTR: Event-Aware Video-Text Retrieval

    Authors: Zongyang Ma, Ziqi Zhang, Yuxin Chen, Zhongang Qi, Chunfeng Yuan, Bing Li, Yingmin Luo, Xu Li, Xiaojuan Qi, Ying Shan, Weiming Hu

    Abstract: Understanding the content of events occurring in the video and their inherent temporal logic is crucial for video-text retrieval. However, web-crawled pre-training datasets often lack sufficient event information, and the widely adopted video-level cross-modal contrastive learning also struggles to capture detailed and complex video-text event alignment. To address these challenges, we make improv… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Accepted by ECCV 2024

  22. arXiv:2407.07476  [pdf, other

    cs.DC

    A Transverse-Read-assisted Valid-Bit Collection to Accelerate Stochastic Conmputing MAC for Energy-Efficient in-RTM DNNs

    Authors: Jihe Wang, Zhiying Zhang, Xingwu Dong, Danghui Wang

    Abstract: It looks very attractive to coordinate racetrack-memory(RM) and stochastic-computing (SC) jointly to build an ultra-low power neuron-architecture.However,the above combination has always been questioned in a fatal weakness that the narrow bit-view of the RM-MTJ structure,a.k.a.shift-and-access pattern,cannot physically match the great throughput of direct-stored stochastic sequences.Fortunately,a… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  23. arXiv:2407.07475  [pdf, ps, other

    cs.NI

    Learning-based Power Control for Secure Covert Semantic Communication

    Authors: Yansheng Liu, Jinbo Wen, Zongyao Zhang, Kun Zhu, Jiawen Kang

    Abstract: Despite progress in semantic communication (SemCom), research on SemCom security is still in its infancy. To bridge this gap, we propose a general covert SemCom framework for wireless networks, reducing eavesdropping risk. Our approach transmits semantic information covertly, making it difficult for wardens to detect. Given the aim of maximizing covert SemCom performance, we formulate a power cont… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  24. arXiv:2407.07465  [pdf, other

    cs.CV

    Exploring the Untouched Sweeps for Conflict-Aware 3D Segmentation Pretraining

    Authors: Tianfang Sun, Zhizhong Zhang, Xin Tan, Yanyun Qu, Yuan Xie

    Abstract: LiDAR-camera 3D representation pretraining has shown significant promise for 3D perception tasks and related applications. However, two issues widely exist in this framework: 1) Solely keyframes are used for training. For example, in nuScenes, a substantial quantity of unpaired LiDAR and camera frames remain unutilized, limiting the representation capabilities of the pretrained network. 2) The con… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: preprint, version 1

  25. arXiv:2407.07447  [pdf

    physics.app-ph

    Spin Splitting in Altermagnetic RuO$_2$ Enables Field-free Spin-Orbit Torque Switching via Dominant Out-of-Plane Spin Polarization

    Authors: Zhuoyi Li, Zhe Zhang, Xianyang Lu, Yongbing Xu

    Abstract: Researchers have recently identified a novel class of magnetism, termed "altermagnetism", which exhibits characteristics of both ferromagnetism and antiferromagnetism. Here, we report a groundbreaking discovery of efficient field-free spin-orbit torque (SOT) switching in a RuO$_2$ (101)/Co/Pt/Co/Pt/Ta structure. Our results demonstrate that the spin current flows along the [100] axis, induced by t… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  26. arXiv:2407.07311  [pdf

    cs.LG cs.AI cs.CV

    ViTime: A Visual Intelligence-Based Foundation Model for Time Series Forecasting

    Authors: Luoxiao Yang, Yun Wang, Xinqi Fan, Israel Cohen, Yue Zhao, Zijun Zhang

    Abstract: The success of large pretrained models in natural language processing (NLP) and computer vision (CV) has opened new avenues for constructing foundation models for time series forecasting (TSF). Traditional TSF foundation models rely heavily on numerical data fitting. In contrast, the human brain is inherently skilled at processing visual information, prefer predicting future trends by observing vi… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  27. arXiv:2407.07299  [pdf, ps, other

    cs.IT cs.DS math.CO

    Random Reed-Solomon Codes Achieve the Half-Singleton Bound for Insertions and Deletions over Linear-Sized Alphabets

    Authors: Roni Con, Zeyu Guo, Ray Li, Zihan Zhang

    Abstract: In this paper, we prove that with high probability, random Reed-Solomon codes approach the half-Singleton bound - the optimal rate versus error tradeoff for linear insdel codes - with linear-sized alphabets. More precisely, we prove that, for any $ε>0$ and positive integers $n$ and $k$, with high probability, random Reed--Solomon codes of length $n$ and dimension $k$ can correct… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  28. arXiv:2407.07099  [pdf, other

    cs.CL cs.AI cs.GT cs.LG

    Nash CoT: Multi-Path Inference with Preference Equilibrium

    Authors: Ziqi Zhang, Cunxiang Wang, Xiong Xiao, Yue Zhang, Donglin Wang

    Abstract: Chain-of-thought (CoT) prompting has emerged as a powerful technique for enhancing the reasoning capabilities of Large Language Models (LLMs) on complex problems. Among CoT-related studies, self-consistency (Multi-path inference with answer filtering through voting) involves generating multiple reasoning paths using the CoT framework and then selecting the most frequently produced outputs standing… ▽ More

    Submitted 18 June, 2024; originally announced July 2024.

  29. arXiv:2407.07094  [pdf, other

    cs.CL cs.AI

    AnyTaskTune: Advanced Domain-Specific Solutions through Task-Fine-Tuning

    Authors: Jiaxi Cui, Wentao Zhang, Jing Tang, Xudong Tong, Zhenwei Zhang, Amie, Jing Wen, Rongsheng Wang, Pengfei Wu

    Abstract: The pervasive deployment of Large Language Models-LLMs in various sectors often neglects the nuanced requirements of individuals and small organizations, who benefit more from models precisely tailored to their specific business contexts rather than those with broadly superior general capabilities. This work introduces \textbf{AnyTaskTune}, a novel fine-tuning methodology coined as \textbf{Task-Fi… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  30. arXiv:2407.07003  [pdf, other

    cs.CV cs.AI

    Learning to Complement and to Defer to Multiple Users

    Authors: Zheng Zhang, Wenjie Ai, Kevin Wells, David Rosewarne, Thanh-Toan Do, Gustavo Carneiro

    Abstract: With the development of Human-AI Collaboration in Classification (HAI-CC), integrating users and AI predictions becomes challenging due to the complex decision-making process. This process has three options: 1) AI autonomously classifies, 2) learning to complement, where AI collaborates with users, and 3) learning to defer, where AI defers to users. Despite their interconnected nature, these optio… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  31. arXiv:2407.06948  [pdf, other

    eess.SY

    Detection-Triggered Recursive Impact Mitigation against Secondary False Data Injection Attacks in Microgrids

    Authors: Mengxiang Liu, Xin Zhang, Rui Zhang, Zhuoran Zhou, Zhenyong Zhang, Ruilong Deng

    Abstract: The cybersecurity of microgrid has received widespread attentions due to the frequently reported attack accidents against distributed energy resource (DER) manufactures. Numerous impact mitigation schemes have been proposed to reduce or eliminate the impacts of false data injection attacks (FDIAs). Nevertheless, the existing methods either requires at least one neighboring trustworthy agent or may… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Submitted to IEEE Transactions on Smart Grid

  32. arXiv:2407.06534  [pdf, other

    quant-ph

    Lamb Shift Breaks the Heat Current Limit

    Authors: Zi-chen Zhang, Chang-shui Yu

    Abstract: We study the Lamb shift by considering the steady-state heat current through two coupled two-level atoms, which, respectively, interact with a heat reservoir at a certain temperature. It is found that the Lamb shift significantly alters the energy levels. In particular, it is shown that the heat current will approach an upper bound if the Lamb shift isn't considered, while the heat current will br… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  33. CrowdTransfer: Enabling Crowd Knowledge Transfer in AIoT Community

    Authors: Yan Liu, Bin Guo, Nuo Li, Yasan Ding, Zhouyangzi Zhang, Zhiwen Yu

    Abstract: Artificial Intelligence of Things (AIoT) is an emerging frontier based on the deep fusion of Internet of Things (IoT) and Artificial Intelligence (AI) technologies. Although advanced deep learning techniques enhance the efficient data processing and intelligent analysis of complex IoT data, they still suffer from notable challenges when deployed to practical AIoT applications, such as constrained… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: This paper has been accepted for publication in IEEE Communications Surveys & Tutorials. Copyright will be transferred without notice, after this version may no longer be accessible

  34. arXiv:2407.06358  [pdf, other

    cs.CV

    MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions

    Authors: Xuan Ju, Yiming Gao, Zhaoyang Zhang, Ziyang Yuan, Xintao Wang, Ailing Zeng, Yu Xiong, Qiang Xu, Ying Shan

    Abstract: Sora's high-motion intensity and long consistent videos have significantly impacted the field of video generation, attracting unprecedented attention. However, existing publicly available datasets are inadequate for generating Sora-like videos, as they mainly contain short videos with low motion intensity and brief captions. To address these issues, we propose MiraData, a high-quality video datase… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  35. arXiv:2407.06128  [pdf

    cs.CV

    Towards SAR Automatic Target Recognition MultiCategory SAR Image Classification Based on Light Weight Vision Transformer

    Authors: Guibin Zhao, Pengfei Li, Zhibo Zhang, Fusen Guo, Xueting Huang, Wei Xu, Jinyin Wang, Jianlong Chen

    Abstract: Synthetic Aperture Radar has been extensively used in numerous fields and can gather a wealth of information about the area of interest. This large scene data intensive technology puts a high value on automatic target recognition which can free the utilizers and boost the efficiency. Recent advances in artificial intelligence have made it possible to create a deep learning based SAR ATR that can a… ▽ More

    Submitted 9 July, 2024; v1 submitted 18 May, 2024; originally announced July 2024.

  36. Tight Quantum Depth Lower Bound for Solving Systems of Linear Equations

    Authors: Qisheng Wang, Zhicheng Zhang

    Abstract: Since Harrow, Hassidim, and Lloyd (2009) showed that a system of linear equations with $N$ variables and condition number $κ$ can be solved on a quantum computer in $\operatorname{poly}(\log(N), κ)$ time, exponentially faster than any classical algorithms, its improvements and applications have been extensively investigated. The state-of-the-art quantum algorithm for this problem is due to Costa,… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 22 pages, 1 table. Close to the official version

    Journal ref: Physical Review A, 110(1): 012422, 2024

  37. arXiv:2407.05963  [pdf, ps, other

    cs.SE cs.AI cs.NI cs.SI

    6GSoft: Software for Edge-to-Cloud Continuum

    Authors: Muhammad Azeem Akbar, Matteo Esposito, Sami Hyrynsalmi, Karthikeyan Dinesh Kumar, Valentina Lenarduzzi, Xiaozhou Li, Ali Mehraj, Tommi Mikkonen, Sergio Moreschini, Niko Mäkitalo, Markku Oivo, Anna-Sofia Paavonen, Risha Parveen, Kari Smolander, Ruoyu Su, Kari Systä, Davide Taibi, Nan Yang, Zheying Zhang, Muhammad Zohaib

    Abstract: In the era of 6G, developing and managing software requires cutting-edge software engineering (SE) theories and practices tailored for such complexity across a vast number of connected edge devices. Our project aims to lead the development of sustainable methods and energy-efficient orchestration models specifically for edge environments, enhancing architectural support driven by AI for contempora… ▽ More

    Submitted 9 July, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

  38. arXiv:2407.05791  [pdf, other

    eess.SP

    Joint Beamforming and Antenna Design for Near-Field Fluid Antenna System

    Authors: Yixuan Chen, Mingzhe Chen, Hao Xu, Zhaohui Yang, Kai-Kit Wong, Zhaoyang Zhang

    Abstract: In this letter, we study the energy efficiency maximization problem for a fluid antenna system (FAS) in near field communications. Specifically, we consider a point-to-point near-field system where the base station (BS) transmitter has multiple fixed-position antennas and the user receives the signals with multiple fluid antennas. Our objective is to jointly optimize the transmit beamforming of th… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  39. arXiv:2407.05626  [pdf, other

    math.NA

    A Stochastic Interacting Particle-Field Algorithm for a Haptotaxis Advection-Diffusion System Modeling Cancer Cell Invasion

    Authors: Boyi Hu, Zhongjian Wang, Jack Xin, Zhiwen Zhang

    Abstract: The investigation of tumor invasion and metastasis dynamics is crucial for advancements in cancer biology and treatment. Many mathematical models have been developed to study the invasion of host tissue by tumor cells. In this paper, we develop a novel stochastic interacting particle-field (SIPF) algorithm that accurately simulates the cancer cell invasion process within the haptotaxis advection-d… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  40. arXiv:2407.05521  [pdf, other

    cs.AR cs.AI

    Accelerating MRI Uncertainty Estimation with Mask-based Bayesian Neural Network

    Authors: Zehuan Zhang, Matej Genci, Hongxiang Fan, Andreas Wetscherek, Wayne Luk

    Abstract: Accurate and reliable Magnetic Resonance Imaging (MRI) analysis is particularly important for adaptive radiotherapy, a recent medical advance capable of improving cancer diagnosis and treatment. Recent studies have shown that IVIM-NET, a deep neural network (DNN), can achieve high accuracy in MRI analysis, indicating the potential of deep learning to enhance diagnostic capabilities in healthcare.… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: The 35th IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP) 2024

  41. arXiv:2407.05458  [pdf, other

    cs.AI

    A Survey of Models for Cognitive Diagnosis: New Developments and Future Directions

    Authors: Fei Wang, Weibo Gao, Qi Liu, Jiatong Li, Guanhao Zhao, Zheng Zhang, Zhenya Huang, Mengxiao Zhu, Shijin Wang, Wei Tong, Enhong Chen

    Abstract: Cognitive diagnosis has been developed for decades as an effective measurement tool to evaluate human cognitive status such as ability level and knowledge mastery. It has been applied to a wide range of fields including education, sport, psychological diagnosis, etc. By providing better awareness of cognitive status, it can serve as the basis for personalized services such as well-designed medical… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  42. arXiv:2407.05249  [pdf, ps, other

    cs.IT eess.SP

    RIS-assisted Coverage Enhancement in mmWave Integrated Sensing and Communication Networks

    Authors: Xu Gan, Chongwen Huang, Zhaohui Yang, Xiaoming Chen, Faouzi Bader, Zhaoyang Zhang, Chau Yuen, Yong Liang Guan, Merouane Debbah

    Abstract: Integrated sensing and communication (ISAC) has emerged as a promising technology to facilitate high-rate communications and super-resolution sensing, particularly operating in the millimeter wave (mmWave) band. However, the vulnerability of mmWave signals to blockages severely impairs ISAC capabilities and coverage. To tackle this, an efficient and low-cost solution is to deploy distributed recon… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  43. arXiv:2407.05165  [pdf, other

    cs.SE

    Feedback-Driven Automated Whole Bug Report Reproduction for Android Apps

    Authors: Dingbang Wang, Yu Zhao, Sidong Feng, Zhaoxu Zhang, William G. J. Halfond, Chunyang Chen, Xiaoxia Sun, Jiangfan Shi, Tingting Yu

    Abstract: In software development, bug report reproduction is a challenging task. This paper introduces ReBL, a novel feedback-driven approach that leverages GPT-4, a large-scale language model, to automatically reproduce Android bug reports. Unlike traditional methods, ReBL bypasses the use of Step to Reproduce (S2R) entities. Instead, it leverages the entire textual bug report and employs innovative promp… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: Accepted by ISSTA 2024

  44. arXiv:2407.05135  [pdf, other

    cs.RO cs.CG

    Theory and Explicit Design of a Path Planner for an SE(3) Robot

    Authors: Zhaoqi Zhang, Yi-Jen Chiang, Chee Yap

    Abstract: We consider path planning for a rigid spatial robot with 6 degrees of freedom (6 DOFs), moving amidst polyhedral obstacles. A correct, complete and practical path planner for such a robot has never been achieved, although this is widely recognized as a key challenge in robotics. This paper provides a complete "explicit" design, down to explicit geometric primitives that are easily implementable.… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: An earlier version has been submitted for a conference publication and is under review. This is a full version, 39 pages, including appendices

  45. arXiv:2407.04944  [pdf, other

    eess.SP cs.IT

    Flexible Antenna Arrays for Wireless Communications: Modeling and Performance Evaluation

    Authors: Songjie Yang, Jiancheng An, Yue Xiu, Wanting Lyu, Boyu Ning, Zhongpei Zhang, Merouane Debbah, Chau Yuen

    Abstract: Flexible antenna arrays (FAAs), distinguished by their rotatable, bendable, and foldable properties, are extensively employed in flexible radio systems to achieve customized radiation patterns. This paper aims to illustrate that FAAs, capable of dynamically adjusting surface shapes, can enhance communication performances with both omni-directional and directional antenna patterns, in terms of mult… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  46. arXiv:2407.04919  [pdf, ps, other

    math.RA

    An Anick type wild automorphism of free Poisson algebras

    Authors: Ivan Shestakov, Zerui Zhang

    Abstract: We construct an Anick type wild automorphism $δ$ in a 3-generated free Poisson algebra which induces a tame automorphism in a 3-generated polynomial algebra. We also show that $δ$ is stably tame. Dedicated to the memory of professor V.A.Roman'kov

    Submitted 5 July, 2024; originally announced July 2024.

    MSC Class: 08A35; 17A30; 17A50

  47. arXiv:2407.04813  [pdf, other

    astro-ph.EP astro-ph.GA astro-ph.SR

    FAUST XVII: Super deuteration in the planet forming system IRS 63 where the streamer strikes the disk

    Authors: L. Podio, C. Ceccarelli, C. Codella, G. Sabatini, D. Segura-Cox, N. Balucani, A. Rimola, P. Ugliengo, C. J. Chandler, N. Sakai, B. Svoboda, J. Pineda, M. De Simone, E. Bianchi, P. Caselli, A. Isella, Y. Aikawa, M. Bouvier, E. Caux, L. Chahine, S. B. Charnley, N. Cuello, F. Dulieu, L. Evans, D. Fedele , et al. (33 additional authors not shown)

    Abstract: Recent observations suggest that planets formation starts early, in protostellar disks of $\le10^5$ yrs, which are characterized by strong interactions with the environment, e.g., through accretion streamers and molecular outflows. To investigate the impact of such phenomena on disk physical and chemical properties it is key to understand what chemistry planets inherit from their natal environment… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 12 pages, 10 figures, accepted for publication on A&A

  48. arXiv:2407.04752  [pdf, other

    cs.LG cs.CL cs.NE

    SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking

    Authors: Xingrun Xing, Boyan Gao, Zheng Zhang, David A. Clifton, Shitao Xiao, Li Du, Guoqi Li, Jiajun Zhang

    Abstract: The recent advancements in large language models (LLMs) with billions of parameters have significantly boosted their performance across various real-world applications. However, the inference processes for these models require substantial energy and computational resources, presenting considerable deployment challenges. In contrast, human brains, which contain approximately 86 billion biological n… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  49. arXiv:2407.04451  [pdf, other

    cs.LG cs.AI

    Hindsight Preference Learning for Offline Preference-based Reinforcement Learning

    Authors: Chen-Xiao Gao, Shengjun Fang, Chenjun Xiao, Yang Yu, Zongzhang Zhang

    Abstract: Offline preference-based reinforcement learning (RL), which focuses on optimizing policies using human preferences between pairs of trajectory segments selected from an offline dataset, has emerged as a practical avenue for RL applications. Existing works rely on extracting step-wise reward signals from trajectory-wise preference annotations, assuming that preferences correlate with the cumulative… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  50. arXiv:2407.04297  [pdf, other

    cs.CR

    HuntFUZZ: Enhancing Error Handling Testing through Clustering Based Fuzzing

    Authors: Jin Wei, Ping Chen, Jun Dai, Xiaoyan Sun, Zhihao Zhang, Chang Xu, Yi Wanga

    Abstract: Testing a program's capability to effectively handling errors is a significant challenge, given that program errors are relatively uncommon. To solve this, Software Fault Injection (SFI)-based fuzzing integrates SFI and traditional fuzzing, injecting and triggering errors for testing (error handling) code. However, we observe that current SFI-based fuzzing approaches have overlooked the correlatio… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.