Skip to main content

Showing 1–50 of 11,296 results for author: Zhang, X

  1. arXiv:2407.09336  [pdf, other

    cs.LG cs.AI

    Guidelines for Augmentation Selection in Contrastive Learning for Time Series Classification

    Authors: Ziyu Liu, Azadeh Alavi, Minyi Li, Xiang Zhang

    Abstract: Self-supervised contrastive learning has become a key technique in deep learning, particularly in time series analysis, due to its ability to learn meaningful representations without explicit supervision. Augmentation is a critical component in contrastive learning, where different augmentations can dramatically impact performance, sometimes influencing accuracy by over 30%. However, the selection… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 20 pages, 11 figures

  2. arXiv:2407.09331  [pdf, other

    quant-ph

    Suppression of quantum dissipation: A cooperative effect of quantum squeezing and quantum measurement

    Authors: Yi-Ming Xia, Yi-Fei Wang, Xiao-Yun Zhang, Hai-Chao Li, Wei Xiong

    Abstract: The ability to isolate a quantum system from its environment is of fundamental interest and importance in optical quantum science and technology. Here we propose an experimentally feasible scheme for beating environment-induced dissipation in an open two-level system coupled to a parametrically driven cavity. The mechanism relies on a novel cooperation between light-matter coupling enhancement and… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  3. arXiv:2407.09274  [pdf, other

    cs.LG cs.AI q-bio.BM

    Unifying Sequences, Structures, and Descriptions for Any-to-Any Protein Generation with the Large Multimodal Model HelixProtX

    Authors: Zhiyuan Chen, Tianhao Chen, Chenggang Xie, Yang Xue, Xiaonan Zhang, Jingbo Zhou, Xiaomin Fang

    Abstract: Proteins are fundamental components of biological systems and can be represented through various modalities, including sequences, structures, and textual descriptions. Despite the advances in deep learning and scientific large language models (LLMs) for protein research, current methodologies predominantly focus on limited specialized tasks -- often predicting one protein modality from another. Th… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  4. arXiv:2407.09026  [pdf, other

    cs.CV cs.LG cs.MM eess.IV

    HPC: Hierarchical Progressive Coding Framework for Volumetric Video

    Authors: Zihan Zheng, Houqiang Zhong, Qiang Hu, Xiaoyun Zhang, Li Song, Ya Zhang, Yanfeng Wang

    Abstract: Volumetric video based on Neural Radiance Field (NeRF) holds vast potential for various 3D applications, but its substantial data volume poses significant challenges for compression and transmission. Current NeRF compression lacks the flexibility to adjust video quality and bitrate within a single model for various network and device capacities. To address these issues, we propose HPC, a novel hie… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 11 pages, 7 figures

  5. arXiv:2407.09009  [pdf, other

    astro-ph.EP

    Probing Cold-to-Temperate Exoplanetary Atmospheres: The Role of Water Condensation on Surface Identification with JWST

    Authors: Ziyu Huang, Xinting Yu, Shang-Min Tsai, Julianne I. Moses, Kazumasa Ohno, Joshua Krissansen-Totton, Xi Zhang, Jonathan Fortney

    Abstract: Understanding the surface temperature and interior structure of cold-to-temperate sub-Neptunes is critical for assessing their habitability, yet direct observations are challenging. In this study, we investigate the impact of water condensation on the atmospheric compositions of sub-Neptunes, focusing on the implications for JWST spectroscopic observations. By modeling the atmospheric photochemist… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 21 pages, 7 figures

  6. arXiv:2407.08990  [pdf, other

    cs.AR cs.AI cs.ET cs.NE

    Dynamic neural network with memristive CIM and CAM for 2D and 3D vision

    Authors: Yue Zhang, Woyu Zhang, Shaocong Wang, Ning Lin, Yifei Yu, Yangu He, Bo Wang, Hao Jiang, Peng Lin, Xiaoxin Xu, Xiaojuan Qi, Zhongrui Wang, Xumeng Zhang, Dashan Shang, Qi Liu, Kwang-Ting Cheng, Ming Liu

    Abstract: The brain is dynamic, associative and efficient. It reconfigures by associating the inputs with past experiences, with fused memory and processing. In contrast, AI models are static, unable to associate inputs with past experiences, and run on digital computers with physically separated memory and processing. We propose a hardware-software co-design, a semantic memory-based dynamic neural network… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: In press

  7. arXiv:2407.08956  [pdf, other

    cs.CR cs.SE

    DeCE: Deceptive Cross-Entropy Loss Designed for Defending Backdoor Attacks

    Authors: Guang Yang, Yu Zhou, Xiang Chen, Xiangyu Zhang, Terry Yue Zhuo, David Lo, Taolue Chen

    Abstract: Code Language Models (CLMs), particularly those leveraging deep learning, have achieved significant success in code intelligence domain. However, the issue of security, particularly backdoor attacks, is often overlooked in this process. The previous research has focused on designing backdoor attacks for CLMs, but effective defenses have not been adequately addressed. In particular, existing defens… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Under Review; Waiting for updates

  8. arXiv:2407.08546  [pdf, other

    cs.CV cs.LG q-bio.QM

    Quantitative Evaluation of the Saliency Map for Alzheimer's Disease Classifier with Anatomical Segmentation

    Authors: Yihan Zhang, Xuanshuo Zhang, Wei Wu, Haohan Wang

    Abstract: Saliency maps have been widely used to interpret deep learning classifiers for Alzheimer's disease (AD). However, since AD is heterogeneous and has multiple subtypes, the pathological mechanism of AD remains not fully understood and may vary from patient to patient. Due to the lack of such understanding, it is difficult to comprehensively and effectively assess the saliency map of AD classifier. I… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  9. arXiv:2407.08440  [pdf, other

    cs.CL cs.AI

    Beyond Instruction Following: Evaluating Rule Following of Large Language Models

    Authors: Wangtao Sun, Chenxiang Zhang, Xueyou Zhang, Ziyang Huang, Haotian Xu, Pei Chen, Shizhu He, Jun Zhao, Kang Liu

    Abstract: Although Large Language Models (LLMs) have demonstrated strong instruction-following ability to be helpful, they are further supposed to be controlled and guided by rules in real-world scenarios to be safe, and accurate in responses. This demands the possession of rule-following capability of LLMs. However, few works have made a clear evaluation of the rule-following capability of LLMs. Previous s… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  10. arXiv:2407.08241  [pdf, other

    gr-qc hep-th

    Thermodynamic bounce effect in quantum BTZ black hole

    Authors: Zhen-Ming Xu, Pan-Pan Zhang, Bin Wu, Xing Zhang

    Abstract: A novel thermodynamic phenomenon has been observed in the quantum Bañados-Teitelboim-Zanelli (qBTZ) black hole, utilizing generalized free energy and Kramer escape rate. This phenomenon also reveals the unique property of the quantum black hole. The stochastic thermal motion of various thermodynamic states within the black hole system induces phase transitions, under the influence of generalized f… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 8 pages, 3 figures

  11. arXiv:2407.08239  [pdf, other

    cs.SD cs.LG eess.AS

    An Unsupervised Domain Adaptation Method for Locating Manipulated Region in partially fake Audio

    Authors: Siding Zeng, Jiangyan Yi, Jianhua Tao, Yujie Chen, Shan Liang, Yong Ren, Xiaohui Zhang

    Abstract: When the task of locating manipulation regions in partially-fake audio (PFA) involves cross-domain datasets, the performance of deep learning models drops significantly due to the shift between the source and target domains. To address this issue, existing approaches often employ data augmentation before training. However, they overlook the characteristics in target domain that are absent in sourc… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  12. Sulphur dioxide in the mid-infrared transmission spectrum of WASP-39b

    Authors: Diana Powell, Adina D. Feinstein, Elspeth K. H. Lee, Michael Zhang, Shang-Min Tsai, Jake Taylor, James Kirk, Taylor Bell, Joanna K. Barstow, Peter Gao, Jacob L. Bean, Jasmina Blecic, Katy L. Chubb, Ian J. M. Crossfield, Sean Jordan, Daniel Kitzmann, Sarah E. Moran, Giuseppe Morello, Julianne I. Moses, Luis Welbanks, Jeehyun Yang, Xi Zhang, Eva-Maria Ahrer, Aaron Bello-Arufe, Jonathan Brande , et al. (48 additional authors not shown)

    Abstract: The recent inference of sulphur dioxide (SO$_2$) in the atmosphere of the hot ($\sim$1100 K), Saturn-mass exoplanet WASP-39b from near-infrared JWST observations suggests that photochemistry is a key process in high temperature exoplanet atmospheres. This is due to the low ($<$1 ppb) abundance of SO$_2$ under thermochemical equilibrium, compared to that produced from the photochemistry of H$_2$O a… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Published in Nature

    Journal ref: Nature 626, 979-983 (2024)

  13. arXiv:2407.07931  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    Search, Examine and Early-Termination: Fake News Detection with Annotation-Free Evidences

    Authors: Yuzhou Yang, Yangming Zhou, Qichao Ying, Zhenxing Qian, Xinpeng Zhang

    Abstract: Pioneer researches recognize evidences as crucial elements in fake news detection apart from patterns. Existing evidence-aware methods either require laborious pre-processing procedures to assure relevant and high-quality evidence data, or incorporate the entire spectrum of available evidences in all news cases, regardless of the quality and quantity of the retrieved data. In this paper, we propos… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: ECAI 2024 paper. Fudan University & NVIDIA. To appear

  14. arXiv:2407.07930  [pdf

    q-bio.BM cs.LG

    Token-Mol 1.0: Tokenized drug design with large language model

    Authors: Jike Wang, Rui Qin, Mingyang Wang, Meijing Fang, Yangyang Zhang, Yuchen Zhu, Qun Su, Qiaolin Gou, Chao Shen, Odin Zhang, Zhenxing Wu, Dejun Jiang, Xujun Zhang, Huifeng Zhao, Xiaozhe Wan, Zhourui Wu, Liwei Liu, Yu Kang, Chang-Yu Hsieh, Tingjun Hou

    Abstract: Significant interests have recently risen in leveraging sequence-based large language models (LLMs) for drug design. However, most current applications of LLMs in drug discovery lack the ability to comprehend three-dimensional (3D) structures, thereby limiting their effectiveness in tasks that explicitly involve molecular conformations. In this study, we introduced Token-Mol, a token-only 3D drug… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  15. arXiv:2407.07672  [pdf, other

    cs.HC

    StoryDiffusion: How to Support UX Storyboarding With Generative-AI

    Authors: Zhaohui Liang, Xiaoyu Zhang, Kevin Ma, Zhao Liu, Xipei Ren, Kosa Goucher-Lambert, Can Liu

    Abstract: Storyboarding is an established method for designing user experiences. Generative AI can support this process by helping designers quickly create visual narratives. However, existing tools only focus on accurate text-to-image generation. Currently, it is not clear how to effectively support the entire creative process of storyboarding and how to develop AI-powered tools to support designers' indiv… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  16. arXiv:2407.07651  [pdf, other

    hep-ex physics.data-an

    Study of the decay and production properties of $D_{s1}(2536)$ and $D_{s2}^*(2573)$

    Authors: M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (645 additional authors not shown)

    Abstract: The $e^+e^-\rightarrow D_s^+D_{s1}(2536)^-$ and $e^+e^-\rightarrow D_s^+D^*_{s2}(2573)^-$ processes are studied using data samples collected with the BESIII detector at center-of-mass energies from 4.530 to 4.946~GeV. The absolute branching fractions of $D_{s1}(2536)^- \rightarrow \bar{D}^{*0}K^-$ and $D_{s2}^*(2573)^- \rightarrow \bar{D}^0K^-$ are measured for the first time to be… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  17. arXiv:2407.07580  [pdf, other

    cs.CV

    InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph Prior

    Authors: Chenguo Lin, Yuchen Lin, Panwang Pan, Xuanyang Zhang, Yadong Mu

    Abstract: Comprehending natural language instructions is a charming property for both 2D and 3D layout synthesis systems. Existing methods implicitly model object joint distributions and express object relations, hindering generation's controllability. We introduce InstructLayout, a novel generative framework that integrates a semantic graph prior and a layout decoder to improve controllability and fidelity… ▽ More

    Submitted 10 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: This paper is an extension of ICLR 2024 "InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior". arXiv admin note: substantial text overlap with arXiv:2402.04717

  18. arXiv:2407.07453  [pdf, other

    physics.optics eess.SP

    Waveguide Superlattices with Artificial Gauge Field Towards Colorless and Crosstalkless Ultrahigh-Density Photonic Integration

    Authors: Xuelin Zhang, Jiangbing Du, Ke Xu, Zuyuan He

    Abstract: Dense waveguide arrays with low crosstalk and ultra-broadband remain a vital issue for chip-scale integrated photonics. However, the sub-wavelength regime of such devices has not been adequately explored in practice. Herein, we propose the advanced waveguide superlattices leveraging the artificial gauge field mechanism. This approach achieves remarkable -24 dB crosstalk suppression with an ultra-b… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  19. arXiv:2407.07427  [pdf, other

    cs.CV

    Unified Embedding Alignment for Open-Vocabulary Video Instance Segmentation

    Authors: Hao Fang, Peng Wu, Yawei Li, Xinxin Zhang, Xiankai Lu

    Abstract: Open-Vocabulary Video Instance Segmentation (VIS) is attracting increasing attention due to its ability to segment and track arbitrary objects. However, the recent Open-Vocabulary VIS attempts obtained unsatisfactory results, especially in terms of generalization ability of novel categories. We discover that the domain gap between the VLM features (e.g., CLIP) and the instance queries and the unde… ▽ More

    Submitted 11 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  20. arXiv:2407.07372  [pdf, other

    eess.IV cs.CV

    Trustworthy Contrast-enhanced Brain MRI Synthesis

    Authors: Jiyao Liu, Yuxin Li, Shangqi Gao, Yuncheng Zhou, Xin Gao, Ningsheng Xu, Xiao-Yong Zhang, Xiahai Zhuang

    Abstract: Contrast-enhanced brain MRI (CE-MRI) is a valuable diagnostic technique but may pose health risks and incur high costs. To create safer alternatives, multi-modality medical image translation aims to synthesize CE-MRI images from other available modalities. Although existing methods can generate promising predictions, they still face two challenges, i.e., exhibiting over-confidence and lacking inte… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 11 pages, 3 figures

  21. arXiv:2407.07343  [pdf

    physics.optics physics.app-ph

    Electrically Tuning Quasi-Bound States in the Continuum with Hybrid Graphene-Silicon Metasurfaces

    Authors: Ziqiang Cai, Xianzhe Zhang, Tushar Sanjay Karnik, Yihao Xu, Tae Yoon Kim, Juejun Hu, Yongmin Liu

    Abstract: Metasurfaces have become one of the most prominent research topics in the field of optics owing to their unprecedented properties and novel applications on an ultrathin platform. By combining graphene with metasurfaces, electrical tunable functions can be achieved with fast tuning speed, large modulation depth and broad tuning range. However, the tuning efficiency of hybrid graphene metasurfaces w… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 14 pages, 5 figures

  22. arXiv:2407.07336  [pdf, other

    astro-ph.CO gr-qc hep-ph

    Testing the cosmic distance duality relation using strong gravitational lensing time delays and Type Ia supernovae

    Authors: Jing-Zhao Qi, Yi-Fan Jiang, Wan-Ting Hou, Xin Zhang

    Abstract: We present a comprehensive test of the cosmic distance duality relation (DDR) using a combination of strong gravitational lensing (SGL) time delay measurements and Type Ia supernovae (SNe Ia) data. We investigate three different parameterizations of potential DDR violations. To bridge the gap between SGL and SNe Ia datasets, we implement an artificial neural network (ANN) approach to reconstruct t… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 11 pages, 3 figures

  23. arXiv:2407.07303  [pdf, other

    cond-mat.mes-hall physics.app-ph

    Bimerons create bimerons: proliferation and aggregation induced by currents and magnetic fields

    Authors: Xichao Zhang, Yan Zhou, Xiuzhen Yu, Masahito Mochizuki

    Abstract: The aggregation of topological spin textures at nano and micro scales has practical applications in spintronic technologies. Here, the authors report the in-plane current-induced proliferation and aggregation of bimerons in a bulk chiral magnet. It is found that the spin-transfer torques can induce the proliferation and aggregation of bimerons only in the presence of an appropriate out-of-plane ma… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 13 pages, 8 figures

  24. arXiv:2407.07298  [pdf, other

    cond-mat.mes-hall physics.app-ph

    Transformation of a cellular skyrmion to polyomino-like structures

    Authors: Jing Xia, Xichao Zhang, Yan Zhou, Xiaoxi Liu, Guoping Zhao, Masahito Mochizuki

    Abstract: Topological spin structures with transformable shapes may have potential implications on data storage and computation. Here, we demonstrate that a square cellular skyrmion on an artificial grid pinning pattern can be manipulated by programmed current pulses. We find that parallel short pulses could result in the elongation of the skyrmion mainly in the current direction, while parallel long pulses… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 7 pages, 5 figures

  25. arXiv:2407.07209  [pdf

    cond-mat.mes-hall

    Electrical switching of spin-polarized light-emitting diodes based on a 2D CrI3/hBN/WSe2 heterostructure

    Authors: Jianchen Dang, Tongyao Wu, Shuohua Yan, Kenji Watanabe, Takashi Taniguchi, Hechang Lei, Xiao-Xiao Zhang

    Abstract: Spin-polarized light-emitting diodes (spin-LEDs) convert the electronic spin information to photon circular polarization, offering potential applications including spin amplification, optical communications, and advanced imaging. The conventional control of the emitted light's circular polarization requires a change in the external magnetic field, limiting the operation conditions of spin-LEDs. He… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  26. arXiv:2407.06988  [pdf, other

    math.SP

    Limiting Over-Smoothing and Over-Squashing of Graph Message Passing by Deep Scattering Transforms

    Authors: Yuanhong Jiang, Dongmian Zou, Xiaoqun Zhang, Yu Guang Wang

    Abstract: Graph neural networks (GNNs) have become pivotal tools for processing graph-structured data, leveraging the message passing scheme as their core mechanism. However, traditional GNNs often grapple with issues such as instability, over-smoothing, and over-squashing, which can degrade performance and create a trade-off dilemma. In this paper, we introduce a discriminatively trained, multi-layer Deep… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 35 pages, 6 figures

  27. arXiv:2407.06948  [pdf, other

    eess.SY

    Detection-Triggered Recursive Impact Mitigation against Secondary False Data Injection Attacks in Microgrids

    Authors: Mengxiang Liu, Xin Zhang, Rui Zhang, Zhuoran Zhou, Zhenyong Zhang, Ruilong Deng

    Abstract: The cybersecurity of microgrid has received widespread attentions due to the frequently reported attack accidents against distributed energy resource (DER) manufactures. Numerous impact mitigation schemes have been proposed to reduce or eliminate the impacts of false data injection attacks (FDIAs). Nevertheless, the existing methods either requires at least one neighboring trustworthy agent or may… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Submitted to IEEE Transactions on Smart Grid

  28. arXiv:2407.06503  [pdf, other

    cs.LG

    Preference-Guided Reinforcement Learning for Efficient Exploration

    Authors: Guojian Wang, Faguo Wu, Xiao Zhang, Tianyuan Chen, Xuyang Chen, Lin Zhao

    Abstract: In this paper, we investigate preference-based reinforcement learning (PbRL) that allows reinforcement learning (RL) agents to learn from human feedback. This is particularly valuable when defining a fine-grain reward function is not feasible. However, this approach is inefficient and impractical for promoting deep exploration in hard-exploration tasks with long horizons and sparse rewards. To tac… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 13 pages, 17 figures

  29. arXiv:2407.06192  [pdf, other

    cs.CV cs.AI cs.CL

    Multi-Object Hallucination in Vision-Language Models

    Authors: Xuweiyi Chen, Ziqiao Ma, Xuejun Zhang, Sihan Xu, Shengyi Qian, Jianing Yang, David F. Fouhey, Joyce Chai

    Abstract: Large vision language models (LVLMs) often suffer from object hallucination, producing objects not present in the given images. While current benchmarks for object hallucination primarily concentrate on the presence of a single object class rather than individual entities, this work systematically investigates multi-object hallucination, examining how models misperceive (e.g., invent nonexistent o… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Accepted to ALVR @ ACL 2024 | Project page: https://multi-object-hallucination.github.io/

  30. arXiv:2407.06159  [pdf, other

    cs.CV

    A Semantic-Aware and Multi-Guided Network for Infrared-Visible Image Fusion

    Authors: Xiaoli Zhang, Liying Wang, Libo Zhao, Xiongfei Li, Siwei Ma

    Abstract: Multi-modality image fusion aims at fusing specific-modality and shared-modality information from two source images. To tackle the problem of insufficient feature extraction and lack of semantic awareness for complex scenes, this paper focuses on how to model correlation-driven decomposing features and reason high-level graph representation by efficiently extracting complementary features and mult… ▽ More

    Submitted 11 June, 2024; originally announced July 2024.

  31. arXiv:2407.06083  [pdf, other

    cs.LG cs.IR

    A Survey of Controllable Learning: Methods and Applications in Information Retrieval

    Authors: Chenglei Shen, Xiao Zhang, Teng Shi, Changshuo Zhang, Guofu Xie, Jun Xu

    Abstract: Controllable learning (CL) emerges as a critical component in trustworthy machine learning, ensuring that learners meet predefined targets and can adaptively adjust without retraining according to the changes in those targets. We provide a formal definition of CL, and discuss its applications in information retrieval (IR) where information needs are often complex and dynamic. The survey categorize… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  32. arXiv:2407.05954  [pdf, other

    cs.LG stat.ME

    Causality-driven Sequence Segmentation for Enhancing Multiphase Industrial Process Data Analysis and Soft Sensing

    Authors: Yimeng He, Le Yao, Xinmin Zhang, Xiangyin Kong, Zhihuan Song

    Abstract: The dynamic characteristics of multiphase industrial processes present significant challenges in the field of industrial big data modeling. Traditional soft sensing models frequently neglect the process dynamics and have difficulty in capturing transient phenomena like phase transitions. To address this issue, this article introduces a causality-driven sequence segmentation (CDSS) model. This mode… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  33. arXiv:2407.05878  [pdf, other

    cs.CV

    HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution

    Authors: Xiang Zhang, Yulun Zhang, Fisher Yu

    Abstract: Transformers have exhibited promising performance in computer vision tasks including image super-resolution (SR). However, popular transformer-based SR methods often employ window self-attention with quadratic computational complexity to window sizes, resulting in fixed small windows with limited receptive fields. In this paper, we present a general strategy to convert transformer-based SR network… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  34. arXiv:2407.05758  [pdf, other

    eess.IV cs.AI cs.CV

    Potential of Multimodal Large Language Models for Data Mining of Medical Images and Free-text Reports

    Authors: Yutong Zhang, Yi Pan, Tianyang Zhong, Peixin Dong, Kangni Xie, Yuxiao Liu, Hanqi Jiang, Zhengliang Liu, Shijie Zhao, Tuo Zhang, Xi Jiang, Dinggang Shen, Tianming Liu, Xin Zhang

    Abstract: Medical images and radiology reports are crucial for diagnosing medical conditions, highlighting the importance of quantitative analysis for clinical decision-making. However, the diversity and cross-source heterogeneity of these data challenge the generalizability of current data-mining methods. Multimodal large language models (MLLMs) have recently transformed many domains, significantly affecti… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  35. arXiv:2407.05700  [pdf, other

    cs.CL cs.AI cs.SE

    InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct

    Authors: Yutong Wu, Di Huang, Wenxuan Shi, Wei Wang, Lingzhe Gao, Shihao Liu, Ziyuan Nan, Kaizhao Yuan, Rui Zhang, Xishan Zhang, Zidong Du, Qi Guo, Yewen Pu, Dawei Yin, Xing Hu, Yunji Chen

    Abstract: Recent advancements in open-source code large language models (LLMs) have demonstrated remarkable coding abilities by fine-tuning on the data generated from powerful closed-source LLMs such as GPT-3.5 and GPT-4 for instruction tuning. This paper explores how to further improve an instruction-tuned code LLM by generating data from itself rather than querying closed-source LLMs. Our key observation… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  36. arXiv:2407.05591  [pdf, other

    cs.LG cs.CL cs.NE

    On the Power of Convolution Augmented Transformer

    Authors: Mingchen Li, Xuechen Zhang, Yixiao Huang, Samet Oymak

    Abstract: The transformer architecture has catalyzed revolutionary advances in language modeling. However, recent architectural recipes, such as state-space models, have bridged the performance gap. Motivated by this, we examine the benefits of Convolution-Augmented Transformer (CAT) for recall, copying, and length generalization tasks. CAT incorporates convolutional filters in the K/Q/V embeddings of an at… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  37. arXiv:2407.05384  [pdf, other

    nlin.SI

    A modified Korteweg-de Vries equation soliton gas under the nonzero background

    Authors: Xiaoen Zhang, Liming Ling

    Abstract: In this paper, we consider a soliton gas of the focusing modified Korteweg-de Vries generated from the $N$-soliton solutions under the nonzero background. The spectral soliton density is chosen on the pure imaginary axis, excluding the branch cut $Σ_{c}=\left[-i, i\right]$. In the limit $N\to\infty$, we establish the Riemann-Hilbert problem of the soliton gas. Using the Deift-Zhou nonlinear steepe… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: 29pages,6 figures

    MSC Class: 35Q55; 35Q51; 35Q15; 37K40; 37K15; 37K10

  38. arXiv:2407.05233  [pdf, other

    cs.CL cs.AI

    Advancing Prompt Recovery in NLP: A Deep Dive into the Integration of Gemma-2b-it and Phi2 Models

    Authors: Jianlong Chen, Wei Xu, Zhicheng Ding, Jinxin Xu, Hao Yan, Xinyu Zhang

    Abstract: Prompt recovery, a crucial task in natural language processing, entails the reconstruction of prompts or instructions that language models use to convert input text into a specific output. Although pivotal, the design and effectiveness of prompts represent a challenging and relatively untapped field within NLP research. This paper delves into an exhaustive investigation of prompt recovery methodol… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  39. arXiv:2407.05229  [pdf, other

    cs.LG

    HiDe-PET: Continual Learning via Hierarchical Decomposition of Parameter-Efficient Tuning

    Authors: Liyuan Wang, Jingyi Xie, Xingxing Zhang, Hang Su, Jun Zhu

    Abstract: The deployment of pre-trained models (PTMs) has greatly advanced the field of continual learning (CL), enabling positive knowledge transfer and resilience to catastrophic forgetting. To sustain these advantages for sequentially arriving tasks, a promising direction involves keeping the pre-trained backbone frozen while employing parameter-efficient tuning (PET) techniques to instruct representatio… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: This is a generalized version of our HiDe-Prompt (NeurIPS 2023, Spotlight)

  40. arXiv:2407.05074  [pdf, other

    quant-ph

    Decoherence without einselection

    Authors: Xiao Zhang

    Abstract: Decoherence in a quantum measurement is typically explained as an interaction with the environment that destroys coherence between the system's eigenstates, a phenomenon known as environment-induced superselection (einselection). In this work, we demonstrate that einselection and the associated envariance are actually artifacts resulting from neglecting the non-equilibrium dynamics of the apparatu… ▽ More

    Submitted 9 July, 2024; v1 submitted 6 July, 2024; originally announced July 2024.

  41. arXiv:2407.04877  [pdf

    cond-mat.mtrl-sci cs.LG physics.chem-ph

    Leveraging Data Mining, Active Learning, and Domain Adaptation in a Multi-Stage, Machine Learning-Driven Approach for the Efficient Discovery of Advanced Acidic Oxygen Evolution Electrocatalysts

    Authors: Rui Ding, Jianguo Liu, Kang Hua, Xuebin Wang, Xiaoben Zhang, Minhua Shao, Yuxin Chen, Junhong Chen

    Abstract: Developing advanced catalysts for acidic oxygen evolution reaction (OER) is crucial for sustainable hydrogen production. This study introduces a novel, multi-stage machine learning (ML) approach to streamline the discovery and optimization of complex multi-metallic catalysts. Our method integrates data mining, active learning, and domain adaptation throughout the materials discovery process. Unlik… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 95 pages (main text 37 pages; supplementary materials 58 pages); 38 figures (main text 6 figures; supplementary materials 32 figures)

  42. arXiv:2407.04751  [pdf, ps, other

    cs.LG cs.AI cs.CR

    A Unified Learn-to-Distort-Data Framework for Privacy-Utility Trade-off in Trustworthy Federated Learning

    Authors: Xiaojin Zhang, Mingcong Xu, Wei Chen

    Abstract: In this paper, we first give an introduction to the theoretical basis of the privacy-utility equilibrium in federated learning based on Bayesian privacy definitions and total variation distance privacy definitions. We then present the \textit{Learn-to-Distort-Data} framework, which provides a principled approach to navigate the privacy-utility equilibrium by explicitly modeling the distortion intr… ▽ More

    Submitted 9 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

  43. arXiv:2407.04313  [pdf, other

    math.DS math.AP math.PR

    Poisson stability of solutions for stochastic evolution equations driven by fractional Brownian motion

    Authors: Xinze Zhang, Xue Yang

    Abstract: In this paper, we study the problem of Poisson stability of solutions for stochastic semi-linear evolution equation driven by fractional Brownian motion $$\mathrm{d} X(t)= \left( AX(t) + f(t,X(t)) \right) \mathrm{d}t + g\left(t,X(t)\right)\mathrm{d}B^H_{Q}(t),$$ where $A$ is an exponentially stable linear operator acting on a separable Hilbert space $\mathbb{H}$, coefficients $f$ and $g$ are Poiss… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 25 pages, 1 figures. arXiv admin note: substantial text overlap with arXiv:1702.02718 by other authors

    MSC Class: 60G22; 34C25; 34C27; 37B20; 60H10; 34D20

  44. arXiv:2407.04290  [pdf, other

    math.PR

    Onsager-Machlup functional for stochastic differential equations with time-varying noise

    Authors: Xinze Zhang, Yong Li

    Abstract: This paper is devoted to studying the Onsager-Machlup functional for stochastic differential equations with time-varying noise of the α-Hölder, 0<α<1/4, dXt =f(t,Xt)dt+g(t)dWt. Our study focuses on scenarios where the diffusion coefficient g(t) exhibits temporal variability, starkly contrasting the conventional assumption of a constant diffusion coefficient in the existing literature. This var… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 17 pages, 4 figures

    MSC Class: 82C35; 60H10; 37H10

  45. arXiv:2407.04224  [pdf, other

    cs.RO

    PA-LOCO: Learning Perturbation-Adaptive Locomotion for Quadruped Robots

    Authors: Zhiyuan Xiao, Xinyu Zhang, Xiang Zhou, Qingrui Zhang

    Abstract: Numerous locomotion controllers have been designed based on Reinforcement Learning (RL) to facilitate blind quadrupedal locomotion traversing challenging terrains. Nevertheless, locomotion control is still a challenging task for quadruped robots traversing diverse terrains amidst unforeseen disturbances. Recently, privileged learning has been employed to learn reliable and robust quadrupedal locom… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 8 pages, Accepted by IROS 2024

  46. arXiv:2407.04093  [pdf, other

    cs.CL

    Stephanie: Step-by-Step Dialogues for Mimicking Human Interactions in Social Conversations

    Authors: Hao Yang, Hongyuan Lu, Xinhua Zeng, Yang Liu, Xiang Zhang, Haoran Yang, Yumeng Zhang, Shan Huang, Yiran Wei, Wai Lam

    Abstract: In the rapidly evolving field of natural language processing, dialogue systems primarily employ a single-step dialogue paradigm. Although this paradigm is efficient, it lacks the depth and fluidity of human interactions and does not appear natural. We introduce a novel \textbf{Step}-by-Step Dialogue Paradigm (Stephanie), designed to mimic the ongoing dynamic nature of human conversations. By emplo… ▽ More

    Submitted 12 July, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

  47. arXiv:2407.04055  [pdf, other

    q-bio.QM cs.AI cs.LG

    Benchmark on Drug Target Interaction Modeling from a Structure Perspective

    Authors: Xinnan Zhang, Jialin Wu, Junyi Xie, Tianlong Chen, Kaixiong Zhou

    Abstract: The prediction modeling of drug-target interactions is crucial to drug discovery and design, which has seen rapid advancements owing to deep learning technologies. Recently developed methods, such as those based on graph neural networks (GNNs) and Transformers, demonstrate exceptional performance across various datasets by effectively extracting structural information. However, the benchmarking of… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: Submitted to NIPS 2024 Dataset and Benchmark

  48. arXiv:2407.03971  [pdf, other

    cs.CV

    MineNetCD: A Benchmark for Global Mining Change Detection on Remote Sensing Imagery

    Authors: Weikang Yu, Xiaokang Zhang, Xiao Xiang Zhu, Richard Gloaguen, Pedram Ghamisi

    Abstract: Monitoring changes triggered by mining activities is crucial for industrial controlling, environmental management and regulatory compliance, yet it poses significant challenges due to the vast and often remote locations of mining sites. Remote sensing technologies have increasingly become indispensable to detect and analyze these changes over time. We thus introduce MineNetCD, a comprehensive benc… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  49. arXiv:2407.03883  [pdf, other

    cs.CR cs.LG cs.SE

    Protecting Deep Learning Model Copyrights with Adversarial Example-Free Reuse Detection

    Authors: Xiaokun Luan, Xiyue Zhang, Jingyi Wang, Meng Sun

    Abstract: Model reuse techniques can reduce the resource requirements for training high-performance deep neural networks (DNNs) by leveraging existing models. However, unauthorized reuse and replication of DNNs can lead to copyright infringement and economic loss to the model owner. This underscores the need to analyze the reuse relation between DNNs and develop copyright protection techniques to safeguard… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 12 pages, 5 figures

  50. arXiv:2407.03704  [pdf, other

    cs.AI cs.LG

    Neural Probabilistic Logic Learning for Knowledge Graph Reasoning

    Authors: Fengsong Sun, Jinyu Wang, Zhiqing Wei, Xianchao Zhang

    Abstract: Knowledge graph (KG) reasoning is a task that aims to predict unknown facts based on known factual samples. Reasoning methods can be divided into two categories: rule-based methods and KG-embedding based methods. The former possesses precise reasoning capabilities but finds it challenging to reason efficiently over large-scale knowledge graphs. While gaining the ability to reason over large-scale… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.