Skip to main content

Showing 1–50 of 1,665 results for author: Jiang, L

  1. arXiv:2407.06975  [pdf

    cond-mat.mtrl-sci

    Optimization of noncollinear magnetic ordering temperature in Y-type hexaferrite by machine learning

    Authors: Yonghong Li, Jing Zhang, Linfeng Jiang, Long Zhang, Yugang Zhang, Xueliang Wu, Yisheng Chai, Xiaoyuan Zhou, Zizhen Zhou

    Abstract: Searching the optimal doping compositions of the Y-type hexaferrite Ba2Mg2Fe12O22 remains a long-standing challenge for enhanced non-collinear magnetic transition temperature (TNC). Instead of the conventional trial-and-error approach, the composition-property descriptor is established via a data driven machine learning method named SISSO (sure independence screening and sparsifying operator). Bas… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: accepted by Applied Physics Letters in 2024

  2. Style Alignment based Dynamic Observation Method for UAV-View Geo-localization

    Authors: Jie Shao, LingHao Jiang

    Abstract: The task of UAV-view geo-localization is to estimate the localization of a query satellite/drone image by matching it against a reference dataset consisting of drone/satellite images. Though tremendous strides have been made in feature alignment between satellite and drone views, vast differences in both inter and intra-class due to changes in viewpoint, altitude, and lighting remain a huge challe… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: has published on IEEE Transactions on Geoscience and Remote Sensing, 2023

  3. arXiv:2407.02234  [pdf, other

    physics.flu-dyn

    How turbulence increases the bubble-particle collision rate

    Authors: Linfeng Jiang, Dominik Krug

    Abstract: We study the effect of turbulence on collisions between a finite-size bubble and small inertial particles based on interface-resolved simulations. Our results show that the interaction with the flow field around the bubble remains the dominant effect. Nonlinear dependencies in this process can enhance the turbulent collision rate by up to 100\% compared to quiescent flow. Fluctuations in the bubbl… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  4. arXiv:2407.01429  [pdf, other

    quant-ph

    Generalized quantum repeater graph states

    Authors: Bikun Li, Kenneth Goodenough, Filip Rozpędek, Liang Jiang

    Abstract: All-photonic quantum repeaters are essential for establishing long-range quantum entanglement. Within repeater nodes, reliably performing entanglement swapping is a key component of scalable quantum communication. To tackle the challenge of probabilistic Bell state measurement in linear optics, which often leads to information loss, various approaches have been proposed to ensure the loss toleranc… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  5. arXiv:2406.19465  [pdf, other

    cs.CL

    Can Large Language Models Generate High-quality Patent Claims?

    Authors: Lekang Jiang, Caiqi Zhang, Pascal A Scherz, Stephan Goetz

    Abstract: Large language models (LLMs) have shown exceptional performance across various text generation tasks but remain under-explored in the patent domain, which offers highly structured and precise language. This paper constructs a dataset to investigate the performance of current LLMs in patent claim generation. Our results demonstrate that generating claims based on patent descriptions outperforms pre… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 13 pages

  6. arXiv:2406.18510  [pdf, other

    cs.CL

    WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models

    Authors: Liwei Jiang, Kavel Rao, Seungju Han, Allyson Ettinger, Faeze Brahman, Sachin Kumar, Niloofar Mireshghallah, Ximing Lu, Maarten Sap, Yejin Choi, Nouha Dziri

    Abstract: We introduce WildTeaming, an automatic LLM safety red-teaming framework that mines in-the-wild user-chatbot interactions to discover 5.7K unique clusters of novel jailbreak tactics, and then composes multiple tactics for systematic exploration of novel jailbreaks. Compared to prior work that performed red-teaming via recruited human workers, gradient-based optimization, or iterative revision with… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  7. arXiv:2406.18495  [pdf, other

    cs.CL

    WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

    Authors: Seungju Han, Kavel Rao, Allyson Ettinger, Liwei Jiang, Bill Yuchen Lin, Nathan Lambert, Yejin Choi, Nouha Dziri

    Abstract: We introduce WildGuard -- an open, light-weight moderation tool for LLM safety that achieves three goals: (1) identifying malicious intent in user prompts, (2) detecting safety risks of model responses, and (3) determining model refusal rate. Together, WildGuard serves the increasing needs for automatic safety moderation and evaluation of LLM interactions, providing a one-stop tool with enhanced a… ▽ More

    Submitted 9 July, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

    Comments: First two authors contributed equally. Third and fourth authors contributed equally

  8. arXiv:2406.17443  [pdf, other

    cs.CV

    Using joint angles based on the international biomechanical standards for human action recognition and related tasks

    Authors: Kevin Schlegel, Lei Jiang, Hao Ni

    Abstract: Keypoint data has received a considerable amount of attention in machine learning for tasks like action detection and recognition. However, human experts in movement such as doctors, physiotherapists, sports scientists and coaches use a notion of joint angles standardised by the International Society of Biomechanics to precisely and efficiently communicate static body poses and movements. In this… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  9. arXiv:2406.16328  [pdf, other

    cs.CE

    Convolutional neural network based reduced order modeling for multiscale problems

    Authors: Xuhan Zhang, Lijian Jiang

    Abstract: In this paper, we combine convolutional neural networks (CNNs) with reduced order modeling (ROM) for efficient simulations of multiscale problems. These problems are modeled by partial differential equations with high-dimensional random inputs. The proposed method involves two separate CNNs: Basis CNNs and Coefficient CNNs (Coef CNNs), which correspond to two main parts of ROM. The method is calle… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 35 pages, 29 figures

  10. arXiv:2406.13987  [pdf

    cs.CV cs.LG

    Image anomaly detection and prediction scheme based on SSA optimized ResNet50-BiGRU model

    Authors: Qianhui Wan, Zecheng Zhang, Liheng Jiang, Zhaoqi Wang, Yan Zhou

    Abstract: Image anomaly detection is a popular research direction, with many methods emerging in recent years due to rapid advancements in computing. The use of artificial intelligence for image anomaly detection has been widely studied. By analyzing images of athlete posture and movement, it is possible to predict injury status and suggest necessary adjustments. Most existing methods rely on convolutional… ▽ More

    Submitted 20 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  11. arXiv:2406.13203  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci physics.app-ph

    Dynamical phase-field model of cavity electromagnonic systems

    Authors: Shihao Zhuang, Yujie Zhu, Changchun Zhong, Liang Jiang, Xufeng Zhang, Jia-Mian Hu

    Abstract: Cavity electromagnonic system, which simultaneously consists of cavities for photons, magnons (quanta of spin waves), and acoustic phonons, provides an exciting platform to achieve coherent energy transduction among different physical systems down to single quantum level. Here we report a dynamical phase-field model that allows simulating the coupled dynamics of the electromagnetic waves, magnetiz… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  12. arXiv:2406.10744  [pdf, other

    cs.CV

    Technique Report of CVPR 2024 PBDL Challenges

    Authors: Ying Fu, Yu Li, Shaodi You, Boxin Shi, Linwei Chen, Yunhao Zou, Zichun Wang, Yichen Li, Yuze Han, Yingkai Zhang, Jianan Wang, Qinglin Liu, Wei Yu, Xiaoqian Lv, Jianing Li, Shengping Zhang, Xiangyang Ji, Yuanpei Chen, Yuhan Zhang, Weihang Peng, Liwen Zhang, Zhe Xu, Dingyong Gou, Cong Li, Senyan Xu , et al. (75 additional authors not shown)

    Abstract: The intersection of physics-based vision and deep learning presents an exciting frontier for advancing computer vision technologies. By leveraging the principles of physics to inform and enhance deep learning models, we can develop more robust and accurate vision systems. Physics-based vision aims to invert the processes to recover scene properties such as shape, reflectance, light distribution, a… ▽ More

    Submitted 12 July, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

    Comments: CVPR 2024 PBDL Challenges: https://pbdl-ws.github.io/pbdl2024/challenge/index.html

  13. arXiv:2406.10148  [pdf, other

    math.OC cs.LG stat.ML

    A Primal-Dual-Assisted Penalty Approach to Bilevel Optimization with Coupled Constraints

    Authors: Liuyuan Jiang, Quan Xiao, Victor M. Tenorio, Fernando Real-Rojas, Antonio Marques, Tianyi Chen

    Abstract: Interest in bilevel optimization has grown in recent years, partially due to its applications to tackle challenging machine-learning problems. Several exciting recent works have been centered around developing efficient gradient-based algorithms that can solve bilevel optimization problems with provable guarantees. However, the existing literature mainly focuses on bilevel problems either without… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  14. arXiv:2406.09991  [pdf, other

    astro-ph.HE astro-ph.SR

    On the Interacting/Active Lifetime of Supernova Fallback Disk around Isolated Neutron Stars

    Authors: Kun Xu, Hao-Ran Yang, Long Jiang, Wen-Cong Chen, Xiang-Dong Li, Jifeng Liu

    Abstract: The fallback disk model is widely accepted to explain long-period neutron stars (NSs) which can't be simulated by magnetic dipole radiation. However, no confirmed detection of disk was found from the newly discovered long period pulsars GLEAM-X 162759.5-523504.3, GPM J1839-10 and the known slowest isolated NSs 1E 161348-5055. This might be that the disks have either been in noninteracting/inactive… ▽ More

    Submitted 16 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted in ApJ, comments are welcome

  15. arXiv:2406.09789  [pdf, ps, other

    math.NA

    Localized subspace iteration methods for elliptic multiscale problems

    Authors: Xiaofei Guan, Lijian Jiang, Yajun Wang, Zihao Yang

    Abstract: This paper proposes localized subspace iteration (LSI) methods to construct generalized finite element basis functions for elliptic problems with multiscale coefficients. The key components of the proposed method consist of the localization of the original differential operator and the subspace iteration of the corresponding local spectral problems, where the localization is conducted by enforcing… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 23 pages

    MSC Class: 65N99; 65N30; 34E13

  16. arXiv:2406.07961  [pdf, other

    cs.CV cs.AI

    Accurate Explanation Model for Image Classifiers using Class Association Embedding

    Authors: Ruitao Xie, Jingbang Chen, Limai Jiang, Rui Xiao, Yi Pan, Yunpeng Cai

    Abstract: Image classification is a primary task in data analysis where explainable models are crucially demanded in various applications. Although amounts of methods have been proposed to obtain explainable knowledge from the black-box classifiers, these approaches lack the efficiency of extracting global knowledge regarding the classification task, thus is vulnerable to local traps and often leads to poor… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 40th IEEE International Conference on Data Engineering

  17. arXiv:2406.06701  [pdf, other

    astro-ph.CO

    The XMM-SERVS X-ray eXtended Galaxy Cluster (XVXGC) catalog

    Authors: Weiwei Xu, Linhua Jiang, Ran Li, Bin Luo, W. Nielsen Brandt, Chaoli Zhang, Thomas Erben

    Abstract: To explain the well-known tension between cosmological parameter constraints obtained from the primary CMB and those drawn from galaxy cluster samples, we propose a possible explanation for the incompleteness of detected clusters are higher than estimated. We aim to search for galaxy groups and clusters with particularly extended surface brightness distributions by creating a new X-ray-selected ca… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 16pages, 11 figures, 5 tables, submit to A&A. This entire sample is available at https://github.com/wwxu/xvxgc.github.io together with the paper publication

  18. arXiv:2406.05673  [pdf, other

    cs.AI cs.CL

    Flow of Reasoning: Efficient Training of LLM Policy with Divergent Thinking

    Authors: Fangxu Yu, Lai Jiang, Haoqiang Kang, Shibo Hao, Lianhui Qin

    Abstract: Divergent thinking, the cognitive process of generating diverse solutions, is a hallmark of human creativity and problem-solving. For machines, sampling diverse solution trajectories in complex reasoning problems is crucial for robust outcomes, data augmentation, and enhanced model generalization. Large language models (LLMs) often struggle with generating high-quality, diverse reasoning. While su… ▽ More

    Submitted 24 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

  19. arXiv:2406.05637  [pdf, ps, other

    math.OC cs.LG math.PR stat.ML

    A Generalized Version of Chung's Lemma and its Applications

    Authors: Li Jiang, Xiao Li, Andre Milzarek, Junwen Qiu

    Abstract: Chung's lemma is a classical tool for establishing asymptotic convergence rates of (stochastic) optimization methods under strong convexity-type assumptions and appropriate polynomial diminishing step sizes. In this work, we develop a generalized version of Chung's lemma, which provides a simple non-asymptotic convergence framework for a more general family of step size rules. We demonstrate broad… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: 43 pages, 5 figures

    MSC Class: 90C15; 90C30; 90C26

  20. arXiv:2406.04275  [pdf, other

    quant-ph

    Interfacing Gottesman-Kitaev-Preskill Qubits to Quantum Memories

    Authors: Prajit Dhara, Liang Jiang, Saikat Guha

    Abstract: Gottesman-Kitaev-Preskill (GKP) states have been demonstrated to pose significant advantages when utilized for fault-tolerant all optical continuous-variable quantum computing as well as for quantum communications links for entanglement distribution. However interfacing these systems to long-lived solid-state quantum memories has remained an open problem. Here we propose an interface between quant… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 17 pages; 8 figures; Comments are welcome!

  21. arXiv:2406.04272  [pdf, other

    quant-ph

    Entangling Quantum Memories at Channel Capacity

    Authors: Prajit Dhara, Liang Jiang, Saikat Guha

    Abstract: Entangling quantum memories, mediated by optical-frequency or microwave channels, at high rates and fidelities is key for linking qubits across short and long ranges. All well-known protocols encode up to one qubit per optical mode, hence entangling one pair of memory qubits per transmitted mode over the channel, with probability $η$, the channel's transmissivity. The rate is proportional to $η$ i… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 14 pages; 8 figures; Comments are welcome!

  22. arXiv:2406.03271  [pdf, other

    cs.CV

    Image Copy-Move Forgery Detection and Localization Scheme: How to Avoid Missed Detection and False Alarm

    Authors: Li Jiang, Zhaowei Lu, Yuebing Gao, Yifan Wang

    Abstract: Image copy-move is an operation that replaces one part of the image with another part of the same image, which can be used for illegal purposes due to the potential semantic changes. Recent studies have shown that keypoint-based algorithms achieved excellent and robust localization performance even when small or smooth tampered areas were involved. However, when the input image is low-resolution,… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  23. arXiv:2406.02856  [pdf, other

    cs.CL cs.AI

    Xmodel-LM Technical Report

    Authors: Yichuan Wang, Yang Liu, Yu Yan, Qun Wang, Xucheng Huang, Ling Jiang

    Abstract: We introduce Xmodel-LM, a compact and efficient 1.1B language model pre-trained on around 2 trillion tokens. Trained on our self-built dataset (Xdata), which balances Chinese and English corpora based on downstream task optimization, Xmodel-LM exhibits remarkable performance despite its smaller size. It notably surpasses existing open-source language models of similar scale. Our model checkpoints… ▽ More

    Submitted 26 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  24. arXiv:2406.02746  [pdf, other

    cs.CL

    RATT: A Thought Structure for Coherent and Correct LLM Reasoning

    Authors: Jinghan Zhang, Xiting Wang, Weijieying Ren, Lu Jiang, Dongjie Wang, Kunpeng Liu

    Abstract: Large Language Models (LLMs) gain substantial reasoning and decision-making capabilities from thought structures. However, existing methods such as Tree of Thought and Retrieval Augmented Thoughts often fall short in complex tasks due to the limitations of insufficient local retrieval of factual knowledge and inadequate global selection of strategies. These limitations make it challenging for thes… ▽ More

    Submitted 11 July, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  25. arXiv:2406.02669  [pdf, other

    quant-ph

    A generalized cycle benchmarking algorithm for characterizing mid-circuit measurements

    Authors: Zhihan Zhang, Senrui Chen, Yunchao Liu, Liang Jiang

    Abstract: Mid-circuit measurement (MCM) is a crucial ingredient in the development of fault-tolerant quantum computation. While there have been rapid experimental progresses in realizing MCM, a systematic method for characterizing noisy MCM is still under exploration. In this work we develop an algorithm to characterize noisy MCM, via a generalization of cycle benchmarking -- a standard approach for charact… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 27 pages, 9 figures

  26. arXiv:2406.01281  [pdf

    physics.med-ph cs.HC

    Extraction of Maternal and fetal ECG in a non-invasive way from abdominal ECG recordings using modified Progressive FastICA Peel-off

    Authors: Yao Li, Xuanyu Luo, Haowen Zhao, Jiawen Cui, Yangfan She, Dongfang Li, Lai Jiang, Xu Zhang

    Abstract: The non-invasive abdominal electrocardiogram (AECG) gives a non-invasive way to monitor fetal well-being during pregnancy. Due to the overlap with maternal ECG (MECG) as well as potential noises from other sources, it is challenging to extract weak fetal ECG (FECG) using surface electrodes. Taking advantage of precise source separation capability of the FastICA approach combined with its constrain… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  27. arXiv:2406.00593  [pdf

    physics.optics

    Low threshold optical bistability based on MoS2 in asymmetric Fabry-Perot cavity structure in visible light band

    Authors: Songqing Tang, Mengjiao Ren, Zhiheng Li, Zhiwei Zheng, Leyong Jiang

    Abstract: This article theoretically proposes a multi-layer Fabry-Perot cavity structure based on nonlinear MoS2, whose cavity is composed of asymmetric photonic crystals. In this structure, we observed a low threshold optical bistability phenomenon on the order of a in the visible light band, which is caused by the large third-order nonlinear conductivity of the bilayer MoS2 and the Fabry-Perot cavity reso… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: 22 pages, 6 figures

  28. arXiv:2406.00590  [pdf

    physics.optics

    MoS2-based optical bistability in silver-Bragg reflector multilayer structure at visible light band

    Authors: Songqing Tang, Mengjiao Ren, Zhiheng Li, Zhiwei Zheng, Leyong Jiang

    Abstract: In this paper, we present a theoretical analysis of the optical bistability in a metallic silver-Bragg reflector structure by embedding bilayer MoS2 at the visible band. The nonlinear OB is achieved due to the nonlinear conductivity of the bilayer MoS2 and the excitation of the optical Tamm state at the interface between the silver and the Bragg reflector. It is found that the hysteresis behaviour… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: 23 pages, 6 figures

  29. arXiv:2406.00420  [pdf

    physics.optics physics.app-ph

    Realization of type-II double-zero-index photonic crystals

    Authors: Zebin Zhu, Dong Zhao, Ziyao Wang, Xucheng Yang, Liyong Jiang, Zhen Gao

    Abstract: Some photonic crystals (PCs) with Dirac-like conical dispersions exhibit the property of double zero refractive index (that is, both epsilon and mu near zero (EMNZ)), wherein the electromagnetic waves have an infinite effective wavelength and do not experience any spatial phase change. The Dirac-like cones that support EMNZ are previously thought to present only at the center of the Brillouin zone… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: 38 pages, 13 figures

  30. arXiv:2405.20000  [pdf, other

    math.NA

    Combining physics-informed graph neural network and finite difference for solving forward and inverse spatiotemporal PDEs

    Authors: Hao Zhang, Longxiang Jiang, Xinkun Chu, Yong Wen, Luxiong Li, Yonghao Xiao, Liyuan Wang

    Abstract: The great success of Physics-Informed Neural Networks (PINN) in solving partial differential equations (PDEs) has significantly advanced our simulation and understanding of complex physical systems in science and engineering. However, many PINN-like methods are poorly scalable and are limited to in-sample scenarios. To address these challenges, this work proposes a novel discrete approach termed P… ▽ More

    Submitted 14 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  31. arXiv:2405.18676  [pdf

    physics.med-ph

    Exploring Automated Contouring Across Institutional Boundaries: A Deep Learning Approach with Mouse Micro-CT Datasets

    Authors: Lu Jiang, Di Xu, Qifan Xu, Arion Chatziioannou, Keisuke S. Iwamoto, Susanta Hui, Ke Sheng

    Abstract: Image-guided mouse irradiation is essential to understand interventions involving radiation prior to human studies. Our objective is to employ Swin UNEt Transformers (Swin UNETR) to segment native micro-CT and contrast-enhanced micro-CT scans and benchmark the results against 3D no-new-Net (nnU-Net). Swin UNETR reformulates mouse organ segmentation as a sequence-to-sequence prediction task, using… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  32. arXiv:2405.15236  [pdf, other

    quant-ph

    Detecting Errors in a Quantum Network with Pauli Checks

    Authors: Alvin Gonzales, Daniel Dilley, Bikun Li, Liang Jiang, Zain H. Saleem

    Abstract: We apply the quantum error detection scheme Pauli check sandwiching (PCS) to quantum networks by turning it into a distributed multiparty protocol. PCS is a distance 1 code and requires less resource overhead than standard quantum error correction and detection methods. We provide analytical equations for the final fidelity and postselection rate. We also introduce a recursive version of PCS for e… ▽ More

    Submitted 3 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: Comments are welcome!

  33. arXiv:2405.14231  [pdf, other

    cs.CL

    From Role-Play to Drama-Interaction: An LLM Solution

    Authors: Weiqi Wu, Hongqiu Wu, Lai Jiang, Xingyuan Liu, Jiale Hong, Hai Zhao, Min Zhang

    Abstract: Drama is a form of storytelling inspired by human creativity, proceeding with a predefined storyline, carrying emotions and thoughts. This paper introduces \emph{LLM-based interactive drama}, which endows traditional drama with an unprecedented immersion, where a person is allowed to walk into it and interact with the characters and scenes. We define this new artistic genre by 6 essential elements… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Accepted by ACL 2024 Findings

  34. arXiv:2405.13762  [pdf, other

    cs.CV cs.LG cs.MM cs.SD eess.AS

    A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation

    Authors: Gwanghyun Kim, Alonso Martinez, Yu-Chuan Su, Brendan Jou, José Lezama, Agrim Gupta, Lijun Yu, Lu Jiang, Aren Jansen, Jacob Walker, Krishna Somandepalli

    Abstract: Training diffusion models for audiovisual sequences allows for a range of generation tasks by learning conditional distributions of various input-output combinations of the two modalities. Nevertheless, this strategy often requires training a separate model for each task which is expensive. Here, we propose a novel training approach to effectively learn arbitrary conditional distributions in the a… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  35. arXiv:2405.11647  [pdf, other

    cs.AI cs.LG

    Hummer: Towards Limited Competitive Preference Dataset

    Authors: Li Jiang, Yusen Wu, Junwu Xiong, Jingqing Ruan, Yichuan Ding, Qingpei Guo, Zujie Wen, Jun Zhou, Xiaotie Deng

    Abstract: Preference datasets are essential for incorporating human preferences into pre-trained language models, playing a key role in the success of Reinforcement Learning from Human Feedback. However, these datasets often demonstrate conflicting alignment objectives, leading to increased vulnerability to jailbreak attacks and challenges in adapting downstream tasks to prioritize specific alignment object… ▽ More

    Submitted 20 May, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

    Comments: 9 pages, 5 figures

  36. arXiv:2405.11607  [pdf, other

    cs.CR cs.AR

    OFHE: An Electro-Optical Accelerator for Discretized TFHE

    Authors: Mengxin Zheng, Cheng Chu, Qian Lou, Nathan Youngblood, Mo Li, Sajjad Moazeni, Lei Jiang

    Abstract: This paper presents \textit{OFHE}, an electro-optical accelerator designed to process Discretized TFHE (DTFHE) operations, which encrypt multi-bit messages and support homomorphic multiplications, lookup table operations and full-domain functional bootstrappings. While DTFHE is more efficient and versatile than other fully homomorphic encryption schemes, it requires 32-, 64-, and 128-bit polynomia… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  37. arXiv:2405.11549  [pdf

    nucl-ex

    Experimental Study on Deuterium-Deuterium Thermonuclear Fusion with Interface Confinement

    Authors: Darong Chen, Liang Jiang, Shuai Chen, Bao Wang, Dangguo Li, Peng Liang

    Abstract: Nuclear fusion is recognized as the energy of the future, and huge efforts and capitals have been put into the research of controlled nuclear fusion in the past decades. The most challenging thing for controlled nuclear fusion is to generate and keep a super high temperature. Here, a sonication system, combining with micro-scale fluid control techniques, was built to generate cavitation within a l… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  38. arXiv:2405.11464  [pdf, other

    cs.CL cs.AI cs.LG

    Efficient Prompt Tuning by Multi-Space Projection and Prompt Fusion

    Authors: Pengxiang Lan, Enneng Yang, Yuting Liu, Guibing Guo, Linying Jiang, Jianzhe Zhao, Xingwei Wang

    Abstract: Prompt tuning is a promising method to fine-tune a pre-trained language model without retraining its large-scale parameters. Instead, it attaches a soft prompt to the input text, whereby downstream tasks can be well adapted by merely learning the embeddings of prompt tokens. Nevertheless, existing methods still suffer from two challenges: (i) they are hard to balance accuracy and efficiency. A lon… ▽ More

    Submitted 1 July, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

  39. arXiv:2405.10825  [pdf, other

    eess.SY cs.LG

    Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and Opportunities

    Authors: Hao Zhou, Chengming Hu, Ye Yuan, Yufei Cui, Yili Jin, Can Chen, Haolun Wu, Dun Yuan, Li Jiang, Di Wu, Xue Liu, Charlie Zhang, Xianbin Wang, Jiangchuan Liu

    Abstract: Large language models (LLMs) have received considerable attention recently due to their outstanding comprehension and reasoning capabilities, leading to great progress in many fields. The advancement of LLM techniques also offers promising opportunities to automate many tasks in the telecommunication (telecom) field. After pre-training and fine-tuning, LLMs can perform diverse downstream tasks bas… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  40. arXiv:2405.09215  [pdf, other

    cs.CV cs.AI

    Xmodel-VLM: A Simple Baseline for Multimodal Vision Language Model

    Authors: Wanting Xu, Yang Liu, Langping He, Xucheng Huang, Ling Jiang

    Abstract: We introduce Xmodel-VLM, a cutting-edge multimodal vision language model. It is designed for efficient deployment on consumer GPU servers. Our work directly confronts a pivotal industry issue by grappling with the prohibitive service costs that hinder the broad adoption of large-scale multimodal systems. Through rigorous training, we have developed a 1B-scale language model from the ground up, emp… ▽ More

    Submitted 20 June, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

  41. arXiv:2405.08977  [pdf, other

    astro-ph.CO

    Constraints on the variation of the fine-structure constant at 3<z<10 with JWST emission-line galaxies

    Authors: Linhua Jiang, Shuqi Fu, Feige Wang, Sarah E. I. Bosman, Zheng Cai, Hyunsung D. Jun, Zhiwei Pan, Fengwu Sun, Jinyi Yang, Huanian Zhang

    Abstract: We present constraints on the spacetime variation of the fine-structure constant $α$ at redshifts $3<z<10$ using JWST emission-line galaxies. The galaxy sample consists of 572 high-quality spectra with strong and narrow [O III] $λλ$4959,5007 doublet emission lines from 522 galaxies, including 267 spectra at $z>5$. The [O III] doublet lines are arguably the best emission lines to probe the variatio… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 9 pages, 6 figures, submitted to ApJ

  42. arXiv:2405.08403  [pdf, other

    cs.LG

    TFWT: Tabular Feature Weighting with Transformer

    Authors: Xinhao Zhang, Zaitian Wang, Lu Jiang, Wanfu Gao, Pengfei Wang, Kunpeng Liu

    Abstract: In this paper, we propose a novel feature weighting method to address the limitation of existing feature processing methods for tabular data. Typically the existing methods assume equal importance across all samples and features in one dataset. This simplified processing methods overlook the unique contributions of each feature, and thus may miss important feature information. As a result, it lead… ▽ More

    Submitted 17 May, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: Accepted by IJCAI 2024

  43. arXiv:2405.07530  [pdf, other

    cs.SE

    Prompt-based Code Completion via Multi-Retrieval Augmented Generation

    Authors: Hanzhuo Tan, Qi Luo, Ling Jiang, Zizheng Zhan, Jing Li, Haotian Zhang, Yuqun Zhang

    Abstract: Automated code completion, aiming at generating subsequent tokens from unfinished code, has been significantly benefited from recent progress in pre-trained Large Language Models (LLMs). However, these models often suffer from coherence issues and hallucinations when dealing with complex code logic or extrapolating beyond their training data. Existing Retrieval Augmented Generation (RAG) technique… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  44. arXiv:2405.07303  [pdf, other

    hep-ex hep-ph physics.ins-det

    Search for solar axions by Primakoff effect with the full dataset of the CDEX-1B Experiment

    Authors: L. T. Yang, S. K. Liu, Q. Yue, K. J. Kang, Y. J. Li, H. P. An, Greeshma C., J. P. Chang, Y. H. Chen, J. P. Cheng, W. H. Dai, Z. Deng, C. H. Fang, X. P. Geng, H. Gong, Q. J. Guo, T. Guo, X. Y. Guo, L. He, J. R. He, J. W. Hu, H. X. Huang, T. C. Huang, L. Jiang, S. Karmakar , et al. (61 additional authors not shown)

    Abstract: We present the first limit on $g_{Aγ}$ coupling constant using the Bragg-Primakoff conversion based on an exposure of 1107.5 kg days of data from the CDEX-1B experiment at the China Jinping Underground Laboratory. The data are consistent with the null signal hypothesis, and no excess signals are observed. Limits of the coupling $g_{Aγ}<2.08\times10^{-9}$ GeV$^{-1}$ (95\% C.L.) are derived for axio… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: 7 pages, 5 figures

  45. arXiv:2405.05671  [pdf, other

    cond-mat.mes-hall quant-ph

    Self-correcting GKP qubit and gates in a driven-dissipative circuit

    Authors: Frederik Nathan, Liam O'Brien, Kyungjoo Noh, Matthew H. Matheny, Arne L. Grimsmo, Liang Jiang, Gil Refael

    Abstract: We propose a circuit architecture for a dissipatively error-corrected GKP qubit. The device consists of a high-impedance LC circuit coupled to a Josephson junction and a resistor via a controllable switch. When the switch is activated via a particular family of stepwise protocols, the resistor absorbs all noise-induced entropy, resulting in dissipative error correction of both phase and amplitude… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 12 pages + 8 figures in the main text

  46. arXiv:2405.04032  [pdf, other

    cs.CR cs.AI

    Locally Differentially Private In-Context Learning

    Authors: Chunyan Zheng, Keke Sun, Wenhao Zhao, Haibo Zhou, Lixin Jiang, Shaoyang Song, Chunlai Zhou

    Abstract: Large pretrained language models (LLMs) have shown surprising In-Context Learning (ICL) ability. An important application in deploying large language models is to augment LLMs with a private database for some specific task. The main problem with this promising commercial use is that LLMs have been shown to memorize their training data and their prompt data are vulnerable to membership inference at… ▽ More

    Submitted 8 May, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

    Comments: This paper was published at LREC-Coling 2024

  47. arXiv:2405.03781  [pdf, other

    astro-ph.GA astro-ph.CO

    Large Scale Overdensity of Lyman Break Galaxies Around the z=6.3 Ultraluminous Quasar J0100+2802

    Authors: Maria Pudoka, Feige Wang, Xiaohui Fan, Jinyi Yang, Jaclyn Champagne, Victoria Jones, Fuyan Bian, Zheng Cai, Linhua Jiang, Dezi Liu, Xue-Bing Wu

    Abstract: We study the environment of the z=6.33 ultraluminous quasar SDSS J010013.02+280225.8 (J0100) to understand its association with large-scale structure. Theoretical models propose high-redshift quasars as markers of galaxy overdensities residing in the most massive dark matter halos (DMHs) in the early universe. J0100 is an ultraluminous quasar with the most massive black hole known at z>6, suggesti… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 21 pages, 11 figures, 3 tables, to be published in The Astrophysical Journal (ApJ)

  48. arXiv:2405.03280  [pdf, other

    cs.CV cs.AI

    Animate Your Thoughts: Decoupled Reconstruction of Dynamic Natural Vision from Slow Brain Activity

    Authors: Yizhuo Lu, Changde Du, Chong Wang, Xuanliu Zhu, Liuyun Jiang, Huiguang He

    Abstract: Reconstructing human dynamic vision from brain activity is a challenging task with great scientific significance. The difficulty stems from two primary issues: (1) vision-processing mechanisms in the brain are highly intricate and not fully revealed, making it challenging to directly learn a mapping between fMRI and video; (2) the temporal resolution of fMRI is significantly lower than that of nat… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  49. arXiv:2405.02798  [pdf, other

    cs.SI

    Structural Balance in Real-World Social Networks: Incorporating Direction and Transitivity in Measuring Partial Balance

    Authors: Rezvaneh Rezapour, Ly Dinh, Lan Jiang, Jana Diesner

    Abstract: Structural balance theory predicts that triads in networks gravitate towards stable configurations. The theory has been verified for undirected graphs. Since real-world networks are often directed, we introduce a novel method for considering both transitivity and sign consistency for evaluating partial balance in signed digraphs. We test our approach on graphs constructed by using different method… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2006.02565

  50. arXiv:2405.02155  [pdf, other

    cs.CV

    Multi-method Integration with Confidence-based Weighting for Zero-shot Image Classification

    Authors: Siqi Yin, Lifan Jiang

    Abstract: This paper introduces a novel framework for zero-shot learning (ZSL), i.e., to recognize new categories that are unseen during training, by using a multi-model and multi-alignment integration method. Specifically, we propose three strategies to enhance the model's performance to handle ZSL: 1) Utilizing the extensive knowledge of ChatGPT and the powerful image generation capabilities of DALL-E to… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.