Skip to main content

Showing 1–50 of 1,336 results for author: Huang, M

  1. arXiv:2407.06677  [pdf, other

    cs.CL

    Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules

    Authors: Zhuocheng Gong, Ang Lv, Jian Guan, Junxi Yan, Wei Wu, Huishuai Zhang, Minlie Huang, Dongyan Zhao, Rui Yan

    Abstract: Is it always necessary to compute tokens from shallow to deep layers in Transformers? The continued success of vanilla Transformers and their variants suggests an undoubted "yes". In this work, however, we attempt to break the depth-ordered convention by proposing a novel architecture dubbed mixture-of-modules (MoM), which is motivated by an intuition that any layer, regardless of its position, ca… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  2. arXiv:2407.04675  [pdf, other

    eess.AS cs.SD

    Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition

    Authors: Ye Bai, Jingping Chen, Jitong Chen, Wei Chen, Zhuo Chen, Chuang Ding, Linhao Dong, Qianqian Dong, Yujiao Du, Kepan Gao, Lu Gao, Yi Guo, Minglun Han, Ting Han, Wenchao Hu, Xinying Hu, Yuxiang Hu, Deyu Hua, Lu Huang, Mingkun Huang, Youjia Huang, Jishuo Jin, Fanliu Kong, Zongwei Lan, Tianyu Li , et al. (30 additional authors not shown)

    Abstract: Modern automatic speech recognition (ASR) model is required to accurately transcribe diverse speech signals (from different domains, languages, accents, etc) given the specific contextual information in various application scenarios. Classic end-to-end models fused with extra language models perform well, but mainly in data matching scenarios and are gradually approaching a bottleneck. In this wor… ▽ More

    Submitted 10 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

  3. arXiv:2407.03978  [pdf, other

    cs.CL cs.AI

    Benchmarking Complex Instruction-Following with Multiple Constraints Composition

    Authors: Bosi Wen, Pei Ke, Xiaotao Gu, Lindong Wu, Hao Huang, Jinfeng Zhou, Wenchuang Li, Binxin Hu, Wendy Gao, Jiaxin Xu, Yiming Liu, Jie Tang, Hongning Wang, Minlie Huang

    Abstract: Instruction following is one of the fundamental capabilities of large language models (LLMs). As the ability of LLMs is constantly improving, they have been increasingly applied to deal with complex human instructions in real-world scenarios. Therefore, how to evaluate the ability of complex instruction-following of LLMs has become a critical research problem. Existing benchmarks mainly focus on m… ▽ More

    Submitted 11 July, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

    Comments: 20 pages, 7 figures

  4. arXiv:2407.02855  [pdf, other

    cs.CR cs.CL cs.LG

    Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks

    Authors: Zhexin Zhang, Junxiao Yang, Pei Ke, Shiyao Cui, Chujie Zheng, Hongning Wang, Minlie Huang

    Abstract: LLMs are known to be vulnerable to jailbreak attacks, even after safety alignment. An important observation is that, while different types of jailbreak attacks can generate significantly different queries, they mostly result in similar responses that are rooted in the same harmful knowledge (e.g., detailed steps to make a bomb). Therefore, we conjecture that directly unlearn the harmful knowledge… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 15 pages

  5. arXiv:2407.00167  [pdf, other

    cs.CL cs.AI cs.ET cs.HC cs.SI

    Can GPT-4 Help Detect Quit Vaping Intentions? An Exploration of Automatic Data Annotation Approach

    Authors: Sai Krishna Revanth Vuruma, Dezhi Wu, Saborny Sen Gupta, Lucas Aust, Valerie Lookingbill, Wyatt Bellamy, Yang Ren, Erin Kasson, Li-Shiun Chen, Patricia Cavazos-Rehg, Dian Hu, Ming Huang

    Abstract: In recent years, the United States has witnessed a significant surge in the popularity of vaping or e-cigarette use, leading to a notable rise in cases of e-cigarette and vaping use-associated lung injury (EVALI) that caused hospitalizations and fatalities during the EVALI outbreak in 2019, highlighting the urgency to comprehend vaping behaviors and develop effective strategies for cessation. Due… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

    Comments: Accepted for the AI Applications in Public Health and Social Services workshop at the 22nd International Conference on Artificial Intelligence in Medicine (AIME 2024)

  6. arXiv:2406.18696  [pdf, other

    cs.CL

    Sequence Graph Network for Online Debate Analysis

    Authors: Quan Mai, Susan Gauch, Douglas Adams, Miaoqing Huang

    Abstract: Online debates involve a dynamic exchange of ideas over time, where participants need to actively consider their opponents' arguments, respond with counterarguments, reinforce their own points, and introduce more compelling arguments as the discussion unfolds. Modeling such a complex process is not a simple task, as it necessitates the incorporation of both sequential characteristics and the capab… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 8 pages, 4 figures

  7. arXiv:2406.16714  [pdf, other

    cs.CL cs.AI cs.LG

    AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models

    Authors: Jiale Cheng, Yida Lu, Xiaotao Gu, Pei Ke, Xiao Liu, Yuxiao Dong, Hongning Wang, Jie Tang, Minlie Huang

    Abstract: Although Large Language Models (LLMs) are becoming increasingly powerful, they still exhibit significant but subtle weaknesses, such as mistakes in instruction-following or coding tasks. As these unexpected errors could lead to severe consequences in practical deployments, it is crucial to investigate the limitations within LLMs systematically. Traditional benchmarking approaches cannot thoroughly… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  8. arXiv:2406.14491  [pdf, other

    cs.CL

    Instruction Pre-Training: Language Models are Supervised Multitask Learners

    Authors: Daixuan Cheng, Yuxian Gu, Shaohan Huang, Junyu Bi, Minlie Huang, Furu Wei

    Abstract: Unsupervised multitask pre-training has been the critical method behind the recent success of language models (LMs). However, supervised multitask learning still holds significant promise, as scaling it in the post-training stage trends towards better generalization. In this paper, we explore supervised multitask pre-training by proposing Instruction Pre-Training, a framework that scalably augment… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  9. arXiv:2406.13326  [pdf

    cond-mat.soft cond-mat.mes-hall cond-mat.mtrl-sci

    Chiral π Domain Walls Composed of Twin Half-Integer Surface Disclinations in Ferroelectric Nematic Liquid Crystals

    Authors: Shengzhu Yi, Zening Hong, Zhongjie Ma, Chao Zhou, Miao Jiang, Xiang Huang, Mingjun Huang, Satoshi Aya, Rui Zhang, Qi-Huo Wei

    Abstract: Ferroelectric nematic liquid crystals are polar fluids characterized by microscopic orientational ordering and macroscopic spontaneous polarizations. Within these fluids, walls that separate domains of different polarizations are ubiquitous. We demonstrate that the π walls in films of polar fluids consist of twin half-integer surface disclinations spaced horizontally, enclosing a subdomain where t… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  10. arXiv:2406.12899  [pdf

    physics.ins-det hep-ex

    Structural design of the acrylic vessel for the Jinping Neutrino Experiment

    Authors: Zongyi Wang, Yuhao Liua, Shaomin Chen, Yuanqing Wang, Zhe Wang, Ming Huang

    Abstract: The Jinping neutrino experiment is designed to have multiple purposes in the China Jinping Underground Laboratory. Following the acrylic vessel design requirements proposal, a structural scheme has been developed and optimized. Subsequently, the stability of the acrylic shell structure was calculated using finite element analysis, as well as the load-bearing capacities under various working condit… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: 27 pages, 11 figures,7 tables

  11. arXiv:2406.12793  [pdf, other

    cs.CL

    ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

    Authors: Team GLM, :, Aohan Zeng, Bin Xu, Bowen Wang, Chenhui Zhang, Da Yin, Diego Rojas, Guanyu Feng, Hanlin Zhao, Hanyu Lai, Hao Yu, Hongning Wang, Jiadai Sun, Jiajie Zhang, Jiale Cheng, Jiayi Gui, Jie Tang, Jing Zhang, Juanzi Li, Lei Zhao, Lindong Wu, Lucen Zhong, Mingdao Liu, Minlie Huang , et al. (32 additional authors not shown)

    Abstract: We introduce ChatGLM, an evolving family of large language models that we have been developing over time. This report primarily focuses on the GLM-4 language series, which includes GLM-4, GLM-4-Air, and GLM-4-9B. They represent our most capable models that are trained with all the insights and lessons gained from the preceding three generations of ChatGLM. To date, the GLM-4 models are pre-trained… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  12. arXiv:2406.12569  [pdf, other

    cs.LG

    MOYU: A Theoretical Study on Massive Over-activation Yielded Uplifts in LLMs

    Authors: Chi Ma, Mincong Huang, Chao Wang, Yujie Wang, Lei Yu

    Abstract: Massive Over-activation Yielded Uplifts(MOYU) is an inherent property of large language models, and dynamic activation(DA) based on the MOYU property is a clever yet under-explored strategy designed to accelerate inference in these models. Existing methods that utilize MOYU often face a significant 'Impossible Trinity': struggling to simultaneously maintain model performance, enhance inference spe… ▽ More

    Submitted 28 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  13. arXiv:2406.12250  [pdf

    cond-mat.mes-hall

    Observation of stacking engineered magnetic phase transitions within moiré supercells of twisted van der Waals magnets

    Authors: Senlei Li, Zeliang Sun, Nathan J. McLaughlin, Afsana Sharmin, Nishkarsh Agarwal, Mengqi Huang, Suk Hyun Sung, Hanyi Lu, Shaohua Yan, Hechang Lei, Robert Hovden, Hailong Wang, Hua Chen, Liuyan Zhao, Chunhui Rita Du

    Abstract: Twist engineering of magnetic van der Waals (vdW) moiré superlattices provides an attractive way to achieve precise nanoscale control over the spin degree of freedom on two-dimensional flatland. Despite the very recent demonstrations of moiré magnetism featuring exotic phases with noncollinear spin order in twisted vdW magnet chromium triiodide CrI3, the local magnetic interactions, spin dynamics,… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  14. arXiv:2406.10261  [pdf, other

    cs.CL cs.AI

    FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination

    Authors: Pengfei Zhou, Weiqing Min, Chaoran Fu, Ying Jin, Mingyu Huang, Xiangyang Li, Shuhuan Mei, Shuqiang Jiang

    Abstract: Food is foundational to human life, serving not only as a source of nourishment but also as a cornerstone of cultural identity and social interaction. As the complexity of global dietary needs and preferences grows, food intelligence is needed to enable food perception and reasoning for various tasks, ranging from recipe generation and dietary recommendation to diet-disease correlation discovery a… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 32 pages, 19 figures

  15. arXiv:2406.09904  [pdf, other

    cs.LG

    QQQ: Quality Quattuor-Bit Quantization for Large Language Models

    Authors: Ying Zhang, Peng Zhang, Mincong Huang, Jingyang Xiang, Yujie Wang, Chao Wang, Yineng Zhang, Lei Yu, Chuan Liu, Wei Lin

    Abstract: Quantization is a proven effective method for compressing large language models. Although popular techniques like W8A8 and W4A16 effectively maintain model performance, they often fail to concurrently speed up the prefill and decoding stages of inference. W4A8 is a promising strategy to accelerate both of them while usually leads to a significant performance degradation. To address these issues, w… ▽ More

    Submitted 28 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

  16. arXiv:2406.09903  [pdf, ps, other

    math.NA cs.IT

    Asymptotic quadratic convergence of the Gauss-Newton method for complex phase retrieval

    Authors: Meng Huang

    Abstract: In this paper, we introduce a Gauss-Newton method for solving the complex phase retrieval problem. In contrast to the real-valued setting, the Gauss-Newton matrix for complex-valued signals is rank-deficient and, thus, non-invertible. To address this, we utilize a Gauss-Newton step that moves orthogonally to certain trivial directions. We establish that this modified Gauss-Newton step has a closed… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 54 pages

  17. arXiv:2406.04604  [pdf, other

    cs.CL cs.PL

    Learning Task Decomposition to Assist Humans in Competitive Programming

    Authors: Jiaxin Wen, Ruiqi Zhong, Pei Ke, Zhihong Shao, Hongning Wang, Minlie Huang

    Abstract: When using language models (LMs) to solve complex problems, humans might struggle to understand the LM-generated solutions and repair the flawed ones. To assist humans in repairing them, we propose to automatically decompose complex solutions into multiple simpler pieces that correspond to specific subtasks. We introduce a novel objective for learning task decomposition, termed assistive value (As… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: ACL 2024 Main Conference

  18. arXiv:2406.04523  [pdf, other

    cs.CL cs.LG

    Proofread: Fixes All Errors with One Tap

    Authors: Renjie Liu, Yanxiang Zhang, Yun Zhu, Haicheng Sun, Yuanbo Zhang, Michael Xuelin Huang, Shanqing Cai, Lei Meng, Shumin Zhai

    Abstract: The impressive capabilities in Large Language Models (LLMs) provide a powerful approach to reimagine users' typing experience. This paper demonstrates Proofread, a novel Gboard feature powered by a server-side LLM in Gboard, enabling seamless sentence-level and paragraph-level corrections with a single tap. We describe the complete system in this paper, from data generation, metrics design to mode… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 8 pages, 3 figures, 2 tables

  19. arXiv:2406.03362  [pdf, other

    math.RA math.CO math.QA math.RT

    Positivity for quantum cluster algebras from orbifolds

    Authors: Min Huang

    Abstract: Let $(S,M,U)$ be a marked orbifold with or without punctures and let $\mathcal A_v$ be a quantum cluster algebra from $(S,M,U)$ with arbitrary coefficients and quantization. We provide combinatorial formulas for quantum Laurent expansion of quantum cluster variables of $\mathcal A_v$ concerning an arbitrary quantum seed. Consequently, the positivity for the quantum cluster algebra $\mathcal A_v$ i… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Comments are welcome!

    MSC Class: 13F60; 05E15; 05E40

  20. arXiv:2406.03104  [pdf, other

    cond-mat.quant-gas physics.atom-ph

    Dark state transport between unitary Fermi superfluids

    Authors: Mohsen Talebi, Simon Wili, Jeffrey Mohan, Philipp Fabritius, Meng-Zi Huang, Tilman Esslinger

    Abstract: The formation of dark states is an important concept in quantum sciences, but its compatibility with strong interparticle interactions, for example, in a quantum degenerate gas is hardly explored. Here, we realize a dark state in one of the spins of a two-component, resonantly-interacting Fermi gas using a $Λ$ system within the $D_2$ transitions of $^6$Li at high magnetic field. The dark state is… ▽ More

    Submitted 6 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: 18 pages, 10 figures

  21. arXiv:2406.00994  [pdf

    cond-mat.soft

    Half-integer Vortices Paired via String Micelles in Ferroelectric Liquid Crystals Facilitated by Ionic Polymer Doping

    Authors: Zhongjie Ma, Miao Jiang, Yaohao Song, Aile Sun, Shengzhu Yi, Chao Zhou, Xiang Huang, Mingjun Huang, Satoshi Aya, Qi-Huo Wei

    Abstract: Ferroelectric nematic (NF) liquid crystals are an intriguing polar system for exploring topological defects, and their properties are subject to significant influence by ionic doping. A prior theory based on a modified XY model predicts that string defects with half-integer vortex-antivortex pairs can be excited, while such stable string defects have not been directly observed in polar materials.… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  22. arXiv:2406.00985  [pdf, other

    cs.CV

    MultiEdits: Simultaneous Multi-Aspect Editing with Text-to-Image Diffusion Models

    Authors: Mingzhen Huang, Jialing Cai, Shan Jia, Vishnu Suresh Lokhande, Siwei Lyu

    Abstract: Text-driven image synthesis has made significant advancements with the development of diffusion models, transforming how visual content is generated from text prompts. Despite these advances, text-driven image editing, a key area in computer graphics, faces unique challenges. A major challenge is making simultaneous edits across multiple objects or attributes. Applying these methods sequentially f… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  23. arXiv:2406.00857  [pdf, other

    astro-ph.IM

    Modeling the refractive index profile n(z) of polar ice for ultra-high energy neutrino experiments

    Authors: S. Ali, P. Allison, S. Archambault, J. J. Beatty, D. Z. Besson, A. Bishop, P. Chen, Y. C. Chen, B. A. Clark, W. Clay, A. Connolly, K. Couberly, L. Cremonesi, A. Cummings, P. Dasgupta, R. Debolt, S. de Kockere, K. D. de Vries, C. Deaconu, M. A. DuVernois, J. Flaherty, E. Friedman, R. Gaior, P. Giri, J. Hanson , et al. (45 additional authors not shown)

    Abstract: We develop an in-situ index of refraction profile using the transit time of radio signals broadcast from an englacial transmitter to 2-5 km distant radio-frequency receivers, deployed at depths up to 200 m. Maxwell's equations generally admit two ray propagation solutions from a given transmitter, corresponding to a direct path (D) and a refracted path (R); the measured D vs. R (dt(D,R)) timing di… ▽ More

    Submitted 11 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

  24. arXiv:2406.00353  [pdf, other

    hep-ph hep-th

    Form factors of $Λ_b^0 \to Λ_c(2595)^+$ within light-cone QCD sum rules

    Authors: Hui-Hui Duan, Yong-Lu Liu, Qin Chang, Ming-Qiu Huang

    Abstract: In this work, we calculated the form factors of the weak decay process $Λ_b^0 \to Λ_c(2595)^+$, where the final charm baryon represents an excited state with spin-parity $\frac{1}{2}^-$. Utilizing the light-cone QCD sum rules approach, we incorporated the contributions of the lowest two charm baryon states: the ground state $Λ_c$ with $J^P=\frac{1}{2}^+$ and the excited state $Λ_c(2595)^+$ with… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: 15 pages, 3 figures and 6 tables

  25. arXiv:2405.14722  [pdf, other

    cs.CL

    CAPE: Context-Adaptive Positional Encoding for Length Extrapolation

    Authors: Chuanyang Zheng, Yihang Gao, Han Shi, Minbin Huang, Jingyao Li, Jing Xiong, Xiaozhe Ren, Michael Ng, Xin Jiang, Zhenguo Li, Yu Li

    Abstract: Positional encoding plays a crucial role in transformers, significantly impacting model performance and length generalization. Prior research has introduced absolute positional encoding (APE) and relative positional encoding (RPE) to distinguish token positions in given sequences. However, both APE and RPE remain fixed after model training regardless of input data, limiting their adaptability and… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Technical Report

  26. arXiv:2405.14383  [pdf, other

    cs.CL cs.AI

    Perception of Knowledge Boundary for Large Language Models through Semi-open-ended Question Answering

    Authors: Zhihua Wen, Zhiliang Tian, Zexin Jian, Zhen Huang, Pei Ke, Yifu Gao, Minlie Huang, Dongsheng Li

    Abstract: Large Language Models (LLMs) are widely used for knowledge-seeking yet suffer from hallucinations. The knowledge boundary (KB) of an LLM limits its factual understanding, beyond which it may begin to hallucinate. Investigating the perception of LLMs' KB is crucial for detecting hallucinations and LLMs' reliable generation. Current studies perceive LLMs' KB on questions with a concrete answer (clos… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  27. arXiv:2405.10197  [pdf, other

    physics.chem-ph

    Forte: A Suite of Advanced Multireference Quantum Chemistry Methods

    Authors: Francesco A. Evangelista, Chenyang Li, Prakash Verma, Kevin P. Hannon, Jeffrey B. Schriber, Tianyuan Zhang, Chenxi Cai, Shuhe Wang, Nan He, Nicholas H. Stair, Meng Huang, Renke Huang, Jonathon P. Misiewicz, Shuhang Li, Kevin Marin, Zijun Zhao, Lori A. Burns

    Abstract: Forte is an open-source library specialized in multireference electronic structure theories for molecular systems and the rapid prototyping of new methods. This paper gives an overview of the capabilities of Forte, its software architecture, and examples of applications enabled by the methods it implements.

    Submitted 16 May, 2024; originally announced May 2024.

  28. arXiv:2405.09274  [pdf, other

    cs.LG

    Dynamic Activation Pitfalls in LLaMA Models: An Empirical Study

    Authors: Chi Ma, Mincong Huang, Chao Wang, Yujie Wang, Lei Yu

    Abstract: In this work, we systematically investigate the efficacy of dynamic activation mechanisms within the LLaMA family of language models. Despite the potential of dynamic activation methods to reduce computation and increase speed in models using the ReLU activation function, our empirical findings have uncovered several inherent pitfalls in the current dynamic activation schemes. Through extensive ex… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  29. arXiv:2405.08748  [pdf, other

    cs.CV

    Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

    Authors: Zhimin Li, Jianwei Zhang, Qin Lin, Jiangfeng Xiong, Yanxin Long, Xinchi Deng, Yingfang Zhang, Xingchao Liu, Minbin Huang, Zedong Xiao, Dayou Chen, Jiajun He, Jiahao Li, Wenyue Li, Chen Zhang, Rongwei Quan, Jianxiang Lu, Jiabin Huang, Xiaoyan Yuan, Xiaoxiao Zheng, Yixuan Li, Jihong Zhang, Chao Zhang, Meng Chen, Jie Liu , et al. (20 additional authors not shown)

    Abstract: We present Hunyuan-DiT, a text-to-image diffusion transformer with fine-grained understanding of both English and Chinese. To construct Hunyuan-DiT, we carefully design the transformer structure, text encoder, and positional encoding. We also build from scratch a whole data pipeline to update and evaluate data for iterative model optimization. For fine-grained language understanding, we train a Mu… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: Project Page: https://dit.hunyuan.tencent.com/

  30. arXiv:2405.06915  [pdf

    cs.AI

    Automating Creativity

    Authors: Ming-Hui Huang, Roland T. Rust

    Abstract: Generative AI (GenAI) has spurred the expectation of being creative, due to its ability to generate content, yet so far, its creativity has somewhat disappointed, because it is trained using existing data following human intentions to generate outputs. The purpose of this paper is to explore what is required to evolve AI from generative to creative. Based on a reinforcement learning approach and b… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: 46 pages, 2 tables, 4 figures

  31. arXiv:2405.06470  [pdf, other

    astro-ph.SR astro-ph.HE nucl-ex nucl-th

    Solar fusion III: New data and theory for hydrogen-burning stars

    Authors: B. Acharya, M. Aliotta, A. B. Balantekin, D. Bemmerer, C. A. Bertulani, A. Best, C. R. Brune, R. Buompane, F. Cavanna, J. W. Chen, J. Colgan, A. Czarnecki, B. Davids, R. J. deBoer, F. Delahaye, R. Depalo, A. García, M. Gatu Johnson, D. Gazit, L. Gialanella, U. Greife, D. Guffanti, A. Guglielmetti, K. Hambleton, W. C. Haxton , et al. (25 additional authors not shown)

    Abstract: In stars that lie on the main sequence in the Hertzsprung Russel diagram, like our sun, hydrogen is fused to helium in a number of nuclear reaction chains and series, such as the proton-proton chain and the carbon-nitrogen-oxygen cycles. Precisely determined thermonuclear rates of these reactions lie at the foundation of the standard solar model. This review, the third decadal evaluation of the nu… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: 85 pages, 15 figures. To be submitted to Reviews of Modern Physics

    Report number: N3AS-24-016

  32. arXiv:2405.06386  [pdf, other

    hep-ph

    Deconfinement and chiral restoration phase transition under rotation from holography in an anisotropic gravitational background

    Authors: Yidian Chen, Xun Chen, Danning Li, Mei Huang

    Abstract: We investigate the effects of rotation on deconfinement and chiral phase transitions in the framework of dynamical holographic QCD model. Instead of transforming to the rotating system by Lorentz boost, we construct an anisotropic gravitational background by incorporating the rotating boundary current. We firstly investigate the pure gluon system under rotation to extract deconfinement phase trans… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  33. arXiv:2405.06179  [pdf, ps, other

    hep-ph

    Flavor dependent Critical endpoint from holographic QCD through machine learning

    Authors: Xun Chen, Mei Huang

    Abstract: QCD phase diagram in the $T - μ$ plane and the equation of state for pure gluon, 2-flavor, 2+1-flavor systems, and 2+1+1-flavor systems have been investigated using the Einstein-Maxwell-Dilaton (EMD) framework at finite temperature and chemical potential. By inputting lattice QCD data for the equation of state and baryon susceptibility at zero chemical potential into holographic model, all the par… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  34. arXiv:2405.04059  [pdf

    cond-mat.str-el cond-mat.mes-hall

    Three-dimensional hidden phase probed by in-plane magnetotransport in kagome metal CsV$_3$Sb$_5$ thin flakes

    Authors: Xinjian Wei, Congkuan Tian, Hang Cui, Yuxin Zhai, Yongkai Li, Shaobo Liu, Yuanjun Song, Ya Feng, Miaoling Huang, Zhiwei Wang, Yi Liu, Qihua Xiong, Yugui Yao, X. C. Xie, Jian-Hao Chen

    Abstract: Transition metal compounds with kagome structure have been found to exhibit a variety of exotic structural, electronic, and magnetic orders. These orders are competing with energies very close to each other, resulting in complex phase transitions. Some of the phases are easily observable, such as the charge density wave (CDW) and the superconducting phase, while others are more challenging to iden… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  35. arXiv:2405.02313  [pdf, ps, other

    physics.flu-dyn

    Physics-informed Data-driven Cavitation Model for a Specific MG EOS

    Authors: Minsheng Huang, Chengbao Yao, Pan Wang, Lidong Cheng, Wenjun Ying

    Abstract: We present a novel one-fluid cavitation model of a specific Mie-Grüneisen equation of state(EOS), named polynomial EOS, based on an artificial neural network. Not only the physics-informed equation but also the experimental data are embedded into the proposed model by an optimization problem. The physics-informed data-driven model provides the concerned pressure within the cavitation region, where… ▽ More

    Submitted 5 April, 2024; originally announced May 2024.

    Comments: 29 pages, 18 figures

  36. arXiv:2405.01949  [pdf, other

    gr-qc astro-ph.CO hep-ph

    Bubble wall velocity and gravitational wave in the minimal left-right symmetric model

    Authors: Dian-Wei Wang, Qi-Shu Yan, Mei Huang

    Abstract: The bubble wall velocity in the first order phase transition plays an important role in determining both the amplitude and the pivot frequency of stochastic gravitational wave background. In the framework of the minimal left-right symmetric model, we study the wall velocity when the first order phase transition can occur. The wall velocity can be determined by matching the distribution functions i… ▽ More

    Submitted 6 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

    Comments: 22 pages, 12 figures

  37. arXiv:2405.00802  [pdf

    cond-mat.mtrl-sci

    Sensing Spin Wave Excitations by Spin Defects in Few-Layer Thick Hexagonal Boron Nitride

    Authors: Jingcheng Zhou, Hanyi Lu, Di Chen, Mengqi Huang, Gerald Q. Yan, Faris Al-matouq, Jiu Chang, Dziga Djugba, Zhigang Jiang, Hailong Wang, Chunhui Rita Du

    Abstract: Optically active spin defects in wide band-gap semiconductors serve as a local sensor of multiple degrees of freedom in a variety of "hard" and "soft" condensed matter systems. Taking advantage of the recent progress on quantum sensing using van der Waals (vdW) quantum materials, here we report direct measurements of spin waves excited in magnetic insulator Y3Fe5O12 (YIG) by boron vacancy $V_B^-$… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  38. arXiv:2404.19652  [pdf, other

    cs.CV cs.AI

    VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization

    Authors: Yuliang Liu, Mingxin Huang, Hao Yan, Linger Deng, Weijia Wu, Hao Lu, Chunhua Shen, Lianwen Jin, Xiang Bai

    Abstract: Text spotting, a task involving the extraction of textual information from image or video sequences, faces challenges in cross-domain adaption, such as image-to-image and image-to-video generalization. In this paper, we introduce a new method, termed VimTS, which enhances the generalization ability of the model by achieving better synergy among different tasks. Typically, we propose a Prompt Queri… ▽ More

    Submitted 14 May, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

  39. arXiv:2404.18919  [pdf, other

    cs.CV

    TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation

    Authors: Junhao Cheng, Baiqiao Yin, Kaixin Cai, Minbin Huang, Hanhui Li, Yuxin He, Xi Lu, Yue Li, Yifei Li, Yuhao Cheng, Yiqiang Yan, Xiaodan Liang

    Abstract: Recent advances in diffusion models can generate high-quality and stunning images from text. However, multi-turn image generation, which is of high demand in real-world scenarios, still faces challenges in maintaining semantic consistency between images and texts, as well as contextual consistency of the same subject across multiple interactive turns. To address this issue, we introduce TheaterGen… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  40. arXiv:2404.18619  [pdf

    cond-mat.soft

    Patterning of 2D second harmonic generation active arrays in ferroelectric nematic fluids

    Authors: M. Lovšin, A. Petelin, B. Berteloot, N. Osterman, S. Aya, M. Huang, I. Drevenšek-Olenik, R. J. Mandle, K. Neyts, A. Mertelj, N. Sebastian

    Abstract: Ferroelectric nematic liquid crystals exhibit unique non-linear optical properties, with the potential to become transformative materials for photonic applications. A promising direction relies on the fabrication of tailored polar orientational patterns via photoalignment, thus shaping the non-linear optical susceptibility through thin slabs of the ferroelectric fluid. Here, we explore the fabrica… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 24 pages, 5 Images main, 15 supplementary images

  41. arXiv:2404.18033  [pdf, other

    cs.CV

    Exposing Text-Image Inconsistency Using Diffusion Models

    Authors: Mingzhen Huang, Shan Jia, Zhou Zhou, Yan Ju, Jialing Cai, Siwei Lyu

    Abstract: In the battle against widespread online misinformation, a growing problem is text-image inconsistency, where images are misleadingly paired with texts with different intent or meaning. Existing classification-based methods for text-image inconsistency can identify contextual inconsistencies but fail to provide explainable justifications for their decisions that humans can understand. Although more… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  42. arXiv:2404.17607  [pdf, other

    cs.IR cs.AI cs.CL cs.LG cs.SI

    Utilizing Large Language Models to Identify Reddit Users Considering Vaping Cessation for Digital Interventions

    Authors: Sai Krishna Revanth Vuruma, Dezhi Wu, Saborny Sen Gupta, Lucas Aust, Valerie Lookingbill, Caleb Henry, Yang Ren, Erin Kasson, Li-Shiun Chen, Patricia Cavazos-Rehg, Dian Hu, Ming Huang

    Abstract: The widespread adoption of social media platforms globally not only enhances users' connectivity and communication but also emerges as a vital channel for the dissemination of health-related information, thereby establishing social media data as an invaluable organic data resource for public health research. The surge in popularity of vaping or e-cigarette use in the United States and other countr… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  43. arXiv:2404.16792  [pdf, other

    cs.LG cs.AI cs.CL

    Weak-to-Strong Extrapolation Expedites Alignment

    Authors: Chujie Zheng, Ziqi Wang, Heng Ji, Minlie Huang, Nanyun Peng

    Abstract: The open-source community is experiencing a surge in the release of large language models (LLMs) that are trained to follow instructions and align with human preference. However, further training to improve them still requires expensive computational resources and data annotations. Is it possible to bypass additional training and cost-effectively acquire better-aligned models? Inspired by the lite… ▽ More

    Submitted 22 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Add theoretical explanation and more evaluation results

  44. arXiv:2404.16425  [pdf, other

    astro-ph.HE

    Soft X-ray prompt emission from a high-redshift gamma-ray burst EP240315a

    Authors: Y. Liu, H. Sun, D. Xu, D. S. Svinkin, J. Delaunay, N. R. Tanvir, H. Gao, C. Zhang, Y. Chen, X. -F. Wu, B. Zhang, W. Yuan, J. An, G. Bruni, D. D. Frederiks, G. Ghirlanda, J. -W. Hu, A. Li, C. -K. Li, J. -D. Li, D. B. Malesani, L. Piro, G. Raman, R. Ricci, E. Troja , et al. (170 additional authors not shown)

    Abstract: Long gamma-ray bursts (GRBs) are believed to originate from core collapse of massive stars. High-redshift GRBs can probe the star formation and reionization history of the early universe, but their detection remains rare. Here we report the detection of a GRB triggered in the 0.5--4 keV band by the Wide-field X-ray Telescope (WXT) on board the Einstein Probe (EP) mission, designated as EP240315a,… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 41 pages, 8 figures, 7 tables

  45. arXiv:2404.15790  [pdf, other

    cs.CV

    Leveraging Large Language Models for Multimodal Search

    Authors: Oriol Barbany, Michael Huang, Xinliang Zhu, Arnab Dhua

    Abstract: Multimodal search has become increasingly important in providing users with a natural and effective way to ex-press their search intentions. Images offer fine-grained details of the desired products, while text allows for easily incorporating search modifications. However, some existing multimodal search systems are unreliable and fail to address simple queries. The problem becomes harder with the… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Published at CVPRW 2024

  46. arXiv:2404.15249  [pdf, ps, other

    math.NA

    A GPU-accelerated Cartesian grid method for PDEs on irregular domain

    Authors: Liwei Tan, Minsheng Huang, Wenjun Ying

    Abstract: The kernel-free boundary integral (KFBI) method has successfully solved partial differential equations (PDEs) on irregular domains. Diverging from traditional boundary integral methods, the computation of boundary integrals in KFBI is executed through the resolution of equivalent simple interface problems on Cartesian grids, utilizing fast algorithms. While existing implementations of KFBI methods… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 24pages 10figures

  47. arXiv:2404.14864  [pdf

    math.NA math-ph

    A GPU-accelerated Cartesian grid method is proposed for solving the heat, wave, and Schrodinger equations on irregular domains

    Authors: Liwei Tan, Minsheng Huang, Wenjun Ying

    Abstract: This paper introduces a second-order method for solving general elliptic partial differential equations (PDEs) on irregular domains using GPU acceleration, based on Ying's kernel-free boundary integral (KFBI) method. The method addresses limitations imposed by CFL conditions in explicit schemes and accuracy issues in fully implicit schemes for the Laplacian operator. To overcome these challenges,… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 40 pages,12 figures

  48. arXiv:2404.14228  [pdf, other

    cs.NE

    A Survey of Decomposition-Based Evolutionary Multi-Objective Optimization: Part II -- A Data Science Perspective

    Authors: Mingyu Huang, Ke Li

    Abstract: This paper presents the second part of the two-part survey series on decomposition-based evolutionary multi-objective optimization where we mainly focus on discussing the literature related to multi-objective evolutionary algorithms based on decomposition (MOEA/D). Complementary to the first part, here we employ a series of advanced data mining approaches to provide a comprehensive anatomy of the… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  49. MathNet: A Data-Centric Approach for Printed Mathematical Expression Recognition

    Authors: Felix M. Schmitt-Koopmann, Elaine M. Huang, Hans-Peter Hutter, Thilo Stadelmann, Alireza Darvishy

    Abstract: Printed mathematical expression recognition (MER) models are usually trained and tested using LaTeX-generated mathematical expressions (MEs) as input and the LaTeX source code as ground truth. As the same ME can be generated by various different LaTeX source codes, this leads to unwanted variations in the ground truth data that bias test performance results and hinder efficient learning. In additi… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 12 pages, 6 figures

    Journal ref: IEEE Access 12 (2024) 76963-76974

  50. arXiv:2404.12602  [pdf

    cs.CV cs.LG

    A visualization method for data domain changes in CNN networks and the optimization method for selecting thresholds in classification tasks

    Authors: Minzhe Huang, Changwei Nie, Weihong Zhong

    Abstract: In recent years, Face Anti-Spoofing (FAS) has played a crucial role in preserving the security of face recognition technology. With the rise of counterfeit face generation techniques, the challenge posed by digitally edited faces to face anti-spoofing is escalating. Existing FAS technologies primarily focus on intercepting physically forged faces and lack a robust solution for cross-domain FAS cha… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.