Skip to main content

Showing 1–50 of 528 results for author: Yin, W

  1. arXiv:2407.09478  [pdf, other

    hep-ph astro-ph.CO

    Induced Domain Walls of QCD Axion, and Gravitational Waves

    Authors: Junseok Lee, Kai Murai, Fuminobu Takahashi, Wen Yin

    Abstract: We show that heavy axion domain walls induce domain walls of the QCD axion through a mixing between the heavy axion and the QCD axion, even when the pre-inflationary initial condition is assumed for the QCD axion. The induced domain walls arise because the effective $θ$ parameter changes across the heavy axion domain walls, shifting the potential minimum of the QCD axion. When the heavy axion doma… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 33 pages, 8 figures

    Report number: TU-1237

  2. arXiv:2407.07924  [pdf, other

    math.OC cs.AI cs.CL cs.LG

    Solving General Natural-Language-Description Optimization Problems with Large Language Models

    Authors: Jihai Zhang, Wei Wang, Siyan Guo, Li Wang, Fangquan Lin, Cheng Yang, Wotao Yin

    Abstract: Optimization problems seek to find the best solution to an objective under a set of constraints, and have been widely investigated in real-world applications. Modeling and solving optimization problems in a specific domain typically require a combination of domain knowledge, mathematical skills, and programming ability, making it difficult for general users and even domain professionals. In this p… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  3. arXiv:2406.17028  [pdf, other

    hep-ph

    Cosmic Stability of Dark Matter from Pauli Blocking

    Authors: Brian Batell, Wen Yin

    Abstract: Why does dark matter (DM) live longer than the age of the Universe? Here we study a novel sub-eV scalar DM candidate whose stability is due to the Pauli exclusion of its fermionic decay products. We analyze the stability of the DM condensate against decays, scatterings (i.e., evaporation), and parametric resonance, delineating the viable parameter regions in which DM is cosmologically stable. In a… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 47 pages, 5 figures

    Report number: TU-1168, PITT-PACC-2403

  4. arXiv:2406.16253  [pdf, other

    cs.CL

    LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing

    Authors: Jiangshu Du, Yibo Wang, Wenting Zhao, Zhongfen Deng, Shuaiqi Liu, Renze Lou, Henry Peng Zou, Pranav Narayanan Venkit, Nan Zhang, Mukund Srinath, Haoran Ranran Zhang, Vipul Gupta, Yinghui Li, Tao Li, Fei Wang, Qin Liu, Tianlin Liu, Pengzhi Gao, Congying Xia, Chen Xing, Jiayang Cheng, Zhaowei Wang, Ying Su, Raj Sanjay Shah, Ruohao Guo , et al. (15 additional authors not shown)

    Abstract: This work is motivated by two key trends. On one hand, large language models (LLMs) have shown remarkable versatility in various generative tasks such as writing, drawing, and question answering, significantly reducing the time required for many routine tasks. On the other hand, researchers, whose work is not only time-consuming but also highly expertise-demanding, face increasing challenges as th… ▽ More

    Submitted 25 June, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

  5. arXiv:2406.16203  [pdf, other

    cs.CL

    LLMs' Classification Performance is Overclaimed

    Authors: Hanzi Xu, Renze Lou, Jiangshu Du, Vahid Mahzoon, Elmira Talebianaraki, Zhuoan Zhou, Elizabeth Garrison, Slobodan Vucetic, Wenpeng Yin

    Abstract: In many classification tasks designed for AI or human to solve, gold labels are typically included within the label space by default, often posed as "which of the following is correct?" This standard setup has traditionally highlighted the strong performance of advanced AI, particularly top-performing Large Language Models (LLMs), in routine classification tasks. However, when the gold label is in… ▽ More

    Submitted 3 July, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

  6. arXiv:2406.13103  [pdf, other

    cs.AI cs.LG

    A Generic Method for Fine-grained Category Discovery in Natural Language Texts

    Authors: Chang Tian, Matthew B. Blaschko, Wenpeng Yin, Mingzhe Xing, Yinliang Yue, Marie-Francine Moens

    Abstract: Fine-grained category discovery using only coarse-grained supervision is a cost-effective yet challenging task. Previous training methods focus on aligning query samples with positive samples and distancing them from negatives. They often neglect intra-category and inter-category semantic similarities of fine-grained categories when navigating sample distributions in the embedding space. Furthermo… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: preprint

  7. arXiv:2406.12554  [pdf, other

    hep-ph astro-ph.CO

    Populating secluded dark sector with ultra-relativistic bubbles

    Authors: Aleksandr Azatov, Xander Nagels, Miguel Vanvlasselaer, Wen Yin

    Abstract: We study Dark Matter production during first order phase transitions from bubble-plasma collisions. We focus on scenarios where the Dark Matter sector is secluded and its interaction with the visible sector (including the Standard Model) originates from dimension-five and dimension-six operators. We find that such DM is generally heavy and has a large initial velocity, leading to the possibility o… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 7 figures, 26 pages + appendices

    Report number: SISSA 11/2024/FISI

  8. arXiv:2406.05938  [pdf, other

    cs.LG math.OC

    Expressive Power of Graph Neural Networks for (Mixed-Integer) Quadratic Programs

    Authors: Ziang Chen, Xiaohan Chen, Jialin Liu, Xinshang Wang, Wotao Yin

    Abstract: Quadratic programming (QP) is the most widely applied category of problems in nonlinear programming. Many applications require real-time/fast solutions, though not necessarily with high precision. Existing methods either involve matrix decomposition or use the preconditioned conjugate gradient method. For relatively large instances, these methods cannot achieve the real-time requirement unless the… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  9. arXiv:2406.05602  [pdf, other

    cs.CV cs.CL

    Can Prompt Modifiers Control Bias? A Comparative Analysis of Text-to-Image Generative Models

    Authors: Philip Wootaek Shin, Jihyun Janice Ahn, Wenpeng Yin, Jack Sampson, Vijaykrishnan Narayanan

    Abstract: It has been shown that many generative models inherit and amplify societal biases. To date, there is no uniform/systematic agreed standard to control/adjust for these biases. This study examines the presence and manipulation of societal biases in leading text-to-image models: Stable Diffusion, DALL-E 3, and Adobe Firefly. Through a comprehensive analysis combining base prompts with modifiers and t… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  10. arXiv:2406.05460  [pdf, other

    cs.CL cs.AI

    Fighting Against the Repetitive Training and Sample Dependency Problem in Few-shot Named Entity Recognition

    Authors: Chang Tian, Wenpeng Yin, Dan Li, Marie-Francine Moens

    Abstract: Few-shot named entity recognition (NER) systems recognize entities using a few labeled training examples. The general pipeline consists of a span detector to identify entity spans in text and an entity-type classifier to assign types to entities. Current span detectors rely on extensive manual labeling to guide training. Almost every span detector requires initial training on basic span features f… ▽ More

    Submitted 18 June, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

    Comments: ieee access: https://doi.org/10.1109/ACCESS.2024.3374727

  11. arXiv:2406.02006  [pdf, other

    math.OC cs.AI

    ODE-based Learning to Optimize

    Authors: Zhonglin Xie, Wotao Yin, Zaiwen Wen

    Abstract: Recent years have seen a growing interest in understanding acceleration methods through the lens of ordinary differential equations (ODEs). Despite the theoretical advancements, translating the rapid convergence observed in continuous-time models to discrete-time iterative methods poses significant challenges. In this paper, we present a comprehensive framework integrating the inertial systems wit… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 55 pages, 28 figures

  12. arXiv:2405.19978  [pdf, other

    cs.LG stat.ML

    Domain Adaptation with Cauchy-Schwarz Divergence

    Authors: Wenzhe Yin, Shujian Yu, Yicong Lin, Jie Liu, Jan-Jakob Sonke, Efstratios Gavves

    Abstract: Domain adaptation aims to use training data from one or multiple source domains to learn a hypothesis that can be generalized to a different, but related, target domain. As such, having a reliable measure for evaluating the discrepancy of both marginal and conditional distributions is crucial. We introduce Cauchy-Schwarz (CS) divergence to the problem of unsupervised domain adaptation (UDA). The C… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Accepted by UAI-24

  13. arXiv:2405.17705  [pdf, other

    cs.CV

    DC-Gaussian: Improving 3D Gaussian Splatting for Reflective Dash Cam Videos

    Authors: Linhan Wang, Kai Cheng, Shuo Lei, Shengkun Wang, Wei Yin, Chenyang Lei, Xiaoxiao Long, Chang-Tien Lu

    Abstract: We present DC-Gaussian, a new method for generating novel views from in-vehicle dash cam videos. While neural rendering techniques have made significant strides in driving scenarios, existing methods are primarily designed for videos collected by autonomous vehicles. However, these videos are limited in both quantity and diversity compared to dash cam videos, which are more widely used across vari… ▽ More

    Submitted 29 May, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: 9 pages,7 figures;project page: https://linhanwang.github.io/dcgaussian/

  14. arXiv:2405.16020  [pdf, ps, other

    math.OC math.NA

    Block Acceleration Without Momentum: On Optimal Stepsizes of Block Gradient Descent for Least-Squares

    Authors: Liangzu Peng, Wotao Yin

    Abstract: Block coordinate descent is a powerful algorithmic template suitable for big data optimization. This template admits a lot of variants including block gradient descent (BGD), which performs gradient descent on a selected block of variables, while keeping other variables fixed. For a very long time, the stepsize for each block has tacitly been set to one divided by the block-wise Lipschitz smoothne… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 36 pages, accepted to ICML 2024

  15. arXiv:2405.15251  [pdf, other

    math.OC cs.LG stat.ML

    Learning to optimize: A tutorial for continuous and mixed-integer optimization

    Authors: Xiaohan Chen, Jialin Liu, Wotao Yin

    Abstract: Learning to Optimize (L2O) stands at the intersection of traditional optimization and machine learning, utilizing the capabilities of machine learning to enhance conventional optimization techniques. As real-world optimization problems frequently share common structures, L2O provides a tool to exploit these structures for better or faster solutions. This tutorial dives deep into L2O techniques, in… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  16. arXiv:2405.14741  [pdf, other

    math.OC cs.LG stat.ML

    Bagging Improves Generalization Exponentially

    Authors: Huajie Qian, Donghao Ying, Henry Lam, Wotao Yin

    Abstract: Bagging is a popular ensemble technique to improve the accuracy of machine learning models. It hinges on the well-established rationale that, by repeatedly retraining on resampled data, the aggregated model exhibits lower variance and hence higher stability, especially for discontinuous base learners. In this paper, we provide a new perspective on bagging: By suitably aggregating the base learners… ▽ More

    Submitted 29 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: Correct author list typo

  17. arXiv:2405.10303  [pdf, other

    hep-ph astro-ph.CO astro-ph.GA astro-ph.IM physics.bio-ph

    Asymmetric Warm Dark Matter: from Cosmological Asymmetry to Chirality of Life

    Authors: Wen Yin, Shota Nakagawa, Tamaki Murokoshi, Makoto Hattori

    Abstract: We investigate a novel scenario involving asymmetric keV-range dark matter (DM) in the form of right-handed (sterile) neutrinos. Based on the Fermi-Dirac distribution, we demonstrate that asymmetric fermionic DM forms a Fermi degenerate gas, making it potentially colder than symmetric fermionic DM. This setup simultaneously accounts for the Universe's baryon asymmetry through tiny Yukawa interacti… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 22pages, 3figures, comments are welcome

  18. arXiv:2405.10205  [pdf, other

    cs.HC

    Exploring the Impact of ChatGPT on Wikipedia Engagement

    Authors: Neal Reeves, Wenjie Yin, Elena Simperl

    Abstract: Wikipedia is one of the most popular websites in the world, serving as a major source of information and learning resource for millions of users worldwide. While motivations for its usage vary, prior research suggests shallow information gathering -- looking up facts and information or answering questions -- dominates over more in-depth usage. On the 22nd of November 2022, ChatGPT was released to… ▽ More

    Submitted 29 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: 12 pages, 4 figures, submitted to ACM Collective Intelligence

  19. arXiv:2404.19417  [pdf, other

    cs.CV

    Physical Backdoor: Towards Temperature-based Backdoor Attacks in the Physical World

    Authors: Wen Yin, Jian Lou, Pan Zhou, Yulai Xie, Dan Feng, Yuhua Sun, Tailai Zhang, Lichao Sun

    Abstract: Backdoor attacks have been well-studied in visible light object detection (VLOD) in recent years. However, VLOD can not effectively work in dark and temperature-sensitive scenarios. Instead, thermal infrared object detection (TIOD) is the most accessible and practical in such environments. In this paper, our team is the first to investigate the security vulnerabilities associated with TIOD in the… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: To appear in CVPR 2024.11pages, 8 figures and 4 tables

  20. arXiv:2404.18372  [pdf, other

    nlin.SI math-ph

    Integrable semi-discretization for a modified Camassa-Holm equation with cubic nonlinearity

    Authors: Bao-Feng Feng, Heng-Chun Hu, Han-Han Sheng, Wei Yin, Guo-Fu Yu

    Abstract: In the present paper, an integrable semi-discretization of the modified Camassa-Holm (mCH) equation with cubic nonlinearity is presented. The key points of the construction are based on the discrete Kadomtsev-Petviashvili (KP) equation and appropriate definition of discrete reciprocal transformations. First, we demonstrate that these bilinear equations and their determinant solutions can be derive… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  21. arXiv:2404.15506  [pdf, other

    cs.CV

    Metric3D v2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation

    Authors: Mu Hu, Wei Yin, Chi Zhang, Zhipeng Cai, Xiaoxiao Long, Hao Chen, Kaixuan Wang, Gang Yu, Chunhua Shen, Shaojie Shen

    Abstract: We introduce Metric3D v2, a geometric foundation model for zero-shot metric depth and surface normal estimation from a single image, which is crucial for metric 3D recovery. While depth and normal are geometrically related and highly complimentary, they present distinct challenges. SoTA monocular depth methods achieve zero-shot generalization by learning affine-invariant depths, which cannot recov… ▽ More

    Submitted 21 March, 2024; originally announced April 2024.

    Comments: Our project page is at https://JUGGHM.github.io/Metric3Dv2. arXiv admin note: substantial text overlap with arXiv:2307.10984

  22. arXiv:2404.06444  [pdf, other

    hep-ph astro-ph.GA gr-qc

    Cosmic Clues: DESI, Dark Energy, and the Cosmological Constant Problem

    Authors: Wen Yin

    Abstract: Several attempts to solve the cosmological constant problem, which concerns the value of the cosmological constant being extremely smaller than the Standard Model mass scales, have introduced a scalar field with a very flat potential that can be approximated as linear around any given position. The scalar field scans the cosmological constant in such a way that the current small value is explained… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 9 pages, 1 figure

  23. arXiv:2404.03602  [pdf, other

    cs.CL

    Evaluating LLMs at Detecting Errors in LLM Responses

    Authors: Ryo Kamoi, Sarkar Snigdha Sarathi Das, Renze Lou, Jihyun Janice Ahn, Yilun Zhao, Xiaoxin Lu, Nan Zhang, Yusen Zhang, Ranran Haoran Zhang, Sujeeth Reddy Vummanthala, Salika Dave, Shaobo Qin, Arman Cohan, Wenpeng Yin, Rui Zhang

    Abstract: With Large Language Models (LLMs) being widely used across various tasks, detecting errors in their responses is increasingly crucial. However, little research has been conducted on error detection of LLM responses. Collecting error annotations on LLM responses is challenging due to the subjective nature of many NLP tasks, and thus previous research focuses on tasks of little practical value (e.g.… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: Benchmark and code: https://github.com/psunlpgroup/ReaLMistake

  24. arXiv:2404.01600  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    C-type antiferromagnetic structure of topological semimetal CaMnSb$_2$

    Authors: Bo Li, Xu-Tao Zeng, Qianhui Xu, Fan Yang, Junsen Xiang, Hengyang Zhong, Sihao Deng, Lunhua He, Juping Xu, Wen Yin, Xingye Lu, Huiying Liu, Xian-Lei Sheng, Wentao Jin

    Abstract: Determination of the magnetic structure and confirmation of the presence or absence of inversion ($\mathcal{P}$) and time reversal ($\mathcal{T}$) symmetry is imperative for correctly understanding the topological magnetic materials. Here high-quality single crystals of the layered manganese pnictide CaMnSb$_2$ are synthesized using the self-flux method. De Haas-van Alphen oscillations indicate a… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 7 Pages, 6 figures

    Journal ref: Chinese Physics Letters 41, 037104 (2024)

  25. arXiv:2404.01592  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Structural, magnetic and magnetocaloric properties of triangular-lattice transition-metal phosphates

    Authors: Chuandi Zhang, Junsen Xiang, Quanliang Zhu, Longfei Wu, Shanfeng Zhang, Juping Xu, Wen Yin, Peijie Sun, Wei Li, Gang Su, Wentao Jin

    Abstract: The recent discovery of the spin supersolid candidate Na$_2$BaCo(PO$_4$)$_2$ stimulates numerous research interest on the triangular-lattice transition-metal phosphates. Here we report a comprehensive study on the structural, magnetic and magnetocaloric properties of polycrystalline Na$_2$$A$$T$(PO$_4$)$_2$ ($A$ = Ba, Sr; $T$ = Co, Ni, Mn). X-ray and neutron diffraction measurements confirm that N… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 10 Pages, 6 figures, accepted for publication in Physical Review Materials

    Journal ref: Physical Review Materials 8, 044409 (2024)

  26. arXiv:2403.17934  [pdf, other

    cs.CV

    AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation

    Authors: Qingping Sun, Yanjun Wang, Ailing Zeng, Wanqi Yin, Chen Wei, Wenjia Wang, Haiyi Mei, Chi Sing Leung, Ziwei Liu, Lei Yang, Zhongang Cai

    Abstract: Expressive human pose and shape estimation (a.k.a. 3D whole-body mesh recovery) involves the human body, hand, and expression estimation. Most existing methods have tackled this task in a two-stage manner, first detecting the human body part with an off-the-shelf detection model and inferring the different human body parts individually. Despite the impressive results achieved, these methods suffer… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Homepage: https://ttxskk.github.io/AiOS/

  27. arXiv:2403.13307  [pdf, other

    cs.CV

    LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment

    Authors: Peishan Cong, Ziyi Wang, Zhiyang Dou, Yiming Ren, Wei Yin, Kai Cheng, Yujing Sun, Xiaoxiao Long, Xinge Zhu, Yuexin Ma

    Abstract: Language-guided scene-aware human motion generation has great significance for entertainment and robotics. In response to the limitations of existing datasets, we introduce LaserHuman, a pioneering dataset engineered to revolutionize Scene-Text-to-Motion research. LaserHuman stands out with its inclusion of genuine human motions within 3D environments, unbounded free-form natural language descript… ▽ More

    Submitted 21 March, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  28. arXiv:2403.12959  [pdf, other

    cs.CV cs.AI cs.GR cs.LG cs.RO

    WHAC: World-grounded Humans and Cameras

    Authors: Wanqi Yin, Zhongang Cai, Ruisi Wang, Fanzhou Wang, Chen Wei, Haiyi Mei, Weiye Xiao, Zhitao Yang, Qingping Sun, Atsushi Yamashita, Ziwei Liu, Lei Yang

    Abstract: Estimating human and camera trajectories with accurate scale in the world coordinate system from a monocular video is a highly desirable yet challenging and ill-posed problem. In this study, we aim to recover expressive parametric human models (i.e., SMPL-X) and corresponding camera poses jointly, by leveraging the synergy between three critical players: the world, the human, and the camera. Our a… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Homepage: https://wqyin.github.io/projects/WHAC/

  29. arXiv:2403.12013  [pdf, other

    cs.CV

    GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image

    Authors: Xiao Fu, Wei Yin, Mu Hu, Kaixuan Wang, Yuexin Ma, Ping Tan, Shaojie Shen, Dahua Lin, Xiaoxiao Long

    Abstract: We introduce GeoWizard, a new generative foundation model designed for estimating geometric attributes, e.g., depth and normals, from single images. While significant research has already been conducted in this area, the progress has been substantially limited by the low diversity and poor quality of publicly available datasets. As a result, the prior works either are constrained to limited scenar… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Project page: https://fuxiao0719.github.io/projects/geowizard/

  30. arXiv:2403.11805  [pdf, other

    cs.OS

    LLM as a System Service on Mobile Devices

    Authors: Wangsong Yin, Mengwei Xu, Yuanchun Li, Xuanzhe Liu

    Abstract: Being more powerful and intrusive into user-device interactions, LLMs are eager for on-device execution to better preserve user privacy. In this work, we propose a new paradigm of mobile AI: LLM as a system service on mobile devices (LLMaaS). Unlike traditional DNNs that execute in a stateless manner, such a system service is stateful: LLMs execution often needs to maintain persistent states (main… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Technical Report

  31. arXiv:2403.10287  [pdf, other

    cs.CV

    Few-Shot Image Classification and Segmentation as Visual Question Answering Using Vision-Language Models

    Authors: Tian Meng, Yang Tao, Ruilin Lyu, Wuliang Yin

    Abstract: The task of few-shot image classification and segmentation (FS-CS) involves classifying and segmenting target objects in a query image, given only a few examples of the target classes. We introduce the Vision-Instructed Segmentation and Evaluation (VISE) method that transforms the FS-CS problem into the Visual Question Answering (VQA) problem, utilising Vision-Language Models (VLMs), and addresses… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  32. arXiv:2403.09407  [pdf, other

    cs.SD cs.AI cs.LG cs.MM eess.AS

    LM2D: Lyrics- and Music-Driven Dance Synthesis

    Authors: Wenjie Yin, Xuejiao Zhao, Yi Yu, Hang Yin, Danica Kragic, Mårten Björkman

    Abstract: Dance typically involves professional choreography with complex movements that follow a musical rhythm and can also be influenced by lyrical content. The integration of lyrics in addition to the auditory dimension, enriches the foundational tone and makes motion generation more amenable to its semantic meanings. However, existing dance synthesis methods tend to model motions only conditioned on au… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  33. arXiv:2403.08881  [pdf, other

    cond-mat.mtrl-sci

    Origin of light-induced metastability in ZrTe$_5$

    Authors: D. Nevola, N. Aryal, G. D. Gu, P. D. Johnson, W. -G. Yin, Q. Li

    Abstract: We study the non-equilibrium electronic structure of a model Dirac semimetal ZrTe$_5$ by using time-and-angle resolved photoemission spectroscopy and density functional theory-based electron and phonon calculations. By measuring the electronic dispersion near the $Γ$ point at time delays up to 10 picoseconds, we discovered that the band spectral weight does not recover during the measured temporal… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 6 pages, 4 figures

  34. arXiv:2403.07535  [pdf, other

    cs.CV

    Adaptive Fusion of Single-View and Multi-View Depth for Autonomous Driving

    Authors: JunDa Cheng, Wei Yin, Kaixuan Wang, Xiaozhi Chen, Shijie Wang, Xin Yang

    Abstract: Multi-view depth estimation has achieved impressive performance over various benchmarks. However, almost all current multi-view systems rely on given ideal camera poses, which are unavailable in many real-world scenarios, such as autonomous driving. In this work, we propose a new robustness benchmark to evaluate the depth estimation system under various noisy pose settings. Surprisingly, we find c… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  35. arXiv:2403.03863  [pdf, other

    cs.CL

    X-Shot: A Unified System to Handle Frequent, Few-shot and Zero-shot Learning Simultaneously in Classification

    Authors: Hanzi Xu, Muhao Chen, Lifu Huang, Slobodan Vucetic, Wenpeng Yin

    Abstract: In recent years, few-shot and zero-shot learning, which learn to predict labels with limited annotated instances, have garnered significant attention. Traditional approaches often treat frequent-shot (freq-shot; labels with abundant instances), few-shot, and zero-shot learning as distinct challenges, optimizing systems for just one of these scenarios. Yet, in real-world settings, label occurrences… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  36. arXiv:2402.18667  [pdf, other

    cs.CL

    FOFO: A Benchmark to Evaluate LLMs' Format-Following Capability

    Authors: Congying Xia, Chen Xing, Jiangshu Du, Xinyi Yang, Yihao Feng, Ran Xu, Wenpeng Yin, Caiming Xiong

    Abstract: This paper presents FoFo, a pioneering benchmark for evaluating large language models' (LLMs) ability to follow complex, domain-specific formats, a crucial yet underexamined capability for their application as AI agents. Despite LLMs' advancements, existing benchmarks fail to assess their format-following proficiency adequately. FoFo fills this gap with a diverse range of real-world formats and in… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: The first two authors contributed equally

  37. arXiv:2402.18568  [pdf, other

    astro-ph.CO astro-ph.GA hep-ph

    A New Probe of Cosmic Birefringence Using Galaxy Polarization and Shapes

    Authors: Weichen Winston Yin, Liang Dai, Junwu Huang, Lingyuan Ji, Simone Ferraro

    Abstract: We propose a new method to search for parity-violating new physics via measurements of cosmic birefringence and demonstrate its power in detecting the topological effect originating from an axion string network with an axion-photon coupling as a motivated source of cosmic birefringence. The method, using large galaxy samples, exploits an empirical correlation between the polarization direction of… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  38. arXiv:2402.15896  [pdf, other

    cs.CV

    Multimodal Instruction Tuning with Conditional Mixture of LoRA

    Authors: Ying Shen, Zhiyang Xu, Qifan Wang, Yu Cheng, Wenpeng Yin, Lifu Huang

    Abstract: Multimodal Large Language Models (MLLMs) have demonstrated remarkable proficiency in diverse tasks across different domains, with an increasing focus on improving their zero-shot generalization capabilities for unseen multimodal tasks. Multimodal instruction tuning has emerged as a successful strategy for achieving zero-shot generalization by fine-tuning pre-trained models on diverse multimodal ta… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

    Comments: 8 pages, multimodal instruction tuning

  39. arXiv:2402.14650  [pdf, other

    cs.CV

    GaussianPro: 3D Gaussian Splatting with Progressive Propagation

    Authors: Kai Cheng, Xiaoxiao Long, Kaizhi Yang, Yao Yao, Wei Yin, Yuexin Ma, Wenping Wang, Xuejin Chen

    Abstract: The advent of 3D Gaussian Splatting (3DGS) has recently brought about a revolution in the field of neural rendering, facilitating high-quality renderings at real-time speed. However, 3DGS heavily depends on the initialized point cloud produced by Structure-from-Motion (SfM) techniques. When tackling with large-scale scenes that unavoidably contain texture-less surfaces, the SfM techniques always f… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: See the project page for code, data: https://kcheng1021.github.io/gaussianpro.github.io

  40. arXiv:2402.11791  [pdf, other

    cs.CV

    SDGE: Stereo Guided Depth Estimation for 360$^\circ$ Camera Sets

    Authors: Jialei Xu, Wei Yin, Dong Gong, Junjun Jiang, Xianming Liu

    Abstract: Depth estimation is a critical technology in autonomous driving, and multi-camera systems are often used to achieve a 360$^\circ$ perception. These 360$^\circ$ camera sets often have limited or low-quality overlap regions, making multi-view stereo methods infeasible for the entire image. Alternatively, monocular methods may not produce consistent cross-view predictions. To address these issues, we… ▽ More

    Submitted 2 April, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  41. arXiv:2402.11592  [pdf, other

    cs.LG cs.CL

    Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark

    Authors: Yihua Zhang, Pingzhi Li, Junyuan Hong, Jiaxiang Li, Yimeng Zhang, Wenqing Zheng, Pin-Yu Chen, Jason D. Lee, Wotao Yin, Mingyi Hong, Zhangyang Wang, Sijia Liu, Tianlong Chen

    Abstract: In the evolving landscape of natural language processing (NLP), fine-tuning pre-trained Large Language Models (LLMs) with first-order (FO) optimizers like SGD and Adam has become standard. Yet, as LLMs grow {in size}, the substantial memory overhead from back-propagation (BP) for FO gradient computation presents a significant challenge. Addressing this issue is crucial, especially for applications… ▽ More

    Submitted 27 May, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  42. arXiv:2402.11138  [pdf, other

    cs.CL cs.AI cs.LG

    Contrastive Instruction Tuning

    Authors: Tianyi Lorena Yan, Fei Wang, James Y. Huang, Wenxuan Zhou, Fan Yin, Aram Galstyan, Wenpeng Yin, Muhao Chen

    Abstract: Instruction tuning has been used as a promising approach to improve the performance of large language models (LLMs) on unseen tasks. However, current LLMs exhibit limited robustness to unseen instructions, generating inconsistent outputs when the same instruction is phrased with slightly varied forms or language styles. This behavior indicates LLMs' lack of robustness to textual variations and gen… ▽ More

    Submitted 6 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: ACL 2024 Findings

  43. arXiv:2402.11122  [pdf, other

    cs.CL cs.AI

    Navigating the Dual Facets: A Comprehensive Evaluation of Sequential Memory Editing in Large Language Models

    Authors: Zihao Lin, Mohammad Beigi, Hongxuan Li, Yufan Zhou, Yuxiang Zhang, Qifan Wang, Wenpeng Yin, Lifu Huang

    Abstract: Memory Editing (ME) has emerged as an efficient method to modify erroneous facts or inject new facts into Large Language Models (LLMs). Two mainstream ME methods exist: parameter-modifying ME and parameter-preserving ME (integrating extra modules while preserving original parameters). Regrettably, previous studies on ME evaluation have two critical limitations: (i) evaluating LLMs with single edit… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: preprint, 15 pages

  44. arXiv:2402.11095  [pdf, other

    cs.CV

    GIM: Learning Generalizable Image Matcher From Internet Videos

    Authors: Xuelun Shen, Zhipeng Cai, Wei Yin, Matthias Müller, Zijun Li, Kaixuan Wang, Xiaozhi Chen, Cheng Wang

    Abstract: Image matching is a fundamental computer vision problem. While learning-based methods achieve state-of-the-art performance on existing benchmarks, they generalize poorly to in-the-wild images. Such methods typically need to train separate models for different scene types and are impractical when the scene type is unknown in advance. One of the underlying problems is the limited scalability of exis… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: Accepted to ICLR 2024 for spotlight presentation

  45. arXiv:2402.10874  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.comp-ph

    Design of 2D Skyrmionic Metamaterial Through Controlled Assembly

    Authors: Qichen Xu, Zhuanglin Shen, Alexander Edström, I. P. Miranda, Zhiwei Lu, Anders Bergman, Danny Thonig, Wanjian Yin, Olle Eriksson, Anna Delin

    Abstract: Despite extensive research on magnetic skyrmions and antiskyrmions, a significant challenge remains in crafting nontrivial high-order skyrmionic textures with varying, or even tailor-made, topologies. We address this challenge, by focusing on a construction pathway of skyrmionics metamaterial within a monolayer thin film and suggest several promising lattice-like, flakes-like, and cell-like skyrmi… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  46. arXiv:2402.09501  [pdf, other

    hep-ph astro-ph.CO

    Bubble Misalignment Mechanism for Axions

    Authors: Junseok Lee, Kai Murai, Fuminobu Takahashi, Wen Yin

    Abstract: We study the dynamics of axions at first-order phase transitions in non-Abelian gauge theories. When the duration of the phase transition is short compared to the timescale of the axion oscillations, the axion dynamics is similar to the trapped misalignment mechanism. On the other hand, if this is not the case, the axions are initially expelled from the inside of the bubbles, generating axion wave… ▽ More

    Submitted 18 March, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: 15pages, 14figures, 1table, v2: added references, corrected an error in axion number at transmission, conclusions unchanged

    Report number: TU-1221

  47. arXiv:2402.07976  [pdf, other

    astro-ph.CO astro-ph.GA astro-ph.IM hep-ph

    First Result for Dark Matter Search by WINERED

    Authors: Wen Yin, Taiki Bessho, Yuji Ikeda, Hitomi Kobayashi, Daisuke Taniguchi, Hiroaki Sameshima, Noriyuki Matsunaga, Shogo Otsubo, Yuki Sarugaku, Tomomi Takeuchi, Haruki Kato, Satoshi Hamano, Hideyo Kawakita

    Abstract: The identity of dark matter has been a mystery in astronomy, cosmology, and particle theory for about a century. Bessho, Ikeda, and Yin (2022), three of the current authors, proposed using the state-of-the-art infrared spectrographs, including WINERED at $6.5$m Magellan Clay telescope and NIRSpec at James Webb Space Telescope, as efficient detectors for the indirect detection of dark matter with t… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: 15 pages, 4 figures, 1 table, 6 data files attached

    Report number: TU-1220

  48. arXiv:2402.07099  [pdf, other

    cs.LG math.OC

    Rethinking the Capacity of Graph Neural Networks for Branching Strategy

    Authors: Ziang Chen, Jialin Liu, Xiaohan Chen, Xinshang Wang, Wotao Yin

    Abstract: Graph neural networks (GNNs) have been widely used to predict properties and heuristics of mixed-integer linear programs (MILPs) and hence accelerate MILP solvers. This paper investigates the capacity of GNNs to represent strong branching (SB), the most effective yet computationally expensive heuristic employed in the branch-and-bound algorithm. In the literature, message-passing GNN (MP-GNN), as… ▽ More

    Submitted 8 June, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

  49. arXiv:2402.07070  [pdf, ps, other

    math.OC

    Efficient Algorithms for Sum-of-Minimum Optimization

    Authors: Lisang Ding, Ziang Chen, Xinshang Wang, Wotao Yin

    Abstract: In this work, we propose a novel optimization model termed "sum-of-minimum" optimization. This model seeks to minimize the sum or average of $N$ objective functions over $k$ parameters, where each objective takes the minimum value of a predefined sub-function with respect to the $k$ parameters. This universal framework encompasses numerous clustering applications in machine learning and related fi… ▽ More

    Submitted 9 June, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

  50. arXiv:2402.00157  [pdf, other

    cs.CL

    Large Language Models for Mathematical Reasoning: Progresses and Challenges

    Authors: Janice Ahn, Rishu Verma, Renze Lou, Di Liu, Rui Zhang, Wenpeng Yin

    Abstract: Mathematical reasoning serves as a cornerstone for assessing the fundamental cognitive capabilities of human intelligence. In recent times, there has been a notable surge in the development of Large Language Models (LLMs) geared towards the automated resolution of mathematical problems. However, the landscape of mathematical problem types is vast and varied, with LLM-oriented techniques undergoing… ▽ More

    Submitted 5 April, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

    Comments: EACL 2024 Student Research Workshop, 8 pages