Skip to main content

Showing 1–50 of 320 results for author: Yao, C

  1. arXiv:2406.17255  [pdf, other

    cs.CL

    MPCODER: Multi-user Personalized Code Generator with Explicit and Implicit Style Representation Learning

    Authors: Zhenlong Dai, Chang Yao, WenKang Han, Ying Yuan, Zhipeng Gao, Jingyuan Chen

    Abstract: Large Language Models (LLMs) have demonstrated great potential for assisting developers in their daily development. However, most research focuses on generating correct code, how to use LLMs to generate personalized code has seldom been investigated. To bridge this gap, we proposed MPCoder (Multi-user Personalized Code Generator) to generate personalized code for multiple users. To better learn co… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted by ACL 2024, Main Conference

  2. arXiv:2406.11382  [pdf, other

    hep-ph hep-ex

    Baryon-number-violating nucleon decays in ALP effective field theories

    Authors: Tong Li, Michael A. Schmidt, Chang-Yuan Yao

    Abstract: The search for baryon-number-violating (BNV) nucleon decay is an intriguing probe of new physics beyond the SM in future neutrino experiments with enhanced sensitivity. The dark sector states such as an axion or axion-like particle (ALP) can induce nucleon decays with distinct signature and kinematics from the conventional nucleon decays. In this work, we study the ALP effective field theories (EF… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 28 pages, 4 figures, 7 tables

    Report number: CPPC-2024-05, DESY-24-082

  3. arXiv:2406.06062  [pdf, other

    cs.CV cs.AI

    ProcessPainter: Learn Painting Process from Sequence Data

    Authors: Yiren Song, Shijie Huang, Chen Yao, Xiaojun Ye, Hai Ci, Jiaming Liu, Yuxuan Zhang, Mike Zheng Shou

    Abstract: The painting process of artists is inherently stepwise and varies significantly among different painters and styles. Generating detailed, step-by-step painting processes is essential for art education and research, yet remains largely underexplored. Traditional stroke-based rendering methods break down images into sequences of brushstrokes, yet they fall short of replicating the authentic processe… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  4. arXiv:2406.03639  [pdf, ps, other

    math.DG math-ph math.SG

    Gravitating vortices and Symplectic Reduction by Stages

    Authors: L. Álvarez-Cónsul, M. Garcia-Fernandez, O. García-Prada, V. P. Pingali, C. -J. Yao

    Abstract: We undertake a novel approach to the existence problem for gravitating vortices on a Riemann surface based on symplectic reduction by stages, which seems to be new in the PDE as well as the gauge theory literature. The main technical tool for our study is the reduced $α$-K-energy, for which we establish convexity properties by means of finite-energy pluripotential theory, as recently applied to th… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 48 pages, no figures, comments are welcome

    MSC Class: Primary 53C07; Secondary 53D20; 53C25

  5. arXiv:2406.02430  [pdf, other

    eess.AS cs.SD

    Seed-TTS: A Family of High-Quality Versatile Speech Generation Models

    Authors: Philip Anastassiou, Jiawei Chen, Jitong Chen, Yuanzhe Chen, Zhuo Chen, Ziyi Chen, Jian Cong, Lelai Deng, Chuang Ding, Lu Gao, Mingqing Gong, Peisong Huang, Qingqing Huang, Zhiying Huang, Yuanyuan Huo, Dongya Jia, Chumin Li, Feiya Li, Hui Li, Jiaxin Li, Xiaoyang Li, Xingxing Li, Lin Liu, Shouda Liu, Sichao Liu , et al. (21 additional authors not shown)

    Abstract: We introduce Seed-TTS, a family of large-scale autoregressive text-to-speech (TTS) models capable of generating speech that is virtually indistinguishable from human speech. Seed-TTS serves as a foundation model for speech generation and excels in speech in-context learning, achieving performance in speaker similarity and naturalness that matches ground truth human speech in both objective and sub… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  6. arXiv:2406.00094  [pdf, other

    hep-ph

    The flavor invariants of the $ν$SM

    Authors: Christophe Grojean, Jonathan Kley, Damien Leflot, Chang-Yuan Yao

    Abstract: Sixty years after the experimental discovery of CP violation in the quark sector, the existence of a similar CP violation in the lepton sector is still to be established. Actually, the structure of such a violation depends crucially on the origin of the neutrino masses. In an attempt at categorizing the leptonic sources of CP violation, we studied the $ν$SM, the Standard Model extended with three… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: 27 pages + appendices, 3 figures

    Report number: CERN-TH-2024-076, DESY-24-021, HU-EP-24/14

  7. arXiv:2405.18458  [pdf

    cs.LG physics.optics

    Asymmetrical estimator for training grey-box deep photonic neural networks

    Authors: Yizhi Wang, Minjia Chen, Chunhui Yao, Jie Ma, Ting Yan, Richard Penty, Qixiang Cheng

    Abstract: Physical neural networks (PNNs) are emerging paradigms for neural network acceleration due to their high-bandwidth, in-propagation analogue processing. Despite the advantages of PNN for inference, training remains a challenge. The imperfect information of the physical transformation means the failure of conventional gradient-based updates from backpropagation (BP). Here, we present the asymmetrica… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 17 pages, 5 figures

    MSC Class: 78-05

  8. arXiv:2405.14336  [pdf, other

    eess.IV

    I$^2$VC: A Unified Framework for Intra- & Inter-frame Video Compression

    Authors: Meiqin Liu, Chenming Xu, Yukai Gu, Chao Yao, Yao Zhao

    Abstract: Video compression aims to reconstruct seamless frames by encoding the motion and residual information from existing frames. Previous neural video compression methods necessitate distinct codecs for three types of frames (I-frame, P-frame and B-frame), which hinders a unified approach and generalization across different video contexts. Intra-codec techniques lack the advanced Motion Estimation and… ▽ More

    Submitted 1 June, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: 19 pages, 10 figures

  9. arXiv:2405.02313  [pdf, ps, other

    physics.flu-dyn

    Physics-informed Data-driven Cavitation Model for a Specific MG EOS

    Authors: Minsheng Huang, Chengbao Yao, Pan Wang, Lidong Cheng, Wenjun Ying

    Abstract: We present a novel one-fluid cavitation model of a specific Mie-Grüneisen equation of state(EOS), named polynomial EOS, based on an artificial neural network. Not only the physics-informed equation but also the experimental data are embedded into the proposed model by an optimization problem. The physics-informed data-driven model provides the concerned pressure within the cavitation region, where… ▽ More

    Submitted 5 April, 2024; originally announced May 2024.

    Comments: 29 pages, 18 figures

  10. arXiv:2405.00277  [pdf, other

    quant-ph

    The strong-coupling quantum thermodynamics of quantum Brownian motion based on the exact solution of its reduced density matrix

    Authors: Chuan-Zhe Yao, Wei-Min Zhang

    Abstract: We derive the quantum thermodynamics of quantum Brownian motion from the exact solution of its reduced density matrix. We start from the total equilibrium thermal state between the Brownian particle and its reservoir, and solve analytically and exactly the reduced density matrix of the system by taking the partial trace over all the reservoir states. We find that the reduced Hamiltonian and the re… ▽ More

    Submitted 5 July, 2024; v1 submitted 30 April, 2024; originally announced May 2024.

    Comments: 21 pages, 5 figures

  11. arXiv:2404.15016  [pdf, ps, other

    math.DG math.SG

    Convergence of the hypersymplectic flow on $T^4$ with $T^3$-symmetry

    Authors: Joel Fine, Weiyong He, Chengjian Yao

    Abstract: A hypersymplectic structure on a 4-manifold is a triple $ω_1, ω_2, ω_3$ of 2-forms for which every non-trivial linear combination $a^1ω_1 + a^2 ω_2 + a^3 ω_3$ is a symplectic form. Donaldson has conjectured that when the underlying manifold is compact, any such structure is isotopic in its cohomolgy class to a hyperkähler triple. We prove this conjecture for a hypersymplectic structure on $T^4$ wh… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 25 pages

    MSC Class: 58J35; 53C26; 53D05

  12. arXiv:2404.13600  [pdf, other

    cs.RO

    Are We Ready for Planetary Exploration Robots? The TAIL-Plus Dataset for SLAM in Granular Environments

    Authors: Zirui Wang, Chen Yao, Yangtao Ge, Guowei Shi, Ningbo Yang, Zheng Zhu, Kewei Dong, Hexiang Wei, Zhenzhong Jia, Jing Wu

    Abstract: So far, planetary surface exploration depends on various mobile robot platforms. The autonomous navigation and decision-making of these mobile robots in complex terrains largely rely on their terrain-aware perception, localization and mapping capabilities. In this paper we release the TAIL-Plus dataset, a new challenging dataset in deformable granular environments for planetary exploration robots,… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: Accepted to the IEEE ICRA Workshop on Field Robotics 2024

  13. arXiv:2404.09986  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Thermal conversion of ultrathin nickel hydroxide for wide bandgap 2D nickel oxides

    Authors: Lu Ping, Nicholas Russo, Zifan Wang, Ching-Hsiang Yao, Kevin E. Smith, Xi Ling

    Abstract: Wide bandgap (WBG) semiconductors (Eg >2.0 eV) are integral to the advancement of next generation electronics, optoelectronics, and power industries, owing to their capability for high temperature operation, high breakdown voltage and efficient light emission. Enhanced power efficiency and functional performance can be attained through miniaturization, specifically via the integration of device fa… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  14. arXiv:2404.06853  [pdf, other

    cond-mat.mtrl-sci

    Revealing mechanism of pore defect formation in laser directed energy deposition of aluminum alloy via in-situ synchrotron X-ray imaging

    Authors: Wei Liu, Yuxiao Li, Chunxia Yao, Dongsheng Zhang, Darui Sun, Sen Chen, Yu Wu, Jun Wang, Lei Lud, Sheng-Nian Luo, Ye Tao, Bingbing Zhang

    Abstract: Laser metal additive manufacturing technology is capable of producing components with complex geometries and compositions that cannot be realized by conventional manufacturing methods. However, a large number of pores generated during the additive manufacturing process greatly affect the mechanical properties of the additively manufactured parts, and the mechanism of such pore generation has not b… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 7 figures

  15. arXiv:2404.05225  [pdf, other

    cs.CV cs.CL

    LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding

    Authors: Chuwei Luo, Yufan Shen, Zhaoqing Zhu, Qi Zheng, Zhi Yu, Cong Yao

    Abstract: Recently, leveraging large language models (LLMs) or multimodal large language models (MLLMs) for document understanding has been proven very promising. However, previous works that employ LLMs/MLLMs for document understanding have not fully explored and utilized the document layout information, which is vital for precise document understanding. In this paper, we propose LayoutLLM, an LLM/MLLM bas… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: CVPR 2024

  16. arXiv:2403.19128  [pdf, other

    cs.CV

    OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition

    Authors: Jianqiang Wan, Sibo Song, Wenwen Yu, Yuliang Liu, Wenqing Cheng, Fei Huang, Xiang Bai, Cong Yao, Zhibo Yang

    Abstract: Recently, visually-situated text parsing (VsTP) has experienced notable advancements, driven by the increasing demand for automated document understanding and the emergence of Generative Large Language Models (LLMs) capable of processing document-based questions. Various methods have been proposed to address the challenging problem of VsTP. However, due to the diversified targets and heterogeneous… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: CVPR 2024

  17. arXiv:2403.17842  [pdf, other

    quant-ph cond-mat.str-el

    Experimental Realization of Discrete Time Quasi-Crystals

    Authors: Guanghui He, Bingtian Ye, Ruotian Gong, Changyu Yao, Zhongyuan Liu, Kater W. Murch, Norman Y. Yao, Chong Zu

    Abstract: Floquet (periodically driven) systems can give rise to unique non-equilibrium phases of matter without equilibrium analogs. The most prominent example is the realization of discrete time crystals. An intriguing question emerges: what other novel phases can manifest when the constraint of time periodicity is relaxed? In this study, we explore quantum systems subjected to a quasi-periodic drive. Lev… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 7+5 pages, 4+5 figures

  18. arXiv:2403.16875  [pdf, other

    cs.RO

    TAIL: A Terrain-Aware Multi-Modal SLAM Dataset for Robot Locomotion in Deformable Granular Environments

    Authors: Chen Yao, Yangtao Ge, Guowei Shi, Zirui Wang, Ningbo Yang, Zheng Zhu, Hexiang Wei, Yuntian Zhao, Jing Wu, Zhenzhong Jia

    Abstract: Terrain-aware perception holds the potential to improve the robustness and accuracy of autonomous robot navigation in the wilds, thereby facilitating effective off-road traversals. However, the lack of multi-modal perception across various motion patterns hinders the solutions of Simultaneous Localization And Mapping (SLAM), especially when confronting non-geometric hazards in demanding landscapes… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Submitted to IEEE Robotics and Automation Letters

  19. arXiv:2403.16662  [pdf, other

    cs.CL

    RU22Fact: Optimizing Evidence for Multilingual Explainable Fact-Checking on Russia-Ukraine Conflict

    Authors: Yirong Zeng, Xiao Ding, Yi Zhao, Xiangyu Li, Jie Zhang, Chao Yao, Ting Liu, Bing Qin

    Abstract: Fact-checking is the task of verifying the factuality of a given claim by examining the available evidence. High-quality evidence plays a vital role in enhancing fact-checking systems and facilitating the generation of explanations that are understandable to humans. However, the provision of both sufficient and relevant evidence for explainable fact-checking systems poses a challenge. To tackle th… ▽ More

    Submitted 26 March, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: 12 pages, 3 figures, accepted by lrec-coling2024

  20. arXiv:2403.14023  [pdf

    cs.CR

    A system capable of verifiably and privately screening global DNA synthesis

    Authors: Carsten Baum, Jens Berlips, Walther Chen, Hongrui Cui, Ivan Damgard, Jiangbin Dong, Kevin M. Esvelt, Mingyu Gao, Dana Gretton, Leonard Foner, Martin Kysel, Kaiyi Zhang, Juanru Li, Xiang Li, Omer Paneth, Ronald L. Rivest, Francesca Sage-Ling, Adi Shamir, Yue Shen, Meicen Sun, Vinod Vaikuntanathan, Lynn Van Hauwe, Theia Vogel, Benjamin Weinstein-Raun, Yun Wang , et al. (5 additional authors not shown)

    Abstract: Printing custom DNA sequences is essential to scientific and biomedical research, but the technology can be used to manufacture plagues as well as cures. Just as ink printers recognize and reject attempts to counterfeit money, DNA synthesizers and assemblers should deny unauthorized requests to make viral DNA that could be used to ignite a pandemic. There are three complications. First, we don't n… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Main text 10 pages, 4 figures. 5 supplementary figures. Total 21 pages. Direct correspondence to: Ivan B. Damgard (ivan@cs.au.dk), Andrew C. Yao (andrewcyao@mail.tsinghua.edu.cn), Kevin M. Esvelt (esvelt@mit.edu)

  21. arXiv:2403.13761  [pdf, other

    cs.CV

    HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition

    Authors: Yuyi Zhang, Yuanzhi Zhu, Dezhi Peng, Peirong Zhang, Zhenhua Yang, Zhibo Yang, Cong Yao, Lianwen Jin

    Abstract: Text recognition, especially for complex scripts like Chinese, faces unique challenges due to its intricate character structures and vast vocabulary. Traditional one-hot encoding methods struggle with the representation of hierarchical radicals, recognition of Out-Of-Vocabulary (OOV) characters, and on-device deployment due to their computational intensity. To address these challenges, we propose… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  22. arXiv:2403.13065  [pdf, other

    hep-ph hep-th

    Aligned Yet Large Dipoles: a SMEFT Study

    Authors: Quentin Bonnefoy, Jonathan Kley, Di Liu, Alejo N. Rossia, Chang-Yuan Yao

    Abstract: We study a non-universal flavor scenario at the level of the Standard Model Effective Field Theory, according to which the matrix of Wilson coefficients $c_{uW}$ of an up-type electroweak quark dipole operator is aligned with the up-type Yukawa coupling. Such an alignment usually follows from the assumption of Minimal Flavor Violation (MFV), away from which we step by allowing the entries of… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 35 pages, 6 figures, 7 tables. Comments are welcomed

    Report number: DESY-24-033, HU-EP-24/09, LAPTH-011/24, COMETA-2024-004

  23. arXiv:2403.12008  [pdf, other

    cs.CV

    SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion

    Authors: Vikram Voleti, Chun-Han Yao, Mark Boss, Adam Letts, David Pankratz, Dmitry Tochilkin, Christian Laforte, Robin Rombach, Varun Jampani

    Abstract: We present Stable Video 3D (SV3D) -- a latent video diffusion model for high-resolution, image-to-multi-view generation of orbital videos around a 3D object. Recent work on 3D generation propose techniques to adapt 2D generative models for novel view synthesis (NVS) and 3D optimization. However, these methods have several disadvantages due to either limited views or inconsistent NVS, thereby affec… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Project page: https://sv3d.github.io/

  24. arXiv:2403.11221  [pdf, other

    cs.DC cs.DB

    Lion: Minimizing Distributed Transactions through Adaptive Replica Provision (Extended Version)

    Authors: Qiushi Zheng, Zhanhao Zhao, Wei Lu, Chang Yao, Yuxing Chen, Anqun Pan, Xiaoyong Du

    Abstract: Distributed transaction processing often involves multiple rounds of cross-node communications, and therefore tends to be slow. To improve performance, existing approaches convert distributed transactions into single-node transactions by either migrating co-accessed partitions onto the same nodes or establishing a super node housing replicas of the entire database. However, migration-based methods… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  25. arXiv:2403.10357  [pdf, other

    cs.CV cs.GR

    ANIM: Accurate Neural Implicit Model for Human Reconstruction from a single RGB-D image

    Authors: Marco Pesavento, Yuanlu Xu, Nikolaos Sarafianos, Robert Maier, Ziyan Wang, Chun-Han Yao, Marco Volino, Edmond Boyer, Adrian Hilton, Tony Tung

    Abstract: Recent progress in human shape learning, shows that neural implicit models are effective in generating 3D human surfaces from limited number of views, and even from a single RGB image. However, existing monocular approaches still struggle to recover fine geometric details such as face, hands or cloth wrinkles. They are also easily prone to depth ambiguities that result in distorted geometries alon… ▽ More

    Submitted 18 March, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR24; Project page: https://marcopesavento.github.io/ANIM/

  26. arXiv:2403.00496  [pdf

    physics.optics physics.app-ph

    Benchmarking reconstructive spectrometer with multi-resonant cavities

    Authors: Chunhui Yao, Kangning Xu, Tianhua Lin, Jie Ma, Chumeng Yao, Peng Bao, Zhitian Shi, Richard Penty, Qixiang Cheng

    Abstract: Recent years have seen the rapid development of miniaturized reconstructive spectrometers (RSs), yet they still confront a range of technical challenges, such as bandwidth/resolution ratio, sensing speed, and/or power efficiency. Reported RS designs often suffer from insufficient decorrelation between sampling channels, which results in limited compressive sampling efficiency, in essence, due to i… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  27. arXiv:2402.17232  [pdf, other

    math.NA cs.LG physics.comp-ph

    Two-scale Neural Networks for Partial Differential Equations with Small Parameters

    Authors: Qiao Zhuang, Chris Ziyi Yao, Zhongqiang Zhang, George Em Karniadakis

    Abstract: We propose a two-scale neural network method for solving partial differential equations (PDEs) with small parameters using physics-informed neural networks (PINNs). We directly incorporate the small parameters into the architecture of neural networks. The proposed method enables solving PDEs with small parameters in a simple fashion, without adding Fourier features or other computationally taxing… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    MSC Class: 65N35; 35B25 ACM Class: I.2.6

  28. arXiv:2402.09152  [pdf, other

    cs.LG

    Improved Regret for Bandit Convex Optimization with Delayed Feedback

    Authors: Yuanyu Wan, Chang Yao, Mingli Song, Lijun Zhang

    Abstract: We investigate bandit convex optimization (BCO) with delayed feedback, where only the loss value of the action is revealed under an arbitrary delay. Let $n,T,\bar{d}$ denote the dimensionality, time horizon, and average delay, respectively. Previous studies have achieved an $O(\sqrt{n}T^{3/4}+(n\bar{d})^{1/3}T^{2/3})$ regret bound for this problem, whose delay-independent part matches the regret o… ▽ More

    Submitted 23 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  29. arXiv:2402.07625  [pdf, other

    cs.CL cs.AI cs.LG

    Autonomous Data Selection with Language Models for Mathematical Texts

    Authors: Yifan Zhang, Yifan Luo, Yang Yuan, Andrew Chi-Chih Yao

    Abstract: To improve language models' proficiency in mathematical reasoning via continual pretraining, we introduce a novel strategy that leverages base language models for autonomous data selection. Departing from conventional supervised fine-tuning or trained classifiers with human-annotated data, our approach Autonomous Data Selection (AutoDS) utilizes meta-prompted language models as zero-shot verifiers… ▽ More

    Submitted 2 April, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  30. arXiv:2402.06641  [pdf, other

    math.NT

    A Survey of a Random Matrix Model for a Family of Cusp Forms

    Authors: Owen Barrett, Zoë X. Batterman, Aditya Jambhale, Steven J. Miller, Akash L. Narayanan, Kishan Sharma, Chris Yao

    Abstract: The Katz-Sarnak philosophy states that statistics of zeros of $L$-function families near the central point as the conductors tend to infinity agree with those of eigenvalues of random matrix ensembles as the matrix size tends to infinity. While numerous results support this conjecture, S. J. Miller observed that for finite conductors, very different behavior can occur for zeros near the central po… ▽ More

    Submitted 17 April, 2024; v1 submitted 28 January, 2024; originally announced February 2024.

    Comments: 28 pages, 7 figures

    MSC Class: 11M26; 11M50

  31. arXiv:2401.17372  [pdf, other

    quant-ph physics.bio-ph

    Optically-Trapped Nanodiamond-Relaxometry Detection of Nanomolar Paramagnetic Spins in Aqueous Environments

    Authors: Shiva Iyer, Changyu Yao, Olivia Lazorik, Pengyun Wang, Gianna Glenn, Michael Mohs, Yinyao Shi, Michael Mansour, Erik Henriksen, Kater Murch, Shankar Mukherji, Chong Zu

    Abstract: Probing electrical and magnetic properties in aqueous environments remains a frontier challenge in nanoscale sensing. Our inability to do so with quantitative accuracy imposes severe limitations, for example, on our understanding of the ionic environments in a diverse array of systems, ranging from novel materials to the living cell. The Nitrogen-Vacancy (NV) center in fluorescent nanodiamonds (FN… ▽ More

    Submitted 20 February, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: 6+7 pages, 3+8 figures

  32. arXiv:2401.17254  [pdf, other

    math.NT

    Limiting Behavior in Missing Sums of Sumsets

    Authors: Aditya Jambhale, Rauan Kaldybayev, Steven J. Miller, Chris Yao

    Abstract: We study $|A + A|$ as a random variable, where $A \subseteq \{0, \dots, N\}$ is a random subset such that each $0 \le n \le N$ is included with probability $0 < p < 1$, and where $A + A$ is the set of sums $a + b$ for $a,b$ in $A$. Lazarev, Miller, and O'Bryant studied the distribution of $2N + 1 - |A + A|$, the number of summands not represented in $A + A$ when $p = 1/2$. A recent paper by Chu, K… ▽ More

    Submitted 1 February, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: 25 pages, 6 figures

    MSC Class: 11P99; 11B13

  33. arXiv:2401.09003  [pdf, other

    cs.CL cs.AI cs.LG

    Augmenting Math Word Problems via Iterative Question Composing

    Authors: Haoxiong Liu, Yifan Zhang, Yifan Luo, Andrew Chi-Chih Yao

    Abstract: Despite the advancements in large language models (LLMs) for mathematical reasoning, solving competition-level math problems remains a significant challenge, especially for open-source LLMs without external tools. We introduce the MMIQC dataset, comprising a mixture of processed web data and synthetic question-response pairs, aimed at enhancing the mathematical reasoning capabilities of base langu… ▽ More

    Submitted 10 February, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

  34. arXiv:2401.07030  [pdf, ps, other

    math.AP

    Subsonic Euler flows in a three-dimensional finitely long cylinder with arbitrary cross section

    Authors: Shangkun Weng, Changkui Yao

    Abstract: This paper concerns the well-posedness of subsonic flows in a three-dimensional finitely long cylinder with arbitrary cross section. We establish the existence and uniqueness of subsonic flows in the Sobolev space by prescribing the normal component of the momentum, the vorticity, the entropy, the Bernoulli's quantity at the entrance and the normal component of the momentum at the exit. One of the… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

    MSC Class: 35M12; 76G25; 76N10

  35. arXiv:2401.05638  [pdf, other

    cs.CV

    MatSAM: Efficient Extraction of Microstructures of Materials via Visual Large Model

    Authors: Changtai Li, Xu Han, Chao Yao, Xiaojuan Ban

    Abstract: Efficient and accurate extraction of microstructures in micrographs of materials is essential in process optimization and the exploration of structure-property relationships. Deep learning-based image segmentation techniques that rely on manual annotation are laborious and time-consuming and hardly meet the demand for model transferability and generalization on various source images. Segment Anyth… ▽ More

    Submitted 2 March, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: 18 pages, 8 figures, and 5 tables. Updated with revision and code repository

  36. arXiv:2401.05412  [pdf, other

    cs.CV cs.AI eess.SP

    Spatial-Related Sensors Matters: 3D Human Motion Reconstruction Assisted with Textual Semantics

    Authors: Xueyuan Yang, Chao Yao, Xiaojuan Ban

    Abstract: Leveraging wearable devices for motion reconstruction has emerged as an economical and viable technique. Certain methodologies employ sparse Inertial Measurement Units (IMUs) on the human body and harness data-driven strategies to model human poses. However, the reconstruction of motion based solely on sparse IMUs data is inherently fraught with ambiguity, a consequence of numerous identical IMU r… ▽ More

    Submitted 26 December, 2023; originally announced January 2024.

    Comments: Accepted by AAAI 2024

  37. arXiv:2401.01522  [pdf, other

    cs.CV

    LORE++: Logical Location Regression Network for Table Structure Recognition with Pre-training

    Authors: Rujiao Long, Hangdi Xing, Zhibo Yang, Qi Zheng, Zhi Yu, Cong Yao, Fei Huang

    Abstract: Table structure recognition (TSR) aims at extracting tables in images into machine-understandable formats. Recent methods solve this problem by predicting the adjacency relations of detected cell boxes or learning to directly generate the corresponding markup sequences from the table images. However, existing approaches either count on additional heuristic rules to recover the table structures, or… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2303.03730

  38. arXiv:2312.12142  [pdf, other

    cs.CV cs.AI

    FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning

    Authors: Zhenhua Yang, Dezhi Peng, Yuxin Kong, Yuyi Zhang, Cong Yao, Lianwen Jin

    Abstract: Automatic font generation is an imitation task, which aims to create a font library that mimics the style of reference images while preserving the content from source images. Although existing font generation methods have achieved satisfactory performance, they still struggle with complex characters and large style variations. To address these issues, we propose FontDiffuser, a diffusion-based ima… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024; Github Page: https://github.com/yeungchenwa/FontDiffuser

    Journal ref: 38th AAAI Conference on Artificial Intelligence (AAAI2024), Vancouver, BC, Canada, 2024

  39. arXiv:2312.09613  [pdf, other

    cs.LG cs.AI stat.ML

    Rethinking Causal Relationships Learning in Graph Neural Networks

    Authors: Hang Gao, Chengyu Yao, Jiangmeng Li, Lingyu Si, Yifan Jin, Fengge Wu, Changwen Zheng, Huaping Liu

    Abstract: Graph Neural Networks (GNNs) demonstrate their significance by effectively modeling complex interrelationships within graph-structured data. To enhance the credibility and robustness of GNNs, it becomes exceptionally crucial to bolster their ability to capture causal relationships. However, despite recent advancements that have indeed strengthened GNNs from a causal learning perspective, conductin… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  40. arXiv:2312.07823  [pdf, other

    cs.CV

    Semantic Lens: Instance-Centric Semantic Alignment for Video Super-Resolution

    Authors: Qi Tang, Yao Zhao, Meiqin Liu, Jian Jin, Chao Yao

    Abstract: As a critical clue of video super-resolution (VSR), inter-frame alignment significantly impacts overall performance. However, accurate pixel-level alignment is a challenging task due to the intricate motion interweaving in the video. In response to this issue, we introduce a novel paradigm for VSR named Semantic Lens, predicated on semantic priors drawn from degraded videos. Specifically, video is… ▽ More

    Submitted 19 January, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024

  41. arXiv:2311.17215  [pdf, other

    math.NT math.NA

    Applications of Moments of Dirichlet Coefficients in Elliptic Curve Families

    Authors: Zoë Batterman, Aditya Jambhale, Steven J. Miller, Akash L. Narayanan, Kishan Sharma, Andrew Yang, Chris Yao

    Abstract: The moments of the coefficients of elliptic curve L-functions are related to numerous arithmetic problems. Rosen and Silverman proved a conjecture of Nagao relating the first moment of one-parameter families satisfying Tate's conjecture to the rank of the corresponding elliptic surface over Q(T); one can also construct families of moderate rank by finding families with large first moments. Michel… ▽ More

    Submitted 17 June, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    MSC Class: 11G05; 11G40

  42. arXiv:2311.14956  [pdf, other

    physics.plasm-ph

    Anomalous hot electron generation from two-plasmon decay instability driven by broadband laser pulses with intensity modulations

    Authors: C. Yao, J. Li, L. Hao, R. Yan, C. Wang, A. Lei, Y-K. Ding, J. Zheng

    Abstract: We investigate the hot electrons generated from two-plasmon decay (TPD) instability driven by laser pulses with intensity modulated by a frequency $Δω_m$. Our primary focus lies on scenarios where $Δω_m$ is on the same order of the TPD growth rate $ γ_0$ ( $Δω_m \sim γ_0$), corresponding to moderate laser frequency bandwidths for TPD mitigation. With $Δω_m$ conveniently modeled by a basic two-colo… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

  43. arXiv:2311.11482  [pdf, other

    cs.AI cs.CL

    Meta Prompting for AI Systems

    Authors: Yifan Zhang, Yang Yuan, Andrew Chi-Chih Yao

    Abstract: In this work, we present a comprehensive study of Meta Prompting (MP), an innovative technique reshaping the utilization of language models (LMs) and AI systems in problem-solving and data interaction. Grounded in type theory and category theory, Meta Prompting emphasizes the structure and syntax of information over traditional content-centric methods. The paper explores the formal definitions of… ▽ More

    Submitted 15 June, 2024; v1 submitted 19 November, 2023; originally announced November 2023.

  44. arXiv:2310.18090  [pdf, ps, other

    eess.SP

    Probabilistic Constellation Shaping for OFDM-Based ISAC Signaling

    Authors: Zhen Du, Fan Liu, Yifeng Xiong, Tony Xiao Han, Weijie Yuan, Yuanhao Cui, Changhua Yao, Yonina C. Eldar

    Abstract: Integrated Sensing and Communications (ISAC) has garnered significant attention as a promising technology for the upcoming sixth-generation wireless communication systems (6G). In pursuit of this goal, a common strategy is that a unified waveform, such as Orthogonal Frequency Division Multiplexing (OFDM), should serve dual-functional roles by enabling simultaneous sensing and communications (S&C)… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

  45. arXiv:2310.16070  [pdf, other

    cs.LG

    Spatial-Temporal Hypergraph Neural Network for Traffic Forecasting

    Authors: Chengzhi Yao, Zhi Li, Junbo Wang

    Abstract: Traffic forecasting, which benefits from mobile Internet development and position technologies, plays a critical role in Intelligent Transportation Systems. It helps to implement rich and varied transportation applications and bring convenient transportation services to people based on collected traffic data. Most existing methods usually leverage graph-based deep learning networks to model the co… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  46. arXiv:2310.12430  [pdf, other

    cs.CV cs.CL

    DocXChain: A Powerful Open-Source Toolchain for Document Parsing and Beyond

    Authors: Cong Yao

    Abstract: In this report, we introduce DocXChain, a powerful open-source toolchain for document parsing, which is designed and developed to automatically convert the rich information embodied in unstructured documents, such as text, tables and charts, into structured representations that are readable and manipulable by machines. Specifically, basic capabilities, including text detection, text recognition, t… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 4 pages, 4 figures, 2 tables

  47. arXiv:2310.10362  [pdf, other

    cs.LG cs.AI

    Self-Pro: A Self-Prompt and Tuning Framework for Graph Neural Networks

    Authors: Chenghua Gong, Xiang Li, Jianxiang Yu, Cheng Yao, Jiaqi Tan, Chengcheng Yu

    Abstract: Graphs have become an important modeling tool for web applications, and Graph Neural Networks (GNNs) have achieved great success in graph representation learning. However, the performance of traditional GNNs heavily relies on a large amount of supervision. Recently, ``pre-train, fine-tune'' has become the paradigm to address the issues of label dependency and poor generalization. However, the pre-… ▽ More

    Submitted 4 June, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted at ECML-PKDD 2024

  48. arXiv:2310.08064  [pdf

    cs.CV

    Age Estimation Based on Graph Convolutional Networks and Multi-head Attention Mechanisms

    Authors: Miaomiao Yang, Changwei Yao, Shijin Yan

    Abstract: Age estimation technology is a part of facial recognition and has been applied to identity authentication. This technology achieves the development and application of a juvenile anti-addiction system by authenticating users in the game. Convolutional Neural Network (CNN) and Transformer algorithms are widely used in this application scenario. However, these two models cannot flexibly extract and m… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  49. arXiv:2310.04975  [pdf, ps, other

    cs.CR cs.DC

    A Trustworthy and Consistent Blockchain Oracle Scheme for Industrial Internet of Things

    Authors: Peng Liu, Youquan Xian, Chuanjian Yao, Peng Wang, Li-e Wang, Xianxian Li

    Abstract: Blockchain provides decentralization and trustlessness features for the Industrial Internet of Things (IIoT), which expands the application scenarios of IIoT. To address the problem that the blockchain cannot actively obtain off-chain data, the blockchain oracle is proposed as a bridge between the blockchain and external data. However, the existing oracle schemes are difficult to solve the problem… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

    Comments: Rejected after the third round of review of IEEE Internet of Things Journal

  50. arXiv:2310.00890  [pdf

    cond-mat.mtrl-sci

    Femtosecond electron diffraction reveals local disorder and local anharmonicity in thermoelectric SnSe

    Authors: Jingjun Li, Yingpeng Qi, Qing Yang, Luye Yue, Changyuan Yao, Zijing Chen, Sheng Meng, Dao Xiang, Jianming Cao

    Abstract: The microscopic arrangement of atoms and molecules is the determining factor in how materials behave and perform. Beyond the long-range periodicity, the local disorder with local structures deviating from the average lattice structure plays a vital role in determining the physical properties of the phonon, electron and spin subsystems in crystalline functional materials. Experimentally characteriz… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Report number: 2313742

    Journal ref: Adv. Mater. 2313742 (2024)