Skip to main content

Showing 1–50 of 1,417 results for author: Ye, J

  1. arXiv:2407.09120  [pdf, other

    cs.LG cs.CL cs.CV

    URRL-IMVC: Unified and Robust Representation Learning for Incomplete Multi-View Clustering

    Authors: Ge Teng, Ting Mao, Chen Shen, Xiang Tian, Xuesong Liu, Yaowu Chen, Jieping Ye

    Abstract: Incomplete multi-view clustering (IMVC) aims to cluster multi-view data that are only partially available. This poses two main challenges: effectively leveraging multi-view information and mitigating the impact of missing views. Prevailing solutions employ cross-view contrastive learning and missing view recovery techniques. However, they either neglect valuable complementary information by focusi… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Accepted by ACM SIGKDD 2024

  2. arXiv:2407.08156  [pdf, other

    cs.CV

    AddressCLIP: Empowering Vision-Language Models for City-wide Image Address Localization

    Authors: Shixiong Xu, Chenghao Zhang, Lubin Fan, Gaofeng Meng, Shiming Xiang, Jieping Ye

    Abstract: In this study, we introduce a new problem raised by social media and photojournalism, named Image Address Localization (IAL), which aims to predict the readable textual address where an image was taken. Existing two-stage approaches involve predicting geographical coordinates and converting them into human-readable addresses, which can lead to ambiguity and be resource-intensive. In contrast, we p… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Accepted at ECCV 2024

  3. arXiv:2407.06109  [pdf, other

    cs.CV

    PerlDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models

    Authors: Jinhua Zhang, Hualian Sheng, Sijia Cai, Bing Deng, Qiao Liang, Wen Li, Ying Fu, Jieping Ye, Shuhang Gu

    Abstract: Controllable generation is considered a potentially vital approach to address the challenge of annotating 3D data, and the precision of such controllable generation becomes particularly imperative in the context of data production for autonomous driving. Existing methods focus on the integration of diverse generative information into controlling inputs, utilizing frameworks such as GLIGEN or Contr… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  4. arXiv:2407.05578  [pdf, other

    cs.CV

    FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance

    Authors: Jiedong Zhuang, Jiaqi Hu, Lianrui Mu, Rui Hu, Xiaoyu Liang, Jiangnan Ye, Haoji Hu

    Abstract: CLIP has achieved impressive zero-shot performance after pre-training on a large-scale dataset consisting of paired image-text data. Previous works have utilized CLIP by incorporating manually designed visual prompts like colored circles and blur masks into the images to guide the model's attention, showing enhanced zero-shot performance in downstream tasks. Although these methods have achieved pr… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: accepted by ECCV2024

  5. arXiv:2407.05106  [pdf, other

    cs.CV

    DailyDVS-200: A Comprehensive Benchmark Dataset for Event-Based Action Recognition

    Authors: Qi Wang, Zhou Xu, Yuming Lin, Jingtao Ye, Hongsheng Li, Guangming Zhu, Syed Afaq Ali Shah, Mohammed Bennamoun, Liang Zhang

    Abstract: Neuromorphic sensors, specifically event cameras, revolutionize visual data acquisition by capturing pixel intensity changes with exceptional dynamic range, minimal latency, and energy efficiency, setting them apart from conventional frame-based cameras. The distinctive capabilities of event cameras have ignited significant interest in the domain of event-based action recognition, recognizing thei… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024

  6. arXiv:2407.04938  [pdf, other

    cs.CV

    SAM-Med3D-MoE: Towards a Non-Forgetting Segment Anything Model via Mixture of Experts for 3D Medical Image Segmentation

    Authors: Guoan Wang, Jin Ye, Junlong Cheng, Tianbin Li, Zhaolin Chen, Jianfei Cai, Junjun He, Bohan Zhuang

    Abstract: Volumetric medical image segmentation is pivotal in enhancing disease diagnosis, treatment planning, and advancing medical research. While existing volumetric foundation models for medical image segmentation, such as SAM-Med3D and SegVol, have shown remarkable performance on general organs and tumors, their ability to segment certain categories in clinical downstream tasks remains limited. Supervi… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Journal ref: MICCAI 2024

  7. arXiv:2407.04689  [pdf, other

    cs.RO cs.CV

    RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation

    Authors: Yuxuan Kuang, Junjie Ye, Haoran Geng, Jiageng Mao, Congyue Deng, Leonidas Guibas, He Wang, Yue Wang

    Abstract: This work proposes a retrieve-and-transfer framework for zero-shot robotic manipulation, dubbed RAM, featuring generalizability across various objects, environments, and embodiments. Unlike existing approaches that learn manipulation from expensive in-domain demonstrations, RAM capitalizes on a retrieval-based affordance transfer paradigm to acquire versatile manipulation capabilities from abundan… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  8. arXiv:2407.03639  [pdf, other

    astro-ph.SR

    Quasi-periodic oscillations of flare loops and slipping motion of ribbon substructures during a C-class flare

    Authors: Yining Zhang, Ting Li, Jing Ye

    Abstract: Quasi-periodic oscillations in solar flaring emission have been observed over the past few decades. To date, the underpinning processes resulting in the quasi-periodic oscillations remain unknown. In this paper, we report a unique event that exhibits both the long-duration quasi-periodic intensity oscillations of flare loops and the quasi-periodic slipping motion of ribbon substructures during a C… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 13 pages, 10 figures, accepted for publication in ApJ

  9. arXiv:2407.00942  [pdf, other

    cs.IR cs.AI cs.CL

    ProductAgent: Benchmarking Conversational Product Search Agent with Asking Clarification Questions

    Authors: Jingheng Ye, Yong Jiang, Xiaobin Wang, Yinghui Li, Yangning Li, Hai-Tao Zheng, Pengjun Xie, Fei Huang

    Abstract: This paper introduces the task of product demand clarification within an e-commercial scenario, where the user commences the conversation with ambiguous queries and the task-oriented agent is designed to achieve more accurate and tailored product searching by asking clarification questions. To address this task, we propose ProductAgent, a conversational information seeking agent equipped with abil… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 17 pages, 13 tables, 6 figures. Under review

  10. arXiv:2407.00934  [pdf, other

    cs.CL

    CLEME2.0: Towards More Interpretable Evaluation by Disentangling Edits for Grammatical Error Correction

    Authors: Jingheng Ye, Zishan Xu, Yinghui Li, Xuxin Cheng, Linlin Song, Qingyu Zhou, Hai-Tao Zheng, Ying Shen, Xin Su

    Abstract: The paper focuses on improving the interpretability of Grammatical Error Correction (GEC) metrics, which receives little attention in previous studies. To bridge the gap, we propose CLEME2.0, a reference-based evaluation strategy that can describe four elementary dimensions of GEC systems, namely hit-correction, error-correction, under-correction, and over-correction. They collectively contribute… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 16 pages, 8 tables, 2 figures. Under review

  11. arXiv:2407.00924  [pdf, other

    cs.CL

    EXCGEC: A Benchmark of Edit-wise Explainable Chinese Grammatical Error Correction

    Authors: Jingheng Ye, Shang Qin, Yinghui Li, Xuxin Cheng, Libo Qin, Hai-Tao Zheng, Peng Xing, Zishan Xu, Guo Cheng, Zhao Wei

    Abstract: Existing studies explore the explainability of Grammatical Error Correction (GEC) in a limited scenario, where they ignore the interaction between corrections and explanations. To bridge the gap, this paper introduces the task of EXplainable GEC (EXGEC), which focuses on the integral role of both correction and explanation tasks. To facilitate the task, we propose EXCGEC, a tailored benchmark for… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 22 pages, 10 tables, 9 figures. Under review

  12. arXiv:2406.18719  [pdf

    physics.atom-ph nucl-ex physics.optics quant-ph

    Dawn of a nuclear clock: frequency ratio of the $^{229m}$Th isomeric transition and the $^{87}$Sr atomic clock

    Authors: Chuankun Zhang, Tian Ooi, Jacob S. Higgins, Jack F. Doyle, Lars von der Wense, Kjeld Beeks, Adrian Leitner, Georgy Kazakov, Peng Li, Peter G. Thirolf, Thorsten Schumm, Jun Ye

    Abstract: Optical atomic clocks$^{1,2}$ use electronic energy levels to precisely keep track of time. A clock based on nuclear energy levels promises a next-generation platform for precision metrology and fundamental physics studies. Thorium-229 nuclei exhibit a uniquely low energy nuclear transition within reach of state-of-the-art vacuum ultraviolet (VUV) laser light sources and have therefore been propos… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 22 pages, 5 figures, 1 extended data figure

  13. arXiv:2406.18118  [pdf, other

    cs.CR cs.CL

    SafeAligner: Safety Alignment against Jailbreak Attacks via Response Disparity Guidance

    Authors: Caishuang Huang, Wanxu Zhao, Rui Zheng, Huijie Lv, Shihan Dou, Sixian Li, Xiao Wang, Enyu Zhou, Junjie Ye, Yuming Yang, Tao Gui, Qi Zhang, Xuanjing Huang

    Abstract: As the development of large language models (LLMs) rapidly advances, securing these models effectively without compromising their utility has become a pivotal area of research. However, current defense strategies against jailbreak attacks (i.e., efforts to bypass security protocols) often suffer from limited adaptability, restricted general capability, and high cost. To address these challenges, w… ▽ More

    Submitted 28 June, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

  14. arXiv:2406.16119  [pdf, other

    hep-ph hep-ex

    Strong Coupling from Inclusive Semi-leptonic Decay of Charmed Mesons

    Authors: Jinfei Wu, Xinchou Lou, Yanping Huang, Gang Li, Jingbo Ye, Yuzhi Che, Manqi Ruan

    Abstract: Employing the heavy quark expansion with kinetic scheme, we study the strong interaction coupling $α_S$ at an energy corresponding to the charm quark mass $m_c$ using the data on the semi-leptonic decays of charmed mesons from the BESIII and CLEO-c. The values of the $α_{s}(m_c^{2})$ from the $D^0$, $D^+$, and $D_s^+$ are extracted and found to be consistent. Combining the semileptonic decay width… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 7 pages, 6 figures

  15. arXiv:2406.13999  [pdf, other

    quant-ph

    Individually Addressed Entangling Gates in a Two-Dimensional Ion Crystal

    Authors: Y. -H. Hou, Y. -J. Yi, Y. -K. Wu, Y. -Y. Chen, L. Zhang, Y. Wang, Y. -L. Xu, C. Zhang, Q. -X. Mei, H. -X. Yang, J. -Y. Ma, S. -A. Guo, J. Ye, B. -X. Qi, Z. -C. Zhou, P. -Y. Hou, L. -M. Duan

    Abstract: Two-dimensional (2D) ion crystals have become a promising way to scale up qubit numbers for ion trap quantum information processing. However, to realize universal quantum computing in this system, individually addressed high-fidelity two-qubit entangling gates still remain challenging due to the inevitable micromotion of ions in a 2D crystal as well as the technical difficulty in 2D addressing. He… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  16. arXiv:2406.13907  [pdf, other

    physics.atom-ph physics.chem-ph quant-ph

    Observation of full contrast icosahedral Bose-Einstein statistics in laser desorbed, buffer gas cooled C$_{60}$

    Authors: Ya-Chu Chan, Lee R. Liu, Andrew Scheck, David J. Nesbitt, Jun Ye, Dina Rosenberg

    Abstract: The quantum mechanical nature of spherical top molecules is particularly evident at low angular momentum quantum number J. Using infrared spectroscopy on the 8.4$μ$m rovibrational band of buffer gas cooled $^{12}$C$_{60}$, we observe the hitherto unseen R(J = 0 - 29) rotational progression, including the complete disappearance of certain transitions due to the molecule's perfect icosahedral symmet… ▽ More

    Submitted 23 June, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

  17. arXiv:2406.13638  [pdf, other

    physics.data-an astro-ph.IM hep-ex physics.ins-det

    XENONnT WIMP Search: Signal & Background Modeling and Statistical Inference

    Authors: XENON Collaboration, E. Aprile, J. Aalbers, K. Abe, S. Ahmed Maouloud, L. Althueser, B. Andrieu, E. Angelino, D. Antón Martin, F. Arneodo, L. Baudis, M. Bazyk, L. Bellagamba, R. Biondi, A. Bismark, K. Boese, A. Brown, G. Bruno, R. Budnik, J. M. R. Cardoso, A. P. Cimental Chávez, A. P. Colijn, J. Conrad, J. J. Cuenca-García, V. D'Andrea , et al. (139 additional authors not shown)

    Abstract: The XENONnT experiment searches for weakly-interacting massive particle (WIMP) dark matter scattering off a xenon nucleus. In particular, XENONnT uses a dual-phase time projection chamber with a 5.9-tonne liquid xenon target, detecting both scintillation and ionization signals to reconstruct the energy, position, and type of recoil. A blind search for nuclear recoil WIMPs with an exposure of 1.1 t… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 20 pages, 10 figures

  18. arXiv:2406.11553  [pdf, other

    cs.SI

    The Susceptibility Paradox in Online Social Influence

    Authors: Luca Luceri, Jinyi Ye, Julie Jiang, Emilio Ferrara

    Abstract: Understanding susceptibility to online influence is crucial for mitigating the spread of misinformation and protecting vulnerable audiences. This paper investigates susceptibility to influence within social networks, focusing on the differential effects of influence-driven versus spontaneous behaviors on user content adoption. Our analysis reveals that influence-driven adoption exhibits high homop… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  19. arXiv:2406.11192  [pdf, other

    cs.CL

    Beyond Boundaries: Learning a Universal Entity Taxonomy across Datasets and Languages for Open Named Entity Recognition

    Authors: Yuming Yang, Wantong Zhao, Caishuang Huang, Junjie Ye, Xiao Wang, Huiyuan Zheng, Yang Nan, Yuran Wang, Xueying Xu, Kaixin Huang, Yunke Zhang, Tao Gui, Qi Zhang, Xuanjing Huang

    Abstract: Open Named Entity Recognition (NER), which involves identifying arbitrary types of entities from arbitrary domains, remains challenging for Large Language Models (LLMs). Recent studies suggest that fine-tuning LLMs on extensive NER data can boost their performance. However, training directly on existing datasets faces issues due to inconsistent entity definitions and redundant data, limiting LLMs… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 20 pages. Project page: https://github.com/UmeanNever/B2NER

  20. Breaking the Memory Wall: A Study of I/O Patterns and GPU Memory Utilization for Hybrid CPU-GPU Offloaded Optimizers

    Authors: Avinash Maurya, Jie Ye, M. Mustafa Rafique, Franck Cappello, Bogdan Nicolae

    Abstract: Transformers and LLMs have seen rapid adoption in all domains. Their sizes have exploded to hundreds of billions of parameters and keep increasing. Under these circumstances, the training of transformers is slow and often takes in the order of weeks or months. Thanks to 3D model parallelism (data, pipeline, and tensor-level parallelism), the training can scale to a large number of GPUs, which redu… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: Accepted at FlexScience'24' Workshop on AI and Scientific Computing at Scale using Flexible Computing Infrastructures (co-located with HPDC'24)

  21. arXiv:2406.10475  [pdf, other

    cs.CV

    Discrete Latent Perspective Learning for Segmentation and Detection

    Authors: Deyi Ji, Feng Zhao, Lanyun Zhu, Wenwei Jin, Hongtao Lu, Jieping Ye

    Abstract: In this paper, we address the challenge of Perspective-Invariant Learning in machine learning and computer vision, which involves enabling a network to understand images from varying perspectives to achieve consistent semantic interpretation. While standard approaches rely on the labor-intensive collection of multi-view images or limited data augmentation techniques, we propose a novel framework,… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: ICML 2024 Spotlight

  22. arXiv:2406.08829  [pdf, other

    cs.CV cs.CR

    Improving Adversarial Robustness via Feature Pattern Consistency Constraint

    Authors: Jiacong Hu, Jingwen Ye, Zunlei Feng, Jiazhen Yang, Shunyu Liu, Xiaotian Yu, Lingxiang Jia, Mingli Song

    Abstract: Convolutional Neural Networks (CNNs) are well-known for their vulnerability to adversarial attacks, posing significant security concerns. In response to these threats, various defense methods have emerged to bolster the model's robustness. However, most existing methods either focus on learning from adversarial perturbations, leading to overfitting to the adversarial examples, or aim to eliminate… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  23. arXiv:2406.08152  [pdf, other

    cs.CV

    CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise Transformer

    Authors: Hualian Sheng, Sijia Cai, Na Zhao, Bing Deng, Qiao Liang, Min-Jian Zhao, Jieping Ye

    Abstract: The field of 3D object detection from point clouds is rapidly advancing in computer vision, aiming to accurately and efficiently detect and localize objects in three-dimensional space. Current 3D detectors commonly fall short in terms of flexibility and scalability, with ample room for advancements in performance. In this paper, our objective is to address these limitations by introducing two fram… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 19 pages, 8 figures

  24. arXiv:2406.08070  [pdf, ps, other

    cs.CV cs.AI cs.LG

    CFG++: Manifold-constrained Classifier Free Guidance for Diffusion Models

    Authors: Hyungjin Chung, Jeongsol Kim, Geon Yeong Park, Hyelin Nam, Jong Chul Ye

    Abstract: Classifier-free guidance (CFG) is a fundamental tool in modern diffusion models for text-guided generation. Although effective, CFG has notable drawbacks. For instance, DDIM with CFG lacks invertibility, complicating image editing; furthermore, high guidance scales, essential for high-quality outputs, frequently result in issues like mode collapse. Contrary to the widespread belief that these are… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  25. arXiv:2406.07756  [pdf, other

    stat.ME

    The Exchangeability Assumption for Permutation Tests of Multiple Regression Models: Implications for Statistics and Data Science

    Authors: Johanna Hardin, Lauren Quesada, Julie Ye, Nicholas J. Horton

    Abstract: Permutation tests are a powerful and flexible approach to inference via resampling. As computational methods become more ubiquitous in the statistics curriculum, use of permutation tests has become more tractable. At the heart of the permutation approach is the exchangeability assumption, which determines the appropriate null sampling distribution. We explore the exchangeability assumption in the… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  26. arXiv:2406.07162  [pdf, other

    cs.SD cs.AI cs.CL cs.MM eess.AS

    EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark

    Authors: Ziyang Ma, Mingjie Chen, Hezhao Zhang, Zhisheng Zheng, Wenxi Chen, Xiquan Li, Jiaxin Ye, Xie Chen, Thomas Hain

    Abstract: Speech emotion recognition (SER) is an important part of human-computer interaction, receiving extensive attention from both industry and academia. However, the current research field of SER has long suffered from the following problems: 1) There are few reasonable and universal splits of the datasets, making comparing different models and methods difficult. 2) No commonly used benchmark covers nu… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted by INTERSPEECH 2024. GitHub Repository: https://github.com/emo-box/EmoBox

  27. arXiv:2406.06807  [pdf

    cond-mat.mtrl-sci

    Additive engineering for Sb$_2$S$_3$ indoor photovoltaics with efficiency exceeding 17%

    Authors: Xiao Chen, Xiaoxuan Shu, Jiangcheng Zhou, Lei Wan, Peng Xiao, Yuchen Fu, Junzhi Ye, Yi-Teng Huang, Bin Yan, Dingjiang Xue, Tao Chen, Jiejie Chen, Robert L. Z. Hoye, Ru Zhou

    Abstract: Indoor photovoltaics (IPVs) have attracted increasing attention for sustainably powering Internet of Things (IoT) electronics. Sb$_2$S$_3$ is a promising IPV candidate material with a bandgap of ~1.75 eV, which is near the optimal value for indoor energy harvesting. However, the performance of Sb$_2$S$_3$ solar cells is limited by nonradiative recombination, closely associated with the poor-qualit… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 28 pages, 6 figures

  28. arXiv:2406.06620  [pdf, other

    cs.LG cs.AI cs.CL

    DualTime: A Dual-Adapter Multimodal Language Model for Time Series Representation

    Authors: Weiqi Zhang, Jiexia Ye, Ziyue Li, Jia Li, Fugee Tsung

    Abstract: The recent rapid development of language models (LMs) has attracted attention in the field of time series, including multimodal time series modeling. However, we note that current time series multimodal methods are biased, often assigning a primary role to one modality while the other assumes a secondary role. They overlook the mutual benefits and complementary of different modalities. For example… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 15 pages, 12 figure, 5 tables

  29. arXiv:2406.06579  [pdf, other

    cs.CL cs.AI cs.CV

    From Redundancy to Relevance: Enhancing Explainability in Multimodal Large Language Models

    Authors: Xiaofeng Zhang, Chen Shen, Xiaosong Yuan, Shaotian Yan, Liang Xie, Wenxiao Wang, Chaochen Gu, Hao Tang, Jieping Ye

    Abstract: Recently, multimodal large language models have exploded with an endless variety, most of the popular Large Vision Language Models (LVLMs) depend on sequential visual representation, where images are converted into hundreds or thousands of tokens before being input into the Large Language Model (LLM) along with language prompts. The black-box design hinders the interpretability of visual-language… ▽ More

    Submitted 13 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  30. arXiv:2406.03804  [pdf, other

    quant-ph cond-mat.quant-gas gr-qc physics.atom-ph

    Exploring the interplay between mass-energy equivalence, interactions and entanglement in an optical lattice clock

    Authors: Anjun Chu, Victor J. Martínez-Lahuerta, Maya Miklos, Kyungtae Kim, Peter Zoller, Klemens Hammerer, Jun Ye, Ana Maria Rey

    Abstract: We propose protocols that probe manifestations of the mass-energy equivalence in an optical lattice clock (OLC) interrogated with spin coherent and entangled quantum states. To tune and uniquely distinguish the mass-energy equivalence effects (gravitational redshift and second order Doppler shift) in such setting, we devise a dressing protocol using an additional nuclear spin state. We then analyz… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 7+17 pages, 4+6 figures

  31. arXiv:2406.03609  [pdf

    physics.optics physics.chem-ph quant-ph

    Modulated Ringdown Comb Interferometry for next-generation high complexity trace gas sensing

    Authors: Qizhong Liang, Apoorva Bisht, Andrew Scheck, Peter G. Schunemann, Jun Ye

    Abstract: Gas samples relevant to health and environment typically contain a plethora of molecular species that span a huge concentration dynamic range. High-concentration molecules impose a strong absorption background that hinders robust identification of low-concentration species. While mid-infrared frequency comb spectroscopy with high-finesse cavity enhancement has realized many of the most sensitive m… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  32. arXiv:2406.00880  [pdf, ps, other

    math.NT math.AG math.LO

    Lang-Weil Type Estimates in Finite Difference Fields

    Authors: Martin Hils, Ehud Hrushovski, Jinhe Ye, Tingxiang Zou

    Abstract: We prove a uniform estimate of the number of points for difference algebraic varieties in finite difference fields in the spirit of Lang-Weil. More precisely, we give uniform lower and upper bounds for the number of rational points of a difference variety in terms of its transformal dimension. As a main technical ingredient, we prove an equidimensionality result for Frobenius reductions of differe… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    MSC Class: Primary 12H10; 11U09 Secondary 03C60; 03C20

  33. arXiv:2406.00779  [pdf, other

    cs.LG

    Differentiation of Multi-objective Data-driven Decision Pipeline

    Authors: Peng Li, Lixia Wu, Chaoqun Feng, Haoyuan Hu, Lei Fu, Jieping Ye

    Abstract: Real-world scenarios frequently involve multi-objective data-driven optimization problems, characterized by unknown problem coefficients and multiple conflicting objectives. Traditional two-stage methods independently apply a machine learning model to estimate problem coefficients, followed by invoking a solver to tackle the predicted optimization problem. The independent use of optimization solve… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  34. arXiv:2406.00275  [pdf, other

    cs.CV cs.LG

    StyDeSty: Min-Max Stylization and Destylization for Single Domain Generalization

    Authors: Songhua Liu, Xin Jin, Xingyi Yang, Jingwen Ye, Xinchao Wang

    Abstract: Single domain generalization (single DG) aims at learning a robust model generalizable to unseen domains from only one training domain, making it a highly ambitious and challenging task. State-of-the-art approaches have mostly relied on data augmentations, such as adversarial perturbation and style enhancement, to synthesize new data and thus increase robustness. Nevertheless, they have largely ov… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: Accepted at ICML 2024; Work in 2022 spring

  35. arXiv:2405.20984  [pdf, other

    cs.LG

    Bayesian Design Principles for Offline-to-Online Reinforcement Learning

    Authors: Hao Hu, Yiqin Yang, Jianing Ye, Chengjie Wu, Ziqing Mai, Yujing Hu, Tangjie Lv, Changjie Fan, Qianchuan Zhao, Chongjie Zhang

    Abstract: Offline reinforcement learning (RL) is crucial for real-world applications where exploration can be costly or unsafe. However, offline learned policies are often suboptimal, and further online fine-tuning is required. In this paper, we tackle the fundamental dilemma of offline-to-online fine-tuning: if the agent remains pessimistic, it may fail to learn a better policy, while if it becomes optimis… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Forty-first International Conference on Machine Learning (ICML), 2024

  36. arXiv:2405.19609  [pdf, other

    cs.CV cs.GR

    SMPLX-Lite: A Realistic and Drivable Avatar Benchmark with Rich Geometry and Texture Annotations

    Authors: Yujiao Jiang, Qingmin Liao, Zhaolong Wang, Xiangru Lin, Zongqing Lu, Yuxi Zhao, Hanqing Wei, Jingrui Ye, Yu Zhang, Zhijing Shao

    Abstract: Recovering photorealistic and drivable full-body avatars is crucial for numerous applications, including virtual reality, 3D games, and tele-presence. Most methods, whether reconstruction or generation, require large numbers of human motion sequences and corresponding textured meshes. To easily learn a drivable avatar, a reasonable parametric body model with unified topology is paramount. However,… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: ICME 2024;Project page: https://alex-jyj.github.io/SMPLX-Lite/

  37. arXiv:2405.17829  [pdf, other

    cs.LG cs.AI

    LDMol: Text-Conditioned Molecule Diffusion Model Leveraging Chemically Informative Latent Space

    Authors: Jinho Chang, Jong Chul Ye

    Abstract: With the emergence of diffusion models as the frontline of generative models, many researchers have proposed molecule generation techniques using conditional diffusion models. However, due to the fundamental nature of a molecule, which carries highly entangled correlations within a small number of atoms and bonds, it becomes difficult for a model to connect raw data with the conditions when the co… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  38. arXiv:2405.17792  [pdf, other

    hep-ex hep-ph

    JUNO Sensitivity to Invisible Decay Modes of Neutrons

    Authors: JUNO Collaboration, Angel Abusleme, Thomas Adam, Kai Adamowicz, Shakeel Ahmad, Rizwan Ahmed, Sebastiano Aiello, Fengpeng An, Qi An, Giuseppe Andronico, Nikolay Anfimov, Vito Antonelli, Tatiana Antoshkina, João Pedro Athayde Marcondes de André, Didier Auguste, Weidong Bai, Nikita Balashov, Wander Baldini, Andrea Barresi, Davide Basilico, Eric Baussan, Marco Bellato, Marco Beretta, Antonio Bergnoli, Daniel Bick , et al. (635 additional authors not shown)

    Abstract: We explore the bound neutrons decay into invisible particles (e.g., $n\rightarrow 3 ν$ or $nn \rightarrow 2 ν$) in the JUNO liquid scintillator detector. The invisible decay includes two decay modes: $ n \rightarrow { inv} $ and $ nn \rightarrow { inv} $. The invisible decays of $s$-shell neutrons in $^{12}{\rm C}$ will leave a highly excited residual nucleus. Subsequently, some de-excitation mode… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 28 pages, 7 figures, 4 tables

  39. arXiv:2405.17720  [pdf, other

    cs.CV cs.AI cs.LG

    MindFormer: A Transformer Architecture for Multi-Subject Brain Decoding via fMRI

    Authors: Inhwa Han, Jaayeon Lee, Jong Chul Ye

    Abstract: Research efforts to understand neural signals have been ongoing for many years, with visual decoding from fMRI signals attracting considerable attention. Particularly, the advent of image diffusion models has advanced the reconstruction of images from fMRI data significantly. However, existing approaches often introduce inter- and intra- subject variations in the reconstructed images, which can co… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  40. arXiv:2405.17473  [pdf, other

    cs.LG cs.AI cs.SI

    Repeat-Aware Neighbor Sampling for Dynamic Graph Learning

    Authors: Tao Zou, Yuhao Mao, Junchen Ye, Bowen Du

    Abstract: Dynamic graph learning equips the edges with time attributes and allows multiple links between two nodes, which is a crucial technology for understanding evolving data scenarios like traffic prediction and recommendation systems. Existing works obtain the evolving patterns mainly depending on the most recent neighbor sequences. However, we argue that whether two nodes will have interaction with ea… ▽ More

    Submitted 20 June, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: Accepted by KDD 2024, Research Track

  41. arXiv:2405.17031  [pdf, other

    cs.LG

    Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning

    Authors: Haoxin Lin, Yu-Yan Xu, Yihao Sun, Zhilong Zhang, Yi-Chen Li, Chengxing Jia, Junyin Ye, Jiaji Zhang, Yang Yu

    Abstract: Model-based methods in reinforcement learning offer a promising approach to enhance data efficiency by facilitating policy exploration within a dynamics model. However, accurately predicting sequential steps in the dynamics model remains a challenge due to the bootstrapping prediction, which attributes the next state to the prediction of the current state. This leads to accumulated errors during m… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  42. arXiv:2405.16823  [pdf, other

    cs.CV cs.AI

    Unified Editing of Panorama, 3D Scenes, and Videos Through Disentangled Self-Attention Injection

    Authors: Gihyun Kwon, Jangho Park, Jong Chul Ye

    Abstract: While text-to-image models have achieved impressive capabilities in image generation and editing, their application across various modalities often necessitates training separate models. Inspired by existing method of single image editing with self attention injection and video editing with shared attention, we propose a novel unified editing framework that combines the strengths of both approache… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Project Page: https://unifyediting.github.io/

  43. arXiv:2405.16011  [pdf, ps, other

    eess.SP

    Semantic Importance-Aware Communications with Semantic Correction Using Large Language Models

    Authors: Shuaishuai Guo, Yanhu Wang, Jia Ye, Anbang Zhang, Kun Xu

    Abstract: Semantic communications, a promising approach for agent-human and agent-agent interactions, typically operate at a feature level, lacking true semantic understanding. This paper explores understanding-level semantic communications (ULSC), transforming visual data into human-intelligible semantic content. We employ an image caption neural network (ICNN) to derive semantic representations from visua… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  44. arXiv:2405.15698  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall

    First-principles studies of fermiology in topological phases of bulk ZrTe$_5$

    Authors: Chao Chen Ye, Yuliia Kreminska, Jianting Ye, Jagoda Sławińska

    Abstract: Topological insulators have been studied intensively over the last decades. Among these materials, three-dimensional (3D) zirconium pentatelluride (ZrTe$_5$) stands out as one of the most intriguing for both theoretical and experimental studies because of its diverse range of distinct topological phases. In this work, we employ density functional theory to study the electronic structure and quantu… ▽ More

    Submitted 27 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

  45. arXiv:2405.14312  [pdf, other

    cs.CV cs.CL cs.MM

    Improving Gloss-free Sign Language Translation by Reducing Representation Density

    Authors: Jinhui Ye, Xing Wang, Wenxiang Jiao, Junwei Liang, Hui Xiong

    Abstract: Gloss-free sign language translation (SLT) aims to develop well-performing SLT systems with no requirement for the costly gloss annotations, but currently still lags behind gloss-based approaches significantly. In this paper, we identify a representation density problem that could be a bottleneck in restricting the performance of gloss-free SLT. Specifically, the representation density problem des… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Representation Density and Performance Drop

  46. arXiv:2405.11778  [pdf, other

    cs.LG cs.AI cs.MA

    Efficient Multi-agent Reinforcement Learning by Planning

    Authors: Qihan Liu, Jianing Ye, Xiaoteng Ma, Jun Yang, Bin Liang, Chongjie Zhang

    Abstract: Multi-agent reinforcement learning (MARL) algorithms have accomplished remarkable breakthroughs in solving large-scale decision-making tasks. Nonetheless, most existing MARL algorithms are model-free, limiting sample efficiency and hindering their applicability in more challenging scenarios. In contrast, model-based reinforcement learning (MBRL), particularly algorithms integrating planning, such… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: ICLR2024

  47. arXiv:2405.09883  [pdf, other

    cs.CV

    RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception

    Authors: Xiaosu Zhu, Hualian Sheng, Sijia Cai, Bing Deng, Shaopeng Yang, Qiao Liang, Ken Chen, Lianli Gao, Jingkuan Song, Jieping Ye

    Abstract: We introduce RoScenes, the largest multi-view roadside perception dataset, which aims to shed light on the development of vision-centric Bird's Eye View (BEV) approaches for more challenging traffic scenes. The highlights of RoScenes include significantly large perception area, full scene coverage and crowded traffic. More specifically, our dataset achieves surprising 21.13M 3D annotations within… ▽ More

    Submitted 4 July, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: ECCV 2024. Extended version. 33 pages, 21 figures, 13 tables. https://github.com/xiaosu-zhu/RoScenes

  48. arXiv:2405.06326  [pdf, other

    physics.ins-det nucl-ex

    A practical approach of measuring $^{238}$U and $^{232}$Th in liquid scintillator to sub-ppq level using ICP-MS

    Authors: Yuanxia Li, Jie Zhao, Yayun Ding, Tao Hu, Jiaxuan Ye, Jian Fang, Liangjian Wen

    Abstract: Liquid scintillator (LS) is commonly utilized in experiments seeking rare events due to its high light yield, transparency, and radiopurity. The concentration of $^{238}$U and $^{232}$Th in LS consistently remains below 1 ppq (10$^{-15}$ g/g), and the current screening result is based on a minimum 20-ton detector. Inductively coupled plasma mass (ICP-MS) spectroscopy is well-regarded for its high… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  49. arXiv:2405.05957  [pdf, other

    cs.CL

    OpenBA-V2: Reaching 77.3% High Compression Ratio with Fast Multi-Stage Pruning

    Authors: Dan Qiao, Yi Su, Pinzheng Wang, Jing Ye, Wenjing Xie, Yuechi Zhou, Yuyang Ding, Zecheng Tang, Jikai Wang, Yixin Ji, Yue Wang, Pei Guo, Zechen Sun, Zikang Zhang, Juntao Li, Pingfu Chao, Wenliang Chen, Guohong Fu, Guodong Zhou, Qiaoming Zhu, Min Zhang

    Abstract: Large Language Models (LLMs) have played an important role in many fields due to their powerful capabilities.However, their massive number of parameters leads to high deployment requirements and incurs significant inference costs, which impedes their practical applications. Training smaller models is an effective way to address this problem. Therefore, we introduce OpenBA-V2, a 3.4B model derived… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  50. arXiv:2405.05554  [pdf, other

    hep-ex

    RELICS: a REactor neutrino LIquid xenon Coherent elastic Scattering experiment

    Authors: Chang Cai, Guocai Chen, Jiangyu Chen, Rundong Fang, Fei Gao, Xiaoran Guo, Jiheng Guo, Tingyi He, Chengjie Jia, Gaojun Jin, Yipin Jing, Gaojun Ju, Yang Lei, Jiayi Li, Kaihang Li, Meng Li, Minhua Li, Shengchao Li, Siyin Li, Tao Li, Qing Lin, Jiajun Liu, Minghao Liu, Sheng Lv, Guang Luo , et al. (24 additional authors not shown)

    Abstract: Coherent elastic neutrino-nucleus scattering (CEvNS) provides a unique probe for neutrino properties Beyond the Standard Model (BSM) physics. REactor neutrino LIquid xenon Coherent Scattering experiment (RELICS), a proposed reactor neutrino program using liquid xenon time projection chamber (LXeTPC) technology, aims to investigate the CEvNS process of antineutrinos off xenon atomic nuclei. In this… ▽ More

    Submitted 12 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.