Skip to main content

Showing 1–50 of 1,430 results for author: Cheng, H

  1. arXiv:2407.07545  [pdf

    physics.optics

    Narrow Linewidth Laser Based on Extended Topological Interface States in One-Dimensional Photonic Crystals

    Authors: Xiao Sun, Zhibo Li, Yiming Sun, Yupei Wang, Jue Wang, Huihua Cheng, Cong Fu, John H. Marsh, Anthony E. Kelly, Lianping Hou

    Abstract: Recent advances in topological one-dimensional photonic crystal concepts have enabled the development of robust light-emitting devices by incorporating a topological interface state (TIS) at the cavity center. In this study, we theoretically and experimentally demonstrate a one-dimensional TIS-extended photonic crystal (1D-TISE-PC) structure. By integrating a linearly dispersive zero-index one-dim… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  2. Collectively induced transparency and absorption in waveguide QED with Bragg atom arrays

    Authors: Haolei Cheng, Wei Nie

    Abstract: Collective quantum states, such as subradiant and superradiant states, are useful for controlling optical responses in many-body quantum systems. In this work, we study novel collective quantum phenomena in waveguide-coupled Bragg atom arrays with inhomogeneous frequencies. For atoms without free-space dissipation, collectively induced transparency is produced by destructive quantum interference b… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Journal ref: Commun. Theor. Phys. 76, 085101 (2024)

  3. arXiv:2407.03568  [pdf, other

    cs.SI cs.IR

    When LLM Meets Hypergraph: A Sociological Analysis on Personality via Online Social Networks

    Authors: Zhiyao Shu, Xiangguo Sun, Hong Cheng

    Abstract: Individual personalities significantly influence our perceptions, decisions, and social interactions, which is particularly crucial for gaining insights into human behavior patterns in online social network analysis. Many psychological studies have observed that personalities are strongly reflected in their social behaviors and social environments. In light of these problems, this paper proposes a… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  4. arXiv:2407.03197  [pdf, other

    cs.CV

    DyFADet: Dynamic Feature Aggregation for Temporal Action Detection

    Authors: Le Yang, Ziwei Zheng, Yizeng Han, Hao Cheng, Shiji Song, Gao Huang, Fan Li

    Abstract: Recent proposed neural network-based Temporal Action Detection (TAD) models are inherently limited to extracting the discriminative representations and modeling action instances with various lengths from complex scenes by shared-weights detection heads. Inspired by the successes in dynamic neural networks, in this paper, we build a novel dynamic feature aggregation (DFA) module that can simultaneo… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  5. arXiv:2407.00946  [pdf

    cond-mat.mtrl-sci

    Atomic cluster expansion interatomic potential for defects and thermodynamics of Cu-W system

    Authors: Jiahao Pan, Huiqun Cheng, Gaosheng Yan, Lei Zhang, Wenshan Yu, Shengping Shen

    Abstract: The unique properties exhibited in immiscible metals, such as excellent strength, hardness, and radiation-damage tolerance, have stimulated the interest of many researchers. As a typical immiscible metal system, the Cu-W nano-multilayers combine the plasticity of copper and the strength of tungsten, making it a suitable candidate for applications in aerospace, nuclear fusion engineering, and elect… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 26 pages, 14 figures

  6. arXiv:2406.18658  [pdf, ps, other

    quant-ph cs.IT

    Sample Complexity of Locally Differentially Private Quantum Hypothesis Testing

    Authors: Hao-Chung Cheng, Christoph Hirche, Cambyse Rouzé

    Abstract: Quantum state discrimination is an important problem in many information processing tasks. In this work we are concerned with finding its best possible sample complexity when the states are preprocessed by a quantum channel that is required to be locally differentially private. To that end we provide achievability and converse bounds for different settings. This includes symmetric state discrimina… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 24 pages. Short version accepted at ISIT 2024. This work is independent and concurrent to "Contraction of Private Quantum Channels and Private Quantum Hypothesis Testing" by Theshani Nuradha and Mark M. Wilde

  7. arXiv:2406.14927  [pdf, other

    cs.CV cs.RO

    Gaussian-Informed Continuum for Physical Property Identification and Simulation

    Authors: Junhao Cai, Yuji Yang, Weihao Yuan, Yisheng He, Zilong Dong, Liefeng Bo, Hui Cheng, Qifeng Chen

    Abstract: This paper studies the problem of estimating physical properties (system identification) through visual observations. To facilitate geometry-aware guidance in physical property estimation, we introduce a novel hybrid framework that leverages 3D Gaussian representation to not only capture explicit shapes but also enable the simulated continuum to deduce implicit shapes during training. We propose a… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 19 pages, 8 figures

  8. arXiv:2406.13159  [pdf, other

    physics.optics physics.ins-det

    Ultrastable vacuum-gap Fabry-Pérot cavities operated in air

    Authors: Yifan Liu, Naijun Jin, Dahyeon Lee, Charles McLemore, Takuma Nakamura, Megan Kelleher, Haotian Cheng, Susan Schima, Nazanin Hoghooghi, Scott Diddams, Peter Rakich, Franklyn Quinlan

    Abstract: We demonstrate a vacuum-gap ultrastable optical reference cavity that does not require a vacuum enclosure. Our simple method of optical contact bonding in a vacuum environment allows for cavity operation in air while maintaining vacuum between the cavity mirrors. Vacuum is maintained long term, with no observed degradation in cavity stability for over 1 year after bonding. For a 1550 nm laser stab… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 10 pages, 6 figures

  9. arXiv:2406.11472  [pdf, other

    cs.CV

    Learning from Exemplars for Interactive Image Segmentation

    Authors: Kun Li, Hao Cheng, George Vosselman, Michael Ying Yang

    Abstract: Interactive image segmentation enables users to interact minimally with a machine, facilitating the gradual refinement of the segmentation mask for a target of interest. Previous studies have demonstrated impressive performance in extracting a single target mask through interactive segmentation. However, the information cues of previously interacted objects have been overlooked in the existing met… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Under review

  10. arXiv:2406.10810  [pdf, other

    cs.RO

    RGBlimp-Q: Robotic Gliding Blimp With Moving Mass Control Based on a Bird-Inspired Continuum Arm

    Authors: Hao Cheng, Feitian Zhang

    Abstract: Robotic blimps, as lighter-than-air aerial systems, offer prolonged duration and enhanced safety in human-robot interactions due to their buoyant lift. However, robust flight against environmental airflow disturbances remains a significant challenge, limiting the broader application of these robots. Drawing inspiration from the flight mechanics of birds and their ability to perch against natural w… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  11. arXiv:2406.05346  [pdf, other

    cs.LG

    ProG: A Graph Prompt Learning Benchmark

    Authors: Chenyi Zi, Haihong Zhao, Xiangguo Sun, Yiqing Lin, Hong Cheng, Jia Li

    Abstract: Artificial general intelligence on graphs has shown significant advancements across various applications, yet the traditional 'Pre-train & Fine-tune' paradigm faces inefficiencies and negative transfer issues, particularly in complex and few-shot settings. Graph prompt learning emerges as a promising alternative, leveraging lightweight prompts to manipulate data and fill the task gap by reformulat… ▽ More

    Submitted 19 June, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

  12. arXiv:2406.04520  [pdf, other

    cs.CL cs.AI

    NATURAL PLAN: Benchmarking LLMs on Natural Language Planning

    Authors: Huaixiu Steven Zheng, Swaroop Mishra, Hugh Zhang, Xinyun Chen, Minmin Chen, Azade Nova, Le Hou, Heng-Tze Cheng, Quoc V. Le, Ed H. Chi, Denny Zhou

    Abstract: We introduce NATURAL PLAN, a realistic planning benchmark in natural language containing 3 key tasks: Trip Planning, Meeting Planning, and Calendar Scheduling. We focus our evaluation on the planning capabilities of LLMs with full information on the task, by providing outputs from tools such as Google Flights, Google Maps, and Google Calendar as contexts to the models. This eliminates the need for… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  13. arXiv:2406.03240  [pdf, other

    cs.SD cs.AI eess.AS

    Generalized Source Tracing: Detecting Novel Audio Deepfake Algorithm with Real Emphasis and Fake Dispersion Strategy

    Authors: Yuankun Xie, Ruibo Fu, Zhengqi Wen, Zhiyong Wang, Xiaopeng Wang, Haonnan Cheng, Long Ye, Jianhua Tao

    Abstract: With the proliferation of deepfake audio, there is an urgent need to investigate their attribution. Current source tracing methods can effectively distinguish in-distribution (ID) categories. However, the rapid evolution of deepfake algorithms poses a critical challenge in the accurate identification of out-of-distribution (OOD) novel deepfake algorithms. In this paper, we propose Real Emphasis an… ▽ More

    Submitted 8 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted by INTERSPEECH 2024

  14. arXiv:2406.03215  [pdf, other

    cs.CV

    Searching Priors Makes Text-to-Video Synthesis Better

    Authors: Haoran Cheng, Liang Peng, Linxuan Xia, Yuepeng Hu, Hengjia Li, Qinglin Lu, Xiaofei He, Boxi Wu

    Abstract: Significant advancements in video diffusion models have brought substantial progress to the field of text-to-video (T2V) synthesis. However, existing T2V synthesis model struggle to accurately generate complex motion dynamics, leading to a reduction in video realism. One possible solution is to collect massive data and train the model on it, but this would be extremely expensive. To alleviate this… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  15. arXiv:2406.02983  [pdf, other

    cs.RO cs.AI

    FREA: Feasibility-Guided Generation of Safety-Critical Scenarios with Reasonable Adversariality

    Authors: Keyu Chen, Yuheng Lei, Hao Cheng, Haoran Wu, Wenchao Sun, Sifa Zheng

    Abstract: Generating safety-critical scenarios, which are essential yet difficult to collect at scale, offers an effective method to evaluate the robustness of autonomous vehicles (AVs). Existing methods focus on optimizing adversariality while preserving the naturalness of scenarios, aiming to achieve a balance through data-driven approaches. However, without an appropriate upper bound for adversariality,… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 19 pages. Under review

  16. A large population of neutron star low-mass X-ray binaries with long outburst recurrence time ?

    Authors: E. Meyer-Hofmeister, Huaqing Cheng, B. F. Liu

    Abstract: Low-mass X-ray binaries (LMXBs) with neutron stars show quite different features which depend on the rate of mass transfer from the donor star. With a high transfer rate the Z sources are in a persistent soft spectral state, with a moderate rate the transient Atoll sources have outburst cycles like the black hole X-ray binaries. The observations document very long outburst recurrence times for qui… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 7 pages, 4 figures, published in MNRAS

    Journal ref: Monthly Notices of the Royal Astronomical Society, Volume 531, Issue 1, pp.1578-1584, 2024

  17. arXiv:2406.02013  [pdf, other

    cs.LG

    Mamba as Decision Maker: Exploring Multi-scale Sequence Modeling in Offline Reinforcement Learning

    Authors: Jiahang Cao, Qiang Zhang, Ziqing Wang, Jiaxu Wang, Hao Cheng, Yecheng Shao, Wen Zhao, Gang Han, Yijie Guo, Renjing Xu

    Abstract: Sequential modeling has demonstrated remarkable capabilities in offline reinforcement learning (RL), with Decision Transformer (DT) being one of the most notable representatives, achieving significant success. However, RL trajectories possess unique properties to be distinguished from the conventional sequence (e.g., text or audio): (1) local correlation, where the next states in RL are theoretica… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 16 pages, 5 figures

  18. arXiv:2406.01598  [pdf

    cs.CV cs.DB cs.RO

    D2E-An Autonomous Decision-making Dataset involving Driver States and Human Evaluation

    Authors: Zehong Ke, Yanbo Jiang, Yuning Wang, Hao Cheng, Jinhao Li, Jianqiang Wang

    Abstract: With the advancement of deep learning technology, data-driven methods are increasingly used in the decision-making of autonomous driving, and the quality of datasets greatly influenced the model performance. Although current datasets have made significant progress in the collection of vehicle and environment data, emphasis on human-end data including the driver states and human evaluation is not s… ▽ More

    Submitted 12 April, 2024; originally announced June 2024.

    Comments: Submit for ITSC 2024

  19. arXiv:2405.20606  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Vision-Language Meets the Skeleton: Progressively Distillation with Cross-Modal Knowledge for 3D Action Representation Learning

    Authors: Yang Chen, Tian He, Junfeng Fu, Ling Wang, Jingcai Guo, Hong Cheng

    Abstract: Supervised and self-supervised learning are two main training paradigms for skeleton-based human action recognition. However, the former one-hot classification requires labor-intensive predefined action categories annotations, while the latter involves skeleton transformations (e.g., cropping) in the pretext tasks that may impair the skeleton structure. To address these challenges, we introduce a… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  20. arXiv:2405.20090  [pdf, other

    cs.CV

    Typography Leads Semantic Diversifying: Amplifying Adversarial Transferability across Multimodal Large Language Models

    Authors: Hao Cheng, Erjia Xiao, Jiahang Cao, Le Yang, Kaidi Xu, Jindong Gu, Renjing Xu

    Abstract: Following the advent of the Artificial Intelligence (AI) era of large models, Multimodal Large Language Models (MLLMs) with the ability to understand cross-modal interactions between vision and text have attracted wide attention. Adversarial examples with human-imperceptible perturbation are shown to possess a characteristic known as transferability, which means that a perturbation generated by on… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  21. arXiv:2405.19119  [pdf, other

    cs.LG

    Can Graph Learning Improve Task Planning?

    Authors: Xixi Wu, Yifei Shen, Caihua Shan, Kaitao Song, Siwei Wang, Bohang Zhang, Jiarui Feng, Hong Cheng, Wei Chen, Yun Xiong, Dongsheng Li

    Abstract: Task planning is emerging as an important research topic alongside the development of large language models (LLMs). It aims to break down complex user requests into solvable sub-tasks, thereby fulfilling the original requests. In this context, the sub-tasks can be naturally viewed as a graph, where the nodes represent the sub-tasks, and the edges denote the dependencies among them. Consequently, t… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  22. arXiv:2405.17678  [pdf, other

    cs.CV cs.AI

    TIMA: Text-Image Mutual Awareness for Balancing Zero-Shot Adversarial Robustness and Generalization Ability

    Authors: Fengji Ma, Li Liu, Hei Victor Cheng

    Abstract: This work addresses the challenge of achieving zero-shot adversarial robustness while preserving zero-shot generalization in large-scale foundation models, with a focus on the popular Contrastive Language-Image Pre-training (CLIP). Although foundation models were reported to have exceptional zero-shot generalization, they are highly vulnerable to adversarial perturbations. Existing methods achieve… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  23. arXiv:2405.16672  [pdf, other

    stat.ML cs.LG stat.ME

    Transfer Learning Under High-Dimensional Graph Convolutional Regression Model for Node Classification

    Authors: Jiachen Chen, Danyang Huang, Liyuan Wang, Kathryn L. Lunetta, Debarghya Mukherjee, Huimin Cheng

    Abstract: Node classification is a fundamental task, but obtaining node classification labels can be challenging and expensive in many real-world scenarios. Transfer learning has emerged as a promising solution to address this challenge by leveraging knowledge from source domains to enhance learning in a target domain. Existing transfer learning methods for node classification primarily focus on integrating… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  24. arXiv:2405.13992  [pdf, other

    math.OC cs.LG

    Learning Cut Generating Functions for Integer Programming

    Authors: Hongyu Cheng, Amitabh Basu

    Abstract: The branch-and-cut algorithm is the method of choice to solve large scale integer programming problems in practice. A key ingredient of branch-and-cut is the use of cutting planes which are derived constraints that reduce the search space for an optimal solution. Selecting effective cutting planes to produce small branch-and-cut trees is a critical challenge in the branch-and-cut algorithm. Recent… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  25. arXiv:2405.12538  [pdf, other

    cs.CV

    Bridging the Intent Gap: Knowledge-Enhanced Visual Generation

    Authors: Yi Cheng, Ziwei Xu, Dongyun Lin, Harry Cheng, Yongkang Wong, Ying Sun, Joo Hwee Lim, Mohan Kankanhalli

    Abstract: For visual content generation, discrepancies between user intentions and the generated content have been a longstanding problem. This discrepancy arises from two main factors. First, user intentions are inherently complex, with subtle details not fully captured by input prompts. The absence of such details makes it challenging for generative models to accurately reflect the intended meaning, leadi… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  26. arXiv:2405.07148  [pdf, other

    physics.flu-dyn cs.CE

    Investigate the efficiency of incompressible flow simulations on CPUs and GPUs with BSAMR

    Authors: Dewen Liu, Shuai He, Haoran Cheng, Yadong Zeng

    Abstract: Adaptive mesh refinement (AMR) is a classical technique about local refinement in space where needed, thus effectively reducing computational costs for HPC-based physics simulations. Although AMR has been used for many years, little reproducible research discusses the impact of software-based parameters on block-structured AMR (BSAMR) efficiency and how to choose them. This article primarily does… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: 22 pages include reference, 9 figures

  27. arXiv:2405.06388  [pdf, other

    math.NA

    Recovery of transversely-isotropic elastic material parameters in induction motor rotors

    Authors: Hanz Martin Cheng, Tapio Helin, Ville-Petteri Manninen, Timo Holopainen, Juha Jokinen, Samu Sorvari, Andreas Rupp

    Abstract: We propose numerical algorithms for recovering parameters in eigenvalue problems for linear elasticity of transversely isotropic materials. Specifically, the algorithms are used to recover the elastic constants of a rotor core. Numerical tests show that in the noiseless setup, two pairs of bending modes are sufficient for recovering one to four parameters accurately. To recover all five parameters… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    MSC Class: 65Z05; 65C20

  28. arXiv:2405.05363  [pdf, other

    cs.CV cs.RO

    LOC-ZSON: Language-driven Object-Centric Zero-Shot Object Retrieval and Navigation

    Authors: Tianrui Guan, Yurou Yang, Harry Cheng, Muyuan Lin, Richard Kim, Rajasimman Madhivanan, Arnie Sen, Dinesh Manocha

    Abstract: In this paper, we present LOC-ZSON, a novel Language-driven Object-Centric image representation for object navigation task within complex scenes. We propose an object-centric image representation and corresponding losses for visual-language model (VLM) fine-tuning, which can handle complex object-level queries. In addition, we design a novel LLM-based augmentation and prompt templates for stabilit… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: Accepted to ICRA 2024

  29. arXiv:2405.04880  [pdf, other

    cs.SD cs.AI eess.AS

    The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio

    Authors: Yuankun Xie, Yi Lu, Ruibo Fu, Zhengqi Wen, Zhiyong Wang, Jianhua Tao, Xin Qi, Xiaopeng Wang, Yukun Liu, Haonan Cheng, Long Ye, Yi Sun

    Abstract: With the proliferation of Audio Language Model (ALM) based deepfake audio, there is an urgent need for generalized detection methods. ALM-based deepfake audio currently exhibits widespread, high deception, and type versatility, posing a significant challenge to current audio deepfake detection (ADD) models trained solely on vocoded data. To effectively detect ALM-based deepfake audio, we focus on… ▽ More

    Submitted 15 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  30. arXiv:2405.03194  [pdf, other

    cs.CV

    CityLLaVA: Efficient Fine-Tuning for VLMs in City Scenario

    Authors: Zhizhao Duan, Hao Cheng, Duo Xu, Xi Wu, Xiangxie Zhang, Xi Ye, Zhen Xie

    Abstract: In the vast and dynamic landscape of urban settings, Traffic Safety Description and Analysis plays a pivotal role in applications ranging from insurance inspection to accident prevention. This paper introduces CityLLaVA, a novel fine-tuning framework for Visual Language Models (VLMs) designed for urban scenarios. CityLLaVA enhances model comprehension and prediction accuracy through (1) employing… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: Accepted by AICITY2024 Workshop Track2 at CVPR2024

  31. arXiv:2405.00079  [pdf

    physics.soc-ph

    A global evidence map of human well-being and biodiversity co-benefits and trade-offs of natural climate solutions

    Authors: Charlotte H. Chang, James T. Erbaugh, Paola Fajardo, Luci Lu, István Molnár, Dávid Papp, Brian E. Robinson, Kemen Austin, Susan Cook-Patton, Timm Kroeger, Lindsey Smart, Miguel Castro, Samantha H. Cheng, Peter W. Ellis, Rob I. McDonald, Teevrat Garg, Erin E. Poor, Preston Welker, Andrew R. Tilman, Stephen A. Wood, Yuta J. Masuda

    Abstract: Natural climate solutions (NCS) are critical for mitigating climate change through ecosystem-based carbon removal and emissions reductions. NCS implementation can also generate biodiversity and human well-being co-benefits and trade-offs ("NCS co-impacts"), but the volume of evidence on NCS co-impacts has grown rapidly across disciplines, is poorly understood, and remains to be systematically coll… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: 28 pages, 5 figures

  32. arXiv:2404.18580  [pdf, other

    cs.RO eess.SY

    Data-Driven Dynamics Modeling of Miniature Robotic Blimps Using Neural ODEs With Parameter Auto-Tuning

    Authors: Yongjian Zhu, Hao Cheng, Feitian Zhang

    Abstract: Miniature robotic blimps, as one type of lighter-than-air aerial vehicles, have attracted increasing attention in the science and engineering community for their enhanced safety, extended endurance, and quieter operation compared to quadrotors. Accurately modeling the dynamics of these robotic blimps poses a significant challenge due to the complex aerodynamics stemming from their large lifting bo… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 8 pages, 8 figures

  33. arXiv:2404.17317  [pdf, other

    cs.NI eess.SY

    Colosseum: The Open RAN Digital Twin

    Authors: Michele Polese, Leonardo Bonati, Salvatore D'Oro, Pedram Johari, Davide Villa, Sakthivel Velumani, Rajeev Gangula, Maria Tsampazi, Clifton Paul Robinson, Gabriele Gemmi, Andrea Lacava, Stefano Maxenti, Hai Cheng, Tommaso Melodia

    Abstract: Recent years have witnessed the Open Radio Access Network (RAN) paradigm transforming the fundamental ways cellular systems are deployed, managed, and optimized. This shift is led by concepts such as openness, softwarization, programmability, interoperability, and intelligence of the network, all of which had never been applied to the cellular ecosystem before. The realization of the Open RAN visi… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: 13 pages, 8 figures, 1 table, submitted to IEEE for publication

  34. arXiv:2404.17152  [pdf, other

    cs.CV

    CSCO: Connectivity Search of Convolutional Operators

    Authors: Tunhou Zhang, Shiyu Li, Hsin-Pai Cheng, Feng Yan, Hai Li, Yiran Chen

    Abstract: Exploring dense connectivity of convolutional operators establishes critical "synapses" to communicate feature vectors from different levels and enriches the set of transformations on Computer Vision applications. Yet, even with heavy-machinery approaches such as Neural Architecture Search (NAS), discovering effective connectivity patterns requires tremendous efforts due to either constrained conn… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: To appear on Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops (2024)

  35. arXiv:2404.16425  [pdf, other

    astro-ph.HE

    Soft X-ray prompt emission from a high-redshift gamma-ray burst EP240315a

    Authors: Y. Liu, H. Sun, D. Xu, D. S. Svinkin, J. Delaunay, N. R. Tanvir, H. Gao, C. Zhang, Y. Chen, X. -F. Wu, B. Zhang, W. Yuan, J. An, G. Bruni, D. D. Frederiks, G. Ghirlanda, J. -W. Hu, A. Li, C. -K. Li, J. -D. Li, D. B. Malesani, L. Piro, G. Raman, R. Ricci, E. Troja , et al. (170 additional authors not shown)

    Abstract: Long gamma-ray bursts (GRBs) are believed to originate from core collapse of massive stars. High-redshift GRBs can probe the star formation and reionization history of the early universe, but their detection remains rare. Here we report the detection of a GRB triggered in the 0.5--4 keV band by the Wide-field X-ray Telescope (WXT) on board the Einstein Probe (EP) mission, designated as EP240315a,… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 41 pages, 8 figures, 7 tables

  36. arXiv:2404.14890  [pdf, other

    cs.CV

    DENOISER: Rethinking the Robustness for Open-Vocabulary Action Recognition

    Authors: Haozhe Cheng, Cheng Ju, Haicheng Wang, Jinxiang Liu, Mengting Chen, Qiang Hu, Xiaoyun Zhang, Yanfeng Wang

    Abstract: As one of the fundamental video tasks in computer vision, Open-Vocabulary Action Recognition (OVAR) recently gains increasing attention, with the development of vision-language pre-trainings. To enable generalization of arbitrary classes, existing methods treat class labels as text descriptions, then formulate OVAR as evaluating embedding similarity between visual samples and textual classes. Howe… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  37. arXiv:2404.14815  [pdf, other

    cs.LG

    Time-aware Heterogeneous Graph Transformer with Adaptive Attention Merging for Health Event Prediction

    Authors: Shibo Li, Hengliang Cheng, Weihua Li

    Abstract: The widespread application of Electronic Health Records (EHR) data in the medical field has led to early successes in disease risk prediction using deep learning methods. These methods typically require extensive data for training due to their large parameter sets. However, existing works do not exploit the full potential of EHR data. A significant challenge arises from the infrequent occurrence o… ▽ More

    Submitted 10 May, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: 16 pages, 9 figures, 4 tables

  38. arXiv:2404.03202  [pdf, other

    cs.CV

    OmniGS: Omnidirectional Gaussian Splatting for Fast Radiance Field Reconstruction using Omnidirectional Images

    Authors: Longwei Li, Huajian Huang, Sai-Kit Yeung, Hui Cheng

    Abstract: Photorealistic reconstruction relying on 3D Gaussian Splatting has shown promising potential in robotics. However, the current 3D Gaussian Splatting system only supports radiance field reconstruction using undistorted perspective images. In this paper, we present OmniGS, a novel omnidirectional Gaussian splatting system, to take advantage of omnidirectional images for fast radiance field reconstru… ▽ More

    Submitted 7 April, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: 7 pages, 4 figures

  39. arXiv:2404.01350  [pdf, other

    hep-ph hep-ex

    Analysis of Hadronic Weak Decays of Charmed Baryons in the Topological Diagrammatic Approach

    Authors: Huiling Zhong, Fanrong Xu, Hai-Yang Cheng

    Abstract: We perform a global fit to the experimental data of two-body charmed baryon decays based on the topological diagrammatic approach (TDA) and take into account the phase shifts between $S$- and $P$-wave amplitudes as inspired by the recent BESIII measurement of the decay asymmetry in the decay $Λ_c^+\to Ξ^0K^+$. The TDA has the advantage that it is more intuitive, graphic and easier to implement mod… ▽ More

    Submitted 20 May, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: 24 pages, 1 figure. Tables I, II, V and VI revised, accepted by PRD. arXiv admin note: text overlap with arXiv:2401.15926

  40. arXiv:2403.17868  [pdf, other

    quant-ph cs.IT cs.LG math.ST

    An invitation to the sample complexity of quantum hypothesis testing

    Authors: Hao-Chung Cheng, Nilanjana Datta, Nana Liu, Theshani Nuradha, Robert Salzmann, Mark M. Wilde

    Abstract: Quantum hypothesis testing (QHT) has been traditionally studied from the information-theoretic perspective, wherein one is interested in the optimal decay rate of error probabilities as a function of the number of samples of an unknown state. In this paper, we study the sample complexity of QHT, wherein the goal is to determine the minimum number of samples needed to reach a desired error probabil… ▽ More

    Submitted 16 May, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: v3: 58 pages, 1 figure, correction to Corollary 10; see independent and concurrent work of Pensia, Jog, Loh at arXiv:2403.16981

  41. arXiv:2403.17807  [pdf, other

    cs.HC

    Towards Inclusive Video Commenting: Introducing Signmaku for the Deaf and Hard-of-Hearing

    Authors: Si Chen, Haocong Cheng, Jason Situ, Desirée Kirst, Suzy Su, Saumya Malhotra, Lawrence Angrave, Qi Wang, Yun Huang

    Abstract: Previous research underscored the potential of danmaku--a text-based commenting feature on videos--in engaging hearing audiences. Yet, for many Deaf and hard-of-hearing (DHH) individuals, American Sign Language (ASL) takes precedence over English. To improve inclusivity, we introduce "Signmaku," a new commenting mechanism that uses ASL, serving as a sign language counterpart to danmaku. Through a… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 14 pages, CHI 2024

    ACM Class: F.2.2; I.2.7

  42. arXiv:2403.14450  [pdf, ps, other

    quant-ph cs.CR cs.IT

    Maximal $α$-Leakage for Quantum Privacy Mechanisms

    Authors: Bo-Yu Yang, Hsuan Yu, Hao-Chung Cheng

    Abstract: In this work, maximal $α$-leakage is introduced to quantify how much a quantum adversary can learn about any sensitive information of data upon observing its disturbed version via a quantum privacy mechanism. We first show that an adversary's maximal expected $α$-gain using optimal measurement is characterized by measured conditional Rényi entropy. This can be viewed as a parametric generalization… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  43. arXiv:2403.14338  [pdf, ps, other

    quant-ph cs.IT math-ph

    Optimal Second-Order Rates for Quantum Information Decoupling

    Authors: Yu-Chen Shen, Li Gao, Hao-Chung Cheng

    Abstract: In this paper, we consider the standard quantum information decoupling, in which Alice aims to decouple her system from the environment by local operations and discarding some of her systems. To achieve an $\varepsilon$-decoupling with trace distance as the error criterion, we establish a near-optimal one-shot characterization for the largest dimension of the remainder system in terms of the condi… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  44. arXiv:2403.13584  [pdf, ps, other

    quant-ph math-ph

    On Strong Converse Theorems for Quantum Hypothesis Testing and Channel Coding

    Authors: Hao-Chung Cheng, Li Gao

    Abstract: Strong converse theorems refer to the study of impossibility results in information theory. In particular, Mosonyi and Ogawa established a one-shot strong converse bound for quantum hypothesis testing [Comm. Math. Phys, 334(3), 2014], which servers as a primitive tool for establishing a variety of tight strong converse theorems in quantum information theory. In this short note, we demonstrate an a… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: one-shot strong converse bound by Mosonyi and Ogawa [arXiv:1309.3228], variational expression by Berta, Fawzi, and Tomamichel [arXiv:1512.02615]

  45. arXiv:2403.13112  [pdf, other

    cs.CL

    Efficient Encoder-Decoder Transformer Decoding for Decomposable Tasks

    Authors: Bo-Ru Lu, Nikita Haduong, Chien-Yu Lin, Hao Cheng, Noah A. Smith, Mari Ostendorf

    Abstract: Transformer-based NLP models are powerful but have high computational costs that limit deployment. Finetuned encoder-decoder models are popular in specialized domains and can outperform larger more generalized decoder-only models, such as GPT-4. We introduce a new configuration for encoder-decoder models that improves efficiency on structured output and decomposable tasks where multiple outputs ar… ▽ More

    Submitted 23 May, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: 14 pages, 4 figures. https://github.com/boru-roylu/encode-once-and-decode-in-parallel

  46. arXiv:2403.11238  [pdf, other

    cs.DC cs.CR

    JUMBO: Fully Asynchronous BFT Consensus Made Truly Scalable

    Authors: Hao Cheng, Yuan Lu, Zhenliang Lu, Qiang Tang, Yuxuan Zhang, Zhenfeng Zhang

    Abstract: Recent progresses in asynchronous Byzantine fault-tolerant (BFT) consensus, e.g. Dumbo-NG (CCS' 22) and Tusk (EuroSys' 22), show promising performance through decoupling transaction dissemination and block agreement. However, when executed with a larger number $n$ of nodes, like several hundreds, they would suffer from significant degradation in performance. Their dominating scalability bottleneck… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  47. arXiv:2403.10628  [pdf, other

    physics.optics

    A Terahertz Bandwidth Nonmagnetic Isolator

    Authors: Haotian Cheng, Yishu Zhou, Freek Ruesink, Margaret Pavlovich, Shai Gertler, Andrew L. Starbuck, Andrew J. Leenheer, Andrew T. Pomerene, Douglas C. Trotter, Christina Dallo, Matthew Boady, Katherine M. Musick, Michael Gehl, Ashok Kodigala, Matt Eichenfield, Anthony L. Lentine, Nils T. Otterstrom, Peter T. Rakich

    Abstract: Integrated photonics could bring transformative breakthroughs in computing, networking, imaging, sensing, and quantum information processing, enabled by increasingly sophisticated optical functionalities on a photonic chip. However, wideband optical isolators, which are essential for the robust operation of practically all optical systems, have been challenging to realize in integrated form due to… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  48. arXiv:2403.10064  [pdf, other

    eess.IV cs.CV

    Progressive Divide-and-Conquer via Subsampling Decomposition for Accelerated MRI

    Authors: Chong Wang, Lanqing Guo, Yufei Wang, Hao Cheng, Yi Yu, Bihan Wen

    Abstract: Deep unfolding networks (DUN) have emerged as a popular iterative framework for accelerated magnetic resonance imaging (MRI) reconstruction. However, conventional DUN aims to reconstruct all the missing information within the entire null space in each iteration. Thus it could be challenging when dealing with highly ill-posed degradation, usually leading to unsatisfactory reconstruction. In this wo… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  49. arXiv:2403.09707  [pdf

    q-bio.NC

    Understanding data analysis aspects of TMS-EEG in clinical study: a mini review and a case study with open dataset

    Authors: Hua Cheng

    Abstract: Concurrency of transcranial magnetic stimulation with electroencephalography (TMS-EEG) technique is a powerful and challenging methodology for basic research and clinical applications. Aspects considered in experiments for effective TMS-EEG recordings and analysis, including artifact management, data analysis and interpretation and protocols. mini review offers an extensive insight of TMS-EEG meth… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: 39 pages,36 fighures,TMS-EEG data analysis

  50. arXiv:2403.08857  [pdf, other

    cs.CV

    DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation

    Authors: Minbin Huang, Yanxin Long, Xinchi Deng, Ruihang Chu, Jiangfeng Xiong, Xiaodan Liang, Hong Cheng, Qinglin Lu, Wei Liu

    Abstract: Text-to-image (T2I) generation models have significantly advanced in recent years. However, effective interaction with these models is challenging for average users due to the need for specialized prompt engineering knowledge and the inability to perform multi-turn image generation, hindering a dynamic and iterative creation process. Recent attempts have tried to equip Multi-modal Large Language M… ▽ More

    Submitted 3 July, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: Project page: https://hunyuan-dialoggen.github.io/