Skip to main content

Showing 1–50 of 704 results for author: Lin, K

  1. arXiv:2407.06516  [pdf, other

    cs.CV

    VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving

    Authors: Yibo Liu, Zheyuan Yang, Guile Wu, Yuan Ren, Kejian Lin, Bingbing Liu, Yang Liu, Jinjun Shan

    Abstract: Generating 3D vehicle assets from in-the-wild observations is crucial to autonomous driving. Existing image-to-3D methods cannot well address this problem because they learn generation merely from image RGB information without a deeper understanding of in-the-wild vehicles (such as car models, manufacturers, etc.). This leads to their poor zero-shot prediction capability to handle real-world obser… ▽ More

    Submitted 10 July, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

  2. arXiv:2407.05285  [pdf, other

    cs.LG cs.AI cs.CR

    Gradient Diffusion: A Perturbation-Resilient Gradient Leakage Attack

    Authors: Xuan Liu, Siqi Cai, Qihua Zhou, Song Guo, Ruibin Li, Kaiwei Lin

    Abstract: Recent years have witnessed the vulnerability of Federated Learning (FL) against gradient leakage attacks, where the private training data can be recovered from the exchanged gradients, making gradient protection a critical issue for the FL training process. Existing solutions often resort to perturbation-based mechanisms, such as differential privacy, where each participating client injects a spe… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  3. arXiv:2406.16544  [pdf, other

    cs.CV

    Hierarchical B-frame Video Coding for Long Group of Pictures

    Authors: Ivan Kirillov, Denis Parkhomenko, Kirill Chernyshev, Alexander Pletnev, Yibo Shi, Kai Lin, Dmitry Babin

    Abstract: Learned video compression methods already outperform VVC in the low-delay (LD) case, but the random-access (RA) scenario remains challenging. Most works on learned RA video compression either use HEVC as an anchor or compare it to VVC in specific test conditions, using RGB-PSNR metric instead of Y-PSNR and avoiding comprehensive evaluation. Here, we present an end-to-end learned video codec for ra… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  4. arXiv:2406.14235  [pdf, other

    cs.CV cs.RO

    Mitigating the Human-Robot Domain Discrepancy in Visual Pre-training for Robotic Manipulation

    Authors: Jiaming Zhou, Teli Ma, Kun-Yu Lin, Ronghe Qiu, Zifan Wang, Junwei Liang

    Abstract: Learning generalizable visual dynamic representation across different embodied environments is crucial for real-world robotic manipulation. As the scale and diversity of robot demonstration data are limited, recent works have turned to large-scale pre-training using human data. However, the morphological differences between humans and robots introduce a significant human-robot domain discrepancy,… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  5. arXiv:2406.13719  [pdf, other

    cs.CV

    GUI Action Narrator: Where and When Did That Action Take Place?

    Authors: Qinchen Wu, Difei Gao, Kevin Qinghong Lin, Zhuoyu Wu, Xiangwu Guo, Peiran Li, Weichen Zhang, Hengxu Wang, Mike Zheng Shou

    Abstract: The advent of Multimodal LLMs has significantly enhanced image OCR recognition capabilities, making GUI automation a viable reality for increasing efficiency in digital tasks. One fundamental aspect of developing a GUI automation system is understanding primitive GUI actions. This comprehension is crucial as it enables agents to learn from user demonstrations, an essential element of automation. T… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  6. arXiv:2406.12466  [pdf, other

    gr-qc

    Rastall gravity: accretion disk image in radiation fields context and visual transformations compared to Reissner-Nordstrom black holes

    Authors: Yu-Xiang Huang, Sen Guo, Yu Liang, Yu-Hao Cui, Qing-Quan Jiang, Kai Lin

    Abstract: Our study investigates the astronomical implications of Rastall gravity, particularly its behavior amidst a radiation field compared to Reissner-Nordstrom (RN) black holes. Our research delineates a crucial correlation between the dynamics of the accretion disk and the parameters Q and N_{\rm r}, which aptly reflect the influence of spacetime metrics on the disk's appearance. Elevated electric cha… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  7. arXiv:2406.11937  [pdf, other

    physics.ins-det hep-ex physics.data-an

    Using graph neural networks to reconstruct charged pion showers in the CMS High Granularity Calorimeter

    Authors: M. Aamir, B. Acar, G. Adamov, T. Adams, C. Adloff, S. Afanasiev, C. Agrawal, C. Agrawal, A. Ahmad, H. A. Ahmed, S. Akbar, N. Akchurin, B. Akgul, B. Akgun, R. O. Akpinar, E. Aktas, A. AlKadhim, V. Alexakhin, J. Alimena, J. Alison, A. Alpana, W. Alshehri, P. Alvarez Dominguez, M. Alyari, C. Amendola , et al. (550 additional authors not shown)

    Abstract: A novel method to reconstruct the energy of hadronic showers in the CMS High Granularity Calorimeter (HGCAL) is presented. The HGCAL is a sampling calorimeter with very fine transverse and longitudinal granularity. The active media are silicon sensors and scintillator tiles readout by SiPMs and the absorbers are a combination of lead and Cu/CuW in the electromagnetic section, and steel in the hadr… ▽ More

    Submitted 30 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: Prepared for submission to JINST

  8. arXiv:2406.11816  [pdf, other

    cs.CV

    VideoLLM-online: Online Video Large Language Model for Streaming Video

    Authors: Joya Chen, Zhaoyang Lv, Shiwei Wu, Kevin Qinghong Lin, Chenan Song, Difei Gao, Jia-Wei Liu, Ziteng Gao, Dongxing Mao, Mike Zheng Shou

    Abstract: Recent Large Language Models have been enhanced with vision capabilities, enabling them to comprehend images, videos, and interleaved vision-language content. However, the learning methods of these large multimodal models typically treat videos as predetermined clips, making them less effective and efficient at handling streaming video inputs. In this paper, we propose a novel Learning-In-Video-St… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: CVPR 2024. This arxiv version is upgraded with Llama-3

  9. arXiv:2406.11781  [pdf, other

    cs.IR

    DiffMM: Multi-Modal Diffusion Model for Recommendation

    Authors: Yangqin Jiang, Lianghao Xia, Wei Wei, Da Luo, Kangyi Lin, Chao Huang

    Abstract: The rise of online multi-modal sharing platforms like TikTok and YouTube has enabled personalized recommender systems to incorporate multiple modalities (such as visual, textual, and acoustic) into user representations. However, addressing the challenge of data sparsity in these systems remains a key issue. To address this limitation, recent research has introduced self-supervised learning techniq… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  10. arXiv:2406.10583  [pdf, other

    hep-ex

    Demonstration of neutron identification in neutrino interactions in the MicroBooNE liquid argon time projection chamber

    Authors: MicroBooNE collaboration, P. Abratenko, O. Alterkait, D. Andrade Aldana, L. Arellano, J. Asaadi, A. Ashkenazi, S. Balasubramanian, B. Baller, A. Barnard, G. Barr, D. Barrow, J. Barrow, V. Basque, J. Bateman, O. Benevides Rodrigues, S. Berkman, A. Bhanderi, A. Bhat, M. Bhattacharya, M. Bishai, A. Blake, B. Bogart, T. Bolton, J. Y. Book , et al. (165 additional authors not shown)

    Abstract: A significant challenge in measurements of neutrino oscillations is reconstructing the incoming neutrino energies. While modern fully-active tracking calorimeters such as liquid argon time projection chambers in principle allow the measurement of all final state particles above some detection threshold, undetected neutrons remain a considerable source of missing energy with little to no data const… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Report number: FERMILAB-PUB-24-0301

  11. arXiv:2406.10227  [pdf, other

    cs.CV cs.AI

    VideoGUI: A Benchmark for GUI Automation from Instructional Videos

    Authors: Kevin Qinghong Lin, Linjie Li, Difei Gao, Qinchen WU, Mingyi Yan, Zhengyuan Yang, Lijuan Wang, Mike Zheng Shou

    Abstract: Graphical User Interface (GUI) automation holds significant promise for enhancing human productivity by assisting with computer tasks. Existing task formulations primarily focus on simple tasks that can be specified by a single, language-only instruction, such as "Insert a new slide." In this work, we introduce VideoGUI, a novel multi-modal benchmark designed to evaluate GUI assistants on visual-c… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 24 pages, 16 tables, 17 figures

  12. arXiv:2406.10123  [pdf, other

    hep-ex physics.ins-det

    Improving neutrino energy estimation of charged-current interaction events with recurrent neural networks in MicroBooNE

    Authors: MicroBooNE collaboration, P. Abratenko, O. Alterkait, D. Andrade Aldana, L. Arellano, J. Asaadi, A. Ashkenazi, S. Balasubramanian, B. Baller, A. Barnard, G. Barr, D. Barrow, J. Barrow, V. Basque, J. Bateman, O. Benevides Rodrigues, S. Berkman, A. Bhanderi, A. Bhat, M. Bhattacharya, M. Bishai, A. Blake, B. Bogart, T. Bolton, J. Y. Book , et al. (164 additional authors not shown)

    Abstract: We present a deep learning-based method for estimating the neutrino energy of charged-current neutrino-argon interactions. We employ a recurrent neural network (RNN) architecture for neutrino energy estimation in the MicroBooNE experiment, utilizing liquid argon time projection chamber (LArTPC) detector technology. Traditional energy estimation approaches in LArTPCs, which largely rely on reconstr… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Report number: FERMILAB-PUB-24-0287

  13. arXiv:2406.09767  [pdf, other

    cs.RO

    Language-Guided Manipulation with Diffusion Policies and Constrained Inpainting

    Authors: Ce Hao, Kelvin Lin, Siyuan Luo, Harold Soh

    Abstract: Diffusion policies have demonstrated robust performance in generative modeling, prompting their application in robotic manipulation controlled via language descriptions. In this paper, we introduce a zero-shot, open-vocabulary diffusion policy method for robot manipulation. Using Vision-Language Models (VLMs), our method transforms linguistic task descriptions into actionable keyframes in 3D space… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  14. arXiv:2406.08407  [pdf, other

    cs.CV cs.AI cs.CL

    MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos

    Authors: Xuehai He, Weixi Feng, Kaizhi Zheng, Yujie Lu, Wanrong Zhu, Jiachen Li, Yue Fan, Jianfeng Wang, Linjie Li, Zhengyuan Yang, Kevin Lin, William Yang Wang, Lijuan Wang, Xin Eric Wang

    Abstract: Multimodal Language Language Models (MLLMs) demonstrate the emerging abilities of "world models" -- interpreting and reasoning about complex real-world dynamics. To assess these abilities, we posit videos are the ideal medium, as they encapsulate rich representations of real-world dynamics and causalities. To this end, we introduce MMWorld, a new benchmark for multi-discipline, multi-faceted multi… ▽ More

    Submitted 13 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  15. arXiv:2406.07540  [pdf, other

    cs.CV cs.LG

    Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance

    Authors: Kuan Heng Lin, Sicheng Mo, Ben Klingher, Fangzhou Mu, Bolei Zhou

    Abstract: Recent controllable generation approaches such as FreeControl and Diffusion Self-guidance bring fine-grained spatial and appearance control to text-to-image (T2I) diffusion models without training auxiliary modules. However, these methods optimize the latent embedding for each type of score function with longer diffusion steps, making the generation process time-consuming and limiting their flexib… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 18 pages, 11 figures, see project page at https://genforce.github.io/ctrl-x

  16. arXiv:2406.07514  [pdf, other

    physics.ins-det hep-ex

    Scintillation Light in SBND: Simulation, Reconstruction, and Expected Performance of the Photon Detection System

    Authors: SBND Collaboration, P. Abratenko, R. Acciarri, C. Adams, L. Aliaga-Soplin, O. Alterkait, R. Alvarez-Garrote, C. Andreopoulos, A. Antonakis, L. Arellano, J. Asaadi, W. Badgett, S. Balasubramanian, V. Basque, A. Beever, B. Behera, E. Belchior, M. Betancourt, A. Bhat, M. Bishai, A. Blake, B. Bogart, J. Bogenschuetz, D. Brailsford, A. Brandt , et al. (158 additional authors not shown)

    Abstract: SBND is the near detector of the Short-Baseline Neutrino program at Fermilab. Its location near to the Booster Neutrino Beam source and relatively large mass will allow the study of neutrino interactions on argon with unprecedented statistics. This paper describes the expected performance of the SBND photon detection system, using a simulated sample of beam neutrinos and cosmogenic particles. Its… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 21 pages, 17 figures

    Report number: FERMILAB-PUB-24-0303-PPD

  17. arXiv:2406.07122  [pdf, other

    quant-ph

    Compact Polarization-Entangled Photon Source Based on Coexisting Noncritically Birefringent and Quasi Phase Matching in a Nonlinear Crystal

    Authors: C. -Y. Yang, C. -Y. Wang, K. -H. Lin, T. -Y. Tsai, C. -C. Lin, C. Canalias, L. -B. Wang, A. Yabushita, C. -S. Chuu

    Abstract: Polarization-entangled photons are indispensable to numerous quantum technologies and fundamental studies. In this paper, we propose and demonstrate a novel source that generates collinear polarization-entangled photons by simultaneously achieving two distinct types of phase-matching conditions (noncritically birefringent and quasi phase matching) in a periodically poled nonlinear crystal with a l… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  18. arXiv:2406.06890  [pdf, other

    cs.CV

    Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation

    Authors: Yuanhao Zhai, Kevin Lin, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Chung-Ching Lin, David Doermann, Junsong Yuan, Lijuan Wang

    Abstract: Image diffusion distillation achieves high-fidelity generation with very few sampling steps. However, applying these techniques directly to video diffusion often results in unsatisfactory frame quality due to the limited visual quality in public video datasets. This affects the performance of both teacher and student video diffusion models. Our study aims to improve video diffusion distillation wh… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Project page: https://yhzhai.github.io/mcm/

  19. arXiv:2406.06472  [pdf, other

    astro-ph.IM

    Multi-Amplifier Sensing Charge-coupled Devices for Next Generation Spectroscopy

    Authors: Kenneth Lin, Armin Karcher, Julien Guy, Stephen E. Holland, William F. Kolbe, Peter Nugent, Alex Drlica-Wagner

    Abstract: We present characterization results and performance of a prototype Multiple-Amplifier Sensing (MAS) silicon charge-coupled device (CCD) sensor with 16 channels potentially suitable for faint object astronomical spectroscopy and low-signal, photon-limited imaging. The MAS CCD is designed to reach sub-electron readout noise by repeatedly measuring charge through a line of amplifiers during the seria… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 20 pages, 18 figures, submitted to PASP

  20. arXiv:2406.03298  [pdf, other

    cs.CV cs.RO

    L-PR: Exploiting LiDAR Fiducial Marker for Unordered Low Overlap Multiview Point Cloud Registration

    Authors: Yibo Liu, Jinjun Shan, Amaldev Haridevan, Shuo Zhang, Kejian Lin

    Abstract: Point cloud registration is a prerequisite for many applications in computer vision and robotics. Most existing methods focus on pairwise registration of two point clouds with high overlap. Although there have been some methods for low overlap cases, they struggle in degraded scenarios. This paper introduces a novel framework named L-PR, designed to register unordered low overlap multiview point c… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 8 pages

  21. arXiv:2406.03270  [pdf, other

    math.OC eess.SY

    A Successive Gap Constraint Linearization Method for Optimal Control Problems with Equilibrium Constraints

    Authors: Kangyu Lin, Toshiyuki Ohtsuka

    Abstract: In this study, we propose a novel gap-constraint-based reformulation for optimal control problems with equilibrium constraints (OCPECs). We show that the proposed reformulation generates a new constraint system equivalent to the original one but more concise and with favorable differentiability. Moreover, constraint regularity can be recovered by a relaxation strategy. We show that the gap constra… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Forthcoming, (Accepted to the 2024 IFAC Conference on Nonlinear Model Predictive Control (NMPC))

  22. Retrieval-Augmented Conversational Recommendation with Prompt-based Semi-Structured Natural Language State Tracking

    Authors: Sara Kemper, Justin Cui, Kai Dicarlantonio, Kathy Lin, Danjie Tang, Anton Korikov, Scott Sanner

    Abstract: Conversational recommendation (ConvRec) systems must understand rich and diverse natural language (NL) expressions of user preferences and intents, often communicated in an indirect manner (e.g., "I'm watching my weight"). Such complex utterances make retrieving relevant items challenging, especially if only using often incomplete or out-of-date metadata. Fortunately, many domains feature rich ite… ▽ More

    Submitted 25 May, 2024; originally announced June 2024.

  23. arXiv:2405.15784  [pdf, other

    cs.IR cs.AI cs.CL

    CLARINET: Augmenting Language Models to Ask Clarification Questions for Retrieval

    Authors: Yizhou Chi, Jessy Lin, Kevin Lin, Dan Klein

    Abstract: Users often make ambiguous requests that require clarification. We study the problem of asking clarification questions in an information retrieval setting, where systems often face ambiguous search queries and it is challenging to turn the uncertainty in the retrieval model into a natural language question. We present CLARINET, a system that asks informative clarification questions by choosing que… ▽ More

    Submitted 28 April, 2024; originally announced May 2024.

  24. arXiv:2405.13860  [pdf, other

    cs.CV

    MAGIC: Map-Guided Few-Shot Audio-Visual Acoustics Modeling

    Authors: Diwei Huang, Kunyang Lin, Peihao Chen, Qing Du, Mingkui Tan

    Abstract: Few-shot audio-visual acoustics modeling seeks to synthesize the room impulse response in arbitrary locations with few-shot observations. To sufficiently exploit the provided few-shot data for accurate acoustic modeling, we present a *map-guided* framework by constructing acoustic-related visual semantic feature maps of the scenes. Visual features preserve semantic details related to sound and map… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 17 pages, 12 pages for main paper, 5 pages for supplementary

  25. arXiv:2405.12808  [pdf, other

    gr-qc astro-ph.HE

    Influence of quantum correction on the Schwarzschild black hole polarized image

    Authors: Sen Guo, Yu-Xiang Huang, Kuan Liu, En-Wei Liang, Kai Lin

    Abstract: Using a model of an accretion disk around a Schwarzschild black hole, the analytic estimates for image polarization were derived by Narayan $et~al.$. [Astrophys. J, 102, 912 (2021)]. Recently, the EHT team also obtained polarization images of the Sgr A$^{*}$ and measured both linear and circular polarization [Astrophys. J. Lett, 964, L25 (2024)]. We find that quantum correction effects can also in… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 20 pages, 8 figures

    Report number: Accepted European Physical Journal C (EPJC) 2024

  26. arXiv:2405.10925  [pdf

    stat.ME cs.AI cs.LG

    High-dimensional multiple imputation (HDMI) for partially observed confounders including natural language processing-derived auxiliary covariates

    Authors: Janick Weberpals, Pamela A. Shaw, Kueiyu Joshua Lin, Richard Wyss, Joseph M Plasek, Li Zhou, Kerry Ngan, Thomas DeRamus, Sudha R. Raman, Bradley G. Hammill, Hana Lee, Sengwee Toh, John G. Connolly, Kimberly J. Dandreo, Fang Tian, Wei Liu, Jie Li, José J. Hernández-Muñoz, Sebastian Schneeweiss, Rishi J. Desai

    Abstract: Multiple imputation (MI) models can be improved by including auxiliary covariates (AC), but their performance in high-dimensional data is not well understood. We aimed to develop and compare high-dimensional MI (HDMI) approaches using structured and natural language processing (NLP)-derived AC in studies with partially observed confounders. We conducted a plasmode simulation study using data from… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  27. Beyond Static Calibration: The Impact of User Preference Dynamics on Calibrated Recommendation

    Authors: Kun Lin, Masoud Mansoury, Farzad Eskandanian, Milad Sabouri, Bamshad Mobasher

    Abstract: Calibration in recommender systems is an important performance criterion that ensures consistency between the distribution of user preference categories and that of recommendations generated by the system. Standard methods for mitigating miscalibration typically assume that user preference profiles are static, and they measure calibration relative to the full history of user's interactions, includ… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 8 pages, 4 figures, accepted as LBR paper at UMAP '24 -- ACM Conference on User Modeling, Adaptation and Personalization 2024

    MSC Class: 68-06 ACM Class: H.3.4

  28. arXiv:2405.07503  [pdf, other

    cs.RO cs.AI

    Consistency Policy: Accelerated Visuomotor Policies via Consistency Distillation

    Authors: Aaditya Prasad, Kevin Lin, Jimmy Wu, Linqi Zhou, Jeannette Bohg

    Abstract: Many robotic systems, such as mobile manipulators or quadrotors, cannot be equipped with high-end GPUs due to space, weight, and power constraints. These constraints prevent these systems from leveraging recent developments in visuomotor policy architectures that require high-end GPUs to achieve fast policy inference. In this paper, we propose Consistency Policy, a faster and similarly powerful al… ▽ More

    Submitted 28 June, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: https://consistency-policy.github.io/

  29. arXiv:2405.07303  [pdf, other

    hep-ex hep-ph physics.ins-det

    Search for solar axions by Primakoff effect with the full dataset of the CDEX-1B Experiment

    Authors: L. T. Yang, S. K. Liu, Q. Yue, K. J. Kang, Y. J. Li, H. P. An, Greeshma C., J. P. Chang, Y. H. Chen, J. P. Cheng, W. H. Dai, Z. Deng, C. H. Fang, X. P. Geng, H. Gong, Q. J. Guo, T. Guo, X. Y. Guo, L. He, J. R. He, J. W. Hu, H. X. Huang, T. C. Huang, L. Jiang, S. Karmakar , et al. (61 additional authors not shown)

    Abstract: We present the first limit on $g_{Aγ}$ coupling constant using the Bragg-Primakoff conversion based on an exposure of 1107.5 kg days of data from the CDEX-1B experiment at the China Jinping Underground Laboratory. The data are consistent with the null signal hypothesis, and no excess signals are observed. Limits of the coupling $g_{Aγ}<2.08\times10^{-9}$ GeV$^{-1}$ (95\% C.L.) are derived for axio… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: 7 pages, 5 figures

  30. arXiv:2405.05962  [pdf, other

    cs.LG cs.CR cs.DC

    Age Aware Scheduling for Differentially-Private Federated Learning

    Authors: Kuan-Yu Lin, Hsuan-Yin Lin, Yu-Pin Hsu, Yu-Chih Huang

    Abstract: This paper explores differentially-private federated learning (FL) across time-varying databases, delving into a nuanced three-way tradeoff involving age, accuracy, and differential privacy (DP). Emphasizing the potential advantages of scheduling, we propose an optimization problem aimed at meeting DP requirements while minimizing the loss difference between the aggregated model and the model obta… ▽ More

    Submitted 5 July, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: Simulation parameters updated. Paper accepted for presentation at the 2024 IEEE International Symposium on Information Theory (ISIT 2024)

  31. arXiv:2405.03648  [pdf, ps, other

    math.AG

    Proof of the geometric Langlands conjecture II: Kac-Moody localization and the FLE

    Authors: D. Arinkin, D. Beraldo, J. Campbell, L. Chen, J. Faergeman, D. Gaitsgory, K. Lin, S. Raskin, N. Rozenblyum

    Abstract: This paper is the second in a series of five that together prove the geometric Langlands conjecture. Our goals are two-fold: (1) Formulate and prove the Fundamental Local Equivalence (FLE) at the critical level; (2) Study the interaction between Kac-Moody localization and the global geometric Langlands functor of ref. [GLC1]. This paper contains an extensive Appendix, whose primary goals are… ▽ More

    Submitted 23 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

  32. arXiv:2405.02794  [pdf, other

    cs.RO

    Octopi: Object Property Reasoning with Large Tactile-Language Models

    Authors: Samson Yu, Kelvin Lin, Anxing Xiao, Jiafei Duan, Harold Soh

    Abstract: Physical reasoning is important for effective robot manipulation. Recent work has investigated both vision and language modalities for physical reasoning; vision can reveal information about objects in the environment and language serves as an abstraction and communication medium for additional context. Although these works have demonstrated success on a variety of physical reasoning tasks, they a… ▽ More

    Submitted 4 June, 2024; v1 submitted 4 May, 2024; originally announced May 2024.

    Comments: Accepted at Robotics: Science and Systems (R:SS 2024)

  33. arXiv:2404.19074  [pdf, other

    cond-mat.mes-hall quant-ph

    Chaos-Assisted Dynamical Tunneling in Flat Band Superwires

    Authors: Anton Marius Graf, Ke Lin, MyeongSeo Kim, Joonas Keski-Rahkonen, Alvar Daza, Eric Heller

    Abstract: Recent theoretical investigations have revealed unconventional transport mechanisms within high Brilliouin zones of two-dimensional superlattices. Electrons can navigate along channels we call superwires, gently guided without brute force confinement. Such dynamical confinement is caused by weak superlattice deflections, markedly different from the static or energetic confinement observed in tradi… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 12 pages, 6 Figures

  34. arXiv:2404.18716  [pdf, other

    cond-mat.mes-hall

    Electrically tunable layer-hybridized trions in doped WSe$_2$ bilayers

    Authors: Raul Perea-Causin, Samuel Brem, Fabian Buchner, Kenji Watanabe, Takashi Taniguchi, John M. Lupton, Kai-Qiang Lin, Ermin Malic

    Abstract: Doped van der Waals heterostructures host layer-hybridized trions, i.e. charged excitons with layer-delocalized constituents holding promise for highly controllable optoelectronics. Combining a microscopic theory with photoluminescence (PL) experiments, we demonstrate the electrical tunability of the trion energy landscape in naturally stacked WSe$_2$ bilayers. We show that an out-of-plane electri… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  35. arXiv:2404.17343  [pdf, other

    cs.CL cs.FL

    A Bionic Natural Language Parser Equivalent to a Pushdown Automaton

    Authors: Zhenghao Wei, Kehua Lin, Jianlin Feng

    Abstract: Assembly Calculus (AC), proposed by Papadimitriou et al., aims to reproduce advanced cognitive functions through simulating neural activities, with several applications based on AC having been developed, including a natural language parser proposed by Mitropolsky et al. However, this parser lacks the ability to handle Kleene closures, preventing it from parsing all regular languages and rendering… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: to be published in IJCNN 2024

  36. arXiv:2404.16375  [pdf, other

    cs.CV cs.AI cs.CL

    List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

    Authors: An Yan, Zhengyuan Yang, Junda Wu, Wanrong Zhu, Jianwei Yang, Linjie Li, Kevin Lin, Jianfeng Wang, Julian McAuley, Jianfeng Gao, Lijuan Wang

    Abstract: Set-of-Mark (SoM) Prompting unleashes the visual grounding capability of GPT-4V, by enabling the model to associate visual objects with tags inserted on the image. These tags, marked with alphanumerics, can be indexed via text tokens for easy reference. Despite the extraordinary performance from GPT-4V, we observe that other Multimodal Large Language Models (MLLMs) struggle to understand these vis… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: Preprint

  37. arXiv:2404.16254  [pdf, ps, other

    gr-qc hep-ph

    Standing wave in perturbed anti-de Sitter spacetimes with a naked singularity

    Authors: Kai Lin, Wei-Liang Qian

    Abstract: In the framework of black hole perturbation theory, this work investigates the standing wave solutions in Reissner-Nordtsröm (RN) anti-de Sitter (AdS) spacetimes with a naked singularity. These solutions can be viewed as a specific class of quasinormal modes exhibiting distinct characteristics. The imaginary parts of their frequencies are numerically vanishing, allowing them to persist over an ext… ▽ More

    Submitted 10 May, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: 15 pages and 8 figures

  38. arXiv:2404.15909  [pdf, other

    cs.CV

    Learning Long-form Video Prior via Generative Pre-Training

    Authors: Jinheng Xie, Jiajun Feng, Zhaoxu Tian, Kevin Qinghong Lin, Yawen Huang, Xi Xia, Nanxu Gong, Xu Zuo, Jiaqi Yang, Yefeng Zheng, Mike Zheng Shou

    Abstract: Concepts involved in long-form videos such as people, objects, and their interactions, can be viewed as following an implicit prior. They are notably complex and continue to pose challenges to be comprehensively learned. In recent years, generative pre-training (GPT) has exhibited versatile capacities in modeling any kind of text content even visual locations. Can this manner work for learning lon… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  39. arXiv:2404.15450  [pdf, other

    gr-qc

    A possible origin of the $α$-vacuum as the initial state of the Universe

    Authors: Pisin Chen, Kuan-Nan Lin, Wei-Chen Lin, Dong-han Yeom

    Abstract: We investigate the cosmological observables using the Euclidean path integral approach. Specifically, we study both the no-boundary compact instantons scenario and the Euclidean wormholes scenario that can induce the creation of two universes from nothing. It is known that perturbations associated with the no-boundary scenario can only be consistent with the Bunch-Davies vacuum. Here we demonstrat… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 25 pages, 8 figures

  40. arXiv:2404.15402  [pdf, other

    astro-ph.CO

    KiDS-SBI: Simulation-Based Inference Analysis of KiDS-1000 Cosmic Shear

    Authors: Maximilian von Wietersheim-Kramsta, Kiyam Lin, Nicolas Tessore, Benjamin Joachimi, Arthur Loureiro, Robert Reischke, Angus H. Wright

    Abstract: We present a simulation-based inference (SBI) cosmological analysis of cosmic shear two-point statistics from the fourth weak gravitational lensing data release of the ESO Kilo-Degree Survey (KiDS-1000). KiDS-SBI efficiently performs non-Limber projection of the matter power spectrum via Levin's method, and constructs log-normal random matter fields on the curved sky for arbitrary cosmologies, inc… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 44 pages, 30 figures. Submitted to Astronomy & Astrophysics

  41. arXiv:2404.14705  [pdf, other

    cs.CV

    Think-Program-reCtify: 3D Situated Reasoning with Large Language Models

    Authors: Qingrong He, Kejun Lin, Shizhe Chen, Anwen Hu, Qin Jin

    Abstract: This work addresses the 3D situated reasoning task which aims to answer questions given egocentric observations in a 3D environment. The task remains challenging as it requires comprehensive 3D perception and complex reasoning skills. End-to-end models trained on supervised data for 3D situated reasoning suffer from data scarcity and generalization ability. Inspired by the recent success of levera… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  42. arXiv:2404.10948  [pdf, other

    hep-ex

    First double-differential cross section measurement of neutral-current $π^0$ production in neutrino-argon scattering in the MicroBooNE detector

    Authors: MicroBooNE collaboration, P. Abratenko, O. Alterkait, D. Andrade Aldana, L. Arellano, J. Asaadi, A. Ashkenazi, S. Balasubramanian, B. Baller, A. Barnard, G. Barr, D. Barrow, J. Barrow, V. Basque, J. Bateman, O. Benevides Rodrigues, S. Berkman, A. Bhanderi, A. Bhat, M. Bhattacharya, M. Bishai, A. Blake, B. Bogart, T. Bolton, J. Y. Book , et al. (166 additional authors not shown)

    Abstract: We report the first double-differential cross section measurement of neutral-current neutral pion (NC$π^0$) production in neutrino-argon scattering, as well as single-differential measurements of the same channel in terms of final states with and without protons. The kinematic variables of interest for these measurements are the $π^0$ momentum and the $π^0$ scattering angle with respect to the neu… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Report number: FERMILAB-PUB-24-0125

  43. arXiv:2404.09949  [pdf, other

    hep-ex physics.ins-det

    Measurement of the differential cross section for neutral pion production in charged-current muon neutrino interactions on argon with the MicroBooNE detector

    Authors: MicroBooNE collaboration, P. Abratenko, O. Alterkait, D. Andrade Aldana, L. Arellano, J. Asaadi, A. Ashkenazi, S. Balasubramanian, B. Baller, G. Barr, D. Barrow, J. Barrow, V. Basque, O. Benevides Rodrigues, S. Berkman, A. Bhanderi, A. Bhat, M. Bhattacharya, M. Bishai, A. Blake, B. Bogart, T. Bolton, J. Y. Book, M. B. Brunetti, L. Camilleri , et al. (163 additional authors not shown)

    Abstract: We present a measurement of neutral pion production in charged-current interactions using data recorded with the MicroBooNE detector exposed to Fermilab's booster neutrino beam. The signal comprises one muon, one neutral pion, any number of nucleons, and no charged pions. Studying neutral pion production in the MicroBooNE detector provides an opportunity to better understand neutrino-argon interac… ▽ More

    Submitted 6 May, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Report number: FERMILAB-PUB-24-0142-CSAID-PPD

  44. arXiv:2404.09793  [pdf, other

    hep-ex hep-ph physics.ins-det

    First Search for Light Fermionic Dark Matter Absorption on Electrons Using Germanium Detector in CDEX-10 Experiment

    Authors: J. X. Liu, L. T. Yang, Q. Yue, K. J. Kang, Y. J. Li, H. P. An, Greeshma C., J. P. Chang, Y. H. Chen, J. P. Cheng, W. H. Dai, Z. Deng, C. H. Fang, X. P. Geng, H. Gong, Q. J. Guo, T. Guo, X. Y. Guo, L. He, J. R. He, J. W. Hu, H. X. Huang, T. C. Huang, L. Jiang, S. Karmakar , et al. (61 additional authors not shown)

    Abstract: We present the first results of the search for sub-MeV fermionic dark matter absorbed by electron targets of Germanium using the 205.4~kg$\cdot$day data collected by the CDEX-10 experiment, with the analysis threshold of 160~eVee. No significant dark matter (DM) signals over the background are observed. Results are presented as limits on the cross section of DM--electron interaction. We present ne… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 6 pages, 4 figures

  45. arXiv:2404.09446  [pdf, other

    gr-qc

    The final burst of the moving mirror is unrelated to the partner mode of analog Hawking radiation

    Authors: Yuki Osawa, Kuan-Nan Lin, Yasusada Nambu, Masahiro Hotta, Pisin Chen

    Abstract: Flying mirrors with appropriate trajectories have been recognized as an analog system that mimics black hole Hawking evaporation and have been widely investigated. It has recently been suggested that the partner mode of the analog Hawking radiation emitted from a moving mirror would manifest itself through a final burst when the mirror executes a sudden stop. Here we argue the opposite via the par… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 18 pages, 6 figures

  46. arXiv:2404.06780  [pdf, other

    cs.CV

    Urban Architect: Steerable 3D Urban Scene Generation with Layout Prior

    Authors: Fan Lu, Kwan-Yee Lin, Yan Xu, Hongsheng Li, Guang Chen, Changjun Jiang

    Abstract: Text-to-3D generation has achieved remarkable success via large-scale text-to-image diffusion models. Nevertheless, there is no paradigm for scaling up the methodology to urban scale. Urban scenes, characterized by numerous elements, intricate arrangement relationships, and vast scale, present a formidable barrier to the interpretability of ambiguous textual descriptions for effective model optimi… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Project page: https://urbanarchitect.github.io/

  47. arXiv:2404.02927  [pdf

    cond-mat.mes-hall

    Probing the band splitting near the $Γ$ point in the van der Waals magnetic semiconductor CrSBr

    Authors: Kaiman Lin, Yi Li, Mahdi Ghorbani-Asl, Zdenek Sofer, Stephan Winnerl, Artur Erbe, Arkady V. Krasheninnikov, Manfred Helm, Shengqiang Zhou, Yaping Dan, Slawomir Prucnal

    Abstract: This study investigates the electronic band structure of Chromium Sulfur Bromide (CrSBr) through comprehensive photoluminescence (PL) characterization. We clearly identify low-temperature optical transitions between two closely adjacent conduction-band states and two different valence-band states. The analysis of the PL data robustly unveils energy splittings, bandgaps and excitonic transitions ac… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  48. arXiv:2404.01294  [pdf, other

    cs.CV

    CosmicMan: A Text-to-Image Foundation Model for Humans

    Authors: Shikai Li, Jianglin Fu, Kaiyuan Liu, Wentao Wang, Kwan-Yee Lin, Wayne Wu

    Abstract: We present CosmicMan, a text-to-image foundation model specialized for generating high-fidelity human images. Unlike current general-purpose foundation models that are stuck in the dilemma of inferior quality and text-image misalignment for humans, CosmicMan enables generating photo-realistic human images with meticulous appearance, reasonable structure, and precise text-image alignment with detai… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR 2024. The supplementary material is included. Project Page: https://cosmicman-cvpr2024.github.io

  49. arXiv:2404.00304  [pdf

    physics.optics physics.atom-ph

    Ultrafast Kapitza-Dirac effect

    Authors: Kang Lin, Sebastian Eckart, Hao Liang, Alexander Hartung, Sina Jacob, Qinying Ji, Lothar Ph. H. Schmidt, Markus S. Schöffler, Till Jahnke, Maksim Kunitski, Reinhard Dörner

    Abstract: Similar to the optical diffraction of light passing through a material grating, the Kapitza-Dirac effect occurs when an electron is diffracted by a standing light wave. In its original description the effect is time-independent. In the present work, we extend the Kapitza-Dirac concept to the time domain. By tracking the spatiotemporal evolution of a pulsed electron wave packet diffracted by a femt… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Journal ref: Science 2024

  50. arXiv:2403.20276  [pdf, other

    hep-ex hep-ph physics.ins-det

    Constraints on the Blazar-Boosted Dark Matter from the CDEX-10 Experiment

    Authors: R. Xu, L. T. Yang, Q. Yue, K. J. Kang, Y. J. Li, H. P. An, Greeshma C., J. P. Chang, Y. H. Chen, J. P. Cheng, W. H. Dai, Z. Deng, C. H. Fang, X. P. Geng, H. Gong, Q. J. Guo, T. Guo, X. Y. Guo, L. He, S. M. He, J. W. Hu, H. X. Huang, T. C. Huang, L. Jiang, S. Karmakar , et al. (59 additional authors not shown)

    Abstract: We report new constraints on light dark matter (DM) boosted by blazars using the 205.4 kg day data from the CDEX-10 experiment located at the China Jinping Underground Laboratory. Two representative blazars, TXS 0506+56 and BL Lacertae are studied. The results derived from TXS 0506+56 exclude DM-nucleon elastic scattering cross sections from $4.6\times 10^{-33}\ \rm cm^2$ to… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: 7 pages, 4 figures