Skip to main content

Showing 1–50 of 2,621 results for author: Lee, D

  1. arXiv:2407.09303  [pdf, other

    cs.CV

    ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion

    Authors: Sungmin Woo, Wonjoon Lee, Woo Jin Kim, Dogyoon Lee, Sangyoun Lee

    Abstract: Self-supervised multi-frame monocular depth estimation relies on the geometric consistency between successive frames under the assumption of a static scene. However, the presence of moving objects in dynamic scenes introduces inevitable inconsistencies, causing misaligned multi-frame feature matching and misleading self-supervision during training. In this paper, we propose a novel framework calle… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Accepted by ECCV 2024. Project Page: https://sungmin-woo.github.io/prodepth/

  2. arXiv:2407.09105  [pdf, other

    cs.LG cs.AI

    Enhancing Training Efficiency Using Packing with Flash Attention

    Authors: Achintya Kundu, Rhui Dih Lee, Laura Wynter, Raghu Kiran Ganti

    Abstract: Padding is often used in tuning LLM models by adding special tokens to shorter training examples to match the length of the longest sequence in each batch. While this ensures uniformity for batch processing, it introduces inefficiencies by including irrelevant padding tokens in the computation and wastes GPU resources. On the other hand, the Hugging Face SFT trainer offers the option to use packin… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  3. arXiv:2407.08901  [pdf, other

    physics.comp-ph math.NA

    Accelerating Eigenvalue Computation for Nuclear Structure Calculations via Perturbative Corrections

    Authors: Dong Min Roh, Esmond Ng, Chao Yang, Dean Lee, Pieter Maris, James P. Vary

    Abstract: We present a new method for computing the lowest few eigenvalues and the corresponding eigenvectors of a nuclear many-body Hamiltonian represented in a truncated configuration interaction subspace, i.e., the no-core shell model (NCSM). The method uses the hierarchical structure of the NCSM Hamiltonian to partition the Hamiltonian as the sum of two matrices. The first matrix corresponds to the Hami… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  4. arXiv:2407.08586  [pdf, other

    nucl-ex

    Centrality dependence of Lévy-stable two-pion Bose-Einstein correlations in $\sqrt{s_{_{NN}}}=200$ GeV Au$+$Au collisions

    Authors: PHENIX Collaboration, N. J. Abdulameer, U. Acharya, A. Adare, C. Aidala, N. N. Ajitanand, Y. Akiba, R. Akimoto, H. Al-Ta'ani, J. Alexander, A. Angerami, K. Aoki, N. Apadula, Y. Aramaki, H. Asano, E. C. Aschenauer, E. T. Atomssa, T. C. Awes, B. Azmoun, V. Babintsev, M. Bai, B. Bannier, K. N. Barish, B. Bassalleck, S. Bathe , et al. (377 additional authors not shown)

    Abstract: The PHENIX experiment measured the centrality dependence of two-pion Bose-Einstein correlation functions in $\sqrt{s_{_{NN}}}=200$~GeV Au$+$Au collisions at the Relativistic Heavy Ion Collider at Brookhaven National Laboratory. The data are well represented by Lévy-stable source distributions. The extracted source parameters are the correlation-strength parameter $λ$, the Lévy index of stability… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 401 authors from 75 institutions, 20 pages, 15 figures, 2 tables. v1 is version submitted to Physical Review C. HEPdata tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

  5. arXiv:2407.06613  [pdf, other

    cs.CV

    Sparse-DeRF: Deblurred Neural Radiance Fields from Sparse View

    Authors: Dogyoon Lee, Donghyeong Kim, Jungho Lee, Minhyeok Lee, Seunghoon Lee, Sangyoun Lee

    Abstract: Recent studies construct deblurred neural radiance fields (DeRF) using dozens of blurry images, which are not practical scenarios if only a limited number of blurry images are available. This paper focuses on constructing DeRF from sparse-view for more pragmatic real-world scenarios. As observed in our experiments, establishing DeRF from sparse views proves to be a more challenging problem due to… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Project page: https://dogyoonlee.github.io/sparsederf/

  6. arXiv:2407.06333  [pdf, ps, other

    cs.LG cs.NE math.NA

    A third-order finite difference weighted essentially non-oscillatory scheme with shallow neural network

    Authors: Kwanghyuk Park, Xinjuan Chen, Dongjin Lee, Jiaxi Gu, Jae-Hun Jung

    Abstract: In this paper, we introduce the finite difference weighted essentially non-oscillatory (WENO) scheme based on the neural network for hyperbolic conservation laws. We employ the supervised learning and design two loss functions, one with the mean squared error and the other with the mean squared logarithmic error, where the WENO3-JS weights are computed as the labels. Each loss function consists of… ▽ More

    Submitted 10 July, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

  7. arXiv:2407.05781  [pdf, other

    cs.LG eess.SY

    Regret Analysis of Multi-task Representation Learning for Linear-Quadratic Adaptive Control

    Authors: Bruce D. Lee, Leonardo F. Toso, Thomas T. Zhang, James Anderson, Nikolai Matni

    Abstract: Representation learning is a powerful tool that enables learning over large multitudes of agents or domains by enforcing that all agents operate on a shared set of learned features. However, many robotics or controls applications that would benefit from collaboration operate in settings with changing environments and goals, whereas most guarantees for representation learning are stated for static… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  8. arXiv:2407.05618  [pdf, other

    nucl-ex hep-ex

    Improved limit on neutrinoless double beta decay of \mohundred~from AMoRE-I

    Authors: A. Agrawal, V. V. Alenkov, P. Aryal, J. Beyer, B. Bhandari, R. S. Boiko, K. Boonin, O. Buzanov, C. R. Byeon, N. Chanthima, M. K. Cheoun, J. S. Choe, Seonho Choi, S. Choudhury, J. S. Chung, F. A. Danevich, M. Djamal, D. Drung, C. Enss, A. Fleischmann, A. M. Gangapshev, L. Gastaldo, Y. M. Gavrilyuk, A. M. Gezhaev, O. Gileva , et al. (83 additional authors not shown)

    Abstract: AMoRE searches for the signature of neutrinoless double beta decay of $^{100}$Mo with a 100 kg sample of enriched $^{100}$Mo. Scintillating molybdate crystals coupled with a metallic magnetic calorimeter operate at milli-Kelvin temperatures to measure the energy of electrons emitted in the decay. As a demonstration of the full-scale AMoRE, we conducted AMoRE-I, a pre-experiment with 18 molybdate c… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 7 pages, 4 figures

  9. arXiv:2407.04443  [pdf, other

    gr-qc

    Primordial perturbations in Type III hilltop inflation models

    Authors: Chia-Min Lin, Harish Dhananjay Nalla, Chen-Pin Yeh, Da-Shin Lee

    Abstract: We analytically compute the power spectrum of primordial curvature perturbations in Type III hilltop inflation models under the slow-roll approximation. The model parameters are constrained using current Cosmic Microwave Background (CMB) data. The curvature perturbations that exit the horizon at small scales show sufficiently large amplitudes to produce primordial black holes (PBHs). We then consi… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 13 pages, 6 figures. arXiv admin note: text overlap with arXiv:2311.13746

  10. arXiv:2407.03958  [pdf, other

    cs.CL cs.CV

    Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Knowledge

    Authors: Young-Jun Lee, Dokyong Lee, Junyoung Youn, Kyeongjin Oh, Byungsoo Ko, Jonghwan Hyeon, Ho-Jin Choi

    Abstract: Humans share a wide variety of images related to their personal experiences within conversations via instant messaging tools. However, existing works focus on (1) image-sharing behavior in singular sessions, leading to limited long-term social interaction, and (2) a lack of personalized image-sharing behavior. In this work, we introduce Stark, a large-scale long-term multi-modal conversation datas… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: Project website: https://stark-dataset.github.io

  11. arXiv:2407.03923  [pdf, other

    cs.CV cs.AI

    CRiM-GS: Continuous Rigid Motion-Aware Gaussian Splatting from Motion Blur Images

    Authors: Junghe Lee, Donghyeong Kim, Dogyoon Lee, Suhwan Cho, Sangyoun Lee

    Abstract: Neural radiance fields (NeRFs) have received significant attention due to their high-quality novel view rendering ability, prompting research to address various real-world cases. One critical challenge is the camera motion blur caused by camera movement during exposure time, which prevents accurate 3D scene reconstruction. In this study, we propose continuous rigid motion-aware gaussian splatting… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: Project Page : https://jho-yonsei.github.io/CRiM-Gaussian/

  12. arXiv:2407.03684  [pdf, other

    cs.RO

    ConPR: Ongoing Construction Site Dataset for Place Recognition

    Authors: Dongjae Lee, Minwoo Jung, Ayoung Kim

    Abstract: Place recognition, an essential challenge in computer vision and robotics, involves identifying previously visited locations. Despite algorithmic progress, challenges related to appearance change persist, with existing datasets often focusing on seasonal and weather variations but overlooking terrain changes. Understanding terrain alterations becomes critical for effective place recognition, given… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 3 pages, 2 figures, IROS 2023 Workshop on Closing the Loop on Localization: What Are We Localizing For, and How Does That Shape Everything We Should Do?

  13. arXiv:2407.03103  [pdf, other

    cs.CL

    Cactus: Towards Psychological Counseling Conversations using Cognitive Behavioral Theory

    Authors: Suyeon Lee, Sunghwan Kim, Minju Kim, Dongjin Kang, Dongil Yang, Harim Kim, Minseok Kang, Dayi Jung, Min Hee Kim, Seungbeen Lee, Kyoung-Mee Chung, Youngjae Yu, Dongha Lee, Jinyoung Yeo

    Abstract: Recently, the demand for psychological counseling has significantly increased as more individuals express concerns about their mental health. This surge has accelerated efforts to improve the accessibility of counseling by using large language models (LLMs) as counselors. To ensure client privacy, training open-source LLMs faces a key challenge: the absence of realistic counseling datasets. To add… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Under Review

  14. arXiv:2406.19617  [pdf, ps, other

    cs.LG cs.IT math.OC

    Stochastic Zeroth-Order Optimization under Strongly Convexity and Lipschitz Hessian: Minimax Sample Complexity

    Authors: Qian Yu, Yining Wang, Baihe Huang, Qi Lei, Jason D. Lee

    Abstract: Optimization of convex functions under stochastic zeroth-order feedback has been a major and challenging question in online learning. In this work, we consider the problem of optimizing second-order smooth and strongly convex functions where the algorithm is only accessible to noisy evaluations of the objective function it queries. We provide the first tight characterization for the rate of the mi… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  15. arXiv:2406.18391  [pdf, other

    eess.SP

    CmWave and Sub-THz: Key Radio Enablers and Complementary Spectrum for 6G

    Authors: Mayur V. Katwe, Aryan Kaushik, Keshav Singh, Marco Di Renzo, Shu Sun, Doohwan Lee, Ana G. Armada, Yonina C. Eldar, Octavia A. Dobre, Theodore S. Rappaport

    Abstract: Sixth-generation (6G) networks are poised to revolutionize communication by exploring alternative spectrum options, aiming to capitalize on strengths while mitigating limitations in current fifth-generation (5G) spectrum. This paper explores the potential opportunities and emerging trends for cmWave and sub-THz spectra as key radio enablers. This paper poses and answers three key questions regardi… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  16. arXiv:2406.18138  [pdf, other

    cs.RO

    B-TMS: Bayesian Traversable Terrain Modeling and Segmentation Across 3D LiDAR Scans and Maps for Enhanced Off-Road Navigation

    Authors: Minho Oh, Gunhee Shin, Seoyeon Jang, Seungjae Lee, Dongkyu Lee, Wonho Song, Byeongho Yu, Hyungtae Lim, Jaeyoung Lee, Hyun Myung

    Abstract: Recognizing traversable terrain from 3D point cloud data is critical, as it directly impacts the performance of autonomous navigation in off-road environments. However, existing segmentation algorithms often struggle with challenges related to changes in data distribution, environmental specificity, and sensor variations. Moreover, when encountering sunken areas, their performance is frequently co… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Accepted by IEEE IV'24 workshop on Off-road autonomy

  17. arXiv:2406.16896  [pdf, other

    eess.SP cs.LG

    f-GAN: A frequency-domain-constrained generative adversarial network for PPG to ECG synthesis

    Authors: Nathan C. L. Kong, Dae Lee, Huyen Do, Dae Hoon Park, Cong Xu, Hongda Mao, Jonathan Chung

    Abstract: Electrocardiograms (ECGs) and photoplethysmograms (PPGs) are generally used to monitor an individual's cardiovascular health. In clinical settings, ECGs and fingertip PPGs are the main signals used for assessing cardiovascular health, but the equipment necessary for their collection precludes their use in daily monitoring. Although PPGs obtained from wrist-worn devices are susceptible to noise due… ▽ More

    Submitted 15 May, 2024; originally announced June 2024.

  18. arXiv:2406.16857  [pdf, other

    math.FA math.AT math.CT math.DG math.PR

    The Surface Signature and Rough Surfaces

    Authors: Darrick Lee

    Abstract: Parallel transport, or path development, provides a rich characterization of paths which preserves the underlying algebraic structure of concatenation. The path signature is universal among such maps: any (translation-invariant) parallel transport factors uniquely through the path signature. Furthermore, the path signature is a central object in the theory of rough paths, which provides an integra… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 78 pages

  19. arXiv:2406.16288  [pdf, other

    cs.CL

    PlagBench: Exploring the Duality of Large Language Models in Plagiarism Generation and Detection

    Authors: Jooyoung Lee, Toshini Agrawal, Adaku Uchendu, Thai Le, Jinghui Chen, Dongwon Lee

    Abstract: Recent literature has highlighted potential risks to academic integrity associated with large language models (LLMs), as they can memorize parts of training instances and reproduce them in the generated texts without proper attribution. In addition, given their capabilities in generating high-quality texts, plagiarists can exploit LLMs to generate realistic paraphrases or summaries indistinguishab… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 9 pages

  20. arXiv:2406.15077  [pdf, other

    math.AP

    Quantitative pointwise estimates of the cooling process for inelastic Boltzmann equation

    Authors: Gayoung An, Jin Woo Jang, Donghyun Lee

    Abstract: In this paper, we study the homogeneous inelastic Boltzmann equation for hard spheres. We first prove that the solution $f(t,v)$ is pointwisely bounded from above by $C_{f_0}\langle t \rangle^3$ and establish that the cooling time is infinite $T_c = +\infty$ under the condition $f_0 \in L^1_2 \cap L^{\infty}_{s} $ for $s > 2 $. Away from the zero velocity, we further prove that… ▽ More

    Submitted 23 June, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

    Comments: 29 pages, 4 figures

  21. arXiv:2406.14876  [pdf, other

    cs.LG cs.AI

    Training Greedy Policy for Proposal Batch Selection in Expensive Multi-Objective Combinatorial Optimization

    Authors: Deokjae Lee, Hyun Oh Song, Kyunghyun Cho

    Abstract: Active learning is increasingly adopted for expensive multi-objective combinatorial optimization problems, but it involves a challenging subset selection problem, optimizing the batch acquisition score that quantifies the goodness of a batch for evaluation. Due to the excessively large search space of the subset selection problem, prior methods optimize the batch acquisition on the latent space, w… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: ICML 2024; Codes at https://github.com/snu-mllab/GreedyPolicyForMOCO

  22. arXiv:2406.14703  [pdf, other

    cs.CL cs.AI

    Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics

    Authors: Seungbeen Lee, Seungwon Lim, Seungju Han, Giyeong Oh, Hyungjoo Chae, Jiwan Chung, Minju Kim, Beong-woo Kwak, Yeonsoo Lee, Dongha Lee, Jinyoung Yeo, Youngjae Yu

    Abstract: The idea of personality in descriptive psychology, traditionally defined through observable behavior, has now been extended to Large Language Models (LLMs) to better understand their behavior. This raises a question: do LLMs exhibit distinct and consistent personality traits, similar to humans? Existing self-assessment personality tests, while applicable, lack the necessary validity and reliabilit… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Preprint; Under review

  23. arXiv:2406.14538  [pdf, ps, other

    math.GT math.GR

    On the Burau representation of $B_3$ modulo $p$

    Authors: Donsung Lee

    Abstract: We present an algorithm that, given a prime $p$ as input, determines whether or not the Burau representation of the 3-strand braid group modulo $p$ is faithful. We also prove that the representation is indeed faithful when $p\le 13$. Additionally, we re-pose Salter's question on the Burau representation of $B_3$ over finite fields $\mathbb{F}_p$, and solve it for every $p$.

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 67 pages

    MSC Class: 20F36; 20H05; 20E05; 20E06; 14H52

  24. arXiv:2406.14091  [pdf, other

    cs.CL

    Protecting Privacy Through Approximating Optimal Parameters for Sequence Unlearning in Language Models

    Authors: Dohyun Lee, Daniel Rim, Minseok Choi, Jaegul Choo

    Abstract: Although language models (LMs) demonstrate exceptional capabilities on various tasks, they are potentially vulnerable to extraction attacks, which represent a significant privacy risk. To mitigate the privacy concerns of LMs, machine unlearning has emerged as an important research area, which is utilized to induce the LM to selectively forget about some of its training data. While completely retra… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL2024 findings

  25. arXiv:2406.13633  [pdf, ps, other

    cs.LG math.OC

    Reinforcement Learning for Infinite-Horizon Average-Reward MDPs with Multinomial Logistic Function Approximation

    Authors: Jaehyun Park, Dabeen Lee

    Abstract: We study model-based reinforcement learning with non-linear function approximation where the transition function of the underlying Markov decision process (MDP) is given by a multinomial logistic (MNL) model. In this paper, we develop two algorithms for the infinite-horizon average reward setting. Our first algorithm \texttt{UCRL2-MNL} applies to the class of communicating MDPs and achieves an… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  26. arXiv:2406.13159  [pdf, other

    physics.optics physics.ins-det

    Ultrastable vacuum-gap Fabry-Pérot cavities operated in air

    Authors: Yifan Liu, Naijun Jin, Dahyeon Lee, Charles McLemore, Takuma Nakamura, Megan Kelleher, Haotian Cheng, Susan Schima, Nazanin Hoghooghi, Scott Diddams, Peter Rakich, Franklyn Quinlan

    Abstract: We demonstrate a vacuum-gap ultrastable optical reference cavity that does not require a vacuum enclosure. Our simple method of optical contact bonding in a vacuum environment allows for cavity operation in air while maintaining vacuum between the cavity mirrors. Vacuum is maintained long term, with no observed degradation in cavity stability for over 1 year after bonding. For a 1550 nm laser stab… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 10 pages, 6 figures

  27. arXiv:2406.12665  [pdf, other

    cs.CL cs.AI

    CollabStory: Multi-LLM Collaborative Story Generation and Authorship Analysis

    Authors: Saranya Venkatraman, Nafis Irtiza Tripto, Dongwon Lee

    Abstract: The rise of unifying frameworks that enable seamless interoperability of Large Language Models (LLMs) has made LLM-LLM collaboration for open-ended tasks a possibility. Despite this, there have not been efforts to explore such collaborative writing. We take the next step beyond human-LLM collaboration to explore this multi-LLM scenario by generating the first exclusively LLM-generated collaborativ… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  28. arXiv:2406.12329  [pdf, other

    cs.CL

    SNAP: Unlearning Selective Knowledge in Large Language Models with Negative Instructions

    Authors: Minseok Choi, Daniel Rim, Dohyun Lee, Jaegul Choo

    Abstract: Instruction-following large language models (LLMs), such as ChatGPT, have become increasingly popular with the general audience, many of whom are incorporating them into their daily routines. However, these LLMs inadvertently disclose personal or copyrighted information, which calls for a machine unlearning method to remove selective knowledge. Previous attempts sought to forget the link between t… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 16 pages, 5 figures

  29. arXiv:2406.12269  [pdf, other

    cs.CL

    Unveiling Implicit Table Knowledge with Question-Then-Pinpoint Reasoner for Insightful Table Summarization

    Authors: Kwangwook Seo, Jinyoung Yeo, Dongha Lee

    Abstract: Implicit knowledge hidden within the explicit table cells, such as data insights, is the key to generating a high-quality table summary. However, unveiling such implicit knowledge is a non-trivial task. Due to the complex nature of structured tables, it is challenging even for large language models (LLMs) to mine the implicit knowledge in an insightful and faithful manner. To address this challeng… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: work in progress

  30. arXiv:2406.12231  [pdf, other

    cond-mat.mes-hall

    Moiré flat bands and antiferroelectric domains in lattice relaxed twisted bilayer hexagonal boron nitride under perpendicular electric fields

    Authors: Fengping Li, Dongkyu Lee, Nicolas Leconte, Srivani Javvaji, Jeil Jung

    Abstract: Local interlayer charge polarization of twisted bilayer hexagonal boron nitride (t2BN) is calculated and parametrized as a function of twist angle and perpendicular electric fields through tight-binding calculations on lattice relaxed geometries Lattice relaxations tend to increase the bandwidth of the nearly flat bands, where widths smaller than 1 meV are expected for angle less than 1.08 degree… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  31. arXiv:2406.11886  [pdf, other

    cs.LG cs.AI cs.CE q-fin.CP

    Financial Assets Dependency Prediction Utilizing Spatiotemporal Patterns

    Authors: Haoren Zhu, Pengfei Zhao, Wilfred Siu Hung NG, Dik Lun Lee

    Abstract: Financial assets exhibit complex dependency structures, which are crucial for investors to create diversified portfolios to mitigate risk in volatile financial markets. To explore the financial asset dependencies dynamics, we propose a novel approach that models the dependencies of assets as an Asset Dependency Matrix (ADM) and treats the ADM sequences as image sequences. This allows us to leverag… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  32. arXiv:2406.11767  [pdf, other

    cs.RO

    Stein Variational Ergodic Search

    Authors: Darrick Lee, Cameron Lerch, Fabio Ramos, Ian Abraham

    Abstract: Exploration requires that robots reason about numerous ways to cover a space in response to dynamically changing conditions. However, in continuous domains there are potentially infinitely many options for robots to explore which can prove computationally challenging. How then should a robot efficiently optimize and choose exploration strategies to adopt? In this work, we explore this question thr… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 16 pages, 12 figures, accepted to Robotics: Science and Systems 2024

  33. arXiv:2406.11248  [pdf

    eess.AS cs.AI cs.SD

    Performance Improvement of Language-Queried Audio Source Separation Based on Caption Augmentation From Large Language Models for DCASE Challenge 2024 Task 9

    Authors: Do Hyun Lee, Yoonah Song, Hong Kook Kim

    Abstract: We present a prompt-engineering-based text-augmentation approach applied to a language-queried audio source separation (LASS) task. To enhance the performance of LASS, the proposed approach utilizes large language models (LLMs) to generate multiple captions corresponding to each sentence of the training dataset. To this end, we first perform experiments to identify the most effective prompts for c… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: DCASE 2024 Challenge Task 9, 4 pages

  34. arXiv:2406.11106  [pdf, other

    cs.CL cs.AI

    From Intentions to Techniques: A Comprehensive Taxonomy and Challenges in Text Watermarking for Large Language Models

    Authors: Harsh Nishant Lalai, Aashish Anantha Ramakrishnan, Raj Sanjay Shah, Dongwon Lee

    Abstract: With the rapid growth of Large Language Models (LLMs), safeguarding textual content against unauthorized use is crucial. Text watermarking offers a vital solution, protecting both - LLM-generated and plain text sources. This paper presents a unified overview of different perspectives behind designing watermarking techniques, through a comprehensive survey of the research literature. Our work has t… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  35. arXiv:2406.10996  [pdf, other

    cs.CL

    THEANINE: Revisiting Memory Management in Long-term Conversations with Timeline-augmented Response Generation

    Authors: Seo Hyun Kim, Kai Tzu-iunn Ong, Taeyoon Kwon, Namyoung Kim, Keummin Ka, SeongHyeon Bae, Yohan Jo, Seung-won Hwang, Dongha Lee, Jinyoung Yeo

    Abstract: Large language models (LLMs) are capable of processing lengthy dialogue histories during prolonged interaction with users without additional memory modules; however, their responses tend to overlook or incorrectly recall information from the past. In this paper, we revisit memory-augmented response generation in the era of LLMs. While prior work focuses on getting rid of outdated memories, we argu… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Under Review

  36. arXiv:2406.10547  [pdf, ps, other

    astro-ph.EP astro-ph.GA astro-ph.IM astro-ph.SR

    Four microlensing giant planets detected through signals produced by minor-image perturbations

    Authors: Cheongho Han, Ian A. Bond, Chung-Uk Lee, Andrew Gould, Michael D. Albrow, Sun-Ju Chung, Kyu-Ha Hwang, Youn Kil Jung, Yoon-Hyun Ryu, Yossi Shvartzvald, In-Gu Shin, Jennifer C. Yee, Hongjing Yang, Weicheng Zang, Sang-Mok Cha, Doeon Kim, Dong-Jin Kim, Seung-Lee Kim, Dong-Joo Lee, Yongseok Lee, Byeong-Gon Park, Richard W. Pogge, Fumio Abe, Ken Bando, Richard Barry , et al. (41 additional authors not shown)

    Abstract: We investigated the nature of the anomalies appearing in four microlensing events KMT-2020-BLG-0757, KMT-2022-BLG-0732, KMT-2022-BLG-1787, and KMT-2022-BLG-1852. The light curves of these events commonly exhibit initial bumps followed by subsequent troughs that extend across a substantial portion of the light curves. We performed thorough modeling of the anomalies to elucidate their characteristic… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 10 pages, 12 figures, 7 tables

  37. arXiv:2406.09946  [pdf, other

    cs.LG eess.SY

    Finite-Time Analysis of Simultaneous Double Q-learning

    Authors: Hyunjun Na, Donghwan Lee

    Abstract: $Q$-learning is one of the most fundamental reinforcement learning (RL) algorithms. Despite its widespread success in various applications, it is prone to overestimation bias in the $Q$-learning update. To address this issue, double $Q$-learning employs two independent $Q$-estimators which are randomly selected and updated during the learning process. This paper proposes a modified double $Q… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 25 pages, 3 figures

  38. arXiv:2406.09698  [pdf, other

    physics.ins-det hep-ex

    Projected background and sensitivity of AMoRE-II

    Authors: A. Agrawal, V. V. Alenkov, P. Aryal, J. Beyer, B. Bhandari, R. S. Boiko, K. Boonin, O. Buzanov, C. R. Byeon, N. Chanthima, M. K. Cheoun, J. S. Choe, Seonho Choi, S. Choudhury, J. S. Chung, F. A. Danevich, M. Djamal, D. Drung, C. Enss, A. Fleischmann, A. M. Gangapshev, L. Gastaldo, Y. M. Gavrilyuk, A. M. Gezhaev, O. Gileva , et al. (81 additional authors not shown)

    Abstract: AMoRE-II aims to search for neutrinoless double beta decay with an array of 423 Li$_2$$^{100}$MoO$_4$ crystals operating in the cryogenic system as the main phase of the Advanced Molybdenum-based Rare process Experiment (AMoRE). AMoRE has been planned to operate in three phases: AMoRE-pilot, AMoRE-I, and AMoRE-II. AMoRE-II is currently being installed at the Yemi Underground Laboratory, located ap… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  39. arXiv:2406.08702  [pdf, other

    cs.AI cs.CL cs.CV

    VLind-Bench: Measuring Language Priors in Large Vision-Language Models

    Authors: Kang-il Lee, Minbeom Kim, Seunghyun Yoon, Minsung Kim, Dongryeol Lee, Hyukhun Koh, Kyomin Jung

    Abstract: Large Vision-Language Models (LVLMs) have demonstrated outstanding performance across various multimodal tasks. However, they suffer from a problem known as language prior, where responses are generated based solely on textual patterns while disregarding image information. Addressing the issue of language prior is crucial, as it can lead to undesirable biases or hallucinations when dealing with im… ▽ More

    Submitted 10 July, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  40. arXiv:2406.08466  [pdf, other

    cs.LG cs.AI math.ST stat.ML

    Scaling Laws in Linear Regression: Compute, Parameters, and Data

    Authors: Licong Lin, Jingfeng Wu, Sham M. Kakade, Peter L. Bartlett, Jason D. Lee

    Abstract: Empirically, large-scale deep learning models often satisfy a neural scaling law: the test error of the trained model improves polynomially as the model size and data size grow. However, conventional wisdom suggests the test error consists of approximation, bias, and variance errors, where the variance error increases with model size. This disagrees with the general form of neural scaling laws, wh… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  41. arXiv:2406.08301  [pdf, other

    nucl-ex

    Jet modification via $π^0$-hadron correlations in Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV

    Authors: PHENIX Collaboration, N. J. Abdulameer, U. Acharya, A. Adare, S. Afanasiev, C. Aidala, N. N. Ajitanand, Y. Akiba, H. Al-Bataineh, J. Alexander, M. Alfred, K. Aoki, N. Apadula, L. Aphecetche, J. Asai, H. Asano, E. T. Atomssa, R. Averbeck, T. C. Awes, B. Azmoun, V. Babintsev, M. Bai, G. Baksay, L. Baksay, A. Baldisseri , et al. (510 additional authors not shown)

    Abstract: High-momentum two-particle correlations are a useful tool for studying jet-quenching effects in the quark-gluon plasma. Angular correlations between neutral-pion triggers and charged hadrons with transverse momenta in the range 4--12~GeV/$c$ and 0.5--7~GeV/$c$, respectively, have been measured by the PHENIX experiment in 2014 for Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$~GeV. Suppression is obs… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 534 authors from 83 institutions, 12 pages, 7 figures. v1 is version submitted to Physical Review C. HEPdata tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

  42. arXiv:2406.08140  [pdf

    q-bio.NC

    Functional voxel hierarchy and afferent capacity revealed mental state transition on dynamic correlation resting-state fMRI

    Authors: Dong Soo Lee, Hyun Joo Kim, Youngmin Huh, Yeon Koo Kang, Wonseok Whi, Hyekyoung Lee, Hyejin Kang

    Abstract: Voxel hierarchy on dynamic brain graphs is produced by k core percolation on functional dynamic amplitude correlation of resting-state fMRI. Directed graphs and their afferent/efferent capacities are produced by Markov modeling of the universal cover of undirected graphs simultaneously with the calculation of volume entropy. Positive and unsigned negative brain graphs were analyzed separately on s… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  43. Optimal Qubit Mapping Search for Encoding Classical Data into Matrix Product State Representation with Minimal Loss

    Authors: Hyeongjun Jeon, Kyungmin Lee, Dongkyu Lee, Bongsang Kim, Taehyun Kim

    Abstract: Matrix product state (MPS) offers a framework for encoding classical data into quantum states, enabling the efficient utilization of quantum resources for data representation and processing. This research paper investigates techniques to enhance the efficiency and accuracy of MPS representations specifically designed for encoding classical data. Based on the observations that MPS truncation error… ▽ More

    Submitted 12 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: 11 pages, 15 figures, The current version represents the initial submission and differs significantly from the final published version. Please check the official publication in Physics Letters A for the most up-to-date and comprehensive content

  44. arXiv:2406.06897  [pdf, ps, other

    q-bio.PE math.DS

    Bifurcations and multistability in empirical mutualistic networks

    Authors: Andrus Giraldo, Deok-Sun Lee

    Abstract: Individual species may experience diverse outcomes, from prosperity to extinction, in an ecological community subject to external and internal variations. Despite the wealth of theoretical results derived from random matrix ensembles, a theoretical framework still remains to be developed to understand species-level dynamical heterogeneity within a given community, hampering real-world ecosystems'… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 16 Pages, 8 Figures

  45. arXiv:2406.06893  [pdf, other

    stat.ML cs.IT cs.LG

    Transformers Provably Learn Sparse Token Selection While Fully-Connected Nets Cannot

    Authors: Zixuan Wang, Stanley Wei, Daniel Hsu, Jason D. Lee

    Abstract: The transformer architecture has prevailed in various deep learning settings due to its exceptional capabilities to select and compose structural information. Motivated by these capabilities, Sanford et al. proposed the sparse token selection task, in which transformers excel while fully-connected networks (FCNs) fail in the worst case. Building upon that, we strengthen the FCN lower bound to an a… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  46. arXiv:2406.06648  [pdf, other

    cs.CL cs.AI cs.LG

    SignBLEU: Automatic Evaluation of Multi-channel Sign Language Translation

    Authors: Jung-Ho Kim, Mathew Huerta-Enochian, Changyong Ko, Du Hui Lee

    Abstract: Sign languages are multi-channel languages that communicate information through not just the hands (manual signals) but also facial expressions and upper body movements (non-manual signals). However, since automatic sign language translation is usually performed by generating a single sequence of glosses, researchers eschew non-manual and co-occurring manual signals in favor of a simplified list o… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Published in LREC-Coling 2024

  47. arXiv:2406.06149  [pdf, other

    cs.LG stat.ML

    Decoupled Marked Temporal Point Process using Neural Ordinary Differential Equations

    Authors: Yujee Song, Donghyun Lee, Rui Meng, Won Hwa Kim

    Abstract: A Marked Temporal Point Process (MTPP) is a stochastic process whose realization is a set of event-time data. MTPP is often used to understand complex dynamics of asynchronous temporal events such as money transaction, social media, healthcare, etc. Recent studies have utilized deep neural networks to capture complex temporal dependencies of events and generate embedding that aptly represent the o… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 18 pages, 8 figures, The Twelfth International Conference on Learning Representations (ICLR 2024)

  48. arXiv:2406.06134  [pdf, other

    cs.CV cs.AI cs.LG

    DiffInject: Revisiting Debias via Synthetic Data Generation using Diffusion-based Style Injection

    Authors: Donggeun Ko, Sangwoo Jo, Dongjun Lee, Namjun Park, Jaekwang Kim

    Abstract: Dataset bias is a significant challenge in machine learning, where specific attributes, such as texture or color of the images are unintentionally learned resulting in detrimental performance. To address this, previous efforts have focused on debiasing models either by developing novel debiasing algorithms or by generating synthetic data to mitigate the prevalent dataset biases. However, generativ… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 10 pages (including supplementary), 3 figures, SynData4CV@CVPR 24 (Workshop)

  49. A Deep Learning-Augmented Stand-off Radar Scheme for Rapidly Detecting Tree Defects

    Authors: Jiwei Qian, Yee Hui Lee, Kaixuan Cheng, Qiqi Dai, Mohamed Lokman Mohd Yusof, Daryl Lee, Abdulkadir C. Yucel

    Abstract: Tree defect detection is crucial for the structural health screening of trees. Existing nondestructive testing (NDT) techniques for tree defect detection require time-consuming and labor-intensive measurement campaigns. This discourages their application for the routine structural health screening of whole populations of managed urban trees. To address this issue, this study proposes a deep-learni… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: Accepted and to be published in IEEE Transactions on Geoscience and Remote Sensing

  50. arXiv:2406.05200  [pdf

    physics.ins-det nucl-ex

    Decay Energy Spectrometry for Improved Nuclear Material Analysis at the IAEA NML

    Authors: G. B. Kim, A. R. L. Kavner, T. Parsons-Davis, S. Friedrich, O. B. Drury, D. Lee, X. Zhang, N. Hines, S. T. P. Boyd, S. Weidenbenner, K. Schreiber, S. Martinson, C. Smith, D. McNeel, S. Salazar, K. Koehler, M. Carpenter, M. Croce, D. Schmidt, J. Ullom

    Abstract: Decay energy spectrometry (DES) is a novel radiometric technique for high-precision analysis of nuclear materials. DES employs the unique thermal detection physics of cryogenic microcalorimeters with ultra-high energy resolution and 100$\%$ detection efficiency to accomplish high precision decay energy measurements. Low-activity nuclear samples of 1 Bq or less, and without chemical separation, are… ▽ More

    Submitted 11 July, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: This was submitted to 2022 IAEA symposium on nuclear safeguards (https://www.iaea.org/events/sg-2022), and posted at https://media.superevent.com/documents/20221027/668fdac0ee8d895ec6bcf293b1c42e6a/id-145.pdf