Skip to main content

Showing 1–50 of 7,494 results for author: Kim, S

  1. arXiv:2407.09374  [pdf

    cond-mat.mtrl-sci physics.chem-ph

    Grain boundaries control lithiation of solid solution substrates in lithium metal batteries

    Authors: Leonardo Shoji Aota, Chanwon Jung, Siyuan Zhang, Ömer K. Büyükuslu, Poonam Yadav, Mahander Pratap Singh, Xinren Chen, Eric Woods, Christina Scheu, Se-Ho Kim, Dierk Raabe, Baptiste Gault

    Abstract: The development of sustainable transportation and communication systems requires an increase in both energy density and capacity retention of Li-batteries. Using substrates forming a solid solution with body centered cubic Li enhances the cycle stability of anode-less batteries. However, it remains unclear how the substrate microstructure affects the lithiation behavior. Here, we deploy a correlat… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  2. arXiv:2407.09153  [pdf

    cond-mat.mtrl-sci

    Topological Fermi-arc surface state covered by floating electrons on a two-dimensional electride

    Authors: Chan-young Lim, Min-Seok Kim, Dong Cheol Lim, Sunghun Kim, Yeonghoon Lee, Jaehoon Cha, Gyubin Lee, Sang Yong Song, Dinesh Thapa, Jonathan D. Denlinger, Seong-Gon Kim, Sung Wng Kim, Jungpil Seo, Yeongkwan Kim

    Abstract: Two-dimensional electrides can acquire topologically non-trivial phases due to intriguing interplay between the cationic atomic layers and anionic electron layers. However, experimental evidence of topological surface states has yet to be verified. Here, via angle-resolved photoemission spectroscopy (ARPES) and scanning tunnelling microscopy (STM), we probe the magnetic Weyl states of the ferromag… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 22 pages, 6 figures

    Journal ref: Nat. Commun. 15 (2024) 5615

  3. arXiv:2407.09033  [pdf, other

    cs.CV

    Textual Query-Driven Mask Transformer for Domain Generalized Segmentation

    Authors: Byeonghyun Pak, Byeongju Woo, Sunghwan Kim, Dae-hwan Kim, Hoseong Kim

    Abstract: In this paper, we introduce a method to tackle Domain Generalized Semantic Segmentation (DGSS) by utilizing domain-invariant semantic knowledge from text embeddings of vision-language models. We employ the text embeddings as object queries within a transformer-based segmentation framework (textual object queries). These queries are regarded as a domain-invariant basis for pixel grouping in DGSS. T… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  4. arXiv:2407.08892  [pdf, other

    cs.CL cs.LG

    Characterizing Prompt Compression Methods for Long Context Inference

    Authors: Siddharth Jha, Lutfi Eren Erdogan, Sehoon Kim, Kurt Keutzer, Amir Gholami

    Abstract: Long context inference presents challenges at the system level with increased compute and memory requirements, as well as from an accuracy perspective in being able to reason over long contexts. Recently, several methods have been proposed to compress the prompt to reduce the context length. However, there has been little work on comparing the different proposed methods across different tasks thro… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Es-FoMo @ ICML 2024

  5. arXiv:2407.07438  [pdf, ps, other

    math.FA

    Near-order relation of power means

    Authors: Jinmi Hwang, Sejong Kim

    Abstract: On the setting of positive definite operators we study the near-order properties of power means such as the quasi-arithmetic mean (Hölder mean) and Rényi power mean. We see the monotonicity of spectral geometric mean and Wasserstein mean on parameters with respect to the near-order and the near-order relationship between the spectral geometric mean and Wasserstein mean. Furthermore, the monotonici… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  6. arXiv:2407.07024  [pdf, other

    cs.CV cs.AI

    Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization

    Authors: Jeongseok Hyun, Su Ho Han, Hyolim Kang, Joon-Young Lee, Seon Joo Kim

    Abstract: The vocabulary size in temporal action localization (TAL) is constrained by the scarcity of large-scale annotated datasets. To address this, recent works incorporate powerful pre-trained vision-language models (VLMs), such as CLIP, to perform open-vocabulary TAL (OV-TAL). However, unlike VLMs trained on extensive image/video-text pairs, existing OV-TAL methods still rely on small, fully labeled TA… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  7. arXiv:2407.06851  [pdf, other

    cs.CL

    Safe-Embed: Unveiling the Safety-Critical Knowledge of Sentence Encoders

    Authors: Jinseok Kim, Jaewon Jung, Sangyeop Kim, Sohyung Park, Sungzoon Cho

    Abstract: Despite the impressive capabilities of Large Language Models (LLMs) in various tasks, their vulnerability to unsafe prompts remains a critical issue. These prompts can lead LLMs to generate responses on illegal or sensitive topics, posing a significant threat to their safe and ethical use. Existing approaches attempt to address this issue using classification models, but they have several drawback… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: ACL 2024 KnowledgeableLMs workshop paper

  8. arXiv:2407.06204  [pdf, other

    cs.LG cs.CL

    A Survey on Mixture of Experts

    Authors: Weilin Cai, Juyong Jiang, Fan Wang, Jing Tang, Sunghun Kim, Jiayi Huang

    Abstract: Large language models (LLMs) have garnered unprecedented advancements across diverse fields, ranging from natural language processing to computer vision and beyond. The prowess of LLMs is underpinned by their substantial model size, extensive and diverse datasets, and the vast computational power harnessed during training, all of which contribute to the emergent abilities of LLMs (e.g., in-context… ▽ More

    Submitted 26 June, 2024; originally announced July 2024.

  9. Is GPT-4 Alone Sufficient for Automated Essay Scoring?: A Comparative Judgment Approach Based on Rater Cognition

    Authors: Seungju Kim, Meounggun Jo

    Abstract: Large Language Models (LLMs) have shown promise in Automated Essay Scoring (AES), but their zero-shot and few-shot performance often falls short compared to state-of-the-art models and human raters. However, fine-tuning LLMs for each specific task is impractical due to the variety of essay prompts and rubrics used in real-world educational contexts. This study proposes a novel approach combining L… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 16 pages, 3 figures, Learning @ Scale 2024

  10. arXiv:2407.05618  [pdf, other

    nucl-ex hep-ex

    Improved limit on neutrinoless double beta decay of \mohundred~from AMoRE-I

    Authors: A. Agrawal, V. V. Alenkov, P. Aryal, J. Beyer, B. Bhandari, R. S. Boiko, K. Boonin, O. Buzanov, C. R. Byeon, N. Chanthima, M. K. Cheoun, J. S. Choe, Seonho Choi, S. Choudhury, J. S. Chung, F. A. Danevich, M. Djamal, D. Drung, C. Enss, A. Fleischmann, A. M. Gangapshev, L. Gastaldo, Y. M. Gavrilyuk, A. M. Gezhaev, O. Gileva , et al. (83 additional authors not shown)

    Abstract: AMoRE searches for the signature of neutrinoless double beta decay of $^{100}$Mo with a 100 kg sample of enriched $^{100}$Mo. Scintillating molybdate crystals coupled with a metallic magnetic calorimeter operate at milli-Kelvin temperatures to measure the energy of electrons emitted in the decay. As a demonstration of the full-scale AMoRE, we conducted AMoRE-I, a pre-experiment with 18 molybdate c… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 7 pages, 4 figures

  11. arXiv:2407.05565  [pdf

    cond-mat.str-el

    Exploring the role of nonlocal Coulomb interactions in perovskite transition metal oxides

    Authors: Indukuru Ramesh Reddy, Chang-Jong Kang, Sooran Kim, Bongjae Kim

    Abstract: Employing the density functional theory incorporating on-site and inter-site Coulomb interactions (DFT+U+V), we have investigated the role of the nonlocal interactions on the electronic structures of the transition metal oxide perovskites. Using constrained random phase approximation calculations, we derived screened Coulomb interaction parameters and revealed a competition between localization an… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  12. arXiv:2407.05186  [pdf, other

    cs.CY cs.SI

    Understanding Political Communication and Political Communicators on Twitch

    Authors: Sangyeon Kim

    Abstract: As new technologies rapidly reshape patterns of political communication, platforms like Twitch are transforming how people consume political information. This entertainment-oriented live streaming platform allows us to observe the impact of technologies such as ``live-streaming'' and ``streaming-chat'' on political communication. Despite its entertainment focus, Twitch hosts a variety of political… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  13. arXiv:2407.04597  [pdf, other

    cs.CV cs.AI

    Feature Attenuation of Defective Representation Can Resolve Incomplete Masking on Anomaly Detection

    Authors: YeongHyeon Park, Sungho Kang, Myung Jin Kim, Hyeong Seok Kim, Juneho Yi

    Abstract: In unsupervised anomaly detection (UAD) research, while state-of-the-art models have reached a saturation point with extensive studies on public benchmark datasets, they adopt large-scale tailor-made neural networks (NN) for detection performance or pursued unified models for various tasks. Towards edge computing, it is necessary to develop a computationally efficient and scalable solution that av… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 11 pages, 6 figures, 5 tables

  14. arXiv:2407.04280  [pdf, other

    cs.CL cs.SD eess.AS

    LearnerVoice: A Dataset of Non-Native English Learners' Spontaneous Speech

    Authors: Haechan Kim, Junho Myung, Seoyoung Kim, Sungpah Lee, Dongyeop Kang, Juho Kim

    Abstract: Prevalent ungrammatical expressions and disfluencies in spontaneous speech from second language (L2) learners pose unique challenges to Automatic Speech Recognition (ASR) systems. However, few datasets are tailored to L2 learner speech. We publicly release LearnerVoice, a dataset consisting of 50.04 hours of audio and transcriptions of L2 learners' spontaneous speech. Our linguistic analysis revea… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: Accepted for INTERSPEECH 2024

  15. arXiv:2407.04192  [pdf, other

    cs.LG

    KAN-ODEs: Kolmogorov-Arnold Network Ordinary Differential Equations for Learning Dynamical Systems and Hidden Physics

    Authors: Benjamin C. Koenig, Suyong Kim, Sili Deng

    Abstract: Kolmogorov-Arnold Networks (KANs) as an alternative to Multi-layer perceptrons (MLPs) are a recent development demonstrating strong potential for data-driven modeling. This work applies KANs as the backbone of a Neural Ordinary Differential Equation framework, generalizing their use to the time-dependent and grid-sensitive cases often seen in scientific machine learning applications. The proposed… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 12 pages, 5 figures plus 1 appendix figure, 1 table plus 1 appendix table. B.C.K. and S.K. contributed equally to this work

    ACM Class: I.6.5; G.1.7

  16. arXiv:2407.04007  [pdf, other

    cond-mat.mes-hall hep-th

    Domain Wall Networks as Skyrmion Crystals in Chiral Magnets

    Authors: Seungho Lee, Toshiaki Fujimori, Muneto Nitta, Se Kwon Kim

    Abstract: We theoretically investigate the ground states of a chiral magnet with a square anisotropy and show that it supports domain wall networks as stable ground states. A domain wall junction in the domain wall network turns out to be a skyrmion with half topological charge and, therefore, the found domain wall network has a second topological nature, a skyrmion crystal. More specifically, we present a… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 8 pages, 6 figures, 2 pages of supplemental material

  17. arXiv:2407.03828  [pdf, other

    astro-ph.CO astro-ph.HE astro-ph.IM astro-ph.SR hep-ex hep-ph

    NuSTAR as an Axion Helioscope

    Authors: J. Ruz, E. Todarello, J. K. Vogel, M. Giannotti, B. Grefenstette, H. S. Hudson, I. G. Hannah, I. G. Irastorza, C. S. Kim, T. O'Shea, M. Regis, D. M. Smith, M. Taoso, J. Trujillo Bueno

    Abstract: The nature of dark matter in the Universe is still an open question in astrophysics and cosmology. Axions and axion-like particles (ALPs) offer a compelling solution, and traditionally ground-based experiments have eagerly, but to date unsuccessfully, searched for these hypothetical low-mass particles that are expected to be produced in large quantities in the strong electromagnetic fields in the… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 23 pages, 12 figures

  18. arXiv:2407.03563  [pdf, other

    eess.AS cs.CL cs.LG eess.IV

    Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition

    Authors: Sungnyun Kim, Kangwook Jang, Sangmin Bae, Hoirin Kim, Se-Young Yun

    Abstract: Audio-visual speech recognition (AVSR) aims to transcribe human speech using both audio and video modalities. In practical environments with noise-corrupted audio, the role of video information becomes crucial. However, prior works have primarily focused on enhancing audio features in AVSR, overlooking the importance of video features. In this study, we strengthen the video features by learning th… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  19. arXiv:2407.03103  [pdf, other

    cs.CL

    Cactus: Towards Psychological Counseling Conversations using Cognitive Behavioral Theory

    Authors: Suyeon Lee, Sunghwan Kim, Minju Kim, Dongjin Kang, Dongil Yang, Harim Kim, Minseok Kang, Dayi Jung, Min Hee Kim, Seungbeen Lee, Kyoung-Mee Chung, Youngjae Yu, Dongha Lee, Jinyoung Yeo

    Abstract: Recently, the demand for psychological counseling has significantly increased as more individuals express concerns about their mental health. This surge has accelerated efforts to improve the accessibility of counseling by using large language models (LLMs) as counselors. To ensure client privacy, training open-source LLMs faces a key challenge: the absence of realistic counseling datasets. To add… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Under Review

  20. arXiv:2407.02750  [pdf, other

    cs.CL

    Learning to Reduce: Towards Improving Performance of Large Language Models on Structured Data

    Authors: Younghun Lee, Sungchul Kim, Ryan A. Rossi, Tong Yu, Xiang Chen

    Abstract: Large Language Models (LLMs) have been achieving competent performance on a wide range of downstream tasks, yet existing work shows that inference on structured data is challenging for LLMs. This is because LLMs need to either understand long structured data or select the most relevant evidence before inference, and both approaches are not trivial. This paper proposes a framework, Learning to Redu… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: ICML 2024 Workshop on Long-Context Foundation Models, Vienna, Austria 2024. arXiv admin note: substantial text overlap with arXiv:2402.14195

  21. Addressing Prediction Delays in Time Series Forecasting: A Continuous GRU Approach with Derivative Regularization

    Authors: Sheo Yon Jhin, Seojin Kim, Noseong Park

    Abstract: Time series forecasting has been an essential field in many different application areas, including economic analysis, meteorology, and so forth. The majority of time series forecasting models are trained using the mean squared error (MSE). However, this training based on MSE causes a limitation known as prediction delay. The prediction delay, which implies the ground-truth precedes the prediction,… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: KDD 2024 accepted paper

  22. arXiv:2407.01073  [pdf, other

    cs.RO

    No More Potentially Dynamic Objects: Static Point Cloud Map Generation based on 3D Object Detection and Ground Projection

    Authors: Soojin Woo, Donghwi Jung, Seong-Woo Kim

    Abstract: In this paper, we propose an algorithm to generate a static point cloud map based on LiDAR point cloud data. Our proposed pipeline detects dynamic objects using 3D object detectors and projects points of dynamic objects onto the ground. Typically, point cloud data acquired in real-time serves as a snapshot of the surrounding areas containing both static objects and dynamic objects. The static obje… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  23. arXiv:2407.00859  [pdf, other

    stat.ME

    Statistical inference on partially shape-constrained function-on-scalar linear regression models

    Authors: Kyunghee Han, Yeonjoo Park, Soo-Young Kim

    Abstract: We consider functional linear regression models where functional outcomes are associated with scalar predictors by coefficient functions with shape constraints, such as monotonicity and convexity, that apply to sub-domains of interest. To validate the partial shape constraints, we propose testing a composite hypothesis of linear functional constraints on regression coefficients. Our approach emplo… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 30 pages, 7 figures

  24. arXiv:2407.00693  [pdf, other

    cs.AI cs.CL cs.LG

    BAPO: Base-Anchored Preference Optimization for Personalized Alignment in Large Language Models

    Authors: Gihun Lee, Minchan Jeong, Yujin Kim, Hojung Jung, Jaehoon Oh, Sangmook Kim, Se-Young Yun

    Abstract: While learning to align Large Language Models (LLMs) with human preferences has shown remarkable success, aligning these models to meet the diverse user preferences presents further challenges in preserving previous knowledge. This paper examines the impact of personalized preference optimization on LLMs, revealing that the extent of knowledge loss varies significantly with preference heterogeneit… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: under review

  25. arXiv:2407.00061  [pdf, ps, other

    math.NT

    Probabilistic multi-Stirling numbers of the second kind and probabilistic multi-Lah numbers

    Authors: Taekyun Kim, Dae san Kim

    Abstract: Assume that the moment generating function of the random vari able Y exists in a neighborhood of the origin. We introduce the probabilistic multi-Stirling numbers of the second kind associated with Y and the proba bilistic multi-Lah numbers associated with Y, both of indices (k1,k2,...,kr), by means of the multiple logarithm. Those numbers are respectively probabilistic extensions of the mul… ▽ More

    Submitted 17 June, 2024; originally announced July 2024.

    Comments: 11 pages

    MSC Class: 11B68; 11B73; 11B83

  26. arXiv:2407.00006  [pdf, other

    cs.DC cs.CE math.NA

    Adaptive and Parallel Multiscale Framework for Modeling Cohesive Failure in Engineering Scale Systems

    Authors: Sion Kim, Ezra Kissel, Karel Matous

    Abstract: The high computational demands of multiscale modeling necessitate advanced parallel and adaptive strategies. To address this challenge, we introduce an adaptive method that utilizes two microscale models based on an offline database for multiscale modeling of curved interfaces (e.g., adhesive layers). This database employs nonlinear classifiers, developed using Support Vector Machines from microsc… ▽ More

    Submitted 18 April, 2024; originally announced July 2024.

  27. arXiv:2406.19848  [pdf, other

    cs.RO

    3D Operation of Autonomous Excavator based on Reinforcement Learning through Independent Reward for Individual Joints

    Authors: Yoonkyu Yoo, Donghwi Jung, Seong-Woo Kim

    Abstract: In this paper, we propose a control algorithm based on reinforcement learning, employing independent rewards for each joint to control excavators in a 3D space. The aim of this research is to address the challenges associated with achieving precise control of excavators, which are extensively utilized in construction sites but prove challenging to control with precision due to their hydraulic stru… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  28. arXiv:2406.19618  [pdf

    cond-mat.mtrl-sci

    Unstable Retention Behavior in MIFIS FEFET: Accurate Analysis of the Origin by Absolute Polarization Measurement

    Authors: Song-Hyeon Kuk, Kyul Ko, Bong Ho Kim, Jae-Hoon Han, Sang-Hyeon Kim

    Abstract: Ferroelectric field-effect-transistor (FEFET) has emerged as a scalable solution for 3D NAND and embedded flash (eFlash), with recent progress in achieving large memory window (MW) using metal-insulator-ferroelectric-insulator-semiconductor (MIFIS) gate stacks. Although the physical origin of the large MW in the MIFIS stack has already been discussed, its retention characteristics have not been ex… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: We are submitting this to an IEEE journal but because of delays, we would like to share the information

  29. arXiv:2406.19575  [pdf

    cs.HC cs.DB cs.PF

    AR-PPF: Advanced Resolution-Based Pixel Preemption Data Filtering for Efficient Time-Series Data Analysis

    Authors: Taewoong Kim, Kukjin Choi, Sungjun Kim

    Abstract: With the advent of automation, many manufacturing industries have transitioned to data-centric methodologies, giving rise to an unprecedented influx of data during the manufacturing process. This data has become instrumental in analyzing the quality of manufacturing process and equipment. Engineers and data analysts, in particular, require extensive time-series data for seasonal cycle analysis. Ho… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 7pages, preprint, '24 Samsung Best Paper Awards

  30. arXiv:2406.19328  [pdf, other

    cs.SD cs.LG eess.AS

    Subtractive Training for Music Stem Insertion using Latent Diffusion Models

    Authors: Ivan Villa-Renteria, Mason L. Wang, Zachary Shah, Zhe Li, Soohyun Kim, Neelesh Ramachandran, Mert Pilanci

    Abstract: We present Subtractive Training, a simple and novel method for synthesizing individual musical instrument stems given other instruments as context. This method pairs a dataset of complete music mixes with 1) a variant of the dataset lacking a specific stem, and 2) LLM-generated instructions describing how the missing stem should be reintroduced. We then fine-tune a pretrained text-to-audio diffusi… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  31. arXiv:2406.19287  [pdf, other

    astro-ph.HE

    Isotropy of cosmic rays beyond $10^{20}$ eV favors their heavy mass composition

    Authors: Telescope Array Collaboration, R. U. Abbasi, Y. Abe, T. Abu-Zayyad, M. Allen, Y. Arai, R. Arimura, E. Barcikowski, J. W. Belz, D. R. Bergman, S. A. Blake, I. Buckland, B. G. Cheon, M. Chikawa, T. Fujii, K. Fujisue, K. Fujita, R. Fujiwara, M. Fukushima, G. Furlich, N. Globus, R. Gonzalez, W. Hanlon, N. Hayashida, H. He , et al. (118 additional authors not shown)

    Abstract: We report an estimation of the injected mass composition of ultra-high energy cosmic rays (UHECRs) at energies higher than 10 EeV. The composition is inferred from an energy-dependent sky distribution of UHECR events observed by the Telescope Array surface detector by comparing it to the Large Scale Structure of the local Universe. In the case of negligible extra-galactic magnetic fields the resul… ▽ More

    Submitted 3 July, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: 8 pages, 3 figures, accepted for publication in PRL

  32. arXiv:2406.19286  [pdf, other

    astro-ph.HE

    Mass composition of ultra-high energy cosmic rays from distribution of their arrival directions with the Telescope Array

    Authors: Telescope Array Collaboration, R. U. Abbasi, Y. Abe, T. Abu-Zayyad, M. Allen, Y. Arai, R. Arimura, E. Barcikowski, J. W. Belz, D. R. Bergman, S. A. Blake, I. Buckland, B. G. Cheon, M. Chikawa, T. Fujii, K. Fujisue, K. Fujita, R. Fujiwara, M. Fukushima, G. Furlich, N. Globus, R. Gonzalez, W. Hanlon, N. Hayashida, H. He , et al. (118 additional authors not shown)

    Abstract: We use a new method to estimate the injected mass composition of ultrahigh cosmic rays (UHECRs) at energies higher than 10 EeV. The method is based on comparison of the energy-dependent distribution of cosmic ray arrival directions as measured by the Telescope Array experiment (TA) with that calculated in a given putative model of UHECR under the assumption that sources trace the large-scale struc… ▽ More

    Submitted 3 July, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: 18 pages, 11 figures, accepted for publication in PRD

  33. arXiv:2406.19135  [pdf, other

    eess.AS cs.AI

    DEX-TTS: Diffusion-based EXpressive Text-to-Speech with Style Modeling on Time Variability

    Authors: Hyun Joon Park, Jin Sob Kim, Wooseok Shin, Sung Won Han

    Abstract: Expressive Text-to-Speech (TTS) using reference speech has been studied extensively to synthesize natural speech, but there are limitations to obtaining well-represented styles and improving model generalization ability. In this study, we present Diffusion-based EXpressive TTS (DEX-TTS), an acoustic model designed for reference-based speech synthesis with enhanced style representations. Based on a… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: Preprint

  34. arXiv:2406.19132  [pdf, other

    astro-ph.SR astro-ph.GA

    Origin of extended Main Sequence Turn Off in open cluster NGC 2355

    Authors: Jayanand Maurya, M. R. Samal, Louis Amard, Yu Zhang, Hubiao Niu, Sang Chul Kim, Y. C. Joshi, B. Kumar

    Abstract: The presence of extended Main Sequence Turn-Off (eMSTO) in the open clusters has been attributed to various factors, such as spread in rotation rates, binary stars, and dust-like extinction from stellar excretion discs. We present a comprehensive analysis of the eMSTO in the open cluster NGC 2355. Using spectra from the Gaia-ESO archives, we find that the stars in the red part of the eMSTO have a… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 11 pages, 12 figures, accepted for publication in MNRAS

  35. arXiv:2406.18858  [pdf

    quant-ph

    On-off switchable nonreciprocal negative refraction in non-Hermitian photon-magnon hybrid systems

    Authors: Junyoung Kim, Bosung Kim, Bo-Jong Kim, Haechan Jeon, Sang-Koog Kim

    Abstract: Photon-magnon coupling, where electromagnetic waves interact with spin waves, and negative refraction, which bends the direction of electromagnetic waves unnaturally, constitute critical foundations and advancements in the realms of optics, spintronics, and quantum information technology. Here, we explore a magnetic-field-controlled, on-off switchable, nonreciprocal negative refraction within a no… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 27 pages, 4 figures

  36. arXiv:2406.18551  [pdf, other

    cs.CV cs.GR

    GFFE: G-buffer Free Frame Extrapolation for Low-latency Real-time Rendering

    Authors: Songyin Wu, Deepak Vembar, Anton Sochenov, Selvakumar Panneer, Sungye Kim, Anton Kaplanyan, Ling-Qi Yan

    Abstract: Real-time rendering has been embracing ever-demanding effects, such as ray tracing. However, rendering such effects in high resolution and high frame rate remains challenging. Frame extrapolation methods, which don't introduce additional latency as opposed to frame interpolation methods such as DLSS 3 and FSR 3, boost the frame rate by generating future frames based on previous frames. However, it… ▽ More

    Submitted 23 May, 2024; originally announced June 2024.

  37. arXiv:2406.17869  [pdf, other

    cs.CV

    Burst Image Super-Resolution with Base Frame Selection

    Authors: Sanghyun Kim, Min Jung Lee, Woohyeok Kim, Deunsol Jung, Jaesung Rim, Sunghyun Cho, Minsu Cho

    Abstract: Burst image super-resolution has been a topic of active research in recent years due to its ability to obtain a high-resolution image by using complementary information between multiple frames in the burst. In this work, we explore using burst shots with non-uniform exposures to confront real-world practical scenarios by introducing a new benchmark dataset, dubbed Non-uniformly Exposed Burst Image… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: CVPR2024W NTIRE accepted

  38. arXiv:2406.17310  [pdf, other

    eess.AS

    High Fidelity Text-to-Speech Via Discrete Tokens Using Token Transducer and Group Masked Language Model

    Authors: Joun Yeop Lee, Myeonghun Jeong, Minchan Kim, Ji-Hyun Lee, Hoon-Young Cho, Nam Soo Kim

    Abstract: We propose a novel two-stage text-to-speech (TTS) framework with two types of discrete tokens, i.e., semantic and acoustic tokens, for high-fidelity speech synthesis. It features two core components: the Interpreting module, which processes text and a speech prompt into semantic tokens focusing on linguistic contents and alignment, and the Speaking module, which captures the timbre of the target v… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech2024

  39. arXiv:2406.17254  [pdf, other

    cs.CV

    Scalp Diagnostic System With Label-Free Segmentation and Training-Free Image Translation

    Authors: Youngmin Kim, Saejin Kim, Hoyeon Moon, Youngjae Yu, Junhyug Noh

    Abstract: Scalp diseases and alopecia affect millions of people around the world, underscoring the urgent need for early diagnosis and management of the disease. However, the development of a comprehensive AI-based diagnosis system encompassing these conditions remains an underexplored domain due to the challenges associated with data imbalance and the costly nature of labeling. To address these issues, we… ▽ More

    Submitted 25 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: IEEE Transactions on Medical Imaging (Under Review)

  40. arXiv:2406.17145  [pdf, other

    cs.DC cs.AI cs.LG

    GraphPipe: Improving Performance and Scalability of DNN Training with Graph Pipeline Parallelism

    Authors: Byungsoo Jeon, Mengdi Wu, Shiyi Cao, Sunghyun Kim, Sunghyun Park, Neeraj Aggarwal, Colin Unger, Daiyaan Arfeen, Peiyuan Liao, Xupeng Miao, Mohammad Alizadeh, Gregory R. Ganger, Tianqi Chen, Zhihao Jia

    Abstract: Deep neural networks (DNNs) continue to grow rapidly in size, making them infeasible to train on a single device. Pipeline parallelism is commonly used in existing DNN systems to support large-scale DNN training by partitioning a DNN into multiple stages, which concurrently perform DNN training for different micro-batches in a pipeline fashion. However, existing pipeline-parallel approaches only c… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  41. arXiv:2406.16994  [pdf, other

    eess.SP cs.AI

    Quantum Multi-Agent Reinforcement Learning for Cooperative Mobile Access in Space-Air-Ground Integrated Networks

    Authors: Gyu Seon Kim, Yeryeong Cho, Jaehyun Chung, Soohyun Park, Soyi Jung, Zhu Han, Joongheon Kim

    Abstract: Achieving global space-air-ground integrated network (SAGIN) access only with CubeSats presents significant challenges such as the access sustainability limitations in specific regions (e.g., polar regions) and the energy efficiency limitations in CubeSats. To tackle these problems, high-altitude long-endurance unmanned aerial vehicles (HALE-UAVs) can complement these CubeSat shortcomings for prov… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 17 pages, 22 figures

  42. arXiv:2406.16702  [pdf, other

    astro-ph.SR astro-ph.EP astro-ph.GA

    North-PHASE: Studying Periodicity, Hot Spots, Accretion Stability and Early Evolution in young stars in the northern hemisphere

    Authors: A. Sicilia-Aguilar, R. S. Kahar, M. E. Pelayo-Baldárrago, V. Roccatagliata, D. Froebrich, F. J. Galindo-Guil, J. Campbell-White, J. S. Kim, I. Mendigutía, L. Schlueter, P. S. Teixeira, S. Matsumura, M. Fang, A. Scholz, P. Ábrahám, A. Frasca, A. Garufi, C. Herbert, Á. Kóspál, C. F. Manara

    Abstract: We present the overview and first results from the North-PHASE Legacy Survey, which follows six young clusters for five years, using the 2 deg$^2$ FoV of the JAST80 telescope from the Javalambre Observatory (Spain). North-PHASE investigates stellar variability on timescales from days to years for thousands of young stars distributed over entire clusters. This allows us to find new YSO, characteris… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted by MNRAS

  43. arXiv:2406.16695  [pdf, other

    cs.CV

    Geometry-Aware Score Distillation via 3D Consistent Noising and Gradient Consistency Modeling

    Authors: Min-Seop Kwak, Donghoon Ahn, Ines Hyeonsu Kim, Jin-Hwa Kim, Seungryong Kim

    Abstract: Score distillation sampling (SDS), the methodology in which the score from pretrained 2D diffusion models is distilled into 3D representation, has recently brought significant advancements in text-to-3D generation task. However, this approach is still confronted with critical geometric inconsistency problems such as the Janus problem. Starting from a hypothesis that such inconsistency problems may… ▽ More

    Submitted 30 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

  44. arXiv:2406.16175  [pdf, other

    cs.SI cs.CY physics.soc-ph

    The Persistence of Contrarianism on Twitter: Mapping users' sharing habits for the Ukraine war, COVID-19 vaccination, and the 2022 Midterm Elections

    Authors: David Axelrod, Sangyeon Kim, John Paolillo

    Abstract: Empirical studies of online disinformation emphasize matters of public concern such as the COVID-19 pandemic, foreign election interference, and the Russo-Ukraine war, largely in studies that treat the topics separately. Comparatively fewer studies attempt to relate such disparate topics and address the extent to which they share behaviors. In this study, we compare three samples of Twitter data o… ▽ More

    Submitted 28 June, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

  45. arXiv:2406.16136  [pdf, other

    stat.ME

    Distribution-Free Online Change Detection for Low-Rank Images

    Authors: Tingnan Gong, Seong-Hee Kim, Yao Xie

    Abstract: We present a distribution-free CUSUM procedure designed for online change detection in a time series of low-rank images, particularly when the change causes a mean shift. We represent images as matrix data and allow for temporal dependence, in addition to inherent spatial dependence, before and after the change. The marginal distributions are assumed to be general, not limited to any specific para… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 29 pages, 7 figures

  46. arXiv:2406.16042  [pdf, other

    cs.CV

    Pose-Diversified Augmentation with Diffusion Model for Person Re-Identification

    Authors: Inès Hyeonsu Kim, JoungBin Lee, Soowon Son, Woojeong Jin, Kyusun Cho, Junyoung Seo, Min-Seop Kwak, Seokju Cho, JeongYeol Baek, Byeongwon Lee, Seungryong Kim

    Abstract: Person re-identification (Re-ID) often faces challenges due to variations in human poses and camera viewpoints, which significantly affect the appearance of individuals across images. Existing datasets frequently lack diversity and scalability in these aspects, hindering the generalization of Re-ID models to new camera systems. Previous methods have attempted to address these issues through data a… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: The project page is available at https://ku-cvlab.github.io/Diff-ID/

  47. arXiv:2406.15664  [pdf, other

    stat.ML cs.LG

    Flat Posterior Does Matter For Bayesian Transfer Learning

    Authors: Sungjun Lim, Jeyoon Yeom, Sooyon Kim, Hoyoon Byun, Jinho Kang, Yohan Jung, Jiyoung Jung, Kyungwoo Song

    Abstract: The large-scale pre-trained neural network has achieved notable success in enhancing performance for downstream tasks. Another promising approach for generalization is Bayesian Neural Network (BNN), which integrates Bayesian methods into neural network architectures, offering advantages such as Bayesian Model averaging (BMA) and uncertainty quantification. Despite these benefits, transfer learning… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  48. arXiv:2406.15102  [pdf, other

    cs.CV cs.LG

    HLQ: Fast and Efficient Backpropagation via Hadamard Low-rank Quantization

    Authors: Seonggon Kim, Eunhyeok Park

    Abstract: With the rapid increase in model size and the growing importance of various fine-tuning applications, lightweight training has become crucial. Since the backward pass is twice as expensive as the forward pass, optimizing backpropagation is particularly important. However, modifications to this process can lead to suboptimal convergence, so training optimization should minimize perturbations, which… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  49. arXiv:2406.14888  [pdf, other

    astro-ph.GA astro-ph.CO

    Finding dusty AGNs from the JWST CEERS survey with mid-infrared photometry

    Authors: Tom C. -C. Chien, Chih-Teng Ling, Tomotsugu Goto, Cossas K. -W. Wu, Seong Jin Kim, Tetsuya Hashimoto, Yu-Wei Lin, Ece Kilerci, Simon C. -C. Ho, Po-Ya Wang, Bjorn Jasper R. Raquel

    Abstract: The nature of the interaction between active galactic nuclei (AGNs) and their host galaxies remains an unsolved question. Therefore, conducting an AGN census is valuable to AGN research. Nevertheless, a significant fraction of AGNs are obscured by their environment, which blocks UV and optical emissions due to the dusty torus surrounding the central supermassive black hole (SMBH). To overcome this… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 15 pages, 20 figures, 4 tables. Accepted for publication in MNRAS. The 3 min summary: https://www.youtube.com/watch?v=mWUebbgUOh8

  50. arXiv:2406.13214  [pdf, other

    cs.LG

    Self-Explainable Temporal Graph Networks based on Graph Information Bottleneck

    Authors: Sangwoo Seo, Sungwon Kim, Jihyeong Jung, Yoonho Lee, Chanyoung Park

    Abstract: Temporal Graph Neural Networks (TGNN) have the ability to capture both the graph topology and dynamic dependencies of interactions within a graph over time. There has been a growing need to explain the predictions of TGNN models due to the difficulty in identifying how past events influence their predictions. Since the explanation model for a static graph cannot be readily applied to temporal grap… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: KDD 2024