Skip to main content

Showing 1–50 of 191 results for author: Chun, S

  1. arXiv:2407.05713  [pdf, other

    cs.CV cs.AI

    Short-term Object Interaction Anticipation with Disentangled Object Detection @ Ego4D Short Term Object Interaction Anticipation Challenge

    Authors: Hyunjin Cho, Dong Un Kang, Se Young Chun

    Abstract: Short-term object interaction anticipation is an important task in egocentric video analysis, including precise predictions of future interactions and their timings as well as the categories and positions of the involved active objects. To alleviate the complexity of this task, our proposed method, SOIA-DOD, effectively decompose it into 1) detecting active object and 2) classifying interaction an… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 4 pages

  2. arXiv:2407.05551  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    Read, Watch and Scream! Sound Generation from Text and Video

    Authors: Yujin Jeong, Yunji Kim, Sanghyuk Chun, Jiyoung Lee

    Abstract: Multimodal generative models have shown impressive advances with the help of powerful diffusion models. Despite the progress, generating sound solely from text poses challenges in ensuring comprehensive scene depiction and temporal alignment. Meanwhile, video-to-sound generation limits the flexibility to prioritize sound synthesis for specific objects within the scene. To tackle these challenges,… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Project page: https://naver-ai.github.io/rewas

  3. arXiv:2406.12014  [pdf, other

    astro-ph.HE

    An IXPE-Led X-ray Spectro-Polarimetric Campaign on the Soft State of Cygnus X-1: X-ray Polarimetric Evidence for Strong Gravitational Lensing

    Authors: James F. Steiner, Edward Nathan, Kun Hu, Henric Krawczynski, Michal Dovciak, Alexandra Veledina, Fabio Muleri, Jiri Svoboda, Kevin Alabarta, Maxime Parra, Yash Bhargava, Giorgio Matt, Juri Poutanen, Pierre-Olivier Petrucci, Allyn F. Tennant, M. Cristina Baglio, Luca Baldini, Samuel Barnier, Sudip Bhattacharyya, Stefano Bianchi, Maimouna Brigitte, Mauricio Cabezas, Floriane Cangemi, Fiamma Capitanio, Jacob Casey , et al. (112 additional authors not shown)

    Abstract: We present the first X-ray spectropolarimetric results for Cygnus X-1 in its soft state from a campaign of five IXPE observations conducted during 2023 May-June. Companion multiwavelength data during the campaign are likewise shown. The 2-8 keV X-rays exhibit a net polarization degree PD=1.99%+/-0.13% (68% confidence). The polarization signal is found to increase with energy across IXPE's 2-8 keV… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 20 pages, accepted for publication in ApJL

  4. arXiv:2406.09188  [pdf, ps, other

    cs.CV cs.IR

    Reducing Task Discrepancy of Text Encoders for Zero-Shot Composed Image Retrieval

    Authors: Jaeseok Byun, Seokhyeon Jeong, Wonjae Kim, Sanghyuk Chun, Taesup Moon

    Abstract: Composed Image Retrieval (CIR) aims to retrieve a target image based on a reference image and conditioning text, enabling controllable searches. Due to the expensive dataset construction cost for CIR triplets, a zero-shot (ZS) CIR setting has been actively studied to eliminate the need for human-collected triplet datasets. The mainstream of ZS-CIR employs an efficient projection module that projec… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 17 pages

  5. arXiv:2404.17507  [pdf, other

    cs.CV

    HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts

    Authors: Wonjae Kim, Sanghyuk Chun, Taekyung Kim, Dongyoon Han, Sangdoo Yun

    Abstract: In an era where the volume of data drives the effectiveness of self-supervised learning, the specificity and clarity of data semantics play a crucial role in model training. Addressing this, we introduce HYPerbolic Entailment filtering (HYPE), a novel methodology designed to meticulously extract modality-wise meaningful and well-aligned data from extensive, noisy image-text pair datasets. Our appr… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: 28pages, 4.5MB

  6. arXiv:2404.04544  [pdf, other

    cs.CV cs.AI

    BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion

    Authors: Gwanghyun Kim, Hayeon Kim, Hoigi Seo, Dong Un Kang, Se Young Chun

    Abstract: Generating higher-resolution human-centric scenes with details and controls remains a challenge for existing text-to-image diffusion models. This challenge stems from limited training image size, text encoder capacity (limited tokens), and the inherent difficulty of generating complex scenes involving multiple humans. While current methods attempted to address training size limit only, they often… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: Project page: https://janeyeon.github.io/beyond-scene

  7. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  8. arXiv:2403.18260  [pdf, other

    cs.CV cs.CL

    Toward Interactive Regional Understanding in Vision-Large Language Models

    Authors: Jungbeom Lee, Sanghyuk Chun, Sangdoo Yun

    Abstract: Recent Vision-Language Pre-training (VLP) models have demonstrated significant advancements. Nevertheless, these models heavily rely on image-text pairs that capture only coarse and global information of an image, leading to a limitation in their regional understanding ability. In this work, we introduce \textbf{RegionVLM}, equipped with explicit regional modeling capabilities, allowing them to un… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: NAACL 2024 Main Conference

  9. arXiv:2403.13993  [pdf, ps, other

    astro-ph.GA

    Is the RSGC4 (Alicante 8) cluster a real star cluster?: Peculiar radial velocities of red supergiant stars

    Authors: Sang-Hyun Chun, GyuChul Myeong, Jae-Joon Lee, Heeyoung Oh

    Abstract: Young massive star clusters, like the six red supergiant clusters in the Scutum complex, provide valuable insights into star-formation and galaxy structures. We investigated the high-resolution near-infrared spectra of 60 RSG candidates in these clusters using the Immersion Grating Infrared Spectrograph. Among the candidates in RSGC4, we found significant scattering in radial velocity ($-64$ km/s… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 22 pages, 9 figures, 2 tables, accepted for publication in AJ

  10. arXiv:2403.04460  [pdf, other

    cs.CL

    Pearl: A Review-driven Persona-Knowledge Grounded Conversational Recommendation Dataset

    Authors: Minjin Kim, Minju Kim, Hana Kim, Beong-woo Kwak, Soyeon Chun, Hyunseo Kim, SeongKu Kang, Youngjae Yu, Jinyoung Yeo, Dongha Lee

    Abstract: Conversational recommender system is an emerging area that has garnered an increasing interest in the community, especially with the advancements in large language models (LLMs) that enable diverse reasoning over conversational input. Despite the progress, the field has many aspects left to explore. The currently available public datasets for conversational recommendation lack specific user prefer… ▽ More

    Submitted 8 June, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: Published at ACL 2024 Findings

  11. Systematic effects on a Compton polarimeter at the focus of an X-ray mirror

    Authors: M. Aoyagi, R. G. Bose, S. Chun, E. Gau, K. Hu, K. Ishiwata, N. K. Iyer, F. Kislat, M. Kiss, K. Klepper, H. Krawczynski, L. Lisalda, Y. Maeda, F. af Malmborg, H. Matsumoto, A. Miyamoto, T. Miyazawa, M. Pearce, B. F. Rauch, N. Rodriguez Cavero, S. Spooner, H. Takahashi, Y. Uchida, A. T. West, K. Wimalasena , et al. (1 additional authors not shown)

    Abstract: XL-Calibur is a balloon-borne Compton polarimeter for X-rays in the $\sim$15-80 keV range. Using an X-ray mirror with a 12 m focal length for collecting photons onto a beryllium scattering rod surrounded by CZT detectors, a minimum-detectable polarization as low as $\sim$3% is expected during a 24-hour on-target observation of a 1 Crab source at 45$^{\circ}$ elevation. Systematic effects alter the… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: Submitted to Astroparticle Physics

    Journal ref: Astropart. Phys. 158 (2024) 102944

  12. arXiv:2401.05513  [pdf, other

    physics.flu-dyn cond-mat.soft

    Flow rate-pressure drop relations for shear-thinning fluids in deformable configurations: theory and experiments

    Authors: SungGyu Chun, Evgeniy Boyko, Ivan C. Christov, Jie Feng

    Abstract: We provide an experimental framework to measure the flow rate--pressure drop relation for Newtonian and shear-thinning fluids in two common deformable configurations: (\textit{i}) a rectangular channel and (\textit{ii}) an axisymmetric tube. Using the Carreau model to describe the shear-dependent viscosity, we identify the key dimensionless rheological number, $Cu$, which characterizes shear thinn… ▽ More

    Submitted 25 April, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: 10 pages, 4 figures; v2 accepted for publication in Physical Review Fluids

    Journal ref: Phys. Rev. Fluids 9 (2024) 043302

  13. arXiv:2312.13027  [pdf, other

    cs.LG cs.CV

    Doubly Perturbed Task Free Continual Learning

    Authors: Byung Hyun Lee, Min-hwan Oh, Se Young Chun

    Abstract: Task Free online continual learning (TF-CL) is a challenging problem where the model incrementally learns tasks without explicit task information. Although training with entire data from the past, present as well as future is considered as the gold standard, naive approaches in TF-CL with the current samples may be conflicted with learning with samples in the future, leading to catastrophic forget… ▽ More

    Submitted 18 February, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024 (Oral)

  14. arXiv:2312.07425  [pdf, other

    cs.LG cs.CV eess.IV eess.SP

    Deep Internal Learning: Deep Learning from a Single Input

    Authors: Tom Tirer, Raja Giryes, Se Young Chun, Yonina C. Eldar

    Abstract: Deep learning, in general, focuses on training a neural network from large labeled datasets. Yet, in many cases there is value in training a network just from the input at hand. This is particularly relevant in many signal and image processing problems where training data is scarce and diversity is large on the one hand, and on the other, there is a lot of structure in the data that can be exploit… ▽ More

    Submitted 8 April, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: Accepted to IEEE Signal Processing Magazine

  15. arXiv:2312.01998  [pdf, other

    cs.CV cs.IR

    Language-only Efficient Training of Zero-shot Composed Image Retrieval

    Authors: Geonmo Gu, Sanghyuk Chun, Wonjae Kim, Yoohoon Kang, Sangdoo Yun

    Abstract: Composed image retrieval (CIR) task takes a composed query of image and text, aiming to search relative images for both conditions. Conventional CIR approaches need a training dataset composed of triplets of query image, query text, and target image, which is very expensive to collect. Several recent works have worked on the zero-shot (ZS) CIR paradigm to tackle the issue without using pre-collect… ▽ More

    Submitted 31 March, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: CVPR 2024 camera-ready; First two authors contributed equally; 17 pages, 3.1MB

  16. arXiv:2312.01689  [pdf, other

    eess.IV cs.CV

    Fast and accurate sparse-view CBCT reconstruction using meta-learned neural attenuation field and hash-encoding regularization

    Authors: Heejun Shin, Taehee Kim, Jongho Lee, Se Young Chun, Seungryung Cho, Dongmyung Shin

    Abstract: Cone beam computed tomography (CBCT) is an emerging medical imaging technique to visualize the internal anatomical structures of patients. During a CBCT scan, several projection images of different angles or views are collectively utilized to reconstruct a tomographic image. However, reducing the number of projections in a CBCT scan while preserving the quality of a reconstructed image is challeng… ▽ More

    Submitted 16 January, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

  17. arXiv:2311.18654  [pdf, other

    cs.CV cs.AI

    Detailed Human-Centric Text Description-Driven Large Scene Synthesis

    Authors: Gwanghyun Kim, Dong Un Kang, Hoigi Seo, Hayeon Kim, Se Young Chun

    Abstract: Text-driven large scene image synthesis has made significant progress with diffusion models, but controlling it is challenging. While using additional spatial controls with corresponding texts has improved the controllability of large scene synthesis, it is still challenging to faithfully reflect detailed text descriptions without user-provided controls. Here, we propose DetText2Scene, a novel tex… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  18. arXiv:2311.18387  [pdf, other

    cs.CV cs.LG

    On Exact Inversion of DPM-Solvers

    Authors: Seongmin Hong, Kyeonghyun Lee, Suh Yoon Jeon, Hyewon Bae, Se Young Chun

    Abstract: Diffusion probabilistic models (DPMs) are a key component in modern generative models. DPM-solvers have achieved reduced latency and enhanced quality significantly, but have posed challenges to find the exact inverse (i.e., finding the initial noise from the given image). Here we investigate the exact inversions for DPM-solvers and propose algorithms to perform them when samples are generated by t… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: 16 pages

  19. arXiv:2311.08461  [pdf, other

    astro-ph.GA astro-ph.SR

    Chemical homogeneity of wide binary system: An approach from Near-Infrared spectroscopy

    Authors: Dongwook Lim, Andreas J. Koch-Hansen, Seungsoo Hong, Sang-Hyun Chun, Young-Wook Lee

    Abstract: Wide binaries, with separations between two stars from a few AU to more than several thousand AU, are valuable objects for various research topics in Galactic astronomy. As the number of newly reported wide binaries continues to increase, studying the chemical abundances of their component stars becomes more important. We conducted high-resolution near-infrared (NIR) spectroscopy for six pairs of… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: 16 pages, 9 figures, accepted for publication in AJ

  20. arXiv:2311.01001  [pdf, other

    cs.CV cs.AI

    Fully Quantized Always-on Face Detector Considering Mobile Image Sensors

    Authors: Haechang Lee, Wongi Jeong, Dongil Ryu, Hyunwoo Je, Albert No, Kijeong Kim, Se Young Chun

    Abstract: Despite significant research on lightweight deep neural networks (DNNs) designed for edge devices, the current face detectors do not fully meet the requirements for "intelligent" CMOS image sensors (iCISs) integrated with embedded DNNs. These sensors are essential in various practical applications, such as energy-efficient mobile phones and surveillance systems with always-on capabilities. One not… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: Accepted to ICCV 2023 Workshop on Low-Bit Quantized Neural Networks (LBQNN), Oral

  21. arXiv:2310.13593  [pdf, other

    cs.CV

    Learning with Unmasked Tokens Drives Stronger Vision Learners

    Authors: Taekyung Kim, Sanghyuk Chun, Byeongho Heo, Dongyoon Han

    Abstract: Masked image modeling (MIM) has become a leading self-supervised learning strategy. MIMs such as Masked Autoencoder (MAE) learn strong representations by randomly masking input tokens for the encoder to process, with the decoder reconstructing the masked tokens to the input. However, MIM pre-trained encoders often exhibit a limited attention span, attributed to MIM's sole focus on regressing maske… ▽ More

    Submitted 23 April, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

  22. arXiv:2310.11125  [pdf, other

    astro-ph.HE

    IXPE observation confirms a high spin in the accreting black hole 4U 1957+115

    Authors: L. Marra, M. Brigitte, N. Rodriguez Cavero, S. Chun, J. F. Steiner, M. Dovčiak, M. Nowak, S. Bianchi, F. Capitanio, A. Ingram, G. Matt, F. Muleri, J. Podgorný, J. Poutanen, J. Svoboda, R. Taverna, F. Ursini, A. Veledina, A. De Rosa, J. A. Garcia, A. A. Lutovinov, I. A. Mereminskiy, R. Farinelli, S. Gunji, P. Kaaret , et al. (91 additional authors not shown)

    Abstract: We present the results of the first X-ray polarimetric observation of the low-mass X-ray binary 4U 1957+115, performed with the Imaging X-ray Polarimetry Explorer in May 2023. The binary system has been in a high-soft spectral state since its discovery and is thought to host a black hole. The $\sim$571 ks observation reveals a linear polarisation degree of $1.9\% \pm 0.6\%$ and a polarisation angl… ▽ More

    Submitted 8 February, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: 12 pages, 10 figures, 2 tables, accepted for publication in A&A

  23. arXiv:2310.10847  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Non-local features of the spin-orbit exciton in Kitaev materials

    Authors: Blair W. Lebert, Subin Kim, Beom Hyun Kim, Sae Hwan Chun, Diego Casa, Jaewon Choi, Stefano Agrestini, Kejin Zhou, Mirian Garcia-Fernandez, Young-June Kim

    Abstract: A comparative resonant inelastic x-ray scattering (RIXS) study of three well-known Kitaev materials is presented: $α$-Li$_2$IrO$_3$, Na$_2$IrO$_3$, and $α$-RuCl$_3$. Despite similar low-energy physics, these materials show distinct electronic properties, such as the large difference in the size of the charge gap. The RIXS spectra of the spin-orbit exciton for these materials show remarkably simila… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Journal ref: Phys. Rev. B 108, 155122, 2023

  24. First X-ray polarization measurement confirms the low black-hole spin in LMC X-3

    Authors: Jiří Svoboda, Michal Dovčiak, James F. Steiner, Fabio Muleri, Adam Ingram, Anastasiya Yilmaz, Nicole Rodriguez Cavero, Lorenzo Marra, Juri Poutanen, Alexandra Veledina, Mehrnoosh Rahbardar Mojaver, Stefano Bianchi, Javier Garcia, Philip Kaaret, Henric Krawczynski, Giorgio Matt, Jakub Podgorný, Martin C. Weisskopf, Fabian Kislat, Pierre-Olivier Petrucci, Maimouna Brigitte, Michal Bursa, Sergio Fabiani, Kun Hu, Sohee Chun , et al. (87 additional authors not shown)

    Abstract: X-ray polarization is a powerful tool to investigate the geometry of accreting material around black holes, allowing independent measurements of the black hole spin and orientation of the innermost parts of the accretion disk. We perform the X-ray spectro-polarimetric analysis of an X-ray binary system in the Large Magellanic Cloud, LMC X-3, that hosts a stellar-mass black hole, known to be persis… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: 14 pages, 8 figures, submitted to ApJ

    Journal ref: The Astrophysical Journal, 2024, 960, 3

  25. arXiv:2308.14844  [pdf, other

    astro-ph.SR astro-ph.GA astro-ph.HE

    The SN 2023ixf Progenitor in M101: II. Properties

    Authors: Schuyler D. Van Dyk, Sundar Srinivasan, Jennifer E. Andrews, Monika Soraisam, Tamas Szalai, Steve B. Howell, Howard Isaacson, Thomas Matheson, Erik Petigura, Peter Scicluna, Andrew W. Stephens, Judah Van Zandt, WeiKang Zheng, Sang-Hyun Chun, Alexei V. Filippenko

    Abstract: We follow our first paper with an analysis of the ensemble of the extensive pre-explosion ground- and space-based infrared observations of the red supergiant (RSG) progenitor candidate for the nearby core-collapse supernova SN 2023ixf in Messier 101, together with optical data prior to explosion obtained with the Hubble Space Telescope (HST). We have confirmed the association of the progenitor can… ▽ More

    Submitted 23 April, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: 40 pages, substantive modifications relative to the previous, although the overall conclusions remain the same; to appear in AAS Journals

  26. arXiv:2308.14374  [pdf, other

    cs.LG

    Online Continual Learning on Hierarchical Label Expansion

    Authors: Byung Hyun Lee, Okchul Jung, Jonghyun Choi, Se Young Chun

    Abstract: Continual learning (CL) enables models to adapt to new tasks and environments without forgetting previously learned knowledge. While current CL setups have ignored the relationship between labels in the past task and the new task with or without small task overlaps, real-world scenarios often involve hierarchical relationships between old and new tasks, posing another challenge for traditional CL… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV 2023

  27. arXiv:2308.13449  [pdf, other

    cs.CL

    The Poison of Alignment

    Authors: Aibek Bekbayev, Sungbae Chun, Yerzat Dulat, James Yamazaki

    Abstract: From the perspective of content safety issues, alignment has shown to limit large language models' (LLMs) harmful content generation. This intentional method of reinforcing models to not respond to certain user inputs seem to be present in many modern open-source instruction tuning datasets such as OpenAssistant or Guanaco. We introduce a novel insight to an instruction-tuned model's performance a… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  28. arXiv:2307.10667  [pdf, other

    eess.IV cs.CV

    Efficient Unified Demosaicing for Bayer and Non-Bayer Patterned Image Sensors

    Authors: Haechang Lee, Dongwon Park, Wongi Jeong, Kijeong Kim, Hyunwoo Je, Dongil Ryu, Se Young Chun

    Abstract: As the physical size of recent CMOS image sensors (CIS) gets smaller, the latest mobile cameras are adopting unique non-Bayer color filter array (CFA) patterns (e.g., Quad, Nona, QxQ), which consist of homogeneous color units with adjacent pixels. These non-Bayer sensors are superior to conventional Bayer CFA thanks to their changeable pixel-bin sizes for different light conditions but may introdu… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  29. arXiv:2306.16615  [pdf, other

    cs.CV

    Representation learning of vertex heatmaps for 3D human mesh reconstruction from multi-view images

    Authors: Sungho Chun, Sungbum Park, Ju Yong Chang

    Abstract: This study addresses the problem of 3D human mesh reconstruction from multi-view images. Recently, approaches that directly estimate the skinned multi-person linear model (SMPL)-based human mesh vertices based on volumetric heatmap representation from input images have shown good performance. We show that representation learning of vertex heatmaps using an autoencoder helps improve the performance… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: ICIP 2023

  30. arXiv:2306.10783  [pdf, other

    astro-ph.SR astro-ph.GA astro-ph.HE

    The SN 2023ixf Progenitor in M101: I. Infrared Variability

    Authors: Monika D. Soraisam, Tamás Szalai, Schuyler D. Van Dyk, Jennifer E. Andrews, Sundar Srinivasan, Sang-Hyun Chun, Thomas Matheson, Peter Scicluna, Diego A. Vasquez-Torres

    Abstract: Observational evidence points to a red supergiant (RSG) progenitor for SN 2023ixf. The progenitor candidate has been detected in archival images at wavelengths (>0.6 micron) where RSGs typically emit profusely. This object is distinctly variable in the infrared (IR). We characterize the variability using pre-explosion mid-IR (3.6 and 4.5 micron) Spitzer and ground-based near-IR (JHKs) archival dat… ▽ More

    Submitted 22 August, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: 20 pages, 2 tables, accepted to ApJ

  31. arXiv:2306.09766  [pdf, other

    cond-mat.mtrl-sci

    Ultrafast switching of topological invariants by light-driven strain

    Authors: Tae Gwan Park, Seungil Baek, Junho Park, Eui-Cheol Shin, Hong Ryeol Na, Eon-Taek Oh, Seung-Hyun Chun, Yong-Hyun Kim, Sunghun Lee, Fabian Rotermund

    Abstract: Reversible control of the topological invariants from nontrivial to trivial states has fundamental implications for quantum information processors and spintronics, by realizing of an on/off switch for robust and dissipationless spin-current. Although mechanical strain has typically advantageous for such control of topological invariants, it is often accompanied by in-plane fractures and is not sui… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: 14 pages, 4 figure, and Supplementary Material

  32. arXiv:2305.18171  [pdf, other

    cs.CV cs.LG

    Improved Probabilistic Image-Text Representations

    Authors: Sanghyuk Chun

    Abstract: Image-Text Matching (ITM) task, a fundamental vision-language (VL) task, suffers from the inherent ambiguity arising from multiplicity and imperfect annotations. Deterministic functions are not sufficiently powerful to capture ambiguity, prompting the exploration of probabilistic embeddings to tackle the challenge. However, the existing probabilistic ITM approach encounters two key shortcomings; t… ▽ More

    Submitted 9 April, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: ICLR 2024 camera-ready; Code: https://github.com/naver-ai/pcmepp. Project page: https://naver-ai.github.io/pcmepp/. 30 pages, 2.2 MB

  33. arXiv:2305.16057  [pdf

    cs.LG cs.CL

    Fake News Detection and Behavioral Analysis: Case of COVID-19

    Authors: Chih-Yuan Li, Navya Martin Kollapally, Soon Ae Chun, James Geller

    Abstract: While the world has been combating COVID-19 for over three years, an ongoing "Infodemic" due to the spread of fake news regarding the pandemic has also been a global issue. The existence of the fake news impact different aspect of our daily lives, including politics, public health, economic activities, etc. Readers could mistake fake news for real news, and consequently have less access to authent… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: 27 pages, 11 figures, 13 tables

    MSC Class: 68

  34. arXiv:2305.10630  [pdf, other

    astro-ph.HE

    The First X-ray Polarization Observation of the Black Hole X-ray Binary 4U 1630-47 in the Steep Power Law State

    Authors: Nicole Rodriguez Cavero, Lorenzo Marra, Henric Krawczynski, Michal Dovčiak, Stefano Bianchi, James F. Steiner, Jiri Svoboda, Fiamma Capitanio, Giorgio Matt, Michela Negro, Adam Ingram, Alexandra Veledina, Roberto Taverna, Vladimir Karas, Francesco Ursini, Jakub Podgorný, Ajay Ratheesh, Valery Suleimanov, Romana Mikušincová, Silvia Zane, Philip Kaaret, Fabio Muleri, Juri Poutanen, Christian Malacaria, Pierre-Olivier Petrucci , et al. (85 additional authors not shown)

    Abstract: The Imaging X-ray Polarimetry Explorer (IXPE) observed the black hole X-ray binary 4U 1630-47 in the steep power law (or very high) state. The observations reveal a linear polarization degree of the 2-8 keV X-rays of 6.8 +/- 0.2 % at a position angle of 21°.3 +/- 0°.9 East of North (all errors at 1σ confidence level). Whereas the polarization degree increases with energy, the polarization angle st… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: 14 pages, 2 tables, 6 figures

  35. X-ray Polarization of the Black Hole X-ray Binary 4U 1630-47 Challenges Standard Thin Accretion Disk Scenario

    Authors: Ajay Ratheesh, Michal Dovčiak, Henric Krawczynski, Jakub Podgorný, Lorenzo Marra, Alexandra Veledina, Valery Suleimanov, Nicole Rodriguez Cavero, James Steiner, Jiri Svoboda, Andrea Marinucci, Stefano Bianchi, Michela Negro, Giorgio Matt, Francesco Tombesi, Juri Poutanen, Adam Ingram, Roberto Taverna, Andrew West, Vladimir Karas, Francesco Ursini, Paolo Soffitta, Fiamma Capitanio, Domenico Viscolo, Alberto Manfreda , et al. (90 additional authors not shown)

    Abstract: Large energy-dependent X-ray polarization degree is detected by the Imaging X-ray Polarimetry Explorer ({IXPE}) in the high-soft emission state of the black hole X-ray binary 4U 1630--47. The highly significant detection (at $\approx50σ$ confidence level) of an unexpectedly high polarization, rising from $\sim6\%$ at $2$ keV to $\sim10\%$ at $8$ keV, cannot be easily reconciled with standard model… ▽ More

    Submitted 19 March, 2024; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: Published in ApJ (https://doi.org/10.3847/1538-4357/ad226e)

    Journal ref: 2024 ApJ 964 77

  36. arXiv:2304.10727  [pdf, other

    cs.CV cs.AI

    RoCOCO: Robustness Benchmark of MS-COCO to Stress-test Image-Text Matching Models

    Authors: Seulki Park, Daeho Um, Hajung Yoon, Sanghyuk Chun, Sangdoo Yun, Jin Young Choi

    Abstract: In this paper, we propose a robustness benchmark for image-text matching models to assess their vulnerabilities. To this end, we insert adversarial texts and images into the search pool (i.e., gallery set) and evaluate models with the adversarial data. Specifically, we replace a word in the text to change the meaning of the text and mix images with different images to create perceptible changes in… ▽ More

    Submitted 14 July, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

  37. arXiv:2304.04875  [pdf, other

    cs.CV

    Three Recipes for Better 3D Pseudo-GTs of 3D Human Mesh Estimation in the Wild

    Authors: Gyeongsik Moon, Hongsuk Choi, Sanghyuk Chun, Jiyoung Lee, Sangdoo Yun

    Abstract: Recovering 3D human mesh in the wild is greatly challenging as in-the-wild (ITW) datasets provide only 2D pose ground truths (GTs). Recently, 3D pseudo-GTs have been widely used to train 3D human mesh estimation networks as the 3D pseudo-GTs enable 3D mesh supervision when training the networks on ITW datasets. However, despite the great potential of the 3D pseudo-GTs, there has been no extensive… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: Published at CVPRW 2023

  38. arXiv:2304.04555  [pdf, other

    cs.LG cs.AI

    Neural Diffeomorphic Non-uniform B-spline Flows

    Authors: Seongmin Hong, Se Young Chun

    Abstract: Normalizing flows have been successfully modeling a complex probability distribution as an invertible transformation of a simple base distribution. However, there are often applications that require more than invertibility. For instance, the computation of energies and forces in physics requires the second derivatives of the transformation to be well-defined and continuous. Smooth normalizing flow… ▽ More

    Submitted 11 April, 2023; v1 submitted 7 April, 2023; originally announced April 2023.

    Comments: Accepted to AAAI 2023

  39. arXiv:2304.02827  [pdf, other

    cs.CV cs.AI

    DITTO-NeRF: Diffusion-based Iterative Text To Omni-directional 3D Model

    Authors: Hoigi Seo, Hayeon Kim, Gwanghyun Kim, Se Young Chun

    Abstract: The increasing demand for high-quality 3D content creation has motivated the development of automated methods for creating 3D object models from a single image and/or from a text prompt. However, the reconstructed 3D objects using state-of-the-art image-to-3D methods still exhibit low correspondence to the given image and low multi-view consistency. Recent state-of-the-art text-to-3D methods are a… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: Project page: https://janeyeon.github.io/ditto-nerf/

  40. arXiv:2304.01900  [pdf, other

    cs.CV cs.AI

    PODIA-3D: Domain Adaptation of 3D Generative Model Across Large Domain Gap Using Pose-Preserved Text-to-Image Diffusion

    Authors: Gwanghyun Kim, Ji Ha Jang, Se Young Chun

    Abstract: Recently, significant advancements have been made in 3D generative models, however training these models across diverse domains is challenging and requires an huge amount of training data and knowledge of pose distribution. Text-guided domain adaptation methods have allowed the generator to be adapted to the target domains using text prompts, thereby obviating the need for assembling numerous data… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: Project page: https://gwang-kim.github.io/podia_3d/

  41. arXiv:2303.17595  [pdf, other

    cs.CV cs.LG

    Neglected Free Lunch -- Learning Image Classifiers Using Annotation Byproducts

    Authors: Dongyoon Han, Junsuk Choe, Seonghyeok Chun, John Joon Young Chung, Minsuk Chang, Sangdoo Yun, Jean Y. Song, Seong Joon Oh

    Abstract: Supervised learning of image classifiers distills human knowledge into a parametric model through pairs of images and corresponding labels (X,Y). We argue that this simple and widely used representation of human knowledge neglects rich auxiliary information from the annotation procedure, such as the time-series of mouse traces and clicks left after image selection. Our insight is that such annotat… ▽ More

    Submitted 26 July, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: Code & data at https://github.com/naver-ai/NeglectedFreeLunch. To be presented at ICCV'23

  42. arXiv:2303.12034  [pdf, other

    astro-ph.HE

    The first X-ray polarimetric observation of the black hole binary LMC X-1

    Authors: Jakub Podgorny, Lorenzo Marra, Fabio Muleri, Nicole Rodriguez Cavero, Ajay Ratheesh, Michal Dovciak, Romana Mikusincova, Maimouna Brigitte, James F. Steiner, Alexandra Veledina, Stefano Bianchi, Henric Krawczynski, Jiri Svoboda, Philip Kaaret, Giorgio Matt, Javier A. Garcia, Pierre-Olivier Petrucci, Alexander A. Lutovinov, Andrey N. Semena, Alessandro Di Marco, Michela Negro, Martin C. Weisskopf, Adam Ingram, Juri Poutanen, Banfsheh Beheshtipour , et al. (86 additional authors not shown)

    Abstract: We report on an X-ray polarimetric observation of the high-mass X-ray binary LMC X-1 in the high/soft state, obtained by the Imaging X-ray Polarimetry Explorer (IXPE) in October 2022. The measured polarization is below the minimum detectable polarization of 1.1 per cent (at the 99 per cent confidence level). Simultaneously, the source was observed with the NICER, NuSTAR and SRG/ART-XC instruments,… ▽ More

    Submitted 9 October, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: 12 pages, 9 figures, 4 tables. Accepted for publication in MNRAS

  43. arXiv:2303.11916  [pdf, other

    cs.CV cs.IR

    CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion

    Authors: Geonmo Gu, Sanghyuk Chun, Wonjae Kim, HeeJae Jun, Yoohoon Kang, Sangdoo Yun

    Abstract: This paper proposes a novel diffusion-based model, CompoDiff, for solving zero-shot Composed Image Retrieval (ZS-CIR) with latent diffusion. This paper also introduces a new synthetic dataset, named SynthTriplets18M, with 18.8 million reference images, conditions, and corresponding target image triplets to train CIR models. CompoDiff and SynthTriplets18M tackle the shortages of the previous CIR ap… ▽ More

    Submitted 25 February, 2024; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: First two authors contributed equally; 28 pages, 6.2MB

  44. arXiv:2303.11114  [pdf, other

    cs.CV

    SeiT: Storage-Efficient Vision Training with Tokens Using 1% of Pixel Storage

    Authors: Song Park, Sanghyuk Chun, Byeongho Heo, Wonjae Kim, Sangdoo Yun

    Abstract: We need billion-scale images to achieve more generalizable and ground-breaking vision models, as well as massive dataset storage to ship the images (e.g., the LAION-4B dataset needs 240TB storage space). However, it has become challenging to deal with unlimited dataset storage with limited storage infrastructure. A number of storage-efficient training methods have been proposed to tackle the probl… ▽ More

    Submitted 11 September, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: ICCV 2023; First two authors contributed equally; code url: https://github.com/naver-ai/seit; 17 pages, 1.2MB

  45. arXiv:2303.00442  [pdf, other

    cs.LG cs.AI cs.CY

    Re-weighting Based Group Fairness Regularization via Classwise Robust Optimization

    Authors: Sangwon Jung, Taeeon Park, Sanghyuk Chun, Taesup Moon

    Abstract: Many existing group fairness-aware training methods aim to achieve the group fairness by either re-weighting underrepresented groups based on certain rules or using weakly approximated surrogates for the fairness metrics in the objective as regularization terms. Although each of the learning schemes has its own strength in terms of applicability or performance, respectively, it is difficult for an… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

  46. arXiv:2212.04319  [pdf, other

    cs.CV cs.AI

    On the Robustness of Normalizing Flows for Inverse Problems in Imaging

    Authors: Seongmin Hong, Inbum Park, Se Young Chun

    Abstract: Conditional normalizing flows can generate diverse image samples for solving inverse problems. Most normalizing flows for inverse problems in imaging employ the conditional affine coupling layer that can generate diverse images quickly. However, unintended severe artifacts are occasionally observed in the output of them. In this work, we address this critical issue by investigating the origins of… ▽ More

    Submitted 16 March, 2023; v1 submitted 8 December, 2022; originally announced December 2022.

    Comments: 16 pages

  47. arXiv:2212.04114  [pdf, other

    cs.CV

    Group Generalized Mean Pooling for Vision Transformer

    Authors: Byungsoo Ko, Han-Gyu Kim, Byeongho Heo, Sangdoo Yun, Sanghyuk Chun, Geonmo Gu, Wonjae Kim

    Abstract: Vision Transformer (ViT) extracts the final representation from either class token or an average of all patch tokens, following the architecture of Transformer in Natural Language Processing (NLP) or Convolutional Neural Networks (CNNs) in computer vision. However, studies for the best way of aggregating the patch tokens are still limited to average pooling, while widely-used pooling strategies, s… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

  48. arXiv:2211.16374  [pdf, other

    cs.CV cs.AI

    DATID-3D: Diversity-Preserved Domain Adaptation Using Text-to-Image Diffusion for 3D Generative Model

    Authors: Gwanghyun Kim, Se Young Chun

    Abstract: Recent 3D generative models have achieved remarkable performance in synthesizing high resolution photorealistic images with view consistency and detailed 3D shapes, but training them for diverse domains is challenging since it requires massive training images and their camera distribution information. Text-guided domain adaptation methods have shown impressive performance on converting the 2D gene… ▽ More

    Submitted 30 March, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: Accepted to CVPR 2023, Project page: https://gwang-kim.github.io/datid_3d/

  49. arXiv:2211.08670  [pdf, ps, other

    physics.flu-dyn

    Experimental observation of a confined bubble moving in shear-thinning fluids

    Authors: SungGyu Chun, Bingqiang Ji, Zhengyu Yang, Vinit Kumar Malik, Jie Feng

    Abstract: The motion of a long gas bubble in a confined capillary tube is ubiquitous in a wide range of engineering and biological applications. While the understanding of the deposited thin viscous film near the tube wall in Newtonian fluids is well developed, the deposition dynamics in commonly encountered non-Newtonian fluids remains much less studied. Here, we investigate the dynamics of a confined bubb… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  50. arXiv:2211.05910  [pdf, other

    eess.IV cs.CV

    Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 challenge: Report

    Authors: Andrey Ignatov, Radu Timofte, Maurizio Denna, Abdel Younes, Ganzorig Gankhuyag, Jingang Huh, Myeong Kyun Kim, Kihwan Yoon, Hyeon-Cheol Moon, Seungho Lee, Yoonsik Choe, Jinwoo Jeong, Sungjei Kim, Maciej Smyl, Tomasz Latkowski, Pawel Kubik, Michal Sokolski, Yujie Ma, Jiahao Chao, Zhou Zhou, Hongfan Gao, Zhengfeng Yang, Zhenbing Zeng, Zhengyang Zhuge, Chenghua Li , et al. (71 additional authors not shown)

    Abstract: Image super-resolution is a common task on mobile and IoT devices, where one often needs to upscale and enhance low-resolution images and video frames. While numerous solutions have been proposed for this problem in the past, they are usually not compatible with low-power mobile NPUs having many computational and memory constraints. In this Mobile AI challenge, we address this problem and propose… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: text overlap with arXiv:2105.07825, arXiv:2105.08826, arXiv:2211.04470, arXiv:2211.03885, arXiv:2211.05256