Skip to main content

Showing 1–50 of 106 results for author: Min, K

  1. arXiv:2406.12354  [pdf, other

    cs.CL

    Cross-Lingual Unlearning of Selective Knowledge in Multilingual Language Models

    Authors: Minseok Choi, Kyunghyun Min, Jaegul Choo

    Abstract: Pretrained language models memorize vast amounts of information, including private and copyrighted data, raising significant safety concerns. Retraining these models after excluding sensitive data is prohibitively expensive, making machine unlearning a viable, cost-effective alternative. Previous research has focused on machine unlearning for monolingual models, but we find that unlearning in one… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 15 pages, 5 figures

  2. arXiv:2406.09462  [pdf, other

    cs.CV cs.AI

    SViTT-Ego: A Sparse Video-Text Transformer for Egocentric Video

    Authors: Hector A. Valdez, Kyle Min, Subarna Tripathi

    Abstract: Pretraining egocentric vision-language models has become essential to improving downstream egocentric video-text tasks. These egocentric foundation models commonly use the transformer architecture. The memory footprint of these models during pretraining can be substantial. Therefore, we pretrain SViTT-Ego, the first sparse egocentric video-text transformer model integrating edge and node sparsific… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  3. arXiv:2406.02631  [pdf, other

    cs.CV

    Contrastive Language Video Time Pre-training

    Authors: Hengyue Liu, Kyle Min, Hector A. Valdez, Subarna Tripathi

    Abstract: We introduce LAVITI, a novel approach to learning language, video, and temporal representations in long-form videos via contrastive learning. Different from pre-training on video-text pairs like EgoVLP, LAVITI aims to align language, video, and temporal features by extracting meaningful moments in untrimmed videos. Our model employs a set of learnable moment queries to decode clip-level visual, la… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: CVPR EgoVis Workshop 2024 extended abstract

  4. arXiv:2405.16341  [pdf, other

    cs.CV

    R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model

    Authors: Changhoon Kim, Kyle Min, Yezhou Yang

    Abstract: In the evolving landscape of text-to-image (T2I) diffusion models, the remarkable capability to generate high-quality images from textual descriptions faces challenges with the potential misuse of reproducing sensitive content. To address this critical issue, we introduce Robust Adversarial Concept Erase (RACE), a novel approach designed to mitigate these risks by enhancing the robustness of conce… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  5. arXiv:2404.15650  [pdf, other

    cs.CL

    Return of EM: Entity-driven Answer Set Expansion for QA Evaluation

    Authors: Dongryeol Lee, Minwoo Lee, Kyungmin Min, Joonsuk Park, Kyomin Jung

    Abstract: Recently, directly using large language models (LLMs) has been shown to be the most reliable method to evaluate QA models. However, it suffers from limited interpretability, high cost, and environmental harm. To address these, we propose to use soft EM with entity-driven answer set expansion. Our approach expands the gold answer set to include diverse surface forms, based on the observation that t… ▽ More

    Submitted 11 June, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: Under Review (9 pages, 4 figures)

  6. arXiv:2401.13206  [pdf, ps, other

    cs.LG

    Self-Improving Interference Management Based on Deep Learning With Uncertainty Quantification

    Authors: Hyun-Suk Lee, Do-Yup Kim, Kyungsik Min

    Abstract: This paper presents a groundbreaking self-improving interference management framework tailored for wireless communications, integrating deep learning with uncertainty quantification to enhance overall system performance. Our approach addresses the computational challenges inherent in traditional optimization-based algorithms by harnessing deep learning models to predict optimal interference manage… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  7. arXiv:2312.03391  [pdf, other

    cs.CV

    Action Scene Graphs for Long-Form Understanding of Egocentric Videos

    Authors: Ivan Rodin, Antonino Furnari, Kyle Min, Subarna Tripathi, Giovanni Maria Farinella

    Abstract: We present Egocentric Action Scene Graphs (EASGs), a new representation for long-form understanding of egocentric videos. EASGs extend standard manually-annotated representations of egocentric videos, such as verb-noun action labels, by providing a temporally evolving graph-based description of the actions performed by the camera wearer, including interacted objects, their relationships, and how a… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  8. arXiv:2311.04468  [pdf

    eess.IV q-bio.NC

    A human brain atlas of chi-separation for normative iron and myelin distributions

    Authors: Kyeongseon Min, Beomseok Sohn, Woo Jung Kim, Chae Jung Park, Soohwa Song, Dong Hoon Shin, Kyung Won Chang, Na-Young Shin, Minjun Kim, Hyeong-Geol Shin, Phil Hyu Lee, Jongho Lee

    Abstract: Iron and myelin are primary susceptibility sources in the human brain. These substances are essential for healthy brain, and their abnormalities are often related to various neurological disorders. Recently, an advanced susceptibility mapping technique, which is referred to as chi-separation, has been proposed, successfully disentangling paramagnetic iron from diamagnetic myelin. This method opene… ▽ More

    Submitted 2 April, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: 19 pages, 9 figures

  9. arXiv:2311.03574  [pdf, ps, other

    cs.DB

    Fuzzy Relational Databases via Associative Arrays

    Authors: Kevin Min, Hayden Jananthan, Jeremy Kepner

    Abstract: The increasing rise in artificial intelligence has made the use of imprecise language in computer programs like ChatGPT more prominent. Fuzzy logic addresses this form of imprecise language by introducing the concept of fuzzy sets, where elements belong to the set with a certain membership value (called the fuzzy value). This paper combines fuzzy data with relational algebra to provide the mathema… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 5 pages, accepted to IEEE URTC 2023

  10. arXiv:2310.20178  [pdf, other

    cs.LG cs.AI

    Learning to Discover Skills through Guidance

    Authors: Hyunseung Kim, Byungkun Lee, Hojoon Lee, Dongyoon Hwang, Sejik Park, Kyushik Min, Jaegul Choo

    Abstract: In the field of unsupervised skill discovery (USD), a major challenge is limited exploration, primarily due to substantial penalties when skills deviate from their initial trajectories. To enhance exploration, recent methodologies employ auxiliary rewards to maximize the epistemic uncertainty or entropy of states. However, we have identified that the effectiveness of these rewards declines as the… ▽ More

    Submitted 1 November, 2023; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: 29 pages, 18 figures, published at NeurIPS 2023

  11. Towards Validating Long-Term User Feedbacks in Interactive Recommendation Systems

    Authors: Hojoon Lee, Dongyoon Hwang, Kyushik Min, Jaegul Choo

    Abstract: Interactive Recommender Systems (IRSs) have attracted a lot of attention, due to their ability to model interactive processes between users and recommender systems. Numerous approaches have adopted Reinforcement Learning (RL) algorithms, as these can directly maximize users' cumulative rewards. In IRS, researchers commonly utilize publicly available review datasets to compare and evaluate algorith… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted to SIGIR'22

  12. arXiv:2306.10608  [pdf, other

    cs.CV cs.SD eess.AS

    STHG: Spatial-Temporal Heterogeneous Graph Learning for Advanced Audio-Visual Diarization

    Authors: Kyle Min

    Abstract: This report introduces our novel method named STHG for the Audio-Visual Diarization task of the Ego4D Challenge 2023. Our key innovation is that we model all the speakers in a video using a single, unified heterogeneous graph learning framework. Unlike previous approaches that require a separate component solely for the camera wearer, STHG can jointly detect the speech activities of all people inc… ▽ More

    Submitted 31 October, 2023; v1 submitted 18 June, 2023; originally announced June 2023.

    Comments: Validation report for the Ego4D challenge at CVPR 2023

  13. arXiv:2306.04744  [pdf, other

    cs.CV

    WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models

    Authors: Changhoon Kim, Kyle Min, Maitreya Patel, Sheng Cheng, Yezhou Yang

    Abstract: The rapid advancement of generative models, facilitating the creation of hyper-realistic images from textual descriptions, has concurrently escalated critical societal concerns such as misinformation. Although providing some mitigation, traditional fingerprinting mechanisms fall short in attributing responsibility for the malicious use of synthetic images. This paper introduces a novel approach to… ▽ More

    Submitted 24 April, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: Accepted at CVPR 2024

  14. arXiv:2305.09691  [pdf, other

    cs.LG cs.AI stat.ME

    Evaluation Strategy of Time-series Anomaly Detection with Decay Function

    Authors: Yongwan Gim, Kyushik Min

    Abstract: Recent algorithms of time-series anomaly detection have been evaluated by applying a Point Adjustment (PA) protocol. However, the PA protocol has a problem of overestimating the performance of the detection algorithms because it only depends on the number of detected abnormal segments and their size. We propose a novel evaluation protocol called the Point-Adjusted protocol with decay function (PAd… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: 20 pages with references and appendix

  15. arXiv:2304.08809  [pdf, other

    cs.CV

    SViTT: Temporal Learning of Sparse Video-Text Transformers

    Authors: Yi Li, Kyle Min, Subarna Tripathi, Nuno Vasconcelos

    Abstract: Do video-text transformers learn to model temporal relationships across frames? Despite their immense capacity and the abundance of multimodal training data, recent work has revealed the strong tendency of video-text models towards frame-based spatial representations, while temporal reasoning remains largely unsolved. In this work, we identify several key challenges in temporal learning of video-t… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: CVPR 2023

  16. arXiv:2304.03293  [pdf

    q-bio.QM physics.bio-ph physics.data-an

    Prediction of Protein Aggregation Propensity via Data-driven Approaches

    Authors: Seungpyo Kang, Minseon Kim, Jiwon Sun, Myeonghun Lee, Kyoungmin Min

    Abstract: Protein aggregation occurs when misfolded or unfolded proteins physically bind together, and can promote the development of various amyloid diseases. This study aimed to construct surrogate models for predicting protein aggregation via data-driven methods using two types of databases. First, an aggregation propensity score database was constructed by calculating the scores for protein structures i… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

  17. arXiv:2304.00733  [pdf, other

    cs.CV

    Unbiased Scene Graph Generation in Videos

    Authors: Sayak Nag, Kyle Min, Subarna Tripathi, Amit K. Roy Chowdhury

    Abstract: The task of dynamic scene graph generation (SGG) from videos is complicated and challenging due to the inherent dynamics of a scene, temporal fluctuation of model predictions, and the long-tailed distribution of the visual relationships in addition to the already existing challenges in image-based SGG. Existing methods for dynamic SGG have primarily focused on capturing spatio-temporal context usi… ▽ More

    Submitted 29 June, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: Published in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023

  18. arXiv:2303.16209  [pdf

    q-bio.QM cs.LG

    AmorProt: Amino Acid Molecular Fingerprints Repurposing based Protein Fingerprint

    Authors: Myeonghun Lee, Kyoungmin Min

    Abstract: As protein therapeutics play an important role in almost all medical fields, numerous studies have been conducted on proteins using artificial intelligence. Artificial intelligence has enabled data driven predictions without the need for expensive experiments. Nevertheless, unlike the various molecular fingerprint algorithms that have been developed, protein fingerprint algorithms have rarely been… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  19. arXiv:2301.04685  [pdf, other

    cs.CV

    SHUNIT: Style Harmonization for Unpaired Image-to-Image Translation

    Authors: Seokbeom Song, Suhyeon Lee, Hongje Seong, Kyoungwon Min, Euntai Kim

    Abstract: We propose a novel solution for unpaired image-to-image (I2I) translation. To translate complex images with a wide range of objects to a different domain, recent approaches often use the object annotations to perform per-class source-to-target style mapping. However, there remains a point for us to exploit in the I2I. An object in each class consists of multiple components, and all the sub-object… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Comments: Accepted to AAAI 2023

  20. arXiv:2210.07764  [pdf, other

    cs.CV cs.SD eess.AS

    Intel Labs at Ego4D Challenge 2022: A Better Baseline for Audio-Visual Diarization

    Authors: Kyle Min

    Abstract: This report describes our approach for the Audio-Visual Diarization (AVD) task of the Ego4D Challenge 2022. Specifically, we present multiple technical improvements over the official baselines. First, we improve the detection performance of the camera wearer's voice activity by modifying the training scheme of its model. Second, we discover that an off-the-shelf voice activity detection model can… ▽ More

    Submitted 29 October, 2023; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: Validation report for the Ego4D challenge at ECCV 2022

  21. arXiv:2207.07783  [pdf, other

    cs.CV

    Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection

    Authors: Kyle Min, Sourya Roy, Subarna Tripathi, Tanaya Guha, Somdeb Majumdar

    Abstract: Active speaker detection (ASD) in videos with multiple speakers is a challenging task as it requires learning effective audiovisual features and spatial-temporal correlations over long temporal windows. In this paper, we present SPELL, a novel spatial-temporal graph learning framework that can solve complex tasks such as ASD. To this end, each person in a video frame is first encoded in a unique n… ▽ More

    Submitted 12 October, 2022; v1 submitted 15 July, 2022; originally announced July 2022.

    Comments: ECCV 2022 camera ready (Supplementary videos: on ECVA soon). This paper supersedes arXiv:2112.01479

  22. arXiv:2207.04624  [pdf, other

    cs.CV

    Hierarchical Latent Structure for Multi-Modal Vehicle Trajectory Forecasting

    Authors: Dooseop Choi, KyoungWook Min

    Abstract: Variational autoencoder (VAE) has widely been utilized for modeling data distributions because it is theoretically elegant, easy to train, and has nice manifold representations. However, when applied to image reconstruction and synthesis tasks, VAE shows the limitation that the generated sample tends to be blurry. We observe that a similar problem, in which the generated trajectory is located betw… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: ECCV2022

  23. arXiv:2204.04892  [pdf, other

    cs.LG cs.AI

    JORLDY: a fully customizable open source framework for reinforcement learning

    Authors: Kyushik Min, Hyunho Lee, Kwansu Shin, Taehak Lee, Hojoon Lee, Jinwon Choi, Sungho Son

    Abstract: Recently, Reinforcement Learning (RL) has been actively researched in both academic and industrial fields. However, there exist only a few RL frameworks which are developed for researchers or students who want to study RL. In response, we propose an open-source RL framework "Join Our Reinforcement Learning framework for Developing Yours" (JORLDY). JORLDY provides more than 20 widely used RL algori… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: 12 pages, 6 figures

  24. arXiv:2203.14565   

    cs.CV

    Style-Guided Domain Adaptation for Face Presentation Attack Detection

    Authors: Young-Eun Kim, Woo-Jeoung Nam, Kyungseo Min, Seong-Whan Lee

    Abstract: Domain adaptation (DA) or domain generalization (DG) for face presentation attack detection (PAD) has attracted attention recently with its robustness against unseen attack scenarios. Existing DA/DG-based PAD methods, however, have not yet fully explored the domain-specific style information that can provide knowledge regarding attack styles (e.g., materials, background, illumination and resolutio… ▽ More

    Submitted 19 June, 2022; v1 submitted 28 March, 2022; originally announced March 2022.

    Comments: With the agreement of all authors, we would like to withdraw the manuscript. For lack of some experiments, a part of important claims cannot stand solidly. We need to further carry out experiments, and reconsider the rationality of these claims

  25. arXiv:2203.14507  [pdf, other

    cs.CL

    ANNA: Enhanced Language Representation for Question Answering

    Authors: Changwook Jun, Hansol Jang, Myoseop Sim, Hyun Kim, Jooyoung Choi, Kyungkoo Min, Kyunghoon Bae

    Abstract: Pre-trained language models have brought significant improvements in performance in a variety of natural language processing tasks. Most existing models performing state-of-the-art results have shown their approaches in the separate perspectives of data processing, pre-training tasks, neural network modeling, or fine-tuning. In this paper, we demonstrate how the approaches affect performance indiv… ▽ More

    Submitted 3 April, 2022; v1 submitted 28 March, 2022; originally announced March 2022.

    Comments: 11 pages, 3 figures

    Journal ref: ACL 2022 Workshop RepL4NLP Submission

  26. arXiv:2202.07476  [pdf

    cs.LG physics.chem-ph

    MGCVAE: Multi-objective Inverse Design via Molecular Graph Conditional Variational Autoencoder

    Authors: Myeonghun Lee, Kyoungmin Min

    Abstract: The ultimate goal of various fields is to directly generate molecules with desired properties, such as finding water-soluble molecules in drug development and finding molecules suitable for organic light-emitting diode (OLED) or photosensitizers in the field of development of new organic materials. In this respect, this study proposes a molecular graph generative model based on the autoencoder for… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: preprint, under review

  27. arXiv:2202.06763  [pdf

    cond-mat.mtrl-sci cs.AI

    Machine Learning-Aided Discovery of Superionic Solid-State Electrolyte for Li-Ion Batteries

    Authors: Seungpyo Kang, Minseon Kim, Kyoungmin Min

    Abstract: Li-Ion Solid-State Electrolytes (Li-SSEs) are a promising solution that resolves the critical issues of conventional Li-Ion Batteries (LIBs) such as poor ionic conductivity, interfacial instability, and dendrites growth. In this study, a platform consisting of a high-throughput screening and a machine-learning surrogate model for discovering superionic Li-SSEs among 20,237 Li-containing materials… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: preprint, under review

  28. arXiv:2201.06223  [pdf, other

    cs.CL

    Korean-Specific Dataset for Table Question Answering

    Authors: Changwook Jun, Jooyoung Choi, Myoseop Sim, Hyun Kim, Hansol Jang, Kyungkoo Min

    Abstract: Existing question answering systems mainly focus on dealing with text data. However, much of the data produced daily is stored in the form of tables that can be found in documents and relational databases, or on the web. To solve the task of question answering over tables, there exist many datasets for table question answering written in English, but few Korean datasets. In this paper, we demonstr… ▽ More

    Submitted 1 May, 2022; v1 submitted 17 January, 2022; originally announced January 2022.

    Comments: 7 pages including references and 4 figures

  29. arXiv:2112.01479  [pdf, other

    cs.CV

    Learning Spatial-Temporal Graphs for Active Speaker Detection

    Authors: Sourya Roy, Kyle Min, Subarna Tripathi, Tanaya Guha, Somdeb Majumdar

    Abstract: We address the problem of active speaker detection through a new framework, called SPELL, that learns long-range multimodal graphs to encode the inter-modal relationship between audio and visual data. We cast active speaker detection as a node classification task that is aware of longer-term dependencies. We first construct a graph from a video so that each node corresponds to one person. Nodes re… ▽ More

    Submitted 3 December, 2021; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: 10 pages

  30. arXiv:2105.14891  [pdf, other

    cs.CV

    ACNet: Mask-Aware Attention with Dynamic Context Enhancement for Robust Acne Detection

    Authors: Kyungseo Min, Gun-Hee Lee, Seong-Whan Lee

    Abstract: Computer-aided diagnosis has recently received attention for its advantage of low cost and time efficiency. Although deep learning played a major role in the recent success of acne detection, there are still several challenges such as color shift by inconsistent illumination, variation in scales, and high density distribution. To address these problems, we propose an acne detection network which c… ▽ More

    Submitted 17 December, 2021; v1 submitted 31 May, 2021; originally announced May 2021.

    Comments: 6 pages, 5 figures, SMC 2021

  31. arXiv:2104.10888  [pdf

    cond-mat.mtrl-sci

    Machine learning aided materials design platform for predicting the mechanical properties of Na-ion solid-state electrolytes

    Authors: Junho Jo, Eunseong Choi, Minseon Kim, Kyoungmin Min

    Abstract: Na-ion solid-state electrolytes (Na-SSE) exhibit high potential for electrical energy storage owing to their high energy densities and low manufacturing cost. However, their mechanical properties critical to maintain structural stability at the interface are still insufficiently understood. In this study, a machine learning based regression model was developed for predicting the mechanical propert… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

    Comments: Submitted and Under Review

    Journal ref: ACS Appl. Energy Mater. 2021

  32. arXiv:2104.04025  [pdf, other

    physics.chem-ph

    A study of the decoherence correction derived from the exact factorization approach for non-adiabatic dynamics

    Authors: Patricia Vindel-Zandbergen, Lea M. Ibele, Jong-Kwon Ha, Seung Kyu Min, Basile F. E. Curchod, Neepa T. Maitra

    Abstract: We present a detailed study of the decoherence correction to surface-hopping that was recently derived from the exact factorization approach. Ab initio multiple spawning calculations that use the same initial conditions and same electronic structure method are used as a reference for three molecules: ethylene, methaniminium cation, and fulvene, for which non-adiabatic dynamics follows a photo-exci… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

  33. Construction of a far ultraviolet all sky map from an incomplete survey: Application of a deep learning algorithm

    Authors: Young-Soo Jo, Yeon-Ju Choi, Min-Gi Kim, Chang-Ho Woo, Kyoung-Wook Min, Kwang-Il Seon

    Abstract: We constructed a far ultraviolet (FUV) all sky map based on observations from the Far Ultraviolet Imaging Spectrograph (FIMS) aboard the Korean microsatellite STSAT-1. For the ~20% of the sky not covered by FIMS observations, predictions from a deep artificial neural network were used. Seven datasets were chosen for input parameters, including five all sky maps of H-alpha, E(B-V), N(HI), and two X… ▽ More

    Submitted 10 January, 2021; originally announced January 2021.

    Comments: 10 pages, 12 figures

  34. arXiv:2011.03920  [pdf, other

    cs.CV

    Integrating Human Gaze into Attention for Egocentric Activity Recognition

    Authors: Kyle Min, Jason J. Corso

    Abstract: It is well known that human gaze carries significant information about visual attention. However, there are three main difficulties in incorporating the gaze data in an attention mechanism of deep neural networks: 1) the gaze fixation points are likely to have measurement errors due to blinking and rapid eye movements; 2) it is unclear when and how much the gaze data is correlated with visual atte… ▽ More

    Submitted 8 November, 2020; originally announced November 2020.

    Comments: WACV 2021 camera ready (Supplementary material: on CVF soon)

  35. Effects of transient non-thermal particles on the big bang nucleosynthesis

    Authors: Tae-Sun Park, Kyung Joo Min, Seung-Woo Hong

    Abstract: The effects of introducing a small amount of non-thermal distribution (NTD) of elements in big bang nucleosynthesis (BBN) are studied by allowing a fraction of the NTD to be time-dependent so that it contributes only during a certain period of the BBN evolution. The fraction is modeled as a Gaussian-shaped function of $\log(T)$, where $T$ is the temperature of the cosmos, and thus the function is… ▽ More

    Submitted 10 September, 2020; originally announced September 2020.

    Journal ref: International Journal of Modern Physics E, 29 (2020) 2050012

  36. arXiv:2008.04574  [pdf, other

    eess.AS cs.LG cs.SD

    Bunched LPCNet : Vocoder for Low-cost Neural Text-To-Speech Systems

    Authors: Ravichander Vipperla, Sangjun Park, Kihyun Choo, Samin Ishtiaq, Kyoungbo Min, Sourav Bhattacharya, Abhinav Mehrotra, Alberto Gil C. P. Ramos, Nicholas D. Lane

    Abstract: LPCNet is an efficient vocoder that combines linear prediction and deep neural network modules to keep the computational complexity low. In this work, we present two techniques to further reduce it's complexity, aiming for a low-cost LPCNet vocoder-based neural Text-to-Speech (TTS) System. These techniques are: 1) Sample-bunching, which allows LPCNet to generate more than one audio sample per infe… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

    Comments: Interspeech 2020

  37. arXiv:2007.06643  [pdf, other

    cs.CV

    Adversarial Background-Aware Loss for Weakly-supervised Temporal Activity Localization

    Authors: Kyle Min, Jason J. Corso

    Abstract: Temporally localizing activities within untrimmed videos has been extensively studied in recent years. Despite recent advances, existing methods for weakly-supervised temporal activity localization struggle to recognize when an activity is not occurring. To address this issue, we propose a novel method named A2CL-PT. Two triplets of the feature space are considered in our approach: one triplet is… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

    Comments: ECCV 2020 camera ready (Supplementary material: on ECVA soon)

  38. arXiv:2007.03877  [pdf, other

    cs.CV

    PathGAN: Local Path Planning with Attentive Generative Adversarial Networks

    Authors: Dooseop Choi, Seung-jun Han, Kyoungwook Min, Jeongdan Choi

    Abstract: To achieve autonomous driving without high-definition maps, we present a model capable of generating multiple plausible paths from egocentric images for autonomous vehicles. Our generative model comprises two neural networks: the feature extraction network (FEN) and path generation network (PGN). The FEN extracts meaningful features from an egocentric image, whereas the PGN generates multiple path… ▽ More

    Submitted 2 March, 2021; v1 submitted 7 July, 2020; originally announced July 2020.

  39. arXiv:1908.05786  [pdf, other

    cs.CV

    TASED-Net: Temporally-Aggregating Spatial Encoder-Decoder Network for Video Saliency Detection

    Authors: Kyle Min, Jason J. Corso

    Abstract: TASED-Net is a 3D fully-convolutional network architecture for video saliency detection. It consists of two building blocks: first, the encoder network extracts low-resolution spatiotemporal features from an input clip of several consecutive frames, and then the following prediction network decodes the encoded features spatially while aggregating all the temporal information. As a result, a single… ▽ More

    Submitted 15 August, 2019; originally announced August 2019.

    Comments: ICCV 2019 camera ready (Supplementary material: on CVF soon)

  40. arXiv:1907.04525  [pdf, other

    cs.CV

    Regularizing Neural Networks for Future Trajectory Prediction via Inverse Reinforcement Learning Framework

    Authors: Dooseop Choi, Kyoungwook Min, Jeongdan Choi

    Abstract: Predicting distant future trajectories of agents in a dynamic scene is not an easy problem because the future trajectory of an agent is affected by not only his/her past trajectory but also the scene contexts. To tackle this problem, we propose a model based on recurrent neural networks (RNNs) and a novel method for training the model. The proposed model is based on an encoder-decoder architecture… ▽ More

    Submitted 25 December, 2019; v1 submitted 10 July, 2019; originally announced July 2019.

  41. Global distribution of far-ultraviolet emissions from highly ionized gas in the Milky Way

    Authors: Young-Soo Jo, Kwang-il Seon, Kyoung-Wook Min, Jerry Edelstein, Wonyong Han

    Abstract: We present all-sky maps of two major FUV cooling lines, C IV and O VI, of highly ionized gas to investigate the nature of the transition-temperature gas. From the extinction-corrected line intensities of C IV and O VI, we calculated the gas temperature and the emission measure of the transition-temperature gas assuming isothermal plasma in the collisional ionization equilibrium. The gas temperatur… ▽ More

    Submitted 22 May, 2019; v1 submitted 19 May, 2019; originally announced May 2019.

    Comments: 20 pages, 16 figures, Accepted to the Astrophysical Journal Supplement Series

  42. Comparison of the extraplanar H$α$ and UV emissions in the halos of nearby edge-on spiral galaxies

    Authors: Young-Soo Jo, Kwang-il Seon, Jong-Ho Shinn, Yujin Yang, Dukhang Lee, Kyoung-Wook Min

    Abstract: We compare vertical profiles of the extraplanar H$α$ emission to those of the UV emission for 38 nearby edge-on late-type galaxies. It is found that detection of the "diffuse" extraplanar dust (eDust), traced by the vertically extended, scattered UV starlight, always coincides with the presence of the extraplanar H$α$ emission. A strong correlation between the scale heights of the extraplanar H… ▽ More

    Submitted 18 June, 2018; originally announced June 2018.

    Comments: 25 pages; 6 figures; It was accepted for publication in The Astrophysical Journal at June 7, 2018

  43. arXiv:1806.02872  [pdf

    cond-mat.mtrl-sci

    Idiosyncratic Approach to Visualize Degradation of Black Phosphorus

    Authors: Bilal Abbas Naqvi, Muhammad Arslan Shehzad, Janghwan Cha, Kyung Ah Min, M. Farooq Khan, Sajjad Hussain, Seo Yongho, Suklyun Hong, Eom Jonghwa, Jung Jongwan

    Abstract: Black Phosphorus (BP) is an excellent material for post graphene era due to its layer dependent band gap, high mobility and high Ion/Ioff. However, its poor stability in ambient poses a great challenge in its practical and long-term usage. Optical visualization of oxidized BP is the key and foremost step for its successful passivation from the ambience. Here, we have done a systematic study of the… ▽ More

    Submitted 7 June, 2018; originally announced June 2018.

    Report number: 12966

    Journal ref: https://www.nature.com/articles/s41598-018-31067-4 2018

  44. arXiv:1805.06266  [pdf, other

    cs.CL

    A Unified Model for Extractive and Abstractive Summarization using Inconsistency Loss

    Authors: Wan-Ting Hsu, Chieh-Kai Lin, Ming-Ying Lee, Kerui Min, Jing Tang, Min Sun

    Abstract: We propose a unified model combining the strength of extractive and abstractive summarization. On the one hand, a simple extractive model can obtain sentence-level attention with high ROUGE scores but less readable. On the other hand, a more complicated abstractive model can obtain word-level dynamic attention to generate a more readable paragraph. In our model, sentence-level attention is used to… ▽ More

    Submitted 5 July, 2018; v1 submitted 16 May, 2018; originally announced May 2018.

    Comments: 9 pages, ACL 2018 oral. Project page: https://hsuwanting.github.io/unified_summ/. Code: https://github.com/HsuWanTing/unified-summarization

  45. arXiv:1804.00722  [pdf, other

    cs.CV cs.LG stat.ML

    Hierarchical Novelty Detection for Visual Object Recognition

    Authors: Kibok Lee, Kimin Lee, Kyle Min, Yuting Zhang, Jinwoo Shin, Honglak Lee

    Abstract: Deep neural networks have achieved impressive success in large-scale visual object recognition tasks with a predefined set of classes. However, recognizing objects of novel classes unseen during training still remains challenging. The problem of detecting such novel classes has been addressed in the literature, but most prior works have focused on providing simple binary or regressive decisions, e… ▽ More

    Submitted 15 June, 2018; v1 submitted 2 April, 2018; originally announced April 2018.

    Comments: CVPR 2018

  46. arXiv:1803.06008  [pdf, other

    physics.med-ph

    A unified image reconstruction framework for quantitative dual- and triple-energy CT imaging of material compositions

    Authors: Wei Zhao, Don Vernekohl, Fei Han, Bin Han, Hao Peng, Lei Xing, James K Min

    Abstract: Many clinical applications depend critically on the accurate differentiation and classification of different types of materials in patient anatomy. This work introduces a unified framework for accurate nonlinear material decomposition and applies it, for the first time, in the concept of triple-energy CT (TECT) for enhanced material differentiation and classification as well as dual-energy CT. The… ▽ More

    Submitted 15 March, 2018; originally announced March 2018.

    Comments: 24 pages, 11 figures. Accepted by Medical Physics

  47. arXiv:1710.07877  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci physics.app-ph

    Controlled Electrochemical Intercalation of Graphene/h-BN van der Waals Heterostructures

    Authors: S. Y. Frank Zhao, Giselle A. Elbaz, D. Kwabena Bediako, Cyndia Yu, Dmitri K. Efetov, Yinsheng Guo, Jayakanth Ravichandran, Kyung-Ah Min, Suklyun Hong, Takashi Taniguchi, Kenji Watanabe, Louis E. Brus, Xavier Roy, Philip Kim

    Abstract: Electrochemical intercalation is a powerful method for tuning the electronic properties of layered solids. In this work, we report an electro-chemical strategy to controllably intercalate lithium ions into a series of van der Waals (vdW) heterostructures built by sandwiching graphene between hexagonal boron nitride (h-BN). We demonstrate that encapsulating graphene with h-BN eliminates parasitic s… ▽ More

    Submitted 21 October, 2017; originally announced October 2017.

  48. arXiv:1709.07549  [pdf

    cond-mat.mes-hall

    Asymmetric Electron-Hole Decoherence in Ion-Gated Epitaxial Graphene

    Authors: Kil-Joon Min, Jaesung Park, Wan-Seop Kim, Dong-Hun Chae

    Abstract: We report on asymmetric electron-hole decoherence in epitaxial graphene gated by an ionic liquid. The observed negative magnetoresistance near zero magnetic field for different gate voltages, analyzed in the framework of weak localization, gives rise to distinct electron-hole decoherence. The hole decoherence rate increases prominently with decreasing negative gate voltage while the electron decoh… ▽ More

    Submitted 21 September, 2017; originally announced September 2017.

  49. arXiv:1709.02233  [pdf, other

    cs.NI

    EpiFi: An In-Home Sensor Network Architecture for Epidemiological Studies

    Authors: Philip Lundrigan, Kyeong Min, Neal Patwari, Sneha Kasera, Kerry Kelly, Jimmy Moore, Miriah Meyer, Scott C. Collingwood, Flory Nkoy, Bryan Stone, Katherine Sward

    Abstract: We design and build a system called EpiFi, which allows epidemiologists to easily design and deploy experiments in homes. The focus of EpiFi is reducing the barrier to entry for deploying and using an in-home sensor network. We present a novel architecture for in-home sensor networks configured using a single configuration file and provide: a fast and reliable method for device discovery when inst… ▽ More

    Submitted 7 September, 2017; originally announced September 2017.

    Comments: 13 pages, 12 figures

  50. Far-ultraviolet fluorescent molecular hydrogen emission map of the Milky Way Galaxy

    Authors: Young-Soo Jo, Kwang-Il Seon, Kyoung-Wook Min, Jerry Edelstein, Wonyong Han

    Abstract: We present the far-ultraviolet (FUV) fluorescent molecular hydrogen (H_2) emission map of the Milky Way Galaxy obtained with FIMS/SPEAR covering ~76% of the sky. The extinction-corrected intensity of the fluorescent H_2 emission has a strong linear correlation with the well-known tracers of the cold interstellar medium (ISM), including color excess E(B-V), neutral hydrogen column density N(H I), a… ▽ More

    Submitted 16 July, 2017; originally announced July 2017.

    Comments: 24 pages, 15 figures, This is accepted for publication in ApJS at July 16, 2017