-
Dimensionality Engineering of Magnetic Anisotropy from Anomalous Hall Effect in Synthetic SrRuO3 Crystals
Authors:
Seung Gyo Jeong,
Seong Won Cho,
Sehwan Song,
Jin Young Oh,
Do Gyeom Jeong,
Gyeongtak Han,
Hu Young Jeong,
Ahmed Yousef Mohamed,
Woo-suk Noh,
Sungkyun Park,
Jong Seok Lee,
Suyoun Lee,
Young-Min Kim,
Deok-Yong Cho,
Woo Seok Choi
Abstract:
Magnetic anisotropy in atomically thin correlated heterostructures is essential for exploring quantum magnetic phases for next-generation spintronics. Whereas previous studies have mostly focused on van der Waals systems, here, we investigate the impact of dimensionality of epitaxially-grown correlated oxides down to the monolayer limit on structural, magnetic, and orbital anisotropies. By designi…
▽ More
Magnetic anisotropy in atomically thin correlated heterostructures is essential for exploring quantum magnetic phases for next-generation spintronics. Whereas previous studies have mostly focused on van der Waals systems, here, we investigate the impact of dimensionality of epitaxially-grown correlated oxides down to the monolayer limit on structural, magnetic, and orbital anisotropies. By designing oxide superlattices with a correlated ferromagnetic SrRuO3 and nonmagnetic SrTiO3 layers, we observed modulated ferromagnetic behavior with the change of the SrRuO3 thickness. Especially, for three-unit-cell-thick layers, we observe a significant 1,500% improvement of coercive field in the anomalous Hall effect, which cannot be solely attributed to the dimensional crossover in ferromagnetism. The atomic-scale heterostructures further reveal the systematic modulation of anisotropy for the lattice structure and orbital hybridization, explaining the enhanced magnetic anisotropy. Our findings provide valuable insights into engineering the anisotropic hybridization of synthetic magnetic crystals, offering a tunable spin order for various applications.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
LLM-Select: Feature Selection with Large Language Models
Authors:
Daniel P. Jeong,
Zachary C. Lipton,
Pradeep Ravikumar
Abstract:
In this paper, we demonstrate a surprising capability of large language models (LLMs): given only input feature names and a description of a prediction task, they are capable of selecting the most predictive features, with performance rivaling the standard tools of data science. Remarkably, these models exhibit this capacity across various query mechanisms. For example, we zero-shot prompt an LLM…
▽ More
In this paper, we demonstrate a surprising capability of large language models (LLMs): given only input feature names and a description of a prediction task, they are capable of selecting the most predictive features, with performance rivaling the standard tools of data science. Remarkably, these models exhibit this capacity across various query mechanisms. For example, we zero-shot prompt an LLM to output a numerical importance score for a feature (e.g., "blood pressure") in predicting an outcome of interest (e.g., "heart failure"), with no additional context. In particular, we find that the latest models, such as GPT-4, can consistently identify the most predictive features regardless of the query mechanism and across various prompting strategies. We illustrate these findings through extensive experiments on real-world data, where we show that LLM-based feature selection consistently achieves strong performance competitive with data-driven methods such as the LASSO, despite never having looked at the downstream training data. Our findings suggest that LLMs may be useful not only for selecting the best features for training but also for deciding which features to collect in the first place. This could potentially benefit practitioners in domains like healthcare, where collecting high-quality data comes at a high cost.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Role of the constriction angle on the clogging by bridging of suspensions of particles
Authors:
Nathan Vani,
Sacha Escudier,
Deok-Hoon Jeong,
Alban Sauret
Abstract:
Confined flows of particles can lead to clogging, and therefore failure, of various fluidic systems across many applications. As a result, design guidelines need to be developed to ensure that clogging is prevented or at least delayed. In this Letter, we investigate the influence of the angle of reduction in the cross-section of the channel on the bridging of semi-dilute and dense non-Brownian sus…
▽ More
Confined flows of particles can lead to clogging, and therefore failure, of various fluidic systems across many applications. As a result, design guidelines need to be developed to ensure that clogging is prevented or at least delayed. In this Letter, we investigate the influence of the angle of reduction in the cross-section of the channel on the bridging of semi-dilute and dense non-Brownian suspensions of spherical particles. We observe a decrease of the clogging probability with the reduction of the constriction angle. This effect is more pronounced for dense suspensions close to the maximum packing fraction where particles are in contact in contrast to semi-dilute suspensions. We rationalize this difference in terms of arch selection. We describe the role of the constriction angle and the flow profile, providing insights into the distinct behavior of semi-dilute and dense suspensions.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
Enhancing Social Media Post Popularity Prediction with Visual Content
Authors:
Dahyun Jeong,
Hyelim Son,
Yunjin Choi,
Keunwoo Kim
Abstract:
Our study presents a framework for predicting image-based social media content popularity that focuses on addressing complex image information and a hierarchical data structure. We utilize the Google Cloud Vision API to effectively extract key image and color information from users' postings, achieving 6.8% higher accuracy compared to using non-image covariates alone. For prediction, we explore a…
▽ More
Our study presents a framework for predicting image-based social media content popularity that focuses on addressing complex image information and a hierarchical data structure. We utilize the Google Cloud Vision API to effectively extract key image and color information from users' postings, achieving 6.8% higher accuracy compared to using non-image covariates alone. For prediction, we explore a wide range of prediction models, including Linear Mixed Model, Support Vector Regression, Multi-layer Perceptron, Random Forest, and XGBoost, with linear regression as the benchmark. Our comparative study demonstrates that models that are capable of capturing the underlying nonlinear interactions between covariates outperform other methods.
△ Less
Submitted 8 May, 2024; v1 submitted 3 May, 2024;
originally announced May 2024.
-
A Parameter-Masked Mock Data Challenge for Beyond-Two-Point Galaxy Clustering Statistics
Authors:
Beyond-2pt Collaboration,
:,
Elisabeth Krause,
Yosuke Kobayashi,
Andrés N. Salcedo,
Mikhail M. Ivanov,
Tom Abel,
Kazuyuki Akitsu,
Raul E. Angulo,
Giovanni Cabass,
Sofia Contarini,
Carolina Cuesta-Lazaro,
ChangHoon Hahn,
Nico Hamaus,
Donghui Jeong,
Chirag Modi,
Nhat-Minh Nguyen,
Takahiro Nishimichi,
Enrique Paillas,
Marcos Pellejero Ibañez,
Oliver H. E. Philcox,
Alice Pisani,
Fabian Schmidt,
Satoshi Tanaka,
Giovanni Verza
, et al. (2 additional authors not shown)
Abstract:
The last few years have seen the emergence of a wide array of novel techniques for analyzing high-precision data from upcoming galaxy surveys, which aim to extend the statistical analysis of galaxy clustering data beyond the linear regime and the canonical two-point (2pt) statistics. We test and benchmark some of these new techniques in a community data challenge "Beyond-2pt", initiated during the…
▽ More
The last few years have seen the emergence of a wide array of novel techniques for analyzing high-precision data from upcoming galaxy surveys, which aim to extend the statistical analysis of galaxy clustering data beyond the linear regime and the canonical two-point (2pt) statistics. We test and benchmark some of these new techniques in a community data challenge "Beyond-2pt", initiated during the Aspen 2022 Summer Program "Large-Scale Structure Cosmology beyond 2-Point Statistics," whose first round of results we present here. The challenge dataset consists of high-precision mock galaxy catalogs for clustering in real space, redshift space, and on a light cone. Participants in the challenge have developed end-to-end pipelines to analyze mock catalogs and extract unknown ("masked") cosmological parameters of the underlying $Λ$CDM models with their methods. The methods represented are density-split clustering, nearest neighbor statistics, BACCO power spectrum emulator, void statistics, LEFTfield field-level inference using effective field theory (EFT), and joint power spectrum and bispectrum analyses using both EFT and simulation-based inference. In this work, we review the results of the challenge, focusing on problems solved, lessons learned, and future research needed to perfect the emerging beyond-2pt approaches. The unbiased parameter recovery demonstrated in this challenge by multiple statistics and the associated modeling and inference frameworks supports the credibility of cosmology constraints from these methods. The challenge data set is publicly available and we welcome future submissions from methods that are not yet represented.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
Multitask Extension of Geometrically Aligned Transfer Encoder
Authors:
Sung Moon Ko,
Sumin Lee,
Dae-Woong Jeong,
Hyunseung Kim,
Chanhui Lee,
Soorin Yim,
Sehui Han
Abstract:
Molecular datasets often suffer from a lack of data. It is well-known that gathering data is difficult due to the complexity of experimentation or simulation involved. Here, we leverage mutual information across different tasks in molecular data to address this issue. We extend an algorithm that utilizes the geometric characteristics of the encoding space, known as the Geometrically Aligned Transf…
▽ More
Molecular datasets often suffer from a lack of data. It is well-known that gathering data is difficult due to the complexity of experimentation or simulation involved. Here, we leverage mutual information across different tasks in molecular data to address this issue. We extend an algorithm that utilizes the geometric characteristics of the encoding space, known as the Geometrically Aligned Transfer Encoder (GATE), to a multi-task setup. Thus, we connect multiple molecular tasks by aligning the curved coordinates onto locally flat coordinates, ensuring the flow of information from source tasks to support performance on target data.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
Winning the Social Media Influence Battle: Uncertainty-Aware Opinions to Understand and Spread True Information via Competitive Influence Maximization
Authors:
Qi Zhang,
Lance M. Kaplan,
Audun Jøsang,
Dong Hyun. Jeong,
Feng Chen,
Jin-Hee Cho
Abstract:
Competitive Influence Maximization (CIM) involves entities competing to maximize influence in online social networks (OSNs). Current Deep Reinforcement Learning (DRL) methods in CIM rely on simplistic binary opinion models (i.e., an opinion is represented by either 0 or 1) and often overlook the complexity of users' behavioral characteristics and their prior knowledge. We propose a novel DRL-based…
▽ More
Competitive Influence Maximization (CIM) involves entities competing to maximize influence in online social networks (OSNs). Current Deep Reinforcement Learning (DRL) methods in CIM rely on simplistic binary opinion models (i.e., an opinion is represented by either 0 or 1) and often overlook the complexity of users' behavioral characteristics and their prior knowledge. We propose a novel DRL-based framework that enhances CIM analysis by integrating Subjective Logic (SL) to accommodate uncertain opinions, users' behaviors, and their preferences. This approach targets the mitigation of false information by effectively propagating true information. By modeling two competitive agents, one spreading true information and the other spreading false information, we capture the strategic interplay essential to CIM. Our framework utilizes an uncertainty-based opinion model (UOM) to assess the impact on information quality in OSNs, emphasizing the importance of user behavior alongside network topology in selecting influential seed nodes. Extensive experiments demonstrate that our approach significantly outperforms state-of-the-art methods, achieving faster and more influential results (i.e., outperforming over 20%) under realistic network conditions. Moreover, our method shows robust performance in partially observable networks, effectively doubling the performance when users are predisposed to disbelieve true information.
△ Less
Submitted 29 April, 2024; v1 submitted 29 April, 2024;
originally announced April 2024.
-
Musical Word Embedding for Music Tagging and Retrieval
Authors:
SeungHeon Doh,
Jongpil Lee,
Dasaem Jeong,
Juhan Nam
Abstract:
Word embedding has become an essential means for text-based information retrieval. Typically, word embeddings are learned from large quantities of general and unstructured text data. However, in the domain of music, the word embedding may have difficulty understanding musical contexts or recognizing music-related entities like artists and tracks. To address this issue, we propose a new approach ca…
▽ More
Word embedding has become an essential means for text-based information retrieval. Typically, word embeddings are learned from large quantities of general and unstructured text data. However, in the domain of music, the word embedding may have difficulty understanding musical contexts or recognizing music-related entities like artists and tracks. To address this issue, we propose a new approach called Musical Word Embedding (MWE), which involves learning from various types of texts, including both everyday and music-related vocabulary. We integrate MWE into an audio-word joint representation framework for tagging and retrieving music, using words like tag, artist, and track that have different levels of musical specificity. Our experiments show that using a more specific musical word like track results in better retrieval performance, while using a less specific term like tag leads to better tagging performance. To balance this compromise, we suggest multi-prototype training that uses words with different levels of musical specificity jointly. We evaluate both word embedding and audio-word joint embedding on four tasks (tag rank prediction, music tagging, query-by-tag, and query-by-track) across two datasets (Million Song Dataset and MTG-Jamendo). Our findings show that the suggested MWE is more efficient and robust than the conventional word embedding.
△ Less
Submitted 22 April, 2024; v1 submitted 21 April, 2024;
originally announced April 2024.
-
Hyper Evidential Deep Learning to Quantify Composite Classification Uncertainty
Authors:
Changbin Li,
Kangshuo Li,
Yuzhe Ou,
Lance M. Kaplan,
Audun Jøsang,
Jin-Hee Cho,
Dong Hyun Jeong,
Feng Chen
Abstract:
Deep neural networks (DNNs) have been shown to perform well on exclusive, multi-class classification tasks. However, when different classes have similar visual features, it becomes challenging for human annotators to differentiate them. This scenario necessitates the use of composite class labels. In this paper, we propose a novel framework called Hyper-Evidential Neural Network (HENN) that explic…
▽ More
Deep neural networks (DNNs) have been shown to perform well on exclusive, multi-class classification tasks. However, when different classes have similar visual features, it becomes challenging for human annotators to differentiate them. This scenario necessitates the use of composite class labels. In this paper, we propose a novel framework called Hyper-Evidential Neural Network (HENN) that explicitly models predictive uncertainty due to composite class labels in training data in the context of the belief theory called Subjective Logic (SL). By placing a grouped Dirichlet distribution on the class probabilities, we treat predictions of a neural network as parameters of hyper-subjective opinions and learn the network that collects both single and composite evidence leading to these hyper-opinions by a deterministic DNN from data. We introduce a new uncertainty type called vagueness originally designed for hyper-opinions in SL to quantify composite classification uncertainty for DNNs. Our results demonstrate that HENN outperforms its state-of-the-art counterparts based on four image datasets. The code and datasets are available at: https://github.com/Hugo101/HyperEvidentialNN.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Towards Efficient and Real-Time Piano Transcription Using Neural Autoregressive Models
Authors:
Taegyun Kwon,
Dasaem Jeong,
Juhan Nam
Abstract:
In recent years, advancements in neural network designs and the availability of large-scale labeled datasets have led to significant improvements in the accuracy of piano transcription models. However, most previous work focused on high-performance offline transcription, neglecting deliberate consideration of model size. The goal of this work is to implement real-time inference for piano transcrip…
▽ More
In recent years, advancements in neural network designs and the availability of large-scale labeled datasets have led to significant improvements in the accuracy of piano transcription models. However, most previous work focused on high-performance offline transcription, neglecting deliberate consideration of model size. The goal of this work is to implement real-time inference for piano transcription while ensuring both high performance and lightweight. To this end, we propose novel architectures for convolutional recurrent neural networks, redesigning an existing autoregressive piano transcription model. First, we extend the acoustic module by adding a frequency-conditioned FiLM layer to the CNN module to adapt the convolutional filters on the frequency axis. Second, we improve note-state sequence modeling by using a pitchwise LSTM that focuses on note-state transitions within a note. In addition, we augment the autoregressive connection with an enhanced recursive context. Using these components, we propose two types of models; one for high performance and the other for high compactness. Through extensive experiments, we show that the proposed models are comparable to state-of-the-art models in terms of note accuracy on the MAESTRO dataset. We also investigate the effective model size and real-time inference latency by gradually streamlining the architecture. Finally, we conduct cross-data evaluation on unseen piano datasets and in-depth analysis to elucidate the effect of the proposed components in the view of note length and pitch range.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
HyperCLOVA X Technical Report
Authors:
Kang Min Yoo,
Jaegeun Han,
Sookyo In,
Heewon Jeon,
Jisu Jeong,
Jaewook Kang,
Hyunwook Kim,
Kyung-Min Kim,
Munhyong Kim,
Sungju Kim,
Donghyun Kwak,
Hanock Kwak,
Se Jung Kwon,
Bado Lee,
Dongsoo Lee,
Gichang Lee,
Jooho Lee,
Baeseong Park,
Seongjin Shin,
Joonsang Yu,
Seolki Baek,
Sumin Byeon,
Eungsup Cho,
Dooseok Choe,
Jeesung Han
, et al. (371 additional authors not shown)
Abstract:
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t…
▽ More
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment to responsible AI. The model is evaluated across various benchmarks, including comprehensive reasoning, knowledge, commonsense, factuality, coding, math, chatting, instruction-following, and harmlessness, in both Korean and English. HyperCLOVA X exhibits strong reasoning capabilities in Korean backed by a deep understanding of the language and cultural nuances. Further analysis of the inherent bilingual nature and its extension to multilingualism highlights the model's cross-lingual proficiency and strong generalization ability to untargeted languages, including machine translation between several language pairs and cross-lingual inference tasks. We believe that HyperCLOVA X can provide helpful guidance for regions or countries in developing their sovereign LLMs.
△ Less
Submitted 13 April, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
Can Audio Reveal Music Performance Difficulty? Insights from the Piano Syllabus Dataset
Authors:
Pedro Ramoneda,
Minhee Lee,
Dasaem Jeong,
J. J. Valero-Mas,
Xavier Serra
Abstract:
Automatically estimating the performance difficulty of a music piece represents a key process in music education to create tailored curricula according to the individual needs of the students. Given its relevance, the Music Information Retrieval (MIR) field depicts some proof-of-concept works addressing this task that mainly focuses on high-level music abstractions such as machine-readable scores…
▽ More
Automatically estimating the performance difficulty of a music piece represents a key process in music education to create tailored curricula according to the individual needs of the students. Given its relevance, the Music Information Retrieval (MIR) field depicts some proof-of-concept works addressing this task that mainly focuses on high-level music abstractions such as machine-readable scores or music sheet images. In this regard, the potential of directly analyzing audio recordings has been generally neglected, which prevents students from exploring diverse music pieces that may not have a formal symbolic-level transcription. This work pioneers in the automatic estimation of performance difficulty of music pieces on audio recordings with two precise contributions: (i) the first audio-based difficulty estimation dataset -- namely, Piano Syllabus (PSyllabus) dataset -- featuring 7,901 piano pieces across 11 difficulty levels from 1,233 composers; and (ii) a recognition framework capable of managing different input representations -- both unimodal and multimodal manners -- directly derived from audio to perform the difficulty estimation task. The comprehensive experimentation comprising different pre-training schemes, input modalities, and multi-task scenarios prove the validity of the proposal and establishes PSyllabus as a reference dataset for audio-based difficulty estimation in the MIR field. The dataset as well as the developed code and trained models are publicly shared to promote further research in the field.
△ Less
Submitted 6 March, 2024;
originally announced March 2024.
-
Bayesian Neural Networks with Domain Knowledge Priors
Authors:
Dylan Sam,
Rattana Pukdee,
Daniel P. Jeong,
Yewon Byun,
J. Zico Kolter
Abstract:
Bayesian neural networks (BNNs) have recently gained popularity due to their ability to quantify model uncertainty. However, specifying a prior for BNNs that captures relevant domain knowledge is often extremely challenging. In this work, we propose a framework for integrating general forms of domain knowledge (i.e., any knowledge that can be represented by a loss function) into a BNN prior throug…
▽ More
Bayesian neural networks (BNNs) have recently gained popularity due to their ability to quantify model uncertainty. However, specifying a prior for BNNs that captures relevant domain knowledge is often extremely challenging. In this work, we propose a framework for integrating general forms of domain knowledge (i.e., any knowledge that can be represented by a loss function) into a BNN prior through variational inference, while enabling computationally efficient posterior inference and sampling. Specifically, our approach results in a prior over neural network weights that assigns high probability mass to models that better align with our domain knowledge, leading to posterior samples that also exhibit this behavior. We show that BNNs using our proposed domain knowledge priors outperform those with standard priors (e.g., isotropic Gaussian, Gaussian process), successfully incorporating diverse types of prior information such as fairness, physics rules, and healthcare knowledge and achieving better predictive performance. We also present techniques for transferring the learned priors across different model architectures, demonstrating their broad utility across various settings.
△ Less
Submitted 20 February, 2024;
originally announced February 2024.
-
Absorption Troughs of Lyman Alpha Emitters in HETDEX
Authors:
Laurel H. Weiss,
Dustin Davis,
Karl Gebhardt,
Simon Gazagnes,
Mahan Mirza Khanlari,
Erin Mentuch Cooper,
John Chisholm,
Danielle Berg,
William P. Bowman,
Chris Byrohl,
Robin Ciardullo,
Maximilian Fabricius,
Daniel Farrow,
Caryl Gronwall,
Gary J. Hill,
Lindsay R. House,
Donghui Jeong,
Hasti Khoraminezhad,
Wolfram Kollatschny,
Eiichiro Komatsu,
Maja Lujan Niemeyer,
Shun Saito,
Donald P. Schneider,
Gregory R. Zeimann
Abstract:
The Hobby-Eberly Telescope Dark Energy Experiment (HETDEX) is designed to detect and measure the redshifts of more than one million Ly$α$ emitting galaxies (LAEs) between $1.88 < z < 3.52$. In addition to its cosmological measurements, these data enable studies of Ly$α$ spectral profiles and the underlying radiative transfer. Using the roughly half a million LAEs in the HETDEX Data Release 3, we s…
▽ More
The Hobby-Eberly Telescope Dark Energy Experiment (HETDEX) is designed to detect and measure the redshifts of more than one million Ly$α$ emitting galaxies (LAEs) between $1.88 < z < 3.52$. In addition to its cosmological measurements, these data enable studies of Ly$α$ spectral profiles and the underlying radiative transfer. Using the roughly half a million LAEs in the HETDEX Data Release 3, we stack various subsets to obtain the typical Ly$α$ profile for the $z \sim 2-3$ epoch and to understand their physical properties. We find clear absorption wings around Ly$α$ emission, which extend $\sim 2000$ km $\mathrm{s}^{-1}$ both redward and blueward of the central line. Using far-UV spectra of nearby ($0.002 < z < 0.182$) LAEs in the CLASSY treasury and optical/near-IR spectra of $2.8 < z < 6.7$ LAEs in the MUSE-Wide survey, we observe absorption profiles in both redshift regimes. Dividing the sample by volume density shows that the troughs increase in higher density regions. This trend suggests that the depth of the absorption is dependent on the local density of objects near the LAE, a geometry that is similar to damped Lyman-$α$ systems. Simple simulations of Ly$α$ radiative transfer can produce similar troughs due to absorption of light from background sources by HI gas surrounding the LAEs.
△ Less
Submitted 4 January, 2024;
originally announced January 2024.
-
Recovering the Missing Large-Scale Density Modes in 21cm Intensity Map from the Scalar-Type Clustering Fossils
Authors:
Zhenyuan Wang,
Donghui Jeong
Abstract:
Revealing the large-scale structure from the 21cm intensity mapping surveys is only possible after the foreground cleaning. However, most current cleaning techniques relying on the smoothness of the foreground spectrum lead to a severe side effect of removing the large-scale structure signal along the line of sight. On the other hand, the clustering fossil, a coherent variation of the small-scale…
▽ More
Revealing the large-scale structure from the 21cm intensity mapping surveys is only possible after the foreground cleaning. However, most current cleaning techniques relying on the smoothness of the foreground spectrum lead to a severe side effect of removing the large-scale structure signal along the line of sight. On the other hand, the clustering fossil, a coherent variation of the small-scale clustering over large scales, allows us to recover the long-wavelength density modes from the off-diagonal correlation between short-wavelength modes. In this paper, we study the requirements for an unbiased and optimal clustering-fossil estimator and show that (A) the estimator is unbiased only when using an accurate bispectrum model for the long-short-short mode coupling and (B) including the connected four-point correlation functions is essential for characterizing the noise power spectrum of the estimated long mode. The clustering fossil estimator based upon the leading-order bispectrum yields an unbiased estimation of the long-wavelength ($k\lesssim 0.01~[h/{\rm Mpc}]$) modes with the cross-correlation coefficient of $0.7$ at redshifts $z=0$ to $3$.
△ Less
Submitted 28 December, 2023;
originally announced December 2023.
-
SoloPose: One-Shot Kinematic 3D Human Pose Estimation with Video Data Augmentation
Authors:
David C. Jeong,
Hongji Liu,
Saunder Salazar,
Jessie Jiang,
Christopher A. Kitts
Abstract:
While recent two-stage many-to-one deep learning models have demonstrated great success in 3D human pose estimation, such models are inefficient ways to detect 3D key points in a sequential video relative to one-shot and many-to-many models. Another key drawback of two-stage and many-to-one models is that errors in the first stage will be passed onto the second stage. In this paper, we introduce S…
▽ More
While recent two-stage many-to-one deep learning models have demonstrated great success in 3D human pose estimation, such models are inefficient ways to detect 3D key points in a sequential video relative to one-shot and many-to-many models. Another key drawback of two-stage and many-to-one models is that errors in the first stage will be passed onto the second stage. In this paper, we introduce SoloPose, a novel one-shot, many-to-many spatio-temporal transformer model for kinematic 3D human pose estimation of video. SoloPose is further fortified by HeatPose, a 3D heatmap based on Gaussian Mixture Model distributions that factors target key points as well as kinematically adjacent key points. Finally, we address data diversity constraints with the 3D AugMotion Toolkit, a methodology to augment existing 3D human pose datasets, specifically by projecting four top public 3D human pose datasets (Humans3.6M, MADS, AIST Dance++, MPI INF 3DHP) into a novel dataset (Humans7.1M) with a universal coordinate system. Extensive experiments are conducted on Human3.6M as well as the augmented Humans7.1M dataset, and SoloPose demonstrates superior results relative to the state-of-the-art approaches.
△ Less
Submitted 15 December, 2023;
originally announced December 2023.
-
A Unified Approach for Comprehensive Analysis of Various Spectral and Tissue Doppler Echocardiography
Authors:
Jaeik Jeon,
Jiyeon Kim,
Yeonggul Jang,
Yeonyee E. Yoon,
Dawun Jeong,
Youngtaek Hong,
Seung-Ah Lee,
Hyuk-Jae Chang
Abstract:
Doppler echocardiography offers critical insights into cardiac function and phases by quantifying blood flow velocities and evaluating myocardial motion. However, previous methods for automating Doppler analysis, ranging from initial signal processing techniques to advanced deep learning approaches, have been constrained by their reliance on electrocardiogram (ECG) data and their inability to proc…
▽ More
Doppler echocardiography offers critical insights into cardiac function and phases by quantifying blood flow velocities and evaluating myocardial motion. However, previous methods for automating Doppler analysis, ranging from initial signal processing techniques to advanced deep learning approaches, have been constrained by their reliance on electrocardiogram (ECG) data and their inability to process Doppler views collectively. We introduce a novel unified framework using a convolutional neural network for comprehensive analysis of spectral and tissue Doppler echocardiography images that combines automatic measurements and end-diastole (ED) detection into a singular method. The network automatically recognizes key features across various Doppler views, with novel Doppler shape embedding and anti-aliasing modules enhancing interpretation and ensuring consistent analysis. Empirical results indicate a consistent outperformance in performance metrics, including dice similarity coefficients (DSC) and intersection over union (IoU). The proposed framework demonstrates strong agreement with clinicians in Doppler automatic measurements and competitive performance in ED detection.
△ Less
Submitted 14 November, 2023;
originally announced November 2023.
-
Pulsar Timing Array Signature from Oscillating Metric Perturbations due to Ultra-light Axion
Authors:
Jai-chan Hwang,
Donghui Jeong,
Hyerim Noh,
Clemente Smarra
Abstract:
A coherently oscillating ultra-light axion can behave as dark matter. In particular, its coherently oscillating pressure perturbations can source an oscillating scalar metric perturbation, with a characteristic oscillation frequency which is twice the axion Compton frequency. A candidate in the mass range $10^{(-24,-21)}{\rm eV}$ can provide a signal in the frequency range tested by current and fu…
▽ More
A coherently oscillating ultra-light axion can behave as dark matter. In particular, its coherently oscillating pressure perturbations can source an oscillating scalar metric perturbation, with a characteristic oscillation frequency which is twice the axion Compton frequency. A candidate in the mass range $10^{(-24,-21)}{\rm eV}$ can provide a signal in the frequency range tested by current and future Pulsar Timing Array (PTA) programs. Involving the pressure perturbations in a highly nonlinear environment, such an analysis demands a relativistic and nonlinear treatment. Here, we provide a rigorous derivation of the effect assuming weak gravity and slow-motion limit of Einstein's gravity in zero-shear gauge and show that dark matter's velocity potential determines the oscillation phase and frequency change. A monochromatic PTA signal correlated with the velocity field would confirm the prediction, for example, by cross-correlating the PTA results with the future local velocity flow measurements.
△ Less
Submitted 31 October, 2023;
originally announced November 2023.
-
MoEmo Vision Transformer: Integrating Cross-Attention and Movement Vectors in 3D Pose Estimation for HRI Emotion Detection
Authors:
David C. Jeong,
Tianma Shen,
Hongji Liu,
Raghav Kapoor,
Casey Nguyen,
Song Liu,
Christopher A. Kitts
Abstract:
Emotion detection presents challenges to intelligent human-robot interaction (HRI). Foundational deep learning techniques used in emotion detection are limited by information-constrained datasets or models that lack the necessary complexity to learn interactions between input data elements, such as the the variance of human emotions across different contexts. In the current effort, we introduce 1)…
▽ More
Emotion detection presents challenges to intelligent human-robot interaction (HRI). Foundational deep learning techniques used in emotion detection are limited by information-constrained datasets or models that lack the necessary complexity to learn interactions between input data elements, such as the the variance of human emotions across different contexts. In the current effort, we introduce 1) MoEmo (Motion to Emotion), a cross-attention vision transformer (ViT) for human emotion detection within robotics systems based on 3D human pose estimations across various contexts, and 2) a data set that offers full-body videos of human movement and corresponding emotion labels based on human gestures and environmental contexts. Compared to existing approaches, our method effectively leverages the subtle connections between movement vectors of gestures and environmental contexts through the use of cross-attention on the extracted movement vectors of full-body human gestures/poses and feature maps of environmental contexts. We implement a cross-attention fusion model to combine movement vectors and environment contexts into a joint representation to derive emotion estimation. Leveraging our Naturalistic Motion Database, we train the MoEmo system to jointly analyze motion and context, yielding emotion detection that outperforms the current state-of-the-art.
△ Less
Submitted 15 October, 2023;
originally announced October 2023.
-
Self supervised convolutional kernel based handcrafted feature harmonization: Enhanced left ventricle hypertension disease phenotyping on echocardiography
Authors:
Jina Lee,
Youngtaek Hong,
Dawun Jeong,
Yeonggul Jang,
Jaeik Jeon,
Sihyeon Jeong,
Taekgeun Jung,
Yeonyee E. Yoon,
Inki Moon,
Seung-Ah Lee,
Hyuk-Jae Chang
Abstract:
Radiomics, a medical imaging technique, extracts quantitative handcrafted features from images to predict diseases. Harmonization in those features ensures consistent feature extraction across various imaging devices and protocols. Methods for harmonization include standardized imaging protocols, statistical adjustments, and evaluating feature robustness. Myocardial diseases such as Left Ventricul…
▽ More
Radiomics, a medical imaging technique, extracts quantitative handcrafted features from images to predict diseases. Harmonization in those features ensures consistent feature extraction across various imaging devices and protocols. Methods for harmonization include standardized imaging protocols, statistical adjustments, and evaluating feature robustness. Myocardial diseases such as Left Ventricular Hypertrophy (LVH) and Hypertensive Heart Disease (HHD) are diagnosed via echocardiography, but variable imaging settings pose challenges. Harmonization techniques are crucial for applying handcrafted features in disease diagnosis in such scenario. Self-supervised learning (SSL) enhances data understanding within limited datasets and adapts to diverse data settings. ConvNeXt-V2 integrates convolutional layers into SSL, displaying superior performance in various tasks. This study focuses on convolutional filters within SSL, using them as preprocessing to convert images into feature maps for handcrafted feature harmonization. Our proposed method excelled in harmonization evaluation and exhibited superior LVH classification performance compared to existing methods.
△ Less
Submitted 22 November, 2023; v1 submitted 13 October, 2023;
originally announced October 2023.
-
Geometrically Aligned Transfer Encoder for Inductive Transfer in Regression Tasks
Authors:
Sung Moon Ko,
Sumin Lee,
Dae-Woong Jeong,
Woohyung Lim,
Sehui Han
Abstract:
Transfer learning is a crucial technique for handling a small amount of data that is potentially related to other abundant data. However, most of the existing methods are focused on classification tasks using images and language datasets. Therefore, in order to expand the transfer learning scheme to regression tasks, we propose a novel transfer technique based on differential geometry, namely the…
▽ More
Transfer learning is a crucial technique for handling a small amount of data that is potentially related to other abundant data. However, most of the existing methods are focused on classification tasks using images and language datasets. Therefore, in order to expand the transfer learning scheme to regression tasks, we propose a novel transfer technique based on differential geometry, namely the Geometrically Aligned Transfer Encoder (GATE). In this method, we interpret the latent vectors from the model to exist on a Riemannian curved manifold. We find a proper diffeomorphism between pairs of tasks to ensure that every arbitrary point maps to a locally flat coordinate in the overlapping region, allowing the transfer of knowledge from the source to the target data. This also serves as an effective regularizer for the model to behave in extrapolation regions. In this article, we demonstrate that GATE outperforms conventional methods and exhibits stable behavior in both the latent space and extrapolation regions for various molecular graph datasets.
△ Less
Submitted 10 October, 2023;
originally announced October 2023.
-
Active Learning on Neural Networks through Interactive Generation of Digit Patterns and Visual Representation
Authors:
Dong H. Jeong,
Jin-Hee Cho,
Feng Chen,
Audun Josang,
Soo-Yeon Ji
Abstract:
Artificial neural networks (ANNs) have been broadly utilized to analyze various data and solve different domain problems. However, neural networks (NNs) have been considered a black box operation for years because their underlying computation and meaning are hidden. Due to this nature, users often face difficulties in interpreting the underlying mechanism of the NNs and the benefits of using them.…
▽ More
Artificial neural networks (ANNs) have been broadly utilized to analyze various data and solve different domain problems. However, neural networks (NNs) have been considered a black box operation for years because their underlying computation and meaning are hidden. Due to this nature, users often face difficulties in interpreting the underlying mechanism of the NNs and the benefits of using them. In this paper, to improve users' learning and understanding of NNs, an interactive learning system is designed to create digit patterns and recognize them in real time. To help users clearly understand the visual differences of digit patterns (i.e., 0 ~ 9) and their results with an NN, integrating visualization is considered to present all digit patterns in a two-dimensional display space with supporting multiple user interactions. An evaluation with multiple datasets is conducted to determine its usability for active learning. In addition, informal user testing is managed during a summer workshop by asking the workshop participants to use the system.
△ Less
Submitted 2 October, 2023;
originally announced October 2023.
-
Optimal Impact Angle Guidance via First-Order Optimization under Nonconvex Constraints
Authors:
Gyubin Park,
Jiwoo Choi,
Da Hoon Jeong,
Jong-Han Kim
Abstract:
Most of the optimal guidance problems can be formulated as nonconvex optimization problems, which can be solved indirectly by relaxation, convexification, or linearization. Although these methods are guaranteed to converge to the global optimum of the modified problems, the obtained solution may not guarantee global optimality or even the feasibility of the original nonconvex problems. In this pap…
▽ More
Most of the optimal guidance problems can be formulated as nonconvex optimization problems, which can be solved indirectly by relaxation, convexification, or linearization. Although these methods are guaranteed to converge to the global optimum of the modified problems, the obtained solution may not guarantee global optimality or even the feasibility of the original nonconvex problems. In this paper, we propose a computational optimal guidance approach that directly handles the nonconvex constraints encountered in formulating the guidance problems. The proposed computational guidance approach alternately solves the least squares problems and projects the solution onto nonconvex feasible sets, which rapidly converges to feasible suboptimal solutions or sometimes to the globally optimal solutions. The proposed algorithm is verified via a series of numerical simulations on impact angle guidance problems under state dependent maneuver vector constraints, and it is demonstrated that the proposed algorithm provides superior guidance performance than conventional techniques.
△ Less
Submitted 17 March, 2024; v1 submitted 30 September, 2023;
originally announced October 2023.
-
Predicting performance difficulty from piano sheet music images
Authors:
Pedro Ramoneda,
Jose J. Valero-Mas,
Dasaem Jeong,
Xavier Serra
Abstract:
Estimating the performance difficulty of a musical score is crucial in music education for adequately designing the learning curriculum of the students. Although the Music Information Retrieval community has recently shown interest in this task, existing approaches mainly use machine-readable scores, leaving the broader case of sheet music images unaddressed. Based on previous works involving shee…
▽ More
Estimating the performance difficulty of a musical score is crucial in music education for adequately designing the learning curriculum of the students. Although the Music Information Retrieval community has recently shown interest in this task, existing approaches mainly use machine-readable scores, leaving the broader case of sheet music images unaddressed. Based on previous works involving sheet music images, we use a mid-level representation, bootleg score, describing notehead positions relative to staff lines coupled with a transformer model. This architecture is adapted to our task by introducing an encoding scheme that reduces the encoded sequence length to one-eighth of the original size. In terms of evaluation, we consider five datasets -- more than 7500 scores with up to 9 difficulty levels -- , two of them particularly compiled for this work. The results obtained when pretraining the scheme on the IMSLP corpus and fine-tuning it on the considered datasets prove the proposal's validity, achieving the best-performing model with a balanced accuracy of 40.34\% and a mean square error of 1.33. Finally, we provide access to our code, data, and models for transparency and reproducibility.
△ Less
Submitted 28 September, 2023;
originally announced September 2023.
-
K-pop Lyric Translation: Dataset, Analysis, and Neural-Modelling
Authors:
Haven Kim,
Jongmin Jung,
Dasaem Jeong,
Juhan Nam
Abstract:
Lyric translation, a field studied for over a century, is now attracting computational linguistics researchers. We identified two limitations in previous studies. Firstly, lyric translation studies have predominantly focused on Western genres and languages, with no previous study centering on K-pop despite its popularity. Second, the field of lyric translation suffers from a lack of publicly avail…
▽ More
Lyric translation, a field studied for over a century, is now attracting computational linguistics researchers. We identified two limitations in previous studies. Firstly, lyric translation studies have predominantly focused on Western genres and languages, with no previous study centering on K-pop despite its popularity. Second, the field of lyric translation suffers from a lack of publicly available datasets; to the best of our knowledge, no such dataset exists. To broaden the scope of genres and languages in lyric translation studies, we introduce a novel singable lyric translation dataset, approximately 89\% of which consists of K-pop song lyrics. This dataset aligns Korean and English lyrics line-by-line and section-by-section. We leveraged this dataset to unveil unique characteristics of K-pop lyric translation, distinguishing it from other extensively studied genres, and to construct a neural lyric translation model, thereby underscoring the importance of a dedicated dataset for singable lyric translations.
△ Less
Submitted 17 May, 2024; v1 submitted 20 September, 2023;
originally announced September 2023.
-
Zero Metallicity with Zero CPU Hours: Masses of the First Stars on the Laptop
Authors:
James Gurian,
Donghui Jeong,
Boyuan Liu
Abstract:
We develop an analytic model for the mass of the first stars forming in the center of primordial gas clouds as a function of host halo mass, redshift, and degree of rotation. The model is based on the estimation of key timescales determining the following three processes: the collapse of the gas cloud, the accretion onto the protostellar core, and the radiative feedback of the protostellar core. T…
▽ More
We develop an analytic model for the mass of the first stars forming in the center of primordial gas clouds as a function of host halo mass, redshift, and degree of rotation. The model is based on the estimation of key timescales determining the following three processes: the collapse of the gas cloud, the accretion onto the protostellar core, and the radiative feedback of the protostellar core. The final stellar mass is determined by the total mass accreted until the radiative feedback halts the accretion. The analytic estimation, motivated by the result of the full numerical simulations, leads to algebraic expressions allowing an extremely fast execution. Despite its simplicity, the model reproduces the stellar mass scale and its parameter dependences observed in state-of-the-art cosmological zoom-in simulations. This work clarifies the basic physical principles undergirding such numerical treatments and provides a path to efficiently calibrating numerical predictions against eventual observations of the first stars.
△ Less
Submitted 8 January, 2024; v1 submitted 11 September, 2023;
originally announced September 2023.
-
3D Denoisers are Good 2D Teachers: Molecular Pretraining via Denoising and Cross-Modal Distillation
Authors:
Sungjun Cho,
Dae-Woong Jeong,
Sung Moon Ko,
Jinwoo Kim,
Sehui Han,
Seunghoon Hong,
Honglak Lee,
Moontae Lee
Abstract:
Pretraining molecular representations from large unlabeled data is essential for molecular property prediction due to the high cost of obtaining ground-truth labels. While there exist various 2D graph-based molecular pretraining approaches, these methods struggle to show statistically significant gains in predictive performance. Recent work have thus instead proposed 3D conformer-based pretraining…
▽ More
Pretraining molecular representations from large unlabeled data is essential for molecular property prediction due to the high cost of obtaining ground-truth labels. While there exist various 2D graph-based molecular pretraining approaches, these methods struggle to show statistically significant gains in predictive performance. Recent work have thus instead proposed 3D conformer-based pretraining under the task of denoising, which led to promising results. During downstream finetuning, however, models trained with 3D conformers require accurate atom-coordinates of previously unseen molecules, which are computationally expensive to acquire at scale. In light of this limitation, we propose D&D, a self-supervised molecular representation learning framework that pretrains a 2D graph encoder by distilling representations from a 3D denoiser. With denoising followed by cross-modal knowledge distillation, our approach enjoys use of knowledge obtained from denoising as well as painless application to downstream tasks with no access to accurate conformers. Experiments on real-world molecular property prediction datasets show that the graph encoder trained via D&D can infer 3D information based on the 2D graph and shows superior performance and label-efficiency against other baselines.
△ Less
Submitted 7 September, 2023;
originally announced September 2023.
-
Disentangling the primordial nature of stochastic gravitational wave backgrounds with CMB spectral distortions
Authors:
Bryce Cyr,
Thomas Kite,
Jens Chluba,
J. Colin Hill,
Donghui Jeong,
Sandeep Kumar Acharya,
Boris Bolliet,
Subodh P. Patil
Abstract:
The recent detection of a stochastic gravitational wave background (SGWB) at nanohertz frequencies by pulsar timing arrays (PTAs) has sparked a flurry of interest. Beyond the standard interpretation that the progenitor is a network of supermassive black hole binaries, many exotic models have also been proposed, some of which can potentially offer a better fit to the data. We explore how the variou…
▽ More
The recent detection of a stochastic gravitational wave background (SGWB) at nanohertz frequencies by pulsar timing arrays (PTAs) has sparked a flurry of interest. Beyond the standard interpretation that the progenitor is a network of supermassive black hole binaries, many exotic models have also been proposed, some of which can potentially offer a better fit to the data. We explore how the various connections between gravitational waves and CMB spectral distortions can be leveraged to help determine whether a SGWB was generated primordially or astrophysically. To this end, we present updated $k$-space window functions which can be used for distortion parameter estimation on enhancements to the primordial scalar power spectrum. These same enhancements can also source gravitational waves (GWs) directly at second order in perturbation theory, so-called scalar-induced GWs (SIGWs), and indirectly through the formation of primordial black holes (PBHs). We perform a mapping of scalar power spectrum constraints into limits on the GW parameter space of SIGWs for $δ$-function features. We highlight that broader features in the scalar spectrum can explain the PTA results while simultaneously producing a spectral distortion (SD) within reach of future experiments. We additionally update PBH constraints from $μ$- and $y$-type spectral distortions. Refined treatments of the distortion window functions widen existing SD constraints, and we find that a future CMB spectrometer could play a pivotal role in unraveling the origin of GWs imprinted at or below CMB anisotropy scales.
△ Less
Submitted 5 September, 2023;
originally announced September 2023.
-
Improving Out-of-Distribution Detection in Echocardiographic View Classication through Enhancing Semantic Features
Authors:
Jaeik Jeon,
Seongmin Ha,
Yeonggul Jang,
Yeonyee E. Yoon,
Jiyeon Kim,
Hyunseok Jeong,
Dawun Jeong,
Youngtaek Hong,
Seung-Ah Lee Hyuk-Jae Chang
Abstract:
In echocardiographic view classification, accurately detecting out-of-distribution (OOD) data is essential but challenging, especially given the subtle differences between in-distribution and OOD data. While conventional OOD detection methods, such as Mahalanobis distance (MD) are effective in far-OOD scenarios with clear distinctions between distributions, they struggle to discern the less obviou…
▽ More
In echocardiographic view classification, accurately detecting out-of-distribution (OOD) data is essential but challenging, especially given the subtle differences between in-distribution and OOD data. While conventional OOD detection methods, such as Mahalanobis distance (MD) are effective in far-OOD scenarios with clear distinctions between distributions, they struggle to discern the less obvious variations characteristic of echocardiographic data. In this study, we introduce a novel use of label smoothing to enhance semantic feature representation in echocardiographic images, demonstrating that these enriched semantic features are key for significantly improving near-OOD instance detection. By combining label smoothing with MD-based OOD detection, we establish a new benchmark for accuracy in echocardiographic OOD detection.
△ Less
Submitted 23 November, 2023; v1 submitted 31 August, 2023;
originally announced August 2023.
-
Watch Out! Smartwatches as criminal tool and digital forensic investigations
Authors:
Seungjae Jeon,
Jaehyun Chung,
Doowon Jeong
Abstract:
In the rapidly advancing technological landscape, smartwatches have materialized as multifunctional devices integral to our daily routines. Smartwatches store a substantial amount of personal information, potentially serving as repositories of digital evidence. Thus, digital forensic researchers have devoted considerable effort to exploring smartwatch forensic techniques. However, it has been obse…
▽ More
In the rapidly advancing technological landscape, smartwatches have materialized as multifunctional devices integral to our daily routines. Smartwatches store a substantial amount of personal information, potentially serving as repositories of digital evidence. Thus, digital forensic researchers have devoted considerable effort to exploring smartwatch forensic techniques. However, it has been observed that prior studies have primarily treated smartwatches as mere storage mediums for digital evidence, neglecting their potential role in criminal activities. This paper presents the information leakage perpetrated through smartwatches. We represent crime scenarios in an environment where smartphones are not available, considering that the perception that smartphones can be used as tools for criminal behavior prevails in many organizations, while the potential of similar-use smartwatches is often overlooked. We detail mechanisms for information leakage via file transfer and camera control using smartwatches. Additionally, we present methods to investigate each crime incident through smartwatch forensics. Finally, we describe the limitations of post-incident responses and propose proactive measures to prepare for potential crimes involving smartwatches. Keywords: Information Leakage, Smartwatch Forensics, Android Forensics, Mobile Device Management, Security Policy
△ Less
Submitted 17 August, 2023;
originally announced August 2023.
-
A Forensic Methodology for Detecting Image Manipulations
Authors:
Jiwon Lee,
Seungjae Jeon,
Yunji Park,
Jaehyun Chung,
Doowon Jeong
Abstract:
By applying artificial intelligence to image editing technology, it has become possible to generate high-quality images with minimal traces of manipulation. However, since these technologies can be misused for criminal activities such as dissemination of false information, destruction of evidence, and denial of facts, it is crucial to implement strong countermeasures. In this study, image file and…
▽ More
By applying artificial intelligence to image editing technology, it has become possible to generate high-quality images with minimal traces of manipulation. However, since these technologies can be misused for criminal activities such as dissemination of false information, destruction of evidence, and denial of facts, it is crucial to implement strong countermeasures. In this study, image file and mobile forensic artifacts analysis were conducted for detecting image manipulation. Image file analysis involves parsing the metadata of manipulated images (e.g., Exif, DQT, and Filename Signature) and comparing them with a Reference DB to detect manipulation. The Reference DB is a database that collects manipulation-related traces left in image metadata, which serves as a criterion for detecting image manipulation. In the mobile forensic artifacts analysis, packages related to image editing tools were extracted and analyzed to aid the detection of image manipulation. The proposed methodology overcomes the limitations of existing graphic feature-based analysis and combines with image processing techniques, providing the advantage of reducing false positives. The research results demonstrate the significant role of such methodology in digital forensic investigation and analysis. Additionally, We provide the code for parsing image metadata and the Reference DB along with the dataset of manipulated images, aiming to contribute to related research.
△ Less
Submitted 9 August, 2023;
originally announced August 2023.
-
Finding Tori: Self-supervised Learning for Analyzing Korean Folk Song
Authors:
Danbinaerin Han,
Rafael Caro Repetto,
Dasaem Jeong
Abstract:
In this paper, we introduce a computational analysis of the field recording dataset of approximately 700 hours of Korean folk songs, which were recorded around 1980-90s. Because most of the songs were sung by non-expert musicians without accompaniment, the dataset provides several challenges. To address this challenge, we utilized self-supervised learning with convolutional neural network based on…
▽ More
In this paper, we introduce a computational analysis of the field recording dataset of approximately 700 hours of Korean folk songs, which were recorded around 1980-90s. Because most of the songs were sung by non-expert musicians without accompaniment, the dataset provides several challenges. To address this challenge, we utilized self-supervised learning with convolutional neural network based on pitch contour, then analyzed how the musical concept of tori, a classification system defined by a specific scale, ornamental notes, and an idiomatic melodic contour, is captured by the model. The experimental result shows that our approach can better capture the characteristics of tori compared to traditional pitch histograms. Using our approaches, we have examined how musical discussions proposed in existing academia manifest in the actual field recordings of Korean folk songs.
△ Less
Submitted 4 August, 2023;
originally announced August 2023.
-
Cosmological Consequences of Kinetic Mixing between Photon and Dark Photon
Authors:
Sung Mook Lee,
Dong Woo Kang,
Jinn-Ouk Gong,
Donghui Jeong,
Dong-Won Jung,
Seong Chan Park
Abstract:
We study the kinetic mixing between the cosmic microwave background (CMB) photon and the birefringent dark photon as a source of cosmic birefringence. We show that indeed the birefringence of the dark photon propagates to the CMB photon, but the resulting birefringence may not be uniform over the sky. Moreover, our investigation sheds light on the essential role played by kinetic mixing in the gen…
▽ More
We study the kinetic mixing between the cosmic microwave background (CMB) photon and the birefringent dark photon as a source of cosmic birefringence. We show that indeed the birefringence of the dark photon propagates to the CMB photon, but the resulting birefringence may not be uniform over the sky. Moreover, our investigation sheds light on the essential role played by kinetic mixing in the generation of two fundamental characteristics of the CMB: circular polarization and spectral distortion.
△ Less
Submitted 27 July, 2023;
originally announced July 2023.
-
Proxy Anchor-based Unsupervised Learning for Continuous Generalized Category Discovery
Authors:
Hyungmin Kim,
Sungho Suh,
Daehwan Kim,
Daun Jeong,
Hansang Cho,
Junmo Kim
Abstract:
Recent advances in deep learning have significantly improved the performance of various computer vision applications. However, discovering novel categories in an incremental learning scenario remains a challenging problem due to the lack of prior knowledge about the number and nature of new categories. Existing methods for novel category discovery are limited by their reliance on labeled datasets…
▽ More
Recent advances in deep learning have significantly improved the performance of various computer vision applications. However, discovering novel categories in an incremental learning scenario remains a challenging problem due to the lack of prior knowledge about the number and nature of new categories. Existing methods for novel category discovery are limited by their reliance on labeled datasets and prior knowledge about the number of novel categories and the proportion of novel samples in the batch. To address the limitations and more accurately reflect real-world scenarios, in this paper, we propose a novel unsupervised class incremental learning approach for discovering novel categories on unlabeled sets without prior knowledge. The proposed method fine-tunes the feature extractor and proxy anchors on labeled sets, then splits samples into old and novel categories and clusters on the unlabeled dataset. Furthermore, the proxy anchors-based exemplar generates representative category vectors to mitigate catastrophic forgetting. Experimental results demonstrate that our proposed approach outperforms the state-of-the-art methods on fine-grained datasets under real-world scenarios.
△ Less
Submitted 2 November, 2023; v1 submitted 20 July, 2023;
originally announced July 2023.
-
HETDEX Public Source Catalog 1 -- Stacking 50K Lyman Alpha Emitters
Authors:
Dustin Davis,
Karl Gebhardt,
Erin Mentuch Cooper,
William P. Bowman,
Barbara Garcia Castanheira,
John Chisholm,
Robin Ciardullo,
Maximilian Fabricius,
Daniel J. Farrow,
Steven L. Finkelstein,
Caryl Gronwall,
Eric Gawiser,
Gary J. Hill,
Ulrich Hopp,
Lindsay R. House,
Donghui Jeong,
Wolfram Kollatschny,
Eiichiro Komatsu,
Chenxu Liu,
Maja Lujan Niemeyer,
Alberto Saldana-Lopez,
Shun Saito,
Donald P. Schneider,
Jan Snigula,
Sarah Tuttle
, et al. (3 additional authors not shown)
Abstract:
We describe the ensemble properties of the $1.9 < z < 3.5$ Lyman Alpha Emitters (LAEs) found in the HETDEX survey's first public data release, HETDEX Public Source Catalog 1 (Mentuch Cooper et al. 2023). Stacking the low-resolution ($R \sim$ 800) spectra greatly increases the signal-to-noise ratio, revealing spectral features otherwise hidden by noise, and we show that the stacked spectrum is repr…
▽ More
We describe the ensemble properties of the $1.9 < z < 3.5$ Lyman Alpha Emitters (LAEs) found in the HETDEX survey's first public data release, HETDEX Public Source Catalog 1 (Mentuch Cooper et al. 2023). Stacking the low-resolution ($R \sim$ 800) spectra greatly increases the signal-to-noise ratio, revealing spectral features otherwise hidden by noise, and we show that the stacked spectrum is representative of an average member of the set. The flux limited, Ly$α$ signal-to-noise ratio restricted stack of 50K HETDEX LAEs shows the ensemble biweight ``average" $z \sim 2.6$ LAE to be a blue (UV continuum slope $\sim -2.4$ and E(B-V) $< 0.1$), moderately bright (M$_{\text{UV}} \sim -19.7$) star forming galaxy with strong Ly$α$ emission (log $L_{Lyα}$ $\sim$ 42.8 and $W_λ$(Ly$α$) $\sim$ 114Å), and potentially significant leakage of ionizing radiation. The restframe UV light is dominated by a young, metal poor stellar population with an average age 5-15 Myr and metallicity of 0.2-0.3 Z$_{\odot}$.
△ Less
Submitted 6 July, 2023;
originally announced July 2023.
-
Combining piano performance dimensions for score difficulty classification
Authors:
Pedro Ramoneda,
Dasaem Jeong,
Vsevolod Eremenko,
Nazif Can Tamer,
Marius Miron,
Xavier Serra
Abstract:
Predicting the difficulty of playing a musical score is essential for structuring and exploring score collections. Despite its importance for music education, the automatic difficulty classification of piano scores is not yet solved, mainly due to the lack of annotated data and the subjectiveness of the annotations. This paper aims to advance the state-of-the-art in score difficulty classification…
▽ More
Predicting the difficulty of playing a musical score is essential for structuring and exploring score collections. Despite its importance for music education, the automatic difficulty classification of piano scores is not yet solved, mainly due to the lack of annotated data and the subjectiveness of the annotations. This paper aims to advance the state-of-the-art in score difficulty classification with two major contributions. To address the lack of data, we present Can I Play It? (CIPI) dataset, a machine-readable piano score dataset with difficulty annotations obtained from the renowned classical music publisher Henle Verlag. The dataset is created by matching public domain scores with difficulty labels from Henle Verlag, then reviewed and corrected by an expert pianist. As a second contribution, we explore various input representations from score information to pre-trained ML models for piano fingering and expressiveness inspired by the musicology definition of performance. We show that combining the outputs of multiple classifiers performs better than the classifiers on their own, pointing to the fact that the representations capture different aspects of difficulty. In addition, we conduct numerous experiments that lay a foundation for score difficulty classification and create a basis for future research. Our best-performing model reports a 39.47% balanced accuracy and 1.13 median square error across the nine difficulty levels proposed in this study. Code, dataset, and models are made available for reproducibility.
△ Less
Submitted 27 September, 2023; v1 submitted 14 June, 2023;
originally announced June 2023.
-
Unraveling Magnetic Anisotropy Energy in Ferromagnetic Monolayer on Ferroelectric ABO$_3$ via DFT and Machine Learning
Authors:
Dameul Jeong,
Seoung-Hun Kang,
Young-Kyun Kwon
Abstract:
Spin-based devices have attracted attention as an alternative to CMOS-based technology. However, one of the challenges in spintronics devices is reducing the spin-switching energy in ferromagnetic (FM) materials. To address this, we considered ferroelectric (FE) materials, which may affect the magnetic properties of FM materials. We explored various oxide perovskites ABO$_3$ as FE materials, onto…
▽ More
Spin-based devices have attracted attention as an alternative to CMOS-based technology. However, one of the challenges in spintronics devices is reducing the spin-switching energy in ferromagnetic (FM) materials. To address this, we considered ferroelectric (FE) materials, which may affect the magnetic properties of FM materials. We explored various oxide perovskites ABO$_3$ as FE materials, onto which a Fe monolayer was placed as the FM material. We evaluated the magnetic anisotropy energy (MAE) of the Fe monolayer while varying the polarization of ABO$_3$. Our analysis showed that the MAE depends on the magnetic dipole moment induced in the FE material at the interface between the FE and FM materials due to structural modifications. Machine learning techniques were also employed to identify universal behaviors of the MAE in the presence of FE layers, confirming the importance of magnetic moments near the interface in explaining the dependence of the MAE on FE materials.
△ Less
Submitted 13 June, 2023;
originally announced June 2023.
-
A study of audio mixing methods for piano transcription in violin-piano ensembles
Authors:
Hyemi Kim,
Jiyun Park,
Taegyun Kwon,
Dasaem Jeong,
Juhan Nam
Abstract:
While piano music transcription models have shown high performance for solo piano recordings, their performance degrades when applied to ensemble recordings. This study aims to analyze the impact of different data augmentation methods on piano transcription performance, specifically focusing on mixing techniques applied to violin-piano ensembles. We apply mixing methods that consider both harmonic…
▽ More
While piano music transcription models have shown high performance for solo piano recordings, their performance degrades when applied to ensemble recordings. This study aims to analyze the impact of different data augmentation methods on piano transcription performance, specifically focusing on mixing techniques applied to violin-piano ensembles. We apply mixing methods that consider both harmonic and temporal characteristics of the audio. To create datasets for this study, we generated the PFVN-synth dataset, which contains 7 hours of violin-piano ensemble audio by rendering MIDI files and corresponding labels, and also collected unaccompanied violin recordings and mixed them with the MAESTRO dataset. We evaluated the transcription results on both synthesized and real audio recordings datasets.
△ Less
Submitted 23 May, 2023;
originally announced May 2023.
-
Deposition and alignment of fiber suspensions by dip coating
Authors:
Deok-Hoon Jeong,
Langqi Xing,
Michael Ka Ho Lee,
Nathan Vani,
Alban Sauret
Abstract:
The dip coating of suspensions made of monodisperse non-Brownian spherical particles dispersed in a Newtonian fluid leads to different coating regimes depending on the ratio of the particle diameter to the thickness of the film entrained on the substrate. In particular, dilute particles dispersed in the liquid are entrained only above a threshold value of film thickness. In the case of anisotropic…
▽ More
The dip coating of suspensions made of monodisperse non-Brownian spherical particles dispersed in a Newtonian fluid leads to different coating regimes depending on the ratio of the particle diameter to the thickness of the film entrained on the substrate. In particular, dilute particles dispersed in the liquid are entrained only above a threshold value of film thickness. In the case of anisotropic particles, in particular fibers, the smallest characteristic dimension will control the entrainment of the particle. Furthermore, it is possible to control the orientation of the anisotropic particles depending on the substrate geometry. To test the hypotheses, we performed dip-coating experiments with dilute suspensions of non-Brownian fibers with different length-to-diameter aspect ratios. We characterize the number of fibers entrained on the surface of the substrate as a function of the withdrawal velocity, allowing us to estimate a threshold capillary number below which all the particles remain in the liquid bath. Besides, we measure the angular distribution of the entrained fibers for two different substrate geometries: flat plates and cylindrical rods. We then measure the film thickness for more concentrated fiber suspensions. The entrainment of the fibers on a flat plate and a cylindrical rod is primarily controlled by the smaller characteristic length of the fibers: their diameter. At first order, the entrainment threshold scales similarly to that of spherical particles. The length of the fibers only appears to have a minor influence on the entrainment threshold. No preferential alignment is observed for non-Brownian fibers on a flat plate, except for very thin films, whereas the fibers tend to align themselves along the axis of a cylindrical rod for a large enough ratio of the fiber length to the radius of the cylindrical rod.
△ Less
Submitted 1 May, 2023;
originally announced May 2023.
-
Using Dark Energy Explorers and Machine Learning to Enhance the Hobby-Eberly Telescope Dark Energy Experiment
Authors:
Lindsay R. House,
Karl Gebhardt,
Keely Finkelstein,
Erin Mentuch Cooper,
Dustin Davis,
Robin Ciardullo,
Daniel J Farrow,
Steven L. Finkelstein,
Caryl Gronwall,
Donghui Jeong,
L. Clifton Johnson,
Chenxu Liu,
Benjamin P. Thomas,
Gregory Zeimann
Abstract:
We present analysis using a citizen science campaign to improve the cosmological measures from the Hobby-Eberly Telescope Dark Energy Experiment (HETDEX). The goal of HETDEX is to measure the Hubble expansion rate, $H(z)$, and angular diameter distance, $D_A(z)$, at $z =$ 2.4, each to percent-level accuracy. This accuracy is determined primarily from the total number of detected Lyman-$α$ emitters…
▽ More
We present analysis using a citizen science campaign to improve the cosmological measures from the Hobby-Eberly Telescope Dark Energy Experiment (HETDEX). The goal of HETDEX is to measure the Hubble expansion rate, $H(z)$, and angular diameter distance, $D_A(z)$, at $z =$ 2.4, each to percent-level accuracy. This accuracy is determined primarily from the total number of detected Lyman-$α$ emitters (LAEs), the false positive rate due to noise, and the contamination due to [O II] emitting galaxies. This paper presents the citizen science project, Dark Energy Explorers, with the goal of increasing the number of LAEs, decreasing the number of false positives due to noise and the [O II] galaxies. Initial analysis shows that citizen science is an efficient and effective tool for classification most accurately done by the human eye, especially in combination with unsupervised machine learning. Three aspects from the citizen science campaign that have the most impact are 1) identifying individual problems with detections, 2) providing a clean sample with 100% visual identification above a signal-to-noise cut, and 3) providing labels for machine learning efforts. Since the end of 2022, Dark Energy Explorers has collected over three and a half million classifications by 11,000 volunteers in over 85 different countries around the world. By incorporating the results of the Dark Energy Explorers we expect to improve the accuracy on the $D_A(z)$ and $H(z)$ parameters at $z =$ 2.4 by 10 - 30%. While the primary goal is to improve on HETDEX, Dark Energy Explorers has already proven to be a uniquely powerful tool for science advancement and increasing accessibility to science worldwide.
△ Less
Submitted 14 April, 2023;
originally announced April 2023.
-
Focus or Not: A Baseline for Anomaly Event Detection On the Open Public Places with Satellite Images
Authors:
Yongjin Jeon,
Youngtack Oh,
Doyoung Jeong,
Hyunguk Choi,
Junsik Kim
Abstract:
In recent years, monitoring the world wide area with satellite images has been emerged as an important issue.
Site monitoring task can be divided into two independent tasks; 1) Change Detection and 2) Anomaly Event Detection.
Unlike to change detection research is actively conducted based on the numerous datasets(\eg LEVIR-CD, WHU-CD, S2Looking, xView2 and etc...) to meet up the expectations o…
▽ More
In recent years, monitoring the world wide area with satellite images has been emerged as an important issue.
Site monitoring task can be divided into two independent tasks; 1) Change Detection and 2) Anomaly Event Detection.
Unlike to change detection research is actively conducted based on the numerous datasets(\eg LEVIR-CD, WHU-CD, S2Looking, xView2 and etc...) to meet up the expectations of industries or governments, research on AI models for detecting anomaly events is passively and rarely conducted.
In this paper, we introduce a novel satellite imagery dataset(AED-RS) for detecting anomaly events on the open public places.
AED-RS Dataset contains satellite images of normal and abnormal situations of 8 open public places from all over the world.
Each places are labeled with different criteria based on the difference of characteristics of each places.
With this dataset, we introduce a baseline model for our dataset TB-FLOW, which can be trained in weakly-supervised manner and shows reasonable performance on the AED-RS Dataset compared with the other NF(Normalizing-Flow) based anomaly detection models. Our dataset and code will be publicly open in \url{https://github.com/SIAnalytics/RS_AnomalyDetection.git}.
△ Less
Submitted 4 April, 2023; v1 submitted 21 March, 2023;
originally announced March 2023.
-
Uncertainty-Aware Reward-based Deep Reinforcement Learning for Intent Analysis of Social Media Information
Authors:
Zhen Guo,
Qi Zhang,
Xinwei An,
Qisheng Zhang,
Audun Jøsang,
Lance M. Kaplan,
Feng Chen,
Dong H. Jeong,
Jin-Hee Cho
Abstract:
Due to various and serious adverse impacts of spreading fake news, it is often known that only people with malicious intent would propagate fake news. However, it is not necessarily true based on social science studies. Distinguishing the types of fake news spreaders based on their intent is critical because it will effectively guide how to intervene to mitigate the spread of fake news with differ…
▽ More
Due to various and serious adverse impacts of spreading fake news, it is often known that only people with malicious intent would propagate fake news. However, it is not necessarily true based on social science studies. Distinguishing the types of fake news spreaders based on their intent is critical because it will effectively guide how to intervene to mitigate the spread of fake news with different approaches. To this end, we propose an intent classification framework that can best identify the correct intent of fake news. We will leverage deep reinforcement learning (DRL) that can optimize the structural representation of each tweet by removing noisy words from the input sequence when appending an actor to the long short-term memory (LSTM) intent classifier. Policy gradient DRL model (e.g., REINFORCE) can lead the actor to a higher delayed reward. We also devise a new uncertainty-aware immediate reward using a subjective opinion that can explicitly deal with multidimensional uncertainty for effective decision-making. Via 600K training episodes from a fake news tweets dataset with an annotated intent class, we evaluate the performance of uncertainty-aware reward in DRL. Evaluation results demonstrate that our proposed framework efficiently reduces the number of selected words to maintain a high 95\% multi-class accuracy.
△ Less
Submitted 18 February, 2023;
originally announced February 2023.
-
HETDEX Public Source Catalog 1: 220K Sources Including Over 50K Lyman Alpha Emitters from an Untargeted Wide-area Spectroscopic Survey
Authors:
Erin Mentuch Cooper,
Karl Gebhardt,
Dustin Davis,
Daniel J. Farrow,
Chenxu Liu,
Gregory Zeimann,
Robin Ciardullo,
John J. Feldmeier,
Niv Drory,
Donghui Jeong,
Barbara Benda,
William P. Bowman,
Michael Boylan-Kolchin,
Oscar A. Chavez Ortiz,
Maya H. Debski,
Mona Dentler,
Maximilian Fabricius,
Rameen Farooq,
Steven L. Finkelstein,
Eric Gawiser,
Caryl Gronwall,
Gary J. Hill,
Ulrich Hopp,
Lindsay R. House,
Steven Janowiecki
, et al. (21 additional authors not shown)
Abstract:
We present the first publicly released catalog of sources obtained from the Hobby-Eberly Telescope Dark Energy Experiment (HETDEX). HETDEX is an integral field spectroscopic survey designed to measure the Hubble expansion parameter and angular diameter distance at 1.88<z<3.52 by using the spatial distribution of more than a million Ly-alpha-emitting galaxies over a total target area of 540 deg^2.…
▽ More
We present the first publicly released catalog of sources obtained from the Hobby-Eberly Telescope Dark Energy Experiment (HETDEX). HETDEX is an integral field spectroscopic survey designed to measure the Hubble expansion parameter and angular diameter distance at 1.88<z<3.52 by using the spatial distribution of more than a million Ly-alpha-emitting galaxies over a total target area of 540 deg^2. The catalog comes from contiguous fiber spectra coverage of 25 deg^2 of sky from January 2017 through June 2020, where object detection is performed through two complementary detection methods: one designed to search for line emission and the other a search for continuum emission. The HETDEX public release catalog is dominated by emission-line galaxies and includes 51,863 Lyα-emitting galaxy (LAE) identifications and 123,891 OII-emitting galaxies at z<0.5. Also included in the catalog are 37,916 stars, 5274 low-redshift (z<0.5) galaxies without emission lines, and 4976 active galactic nuclei. The catalog provides sky coordinates, redshifts, line identifications, classification information, line fluxes, OII and Ly-alpha line luminosities where applicable, and spectra for all identified sources processed by the HETDEX detection pipeline. Extensive testing demonstrates that HETDEX redshifts agree to within deltaz < 0.02, 96.1% of the time to those in external spectroscopic catalogs. We measure the photometric counterpart fraction in deep ancillary Hyper Suprime-Cam imaging and find that only 55.5% of the LAE sample has an r-band continuum counterpart down to a limiting magnitude of r~26.2 mag (AB) indicating that an LAE search of similar sensitivity with photometric pre-selection would miss nearly half of the HETDEX LAE catalog sample. Data access and details about the catalog can be found online at http://hetdex.org/.
△ Less
Submitted 4 January, 2023;
originally announced January 2023.
-
The HETDEX Survey: Emission Line Exploration and Source Classification
Authors:
Dustin Davis,
Karl Gebhardt,
Erin Mentuch Cooper,
Robin Ciardullo,
Maximilian Fabricius,
Daniel J. Farrow,
John J. Feldmeier,
Steven L. Finkelstein,
Eric Gawiser,
Caryl Gronwall,
Gary J. Hill,
Ulrich Hopp,
Lindsay R. House,
Donghui Jeong,
Wolfram Kollatschny,
Eiichiro Komatsu,
Martin Landriau,
Chenxu Liu,
Shun Saito,
Sarah Tuttle,
Isak G. B. Wold,
Gregory R. Zeimann,
Yechi Zhang
Abstract:
The Hobby-Eberly Telescope Dark Energy Experiment (HETDEX) is an untargeted spectroscopic survey that aims to measure the expansion rate of the Universe at $z \sim 2.4$ to 1% precision for both $H(z)$ and $D_A(z)$. HETDEX is in the process of mapping in excess of one million Lyman Alpha emitting (LAE) galaxies and a similar number of lower-z galaxies as a tracer of the large-scale structure. The s…
▽ More
The Hobby-Eberly Telescope Dark Energy Experiment (HETDEX) is an untargeted spectroscopic survey that aims to measure the expansion rate of the Universe at $z \sim 2.4$ to 1% precision for both $H(z)$ and $D_A(z)$. HETDEX is in the process of mapping in excess of one million Lyman Alpha emitting (LAE) galaxies and a similar number of lower-z galaxies as a tracer of the large-scale structure. The success of the measurement is predicated on the post-observation separation of galaxies with Ly$α$ emission from the lower-$z$ interloping galaxies, primarily [OII], with low contamination and high recovery rates. The Emission Line eXplorer (ELiXer) is the principal classification tool for HETDEX, providing a tunable balance between contamination and completeness as dictated by science needs. By combining multiple selection criteria, ELiXer improves upon the 20 Angstrom rest-frame equivalent width cut commonly used to distinguish LAEs from lower-$z$ [OII] emitting galaxies. Despite a spectral resolving power, R $\sim800$, that cannot resolve the [OII] doublet, we demonstrate the ability to distinguish LAEs from foreground galaxies with 98.1% accuracy. We estimate a contamination rate of Ly$α$ by [OII] of 1.2% and a Ly$α$ recovery rate of 99.1% using the default ELiXer configuration. These rates meet the HETDEX science requirements.
△ Less
Submitted 4 January, 2023;
originally announced January 2023.
-
PPO-UE: Proximal Policy Optimization via Uncertainty-Aware Exploration
Authors:
Qisheng Zhang,
Zhen Guo,
Audun Jøsang,
Lance M. Kaplan,
Feng Chen,
Dong H. Jeong,
Jin-Hee Cho
Abstract:
Proximal Policy Optimization (PPO) is a highly popular policy-based deep reinforcement learning (DRL) approach. However, we observe that the homogeneous exploration process in PPO could cause an unexpected stability issue in the training phase. To address this issue, we propose PPO-UE, a PPO variant equipped with self-adaptive uncertainty-aware explorations (UEs) based on a ratio uncertainty level…
▽ More
Proximal Policy Optimization (PPO) is a highly popular policy-based deep reinforcement learning (DRL) approach. However, we observe that the homogeneous exploration process in PPO could cause an unexpected stability issue in the training phase. To address this issue, we propose PPO-UE, a PPO variant equipped with self-adaptive uncertainty-aware explorations (UEs) based on a ratio uncertainty level. The proposed PPO-UE is designed to improve convergence speed and performance with an optimized ratio uncertainty level. Through extensive sensitivity analysis by varying the ratio uncertainty level, our proposed PPO-UE considerably outperforms the baseline PPO in Roboschool continuous control tasks.
△ Less
Submitted 12 December, 2022;
originally announced December 2022.
-
Search for subsolar-mass black hole binaries in the second part of Advanced LIGO's and Advanced Virgo's third observing run
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
R. Abbott,
H. Abe,
F. Acernese,
K. Ackley,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
C. Alléné,
A. Allocca,
P. A. Altin
, et al. (1680 additional authors not shown)
Abstract:
We describe a search for gravitational waves from compact binaries with at least one component with mass 0.2 $M_\odot$ -- $1.0 M_\odot$ and mass ratio $q \geq 0.1$ in Advanced LIGO and Advanced Virgo data collected between 1 November 2019, 15:00 UTC and 27 March 2020, 17:00 UTC. No signals were detected. The most significant candidate has a false alarm rate of 0.2 $\mathrm{yr}^{-1}$. We estimate t…
▽ More
We describe a search for gravitational waves from compact binaries with at least one component with mass 0.2 $M_\odot$ -- $1.0 M_\odot$ and mass ratio $q \geq 0.1$ in Advanced LIGO and Advanced Virgo data collected between 1 November 2019, 15:00 UTC and 27 March 2020, 17:00 UTC. No signals were detected. The most significant candidate has a false alarm rate of 0.2 $\mathrm{yr}^{-1}$. We estimate the sensitivity of our search over the entirety of Advanced LIGO's and Advanced Virgo's third observing run, and present the most stringent limits to date on the merger rate of binary black holes with at least one subsolar-mass component. We use the upper limits to constrain two fiducial scenarios that could produce subsolar-mass black holes: primordial black holes (PBH) and a model of dissipative dark matter. The PBH model uses recent prescriptions for the merger rate of PBH binaries that include a rate suppression factor to effectively account for PBH early binary disruptions. If the PBHs are monochromatically distributed, we can exclude a dark matter fraction in PBHs $f_\mathrm{PBH} \gtrsim 0.6$ (at 90% confidence) in the probed subsolar-mass range. However, if we allow for broad PBH mass distributions we are unable to rule out $f_\mathrm{PBH} = 1$. For the dissipative model, where the dark matter has chemistry that allows a small fraction to cool and collapse into black holes, we find an upper bound $f_{\mathrm{DBH}} < 10^{-5}$ on the fraction of atomic dark matter collapsed into black holes.
△ Less
Submitted 26 January, 2024; v1 submitted 2 December, 2022;
originally announced December 2022.
-
AI-KD: Adversarial learning and Implicit regularization for self-Knowledge Distillation
Authors:
Hyungmin Kim,
Sungho Suh,
Sunghyun Baek,
Daehwan Kim,
Daun Jeong,
Hansang Cho,
Junmo Kim
Abstract:
We present a novel adversarial penalized self-knowledge distillation method, named adversarial learning and implicit regularization for self-knowledge distillation (AI-KD), which regularizes the training procedure by adversarial learning and implicit distillations. Our model not only distills the deterministic and progressive knowledge which are from the pre-trained and previous epoch predictive p…
▽ More
We present a novel adversarial penalized self-knowledge distillation method, named adversarial learning and implicit regularization for self-knowledge distillation (AI-KD), which regularizes the training procedure by adversarial learning and implicit distillations. Our model not only distills the deterministic and progressive knowledge which are from the pre-trained and previous epoch predictive probabilities but also transfers the knowledge of the deterministic predictive distributions using adversarial learning. The motivation is that the self-knowledge distillation methods regularize the predictive probabilities with soft targets, but the exact distributions may be hard to predict. Our method deploys a discriminator to distinguish the distributions between the pre-trained and student models while the student model is trained to fool the discriminator in the trained procedure. Thus, the student model not only can learn the pre-trained model's predictive probabilities but also align the distributions between the pre-trained and student models. We demonstrate the effectiveness of the proposed method with network architectures on multiple datasets and show the proposed method achieves better performance than state-of-the-art methods.
△ Less
Submitted 21 March, 2024; v1 submitted 20 November, 2022;
originally announced November 2022.
-
Deep Distance Sensitivity Oracles
Authors:
Davin Jeong,
Allison Gunby-Mann,
Sarel Cohen,
Maximilian Katzmann,
Chau Pham,
Arnav Bhakta,
Tobias Friedrich,
Sang Chin
Abstract:
One of the most fundamental graph problems is finding a shortest path from a source to a target node. While in its basic forms the problem has been studied extensively and efficient algorithms are known, it becomes significantly harder as soon as parts of the graph are susceptible to failure. Although one can recompute a shortest replacement path after every outage, this is rather inefficient both…
▽ More
One of the most fundamental graph problems is finding a shortest path from a source to a target node. While in its basic forms the problem has been studied extensively and efficient algorithms are known, it becomes significantly harder as soon as parts of the graph are susceptible to failure. Although one can recompute a shortest replacement path after every outage, this is rather inefficient both in time and/or storage. One way to overcome this problem is to shift computational burden from the queries into a pre-processing step, where a data structure is computed that allows for fast querying of replacement paths, typically referred to as a Distance Sensitivity Oracle (DSO). While DSOs have been extensively studied in the theoretical computer science community, to the best of our knowledge this is the first work to construct DSOs using deep learning techniques. We show how to use deep learning to utilize a combinatorial structure of replacement paths. More specifically, we utilize the combinatorial structure of replacement paths as a concatenation of shortest paths and use deep learning to find the pivot nodes for stitching shortest paths into replacement paths.
△ Less
Submitted 18 October, 2023; v1 submitted 2 November, 2022;
originally announced November 2022.
-
Particulate suspension coating of capillary tubes
Authors:
Deok-Hoon Jeong,
Langqi Xing,
Jean-Baptiste Boutin,
Alban Sauret
Abstract:
The displacement of a suspension of particles by an immiscible fluid in a capillary tube or in a porous media is a canonical configuration that finds application in a large number of natural and industrial applications, including water purification, dispersion of colloids and microplastics, coating and functionalization of tubings. The influence of particles dispersed in the fluid on the interfaci…
▽ More
The displacement of a suspension of particles by an immiscible fluid in a capillary tube or in a porous media is a canonical configuration that finds application in a large number of natural and industrial applications, including water purification, dispersion of colloids and microplastics, coating and functionalization of tubings. The influence of particles dispersed in the fluid on the interfacial dynamics and on the properties of the liquid film left behind remain poorly understood. Here, we study the deposition of a coating film on the walls of a capillary tube induced by the translation of a suspension plug pushed by air. We identify the different deposition regimes as a function of the translation speed of the plug, the particle size, and the volume fraction of the suspension. The thickness of the coating film is characterized, and we show that similarly to dip coating, three coating regimes, liquid only, heterogeneous, and thick films, are observed. We also show that, at first order, the thickness of films thicker than the particle diameter can be predicted using the effective viscosity of the suspension. Nevertheless, we also report that for large particles and concentrated suspensions, a shear-induced migration mechanism leads to local variations in volume fraction and modifies the deposited film thickness and composition.
△ Less
Submitted 29 September, 2022;
originally announced September 2022.
-
Grouping-matrix based Graph Pooling with Adaptive Number of Clusters
Authors:
Sung Moon Ko,
Sungjun Cho,
Dae-Woong Jeong,
Sehui Han,
Moontae Lee,
Honglak Lee
Abstract:
Graph pooling is a crucial operation for encoding hierarchical structures within graphs. Most existing graph pooling approaches formulate the problem as a node clustering task which effectively captures the graph topology. Conventional methods ask users to specify an appropriate number of clusters as a hyperparameter, then assume that all input graphs share the same number of clusters. In inductiv…
▽ More
Graph pooling is a crucial operation for encoding hierarchical structures within graphs. Most existing graph pooling approaches formulate the problem as a node clustering task which effectively captures the graph topology. Conventional methods ask users to specify an appropriate number of clusters as a hyperparameter, then assume that all input graphs share the same number of clusters. In inductive settings where the number of clusters can vary, however, the model should be able to represent this variation in its pooling layers in order to learn suitable clusters. Thus we propose GMPool, a novel differentiable graph pooling architecture that automatically determines the appropriate number of clusters based on the input data. The main intuition involves a grouping matrix defined as a quadratic form of the pooling operator, which induces use of binary classification probabilities of pairwise combinations of nodes. GMPool obtains the pooling operator by first computing the grouping matrix, then decomposing it. Extensive evaluations on molecular property prediction tasks demonstrate that our method outperforms conventional methods.
△ Less
Submitted 7 September, 2022;
originally announced September 2022.