-
Metals in Star-Forming Galaxies with KCWI. I. Methodology and First Results on the Abundances of Iron, Magnesium, and Oxygen
Authors:
Zhuyun Zhuang,
Evan N. Kirby,
Charles C. Steidel,
Mithi A. C. de los Reyes,
Nikolaus Z. Prusinski,
N. Leethochawalit,
Minjung Park,
Charlie Conroy,
Evan H. Nuñez
Abstract:
Understanding the chemical enrichment of different elements is crucial to gaining a complete picture of galaxy chemical evolution. In this study, we present a new sample of 46 low-redshift, low-mass star-forming galaxies at $M_*\sim 10^{8-10}M_{\odot}$ along with two quiescent galaxies at $M_*\sim 10^{8.8}M_{\odot}$ observed with the Keck Cosmic Web Imager (KCWI), aiming to investigate the chemica…
▽ More
Understanding the chemical enrichment of different elements is crucial to gaining a complete picture of galaxy chemical evolution. In this study, we present a new sample of 46 low-redshift, low-mass star-forming galaxies at $M_*\sim 10^{8-10}M_{\odot}$ along with two quiescent galaxies at $M_*\sim 10^{8.8}M_{\odot}$ observed with the Keck Cosmic Web Imager (KCWI), aiming to investigate the chemical evolution of galaxies in the transition zone between Local Group satellites and massive field galaxies. We develop a novel method to simultaneously determine stellar abundances of iron and magnesium in star-forming galaxies. With the gas-phase oxygen abundance (O/H)$_{\rm g}$ measured using the strong line method, we are able to make the first-ever apples-to-apples comparison of $α$ elements in the stars and the ISM. We find that the [Mg/H]$_*$-[O/H]$_{\rm g}$ relation is much tighter than the [Fe/H]$_*$-[O/H]$_{\rm g}$ relation, which can be explained by the similar production processes of $α$ elements. Most galaxies in our sample exhibit higher [O/H]$_{\rm g}$ than [Fe/H]$_*$ and [Mg/H]$_*$. In addition, we construct mass-metallicity relations (MZRs) measured as three different elements (Fe$_*$, Mg$_*$, O$_{\rm g}$). Compared to the gas O-MZR, the stellar Fe- and Mg-MZRs show larger scatter driven by variations in specific star formation rates (sSFR), with star-forming galaxies exhibiting higher sSFR and lower stellar abundances at fixed mass. The excess of [O/H]$_{\rm g}$ compared to stellar abundances as well as the anti-correlation between sSFR and stellar abundance suggests that galaxy quenching of intermediate-mass galaxies at $M_*\sim 10^{8-10}M_{\odot}$ is primarily driven by starvation.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
CanonicalFusion: Generating Drivable 3D Human Avatars from Multiple Images
Authors:
Jisu Shin,
Junmyeong Lee,
Seongmin Lee,
Min-Gyu Park,
Ju-Mi Kang,
Ju Hong Yoon,
Hae-Gon Jeon
Abstract:
We present a novel framework for reconstructing animatable human avatars from multiple images, termed CanonicalFusion. Our central concept involves integrating individual reconstruction results into the canonical space. To be specific, we first predict Linear Blend Skinning (LBS) weight maps and depth maps using a shared-encoder-dual-decoder network, enabling direct canonicalization of the 3D mesh…
▽ More
We present a novel framework for reconstructing animatable human avatars from multiple images, termed CanonicalFusion. Our central concept involves integrating individual reconstruction results into the canonical space. To be specific, we first predict Linear Blend Skinning (LBS) weight maps and depth maps using a shared-encoder-dual-decoder network, enabling direct canonicalization of the 3D mesh from the predicted depth maps. Here, instead of predicting high-dimensional skinning weights, we infer compressed skinning weights, i.e., 3-dimensional vector, with the aid of pre-trained MLP networks. We also introduce a forward skinning-based differentiable rendering scheme to merge the reconstructed results from multiple images. This scheme refines the initial mesh by reposing the canonical mesh via the forward skinning and by minimizing photometric and geometric errors between the rendered and the predicted results. Our optimization scheme considers the position and color of vertices as well as the joint angles for each image, thereby mitigating the negative effects of pose errors. We conduct extensive experiments to demonstrate the effectiveness of our method and compare our CanonicalFusion with state-of-the-art methods. Our source codes are available at https://github.com/jsshin98/CanonicalFusion.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
AI Driven Laser Parameter Search: Inverse Design of Photonic Surfaces using Greedy Surrogate-based Optimization
Authors:
Luka Grbcic,
Minok Park,
Juliane Müller,
Vassilia Zorba,
Wibe Albert de Jong
Abstract:
Photonic surfaces designed with specific optical characteristics are becoming increasingly important for use in in various energy harvesting and storage systems. , In this study, we develop a surrogate-based optimization approach for designing such surfaces. The surrogate-based optimization framework employs the Random Forest algorithm and uses a greedy, prediction-based exploration strategy to id…
▽ More
Photonic surfaces designed with specific optical characteristics are becoming increasingly important for use in in various energy harvesting and storage systems. , In this study, we develop a surrogate-based optimization approach for designing such surfaces. The surrogate-based optimization framework employs the Random Forest algorithm and uses a greedy, prediction-based exploration strategy to identify the laser fabrication parameters that minimize the discrepancy relative to a user-defined target optical characteristics. We demonstrate the approach on two synthetic benchmarks and two specific cases of photonic surface inverse design targets. It exhibits superior performance when compared to other optimization algorithms across all benchmarks. Additionally, we demonstrate a technique of inverse design warm starting for changed target optical characteristics which enhances the performance of the introduced approach.
△ Less
Submitted 20 June, 2024;
originally announced July 2024.
-
Unconventional p-wave and finite-momentum superconductivity induced by altermagnetism through the formation of Bogoliubov Fermi surface
Authors:
SeungBeom Hong,
Moon Jip Park,
Kyoung-Min Kim
Abstract:
Altermagnet is an exotic class of magnetic materials wherein the Fermi surface exhibits a momentum-dependent spin-splitting while maintaining a net zero magnetization. Previous studies have shown that this distinctive spin-splitting can induce chiral p-wave superconductors or Fulde-Ferrell superconducting states carrying finite momentum. However, the underlying mechanisms of such unconventional su…
▽ More
Altermagnet is an exotic class of magnetic materials wherein the Fermi surface exhibits a momentum-dependent spin-splitting while maintaining a net zero magnetization. Previous studies have shown that this distinctive spin-splitting can induce chiral p-wave superconductors or Fulde-Ferrell superconducting states carrying finite momentum. However, the underlying mechanisms of such unconventional superconductivities remain incompletely understood. Here, we propose that the formation of the Bogoliubov Fermi surface through the exchange field can play a significant role in such phenomena. Through a systematic self-consistent mean-field analysis on the extended attractive Hubbard model combined with the d-wave spin-splitting induced by the exchange field, as observed in RuO2, we demonstrate that the formation of the Bogoliubov Fermi surface suppresses conventional spin-singlet superconducting states with s-wave characteristics. In contrast, the chiral p-wave state maintains a fully gapped spectrum without the Fermi surface, thereby becoming the ground state in the strong field regime. In the intermediate regime, we find that the Fulde-Ferrell state becomes the predominant state through the optimization of available channels for Cooper pairing. Moreover, we illustrate how the prevalence of the chiral p-wave and Fulde-Ferrell states over the s-wave state changes under the variation of the field strength or chemical potential. Our findings provide valuable insights into potential pathways for realizing sought-after topological p-wave superconductivity and finite momentum pairing facilitated by altermagnetism.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
SAM: Semi-Active Mechanism for Extensible Continuum Manipulator and Real-time Hysteresis Compensation Control Algorithm
Authors:
Junhyun Park,
Seonghyeok Jang,
Myeongbo Park,
Hyojae Park,
Jeonghyeon Yoon,
Minho Hwang
Abstract:
Cable-Driven Continuum Manipulators (CDCMs) enable scar-free procedures via natural orifices and improve target lesion accessibility through curved paths. However, CDCMs face limitations in workspace and control accuracy due to non-linear cable effects causing hysteresis. This paper introduces an extensible CDCM with a Semi-active Mechanism (SAM) to expand the workspace via translational motion wi…
▽ More
Cable-Driven Continuum Manipulators (CDCMs) enable scar-free procedures via natural orifices and improve target lesion accessibility through curved paths. However, CDCMs face limitations in workspace and control accuracy due to non-linear cable effects causing hysteresis. This paper introduces an extensible CDCM with a Semi-active Mechanism (SAM) to expand the workspace via translational motion without additional mechanical elements or actuation. We collect a hysteresis dataset using 8 fiducial markers and RGBD sensing. Based on this dataset, we develop a real-time hysteresis compensation control algorithm using the trained Temporal Convolutional Network (TCN) with a 1ms time latency, effectively estimating the manipulator's hysteresis behavior. Performance validation through random trajectory tracking tests and box pointing tasks shows the proposed controller significantly reduces hysteresis by up to 69.5% in joint space and approximately 26% in the box pointing task.
△ Less
Submitted 27 June, 2024; v1 submitted 26 June, 2024;
originally announced June 2024.
-
Terahertz photocurrent probe of quantum geometry and interactions in magic-angle twisted bilayer graphene
Authors:
Roshan Krishna Kumar,
Geng Li,
Riccardo Bertini,
Swati Chaudhary,
Krystian Nowakowski,
Jeong Min Park,
Sebastian Castilla,
Zhen Zhan,
Pierre A. Pantaleón,
Hitesh Agarwal,
Sergi Battle-Porro,
Eike Icking,
Matteo Ceccanti,
Antoine Reserbat-Plantey,
Giulia Piccinini,
Julien Barrier,
Ekaterina Khestanova,
Takashi Taniguchi,
Kenji Watanabe,
Christoph Stampfer,
Gil Refael,
Francisco Guinea,
Pablo Jarillo-Herrero,
Justin C. W. Song,
Petr Stepanov
, et al. (2 additional authors not shown)
Abstract:
Moiré materials represent strongly interacting electron systems bridging topological and correlated physics. Despite significant advances, decoding wavefunction properties underlying the quantum geometry remains challenging. Here, we utilize polarization-resolved photocurrent measurements to probe magic-angle twisted bilayer graphene, leveraging its sensitivity to the Berry connection that encompa…
▽ More
Moiré materials represent strongly interacting electron systems bridging topological and correlated physics. Despite significant advances, decoding wavefunction properties underlying the quantum geometry remains challenging. Here, we utilize polarization-resolved photocurrent measurements to probe magic-angle twisted bilayer graphene, leveraging its sensitivity to the Berry connection that encompasses quantum "textures" of electron wavefunctions. Using terahertz light resonant with optical transitions of its flat bands, we observe bulk photocurrents driven by broken symmetries and reveal the interplay between electron interactions and quantum geometry. We observe inversion-breaking gapped states undetectable through quantum transport, sharp changes in the polarization axes caused by interaction-induced band renormalization, and recurring photocurrent patterns at integer fillings of the moiré unit cell that track the evolution of quantum geometry through the cascade of phase transitions. The large and tunable terahertz response intrinsic to flat-band systems offers direct insights into the quantum geometry of interacting electrons and paves the way for innovative terahertz quantum technologies.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Superfluid stiffness of twisted multilayer graphene superconductors
Authors:
Abhishek Banerjee,
Zeyu Hao,
Mary Kreidel,
Patrick Ledwith,
Isabelle Phinney,
Jeong Min Park,
Andrew M. Zimmerman,
Kenji Watanabe,
Takashi Taniguchi,
Robert M Westervelt,
Pablo Jarillo-Herrero,
Pavel A. Volkov,
Ashvin Vishwanath,
Kin Chung Fong,
Philip Kim
Abstract:
The robustness of the macroscopic quantum nature of a superconductor can be characterized by the superfluid stiffness, $ρ_s$, a quantity that describes the energy required to vary the phase of the macroscopic quantum wave function. In unconventional superconductors, such as cuprates, the low-temperature behavior of $ρ_s$ drastically differs from that of conventional superconductors due to quasipar…
▽ More
The robustness of the macroscopic quantum nature of a superconductor can be characterized by the superfluid stiffness, $ρ_s$, a quantity that describes the energy required to vary the phase of the macroscopic quantum wave function. In unconventional superconductors, such as cuprates, the low-temperature behavior of $ρ_s$ drastically differs from that of conventional superconductors due to quasiparticle excitations from gapless points (nodes) in momentum space. Intensive research on the recently discovered magic-angle twisted graphene family has revealed, in addition to superconducting states, strongly correlated electronic states associated with spontaneously broken symmetries, inviting the study of $ρ_s$ to uncover the potentially unconventional nature of its superconductivity. Here we report the measurement of $ρ_s$ in magic-angle twisted trilayer graphene (TTG), revealing unconventional nodal-gap superconductivity. Utilizing radio-frequency reflectometry techniques to measure the kinetic inductive response of superconducting TTG coupled to a microwave resonator, we find a linear temperature dependence of $ρ_s$ at low temperatures and nonlinear Meissner effects in the current bias dependence, both indicating nodal structures in the superconducting order parameter. Furthermore, the doping dependence shows a linear correlation between the zero temperature $ρ_s$ and the superconducting transition temperature $T_c$, reminiscent of Uemura's relation in cuprates, suggesting phase-coherence-limited superconductivity. Our results provide strong evidence for nodal superconductivity in TTG and put strong constraints on the mechanisms of these graphene-based superconductors.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Design Optimization of NOMA Aided Multi-STAR-RIS for Indoor Environments: A Convex Approximation Imitated Reinforcement Learning Approach
Authors:
Yu Min Park,
Sheikh Salman Hassan,
Yan Kyaw Tun,
Eui-Nam Huh,
Walid Saad,
Choong Seon Hong
Abstract:
Sixth-generation (6G) networks leverage simultaneously transmitting and reflecting reconfigurable intelligent surfaces (STAR-RISs) to overcome the limitations of traditional RISs. STAR-RISs offer 360-degree full-space coverage and optimized transmission and reflection for enhanced network performance and dynamic control of the indoor propagation environment. However, deploying STAR-RISs indoors pr…
▽ More
Sixth-generation (6G) networks leverage simultaneously transmitting and reflecting reconfigurable intelligent surfaces (STAR-RISs) to overcome the limitations of traditional RISs. STAR-RISs offer 360-degree full-space coverage and optimized transmission and reflection for enhanced network performance and dynamic control of the indoor propagation environment. However, deploying STAR-RISs indoors presents challenges in interference mitigation, power consumption, and real-time configuration. In this work, a novel network architecture utilizing multiple access points (APs) and STAR-RISs is proposed for indoor communication. An optimization problem encompassing user assignment, access point beamforming, and STAR-RIS phase control for reflection and transmission is formulated. The inherent complexity of the formulated problem necessitates a decomposition approach for an efficient solution. This involves tackling different sub-problems with specialized techniques: a many-to-one matching algorithm is employed to assign users to appropriate access points, optimizing resource allocation. To facilitate efficient resource management, access points are grouped using a correlation-based K-means clustering algorithm. Multi-agent deep reinforcement learning (MADRL) is leveraged to optimize the control of the STAR-RIS. Within the proposed MADRL framework, a novel approach is introduced where each decision variable acts as an independent agent, enabling collaborative learning and decision-making. Additionally, the proposed MADRL approach incorporates convex approximation (CA). This technique utilizes suboptimal solutions from successive convex approximation (SCA) to accelerate policy learning for the agents, thereby leading to faster environment adaptation and convergence. Simulations demonstrate significant network utility improvements compared to baseline approaches.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
AGN Feedback in Quiescent Galaxies at Cosmic Noon Traced by Ionized Gas Emission
Authors:
Letizia Bugiani,
Sirio Belli,
Minjung Park,
Rebecca L. Davies,
J. Trevor Mendel,
Benjamin D. Johnson,
Amir H. Khoram,
Chloë Benton,
Andrea Cimatti,
Charlie Conroy,
Razieh Emami,
Joel Leja,
Yijia Li,
Gabriel Maheson,
Elijah P. Mathews,
Rohan P. Naidu,
Erica J. Nelson,
Sandro Tacchella,
Bryan A. Terrazas,
Rainer Weinberger
Abstract:
We analyze ionized gas emission lines in deep rest-frame optical spectra of 16 quiescent galaxies at redshift $1.7<z<3.5$ observed with JWST/NIRSpec by the Blue Jay survey. Robust detection of emission lines in $75\%$ of the sample indicates the presence of ongoing ionizing sources in this passive population. The H$α$ line luminosities confirm that the population is quiescent, with star formation…
▽ More
We analyze ionized gas emission lines in deep rest-frame optical spectra of 16 quiescent galaxies at redshift $1.7<z<3.5$ observed with JWST/NIRSpec by the Blue Jay survey. Robust detection of emission lines in $75\%$ of the sample indicates the presence of ongoing ionizing sources in this passive population. The H$α$ line luminosities confirm that the population is quiescent, with star formation rates that are at least ten times lower than the main sequence of star formation. The quiescent sample is clearly separate from the star-forming population in line diagnostic diagrams, and occupies a region usually populated by active galactic nuclei (AGN). Analysis of the observed line ratios, equivalent widths, and velocity dispersions leads us to conclude that in most cases the gas is ionized by AGN activity, despite the lack of X-ray detections. A subset of the sample also hosts ionized and/or neutral outflows. Our results show, for the first time using a representative sample, that low luminosity AGN are extremely common among quiescent galaxies at high redshift. These low luminosity AGN may play a key role in quenching star formation and in maintaining massive galaxies quiescent from Cosmic Noon to $z\sim0$.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Harnessing Business and Media Insights with Large Language Models
Authors:
Yujia Bao,
Ankit Parag Shah,
Neeru Narang,
Jonathan Rivers,
Rajeev Maksey,
Lan Guan,
Louise N. Barrere,
Shelley Evenson,
Rahul Basole,
Connie Miao,
Ankit Mehta,
Fabien Boulay,
Su Min Park,
Natalie E. Pearson,
Eldhose Joy,
Tiger He,
Sumiran Thakur,
Koustav Ghosal,
Josh On,
Phoebe Morrison,
Tim Major,
Eva Siqi Wang,
Gina Escobar,
Jiaheng Wei,
Tharindu Cyril Weerasooriya
, et al. (8 additional authors not shown)
Abstract:
This paper introduces Fortune Analytics Language Model (FALM). FALM empowers users with direct access to comprehensive business analysis, including market trends, company performance metrics, and expert insights. Unlike generic LLMs, FALM leverages a curated knowledge base built from professional journalism, enabling it to deliver precise and in-depth answers to intricate business questions. Users…
▽ More
This paper introduces Fortune Analytics Language Model (FALM). FALM empowers users with direct access to comprehensive business analysis, including market trends, company performance metrics, and expert insights. Unlike generic LLMs, FALM leverages a curated knowledge base built from professional journalism, enabling it to deliver precise and in-depth answers to intricate business questions. Users can further leverage natural language queries to directly visualize financial data, generating insightful charts and graphs to understand trends across diverse business sectors clearly. FALM fosters user trust and ensures output accuracy through three novel methods: 1) Time-aware reasoning guarantees accurate event registration and prioritizes recent updates. 2) Thematic trend analysis explicitly examines topic evolution over time, providing insights into emerging business landscapes. 3) Content referencing and task decomposition enhance answer fidelity and data visualization accuracy. We conduct both automated and human evaluations, demonstrating FALM's significant performance improvements over baseline methods while prioritizing responsible AI practices. These benchmarks establish FALM as a cutting-edge LLM in the business and media domains, with exceptional accuracy and trustworthiness.
△ Less
Submitted 2 June, 2024;
originally announced June 2024.
-
Regularized Training with Generated Datasets for Name-Only Transfer of Vision-Language Models
Authors:
Minho Park,
Sunghyun Park,
Jooyeol Yun,
Jaegul Choo
Abstract:
Recent advancements in text-to-image generation have inspired researchers to generate datasets tailored for perception models using generative models, which prove particularly valuable in scenarios where real-world data is limited. In this study, our goal is to address the challenges when fine-tuning vision-language models (e.g., CLIP) on generated datasets. Specifically, we aim to fine-tune visio…
▽ More
Recent advancements in text-to-image generation have inspired researchers to generate datasets tailored for perception models using generative models, which prove particularly valuable in scenarios where real-world data is limited. In this study, our goal is to address the challenges when fine-tuning vision-language models (e.g., CLIP) on generated datasets. Specifically, we aim to fine-tune vision-language models to a specific classification model without access to any real images, also known as name-only transfer. However, despite the high fidelity of generated images, we observed a significant performance degradation when fine-tuning the model using the generated datasets due to the domain gap between real and generated images. To overcome the domain gap, we provide two regularization methods for training and post-training, respectively. First, we leverage the domain-agnostic knowledge from the original pre-trained vision-language model by conducting the weight-space ensemble of the fine-tuned model on the generated dataset with the original pre-trained model at the post-training. Secondly, we reveal that fine-tuned models with high feature diversity score high performance in the real domain, which indicates that increasing feature diversity prevents learning the generated domain-specific knowledge. Thus, we encourage feature diversity by providing additional regularization at training time. Extensive experiments on various classification datasets and various text-to-image generation models demonstrated that our analysis and regularization techniques effectively mitigate the domain gap, which has long been overlooked, and enable us to achieve state-of-the-art performance by training with generated images. Code is available at https://github.com/pmh9960/regft-for-gen
△ Less
Submitted 8 June, 2024;
originally announced June 2024.
-
Accelerated protons produced by magnetic Penrose process in Sgr A*
Authors:
Myeonghwan Oh,
Myeong-Gu Park
Abstract:
Typical mechanisms to extract energies from a rotating black hole are the Blandford-Znajek process and the Penrose process. The Penrose process requires a special condition that is difficult to occur in common astrophysical situations. However, the magnetic Penrose process (MPP) does not require such a special condition, and can produce ultra-high energy cosmic rays. When neutrons decay near a rot…
▽ More
Typical mechanisms to extract energies from a rotating black hole are the Blandford-Znajek process and the Penrose process. The Penrose process requires a special condition that is difficult to occur in common astrophysical situations. However, the magnetic Penrose process (MPP) does not require such a special condition, and can produce ultra-high energy cosmic rays. When neutrons decay near a rotating black hole, the MPP efficiency of the produced proton is maximized. The supermassive black hole in Sagittarius A* (Sgr A*) is likely to have a radiatively inefficient accretion flow that is hot enough to produce neutrons by nuclear reactions, which can be subsequently accelerated to high-energy by the MPP. We calculate the production rate of accelerated protons from the Sgr A* to estimated the gamma-ray flux at Earth produced by these accelerated protons and the flux of the accelerated protons themselves transported from Sgr A* to Earth. We find that these very high-energy gamma rays ($E_γ\gtrsim10\,\mathrm{TeV}$) amount to a significant fraction of the flux of the gamma-ray from the HESS J1745-290 and the central molecular zone around $100\,\mathrm{TeV}$. The accelerated proton flux, when the dimensionless spin parameter $a_{*}=0.5$ and the magnetic field strength in the vicinity of the black hole $B_{0}=100\,\mathrm{G}$, is about $1.6-4.1\%$ of the cosmic ray proton flux from KASCADE experiment at about $1\,\mathrm{PeV}$. Due to the finite decay time of neutrons which need to be transported from the accretion flow to the acceleration zone, our acceleration model can operate only around black holes with mass not much greater than $\sim10^8\,M_\odot$.
△ Less
Submitted 11 July, 2024; v1 submitted 7 June, 2024;
originally announced June 2024.
-
BIPED: Pedagogically Informed Tutoring System for ESL Education
Authors:
Soonwoo Kwon,
Sojung Kim,
Minju Park,
Seunghyun Lee,
Kyuseok Kim
Abstract:
Large Language Models (LLMs) have a great potential to serve as readily available and cost-efficient Conversational Intelligent Tutoring Systems (CITS) for teaching L2 learners of English. Existing CITS, however, are designed to teach only simple concepts or lack the pedagogical depth necessary to address diverse learning strategies. To develop a more pedagogically informed CITS capable of teachin…
▽ More
Large Language Models (LLMs) have a great potential to serve as readily available and cost-efficient Conversational Intelligent Tutoring Systems (CITS) for teaching L2 learners of English. Existing CITS, however, are designed to teach only simple concepts or lack the pedagogical depth necessary to address diverse learning strategies. To develop a more pedagogically informed CITS capable of teaching complex concepts, we construct a BIlingual PEDagogically-informed Tutoring Dataset (BIPED) of one-on-one, human-to-human English tutoring interactions. Through post-hoc analysis of the tutoring interactions, we come up with a lexicon of dialogue acts (34 tutor acts and 9 student acts), which we use to further annotate the collected dataset. Based on a two-step framework of first predicting the appropriate tutor act then generating the corresponding response, we implemented two CITS models using GPT-4 and SOLAR-KO, respectively. We experimentally demonstrate that the implemented models not only replicate the style of human teachers but also employ diverse and contextually appropriate pedagogical strategies.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Discrepancies Between JWST Observations and Simulations of Quenched Massive Galaxies at $z > 3$: A Comparative Study With IllustrisTNG and ASTRID
Authors:
Emma Jane Weller,
Fabio Pacucci,
Yueying Ni,
Lars Hernquist,
Minjung Park
Abstract:
Recent JWST observations have uncovered an unexpectedly large population of massive quiescent galaxies at $z>3$. Using the cosmological simulations IllustrisTNG and ASTRID, we identify analogous galaxies and investigate their abundance, formation, quenching mechanisms, and post-quenching evolution for stellar masses $9.5 < \log_{10}{(M_\star/{\rm M}_\odot)} < 12$. We apply three different quenchin…
▽ More
Recent JWST observations have uncovered an unexpectedly large population of massive quiescent galaxies at $z>3$. Using the cosmological simulations IllustrisTNG and ASTRID, we identify analogous galaxies and investigate their abundance, formation, quenching mechanisms, and post-quenching evolution for stellar masses $9.5 < \log_{10}{(M_\star/{\rm M}_\odot)} < 12$. We apply three different quenching definitions and find that both simulations significantly underestimate the comoving number density of quenched massive galaxies at $z \gtrsim 3$ compared to JWST observations by up to $\sim 2$ dex. This fact highlights the necessity for improved physical models of AGN feedback in galaxy formation simulations. In both simulations, the high-$z$ quenched massive galaxies often host overmassive central black holes above the standard $M_{BH}-M_\star$ relation, implying that the AGN feedback plays a crucial role in quenching galaxies in the early Universe. The typical quenching timescales for these galaxies are $\sim 200-600$ Myr. IllustrisTNG primarily employs AGN kinetic feedback, while ASTRID relies on AGN thermal feedback, which is less effective and has a longer quenching timescale. We also study the post-quenching evolution of the high-$z$ massive quiescent galaxies and find that many experience subsequent reactivation of star formation, evolving into primary progenitors of $z=0$ brightest cluster galaxies.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Inverse design of photonic surfaces on Inconel via multi-fidelity machine learning ensemble framework and high throughput femtosecond laser processing
Authors:
Luka Grbcic,
Minok Park,
Mahmoud Elzouka,
Ravi Prasher,
Juliane Müller,
Costas P. Grigoropoulos,
Sean D. Lubner,
Vassilia Zorba,
Wibe Albert de Jong
Abstract:
We demonstrate a multi-fidelity (MF) machine learning ensemble framework for the inverse design of photonic surfaces, trained on a dataset of 11,759 samples that we fabricate using high throughput femtosecond laser processing. The MF ensemble combines an initial low fidelity model for generating design solutions, with a high fidelity model that refines these solutions through local optimization. T…
▽ More
We demonstrate a multi-fidelity (MF) machine learning ensemble framework for the inverse design of photonic surfaces, trained on a dataset of 11,759 samples that we fabricate using high throughput femtosecond laser processing. The MF ensemble combines an initial low fidelity model for generating design solutions, with a high fidelity model that refines these solutions through local optimization. The combined MF ensemble can generate multiple disparate sets of laser-processing parameters that can each produce the same target input spectral emissivity with high accuracy (root mean squared errors < 2%). SHapley Additive exPlanations analysis shows transparent model interpretability of the complex relationship between laser parameters and spectral emissivity. Finally, the MF ensemble is experimentally validated by fabricating and evaluating photonic surface designs that it generates for improved efficiency energy harvesting devices. Our approach provides a powerful tool for advancing the inverse design of photonic surfaces in energy harvesting applications.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Recover as It is Designed to Be: Recovering from Compatibility Mobile App Crashes by Reusing User Flows
Authors:
Donghwi Kim,
Hyungjun Yoon,
Chang Min Park,
Sujin Han,
Youngjin Kwon,
Steven Y. Ko,
Sung-Ju Lee
Abstract:
Android OS is severely fragmented by API updates and device vendors' OS customization, creating a market condition where vastly different OS versions coexist. This gives rise to compatibility crash problems where Android apps crash on certain Android versions but not on others. Although well-known, this problem is extremely challenging for app developers to overcome due to the sheer number of Andr…
▽ More
Android OS is severely fragmented by API updates and device vendors' OS customization, creating a market condition where vastly different OS versions coexist. This gives rise to compatibility crash problems where Android apps crash on certain Android versions but not on others. Although well-known, this problem is extremely challenging for app developers to overcome due to the sheer number of Android versions in the market that must be tested. We present RecoFlow, a framework for enabling app developers to automatically recover an app from a crash by programming user flows with our API and visual tools. RecoFlow tracks app feature usage with the user flows on user devices and recovers an app from a crash by replaying UI actions of the app feature disrupted by the crash. To prevent recurring compatibility crashes, RecoFlow executes a previously crashed app in compatibility mode that is enabled by our novel Android OS virtualization technique. Our evaluation with professional Android developers shows that our API and tools are easy to use and effective in recovering from compatibility crashes.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Posterior Label Smoothing for Node Classification
Authors:
Jaeseung Heo,
Moonjeong Park,
Dongwoo Kim
Abstract:
Soft labels can improve the generalization of a neural network classifier in many domains, such as image classification. Despite its success, the current literature has overlooked the efficiency of label smoothing in node classification with graph-structured data. In this work, we propose a simple yet effective label smoothing for the transductive node classification task. We design the soft label…
▽ More
Soft labels can improve the generalization of a neural network classifier in many domains, such as image classification. Despite its success, the current literature has overlooked the efficiency of label smoothing in node classification with graph-structured data. In this work, we propose a simple yet effective label smoothing for the transductive node classification task. We design the soft label to encapsulate the local context of the target node through the neighborhood label distribution. We apply the smoothing method for seven baseline models to show its effectiveness. The label smoothing methods improve the classification accuracy in 10 node classification datasets in most cases. In the following analysis, we find that incorporating global label statistics in posterior computation is the key to the success of label smoothing. Further investigation reveals that the soft labels mitigate overfitting during training, leading to better generalization performance.
△ Less
Submitted 1 June, 2024;
originally announced June 2024.
-
Exploring Exotic Decays of the Higgs Boson to Multi-Photons at the LHC via Multimodal Learning Approaches
Authors:
A. Hammad,
P. Ko,
Chih-Ting Lu,
Myeonghun Park
Abstract:
The Standard Model (SM) Higgs boson, the most recently discovered elementary particle, may still serve as a mediator between the SM sector and a new physics sector related to dark matter (DM). The Large Hadron Collider (LHC) has not yet fully constrained the physics associated with the Higgs boson, leaving room for such possibilities. Among the various potential mass scales of the dark sector, the…
▽ More
The Standard Model (SM) Higgs boson, the most recently discovered elementary particle, may still serve as a mediator between the SM sector and a new physics sector related to dark matter (DM). The Large Hadron Collider (LHC) has not yet fully constrained the physics associated with the Higgs boson, leaving room for such possibilities. Among the various potential mass scales of the dark sector, the sub-GeV mass range is particularly intriguing. This parameter space presents significant challenges for DM direct detection experiments that rely on nuclear recoils. Various innovative experimental methods are currently under investigation to explore this sub-GeV dark sector. The LHC, functioning as a Higgs factory, could explore this sector once the challenge of identifying DM signals is resolved. Due to the significantly lower mass of particles in the dark sector compared to the Higgs boson, these particles are expected to be highly boosted following the Higgs boson's decay. However, detecting and identifying these highly boosted particles remains a considerable challenge at the LHC, despite their eventual decay into SM particles. We employ a well-motivated leptophobic $Z^{\prime}_B$ model as a prototype to analyze the distinctive signatures from Higgs boson exotic decays into multi-photons. These signatures consist of collimated photons that fail to meet the photon isolation criteria, forming jet-like objects. Conventional analyses relying solely on the purity of energy deposits in the electromagnetic calorimeter would fail to detect these signatures, as they would be overwhelmed by background events from Quantum Chromodynamics. To effectively distinguish between such novel signal signatures and SM background events, we leverage advanced machine learning techniques, specifically the transformer encoder in a multimodal network structure.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Diffusion Rejection Sampling
Authors:
Byeonghu Na,
Yeongmin Kim,
Minsang Park,
Donghyeok Shin,
Wanmo Kang,
Il-Chul Moon
Abstract:
Recent advances in powerful pre-trained diffusion models encourage the development of methods to improve the sampling performance under well-trained diffusion models. This paper introduces Diffusion Rejection Sampling (DiffRS), which uses a rejection sampling scheme that aligns the sampling transition kernels with the true ones at each timestep. The proposed method can be viewed as a mechanism tha…
▽ More
Recent advances in powerful pre-trained diffusion models encourage the development of methods to improve the sampling performance under well-trained diffusion models. This paper introduces Diffusion Rejection Sampling (DiffRS), which uses a rejection sampling scheme that aligns the sampling transition kernels with the true ones at each timestep. The proposed method can be viewed as a mechanism that evaluates the quality of samples at each intermediate timestep and refines them with varying effort depending on the sample. Theoretical analysis shows that DiffRS can achieve a tighter bound on sampling error compared to pre-trained models. Empirical results demonstrate the state-of-the-art performance of DiffRS on the benchmark datasets and the effectiveness of DiffRS for fast diffusion samplers and large-scale text-to-image diffusion models. Our code is available at https://github.com/aailabkaist/DiffRS.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Pseudo-Hermitian Topology of Multiband Non-Hermitian Systems
Authors:
Jung-Wan Ryu,
Jae-Ho Han,
Chang-Hwan Yi,
Hee Chul Park,
Moon Jip Park
Abstract:
The complex eigenenergies and non-orthogonal eigenstates of non-Hermitian systems exhibit unique topological phenomena that cannot appear in Hermitian systems. Representative examples are the non-Hermitian skin effect and exceptional points. In a two-dimensional parameter space, topological classifications of non-separable bands in multiband non-Hermitian systems can be established by invoking a p…
▽ More
The complex eigenenergies and non-orthogonal eigenstates of non-Hermitian systems exhibit unique topological phenomena that cannot appear in Hermitian systems. Representative examples are the non-Hermitian skin effect and exceptional points. In a two-dimensional parameter space, topological classifications of non-separable bands in multiband non-Hermitian systems can be established by invoking a permutation group, where the product of the permutation represents state exchange due to exceptional points in the space. We unveil in this work the role of pseudo-Hermitian lines in non-Hermitian topology for multiple bands. Contrary to current understanding, the non-separability of non-Hermitian multibands can be topologically non-trivial without exceptional points in two-dimensional space. Our work builds on the fundamental and comprehensive understanding of non-Hermitian multiband systems and also offers versatile applications and realizations of non-Hermitian systems without the need to consider exceptional points.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Diffusion Bridge AutoEncoders for Unsupervised Representation Learning
Authors:
Yeongmin Kim,
Kwanghyeon Lee,
Minsang Park,
Byeonghu Na,
Il-Chul Moon
Abstract:
Diffusion-based representation learning has achieved substantial attention due to its promising capabilities in latent representation and sample generation. Recent studies have employed an auxiliary encoder to identify a corresponding representation from a sample and to adjust the dimensionality of a latent variable z. Meanwhile, this auxiliary structure invokes information split problem because t…
▽ More
Diffusion-based representation learning has achieved substantial attention due to its promising capabilities in latent representation and sample generation. Recent studies have employed an auxiliary encoder to identify a corresponding representation from a sample and to adjust the dimensionality of a latent variable z. Meanwhile, this auxiliary structure invokes information split problem because the diffusion and the auxiliary encoder would divide the information from the sample into two representations for each model. Particularly, the information modeled by the diffusion becomes over-regularized because of the static prior distribution on xT. To address this problem, we introduce Diffusion Bridge AuteEncoders (DBAE), which enable z-dependent endpoint xT inference through a feed-forward architecture. This structure creates an information bottleneck at z, so xT becomes dependent on z in its generation. This results in two consequences: 1) z holds the full information of samples, and 2) xT becomes a learnable distribution, not static any further. We propose an objective function for DBAE to enable both reconstruction and generative modeling, with their theoretical justification. Empirical evidence supports the effectiveness of the intended design in DBAE, which notably enhances downstream inference quality, reconstruction, and disentanglement. Additionally, DBAE generates high-fidelity samples in the unconditional generation.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Improving Multi-lingual Alignment Through Soft Contrastive Learning
Authors:
Minsu Park,
Seyeon Choi,
Chanyeol Choi,
Jun-Seong Kim,
Jy-yong Sohn
Abstract:
Making decent multi-lingual sentence representations is critical to achieve high performances in cross-lingual downstream tasks. In this work, we propose a novel method to align multi-lingual embeddings based on the similarity of sentences measured by a pre-trained mono-lingual embedding model. Given translation sentence pairs, we train a multi-lingual model in a way that the similarity between cr…
▽ More
Making decent multi-lingual sentence representations is critical to achieve high performances in cross-lingual downstream tasks. In this work, we propose a novel method to align multi-lingual embeddings based on the similarity of sentences measured by a pre-trained mono-lingual embedding model. Given translation sentence pairs, we train a multi-lingual model in a way that the similarity between cross-lingual embeddings follows the similarity of sentences measured at the mono-lingual teacher model. Our method can be considered as contrastive learning with soft labels defined as the similarity between sentences. Our experimental results on five languages show that our contrastive loss with soft labels far outperforms conventional contrastive loss with hard labels in various benchmarks for bitext mining tasks and STS tasks. In addition, our method outperforms existing multi-lingual embeddings including LaBSE, for Tatoeba dataset. The code is available at https://github.com/YAI12xLinq-B/IMASCL
△ Less
Submitted 28 May, 2024; v1 submitted 25 May, 2024;
originally announced May 2024.
-
Scalar Field Perturbation of Hairy Black Holes in EsGB theory
Authors:
Young-Hwan Hyun,
Boris Latosh,
Miok Park
Abstract:
We investigate scalar field perturbations of the hairy black holes involved with spontaneous symmetry breaking of the global U(1) symmetry in Einstein-scalar-Gauss-Bonnet theory for asymptotically flat spacetimes. We consider the mechanism that black holes without hairs become unstable at the critical point of the coupling constant and undergo a phase transition to hairy black holes in the symmetr…
▽ More
We investigate scalar field perturbations of the hairy black holes involved with spontaneous symmetry breaking of the global U(1) symmetry in Einstein-scalar-Gauss-Bonnet theory for asymptotically flat spacetimes. We consider the mechanism that black holes without hairs become unstable at the critical point of the coupling constant and undergo a phase transition to hairy black holes in the symmetry-broken phase driven by spontaneous symmetry breaking. This transition occurs near the black hole horizon due to the diminishing influence of the Gauss-Bonnet term at infinity. To examine such process, we introduce a scalar field perturbation on the newly formed background spacetime. We solve the linearized perturbation equation using Green's function method. We begin by solving the Green's function, incorporating the branch cut contribution. This allows us to analytically investigate the late-time behavior of the perturbation at both spatial and null infinity. We found that the late-time behavior only differs from the Schwarzschild black hole by a mass term. We then proceed to calculate the quasinormal modes (QNMs) numerically, which arise from the presence of poles in the Green's function. Our primary interest lies in utilizing QNMs to investigate the stability of the black hole solutions both the symmetric and symmetry-broken phases. Consistent with the prior study, our analysis shows that hairy black holes in the symmetric phase become unstable when the quadratic coupling constant exceeds a critical value for a fixed value of the quartic coupling constant. In contrast, hairy black holes in the symmetry-broken phase are always stable at the critical value. These numerical results provide strong evidence for a dynamical process that unstable black holes without hairs transition into stable hairy black holes in the symmetry-broken phase through the spontaneous symmetry breaking.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
Revisiting Reactor Anti-Neutrino 5 MeV Bump with $^{13}$C Neutral-Current Interaction
Authors:
Pouya Bakhti,
Min-Gwa Park,
Meshkat Rajaee,
Chang Sub Shin,
Seodong Shin
Abstract:
For the first time, we systematically investigate the potential of neutrino-nucleus neutral current interactions with $^{13}$C to identify the origin of the 5 MeV bump observed in reactor anti-neutrino spectra in the inverse beta decay process. The distinctive signal is obtained from the de-excitation of $^{13}$C$^*$ into the ground state emitting a 3.685 MeV photon in various liquid scintillator…
▽ More
For the first time, we systematically investigate the potential of neutrino-nucleus neutral current interactions with $^{13}$C to identify the origin of the 5 MeV bump observed in reactor anti-neutrino spectra in the inverse beta decay process. The distinctive signal is obtained from the de-excitation of $^{13}$C$^*$ into the ground state emitting a 3.685 MeV photon in various liquid scintillator detectors. Such an interaction predominantly occurs for the reactor anti-neutrinos within the energy range coinciding with the 5 MeV bump. For a detector that has a capability of 95\% level photon and electron separation and small thorium contamination below $5 \times 10^{-17}$ gr/gr located in a site with an overburden of about a few hundred m.w.e, such as the location of near detectors of RENO and Daya Bay will have a great sensitivity to resolve the 5 MeV bump. In addition, we propose a novel approach to track the time evolution of reactor isotopes by analyzing our $^{13}$C signal shedding light on the contributions from $^{235}$U or $^{239}$Pu to the observed bump. This provides an extra powerful tool in both discriminating the flux models and testing any new physics possibilities for the 5 MeV bump at 3$σ$ to 5$σ$ level with much less systematic uncertainties and assuming 10 kt.year of data collection. Our detector requirements are realistic, aligning well with recent studies conducted for existing or forthcoming experiments.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
Inclusive content reduces racial and gender biases, yet non-inclusive content dominates popular media outlets
Authors:
Nouar AlDahoul,
Hazem Ibrahim,
Minsu Park,
Talal Rahwan,
Yasir Zaki
Abstract:
Images are often termed as representations of perceived reality. As such, racial and gender biases in popular media imagery could play a vital role in shaping people's perceptions of society. While inquiries into such biases have examined the frequency at which different racial and gender groups appear in different forms of media, the literature still lacks a large-scale longitudinal study that fu…
▽ More
Images are often termed as representations of perceived reality. As such, racial and gender biases in popular media imagery could play a vital role in shaping people's perceptions of society. While inquiries into such biases have examined the frequency at which different racial and gender groups appear in different forms of media, the literature still lacks a large-scale longitudinal study that further examines the manner in which these groups are portrayed. To fill this gap, we examine three media forms, namely fashion magazines, movie posters, and advertisements. To do so, we collect a large dataset comprising over 300,000 images spanning over five decades and utilize state-of-the-art machine learning models to not only classify race and gender but also identify the posture, emotional state, and body composition of the person featured in each image. We find that racial minorities appear far less frequently than their White counterparts, and when they do appear, they are portrayed less prominently and tend to convey more negative emotions. We also find that women are more likely to be portrayed with their full bodies in images, whereas men are more frequently presented with their faces. This disparity exemplifies face-ism, where emphasizing faces over bodies has been linked to perceptions of higher competence and intelligence. Finally, through a series of survey experiments, we show that exposure to inclusive content-rather than racially and gender-homogenized content -- significantly reduces perception biases towards minorities in areas such as household income, hiring merit, beauty standards, leadership positions, and the representation of women in the workplace. Taken together, our findings demonstrate that racial and gender biases in media continue to be an ongoing problem that may exacerbate existing stereotypes.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
Non-Bloch band theory of sub-symmetry-protected topological phases
Authors:
Sonu Verma,
Moon Jip Park
Abstract:
Bulk-boundary correspondence (BBC) of symmetry-protected topological (SPT) phases relates the non-trivial topological invariant of the bulk to the number of topologically protected boundary states. Recently, a finer classification of SPT phases has been discovered, known as sub-symmetry- protected topological (sub-SPT) phases. In sub- SPT phases, a fraction of the boundary states is protected by t…
▽ More
Bulk-boundary correspondence (BBC) of symmetry-protected topological (SPT) phases relates the non-trivial topological invariant of the bulk to the number of topologically protected boundary states. Recently, a finer classification of SPT phases has been discovered, known as sub-symmetry- protected topological (sub-SPT) phases. In sub- SPT phases, a fraction of the boundary states is protected by the sub-symmetry of the system, even when the full symmetry is broken. While the conventional topological invariant derived from the Bloch band is not applicable to describe the BBC in these systems, we propose to use the non-Bloch topological band theory to describe the BBC of sub-SPT phases. Using the concept of the generalized Brillouin zone (GBZ), where Bloch momenta are generalized to take complex values, we show that the non-Bloch band theory naturally gives rise to a non-Bloch topological invariant, establishing the BBC in both SPT and sub-SPT phases. In a one-dimensional system, we define the winding number, whose physical meaning corresponds to the reflection amplitude in the scattering matrix. Furthermore, the non-Bloch topological invariant characterizes the hidden intrinsic topology of the GBZ under translation symmetry-breaking boundary conditions. The topological phase transitions are characterized by the generalized momenta touching the GBZ, which accompanies the emergence of diabolic or band-touching points. Additionally, we discuss the BBCs in the presence of local or global full-symmetry or sub-symmetry-breaking deformations.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
Cue: A Fast and Flexible Photoionization Emulator for Modeling Nebular Emission Powered By Almost Any Ionizing Source
Authors:
Yijia Li,
Joel Leja,
Benjamin D. Johnson,
Sandro Tacchella,
Rebecca Davies,
Sirio Belli,
Minjung Park,
Razieh Emami
Abstract:
The complex physics governing nebular emission in galaxies, particularly in the early universe, often defy simple low-dimensional models. This has proven to be a significant barrier in understanding the (often diverse) ionizing sources powering this emission. We present Cue, a highly flexible tool for interpreting nebular emission across a wide range of abundances and ionizing conditions of galaxi…
▽ More
The complex physics governing nebular emission in galaxies, particularly in the early universe, often defy simple low-dimensional models. This has proven to be a significant barrier in understanding the (often diverse) ionizing sources powering this emission. We present Cue, a highly flexible tool for interpreting nebular emission across a wide range of abundances and ionizing conditions of galaxies at different redshifts. Unlike typical nebular models used to interpret extragalactic nebular emission, our model does not require a specific ionizing spectrum as a source, instead approximating the ionizing spectrum with a 4-part piece-wise power-law. We train a neural net emulator based on the CLOUDY photoionization modeling code and make self-consistent nebular continuum and line emission predictions. Along with the flexible ionizing spectra, we allow freedom in [O/H], [N/O], [C/O], gas density, and total ionizing photon budget. This flexibility allows us to either marginalize over or directly measure the incident ionizing radiation, thereby directly interrogating the source of the ionizing photons in distant galaxies via their nebular emission. Our emulator demonstrates a high accuracy, with $\sim$1% uncertainty in predicting the nebular continuum and $\sim$5% uncertainty in the emission lines. Mock tests suggest Cue is well-calibrated and produces useful constraints on the ionizing spectra when $S/N (\mathrm{H}_α) \gtrsim 10$, and furthermore capable of distinguishing between the ionizing spectra predicted by single and binary stellar models. The compute efficiency of neural networks facilitates future applications of Cue for rapid modeling of the nebular emission in large samples and Monte Carlo sampling techniques.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
Impacts of bar-driven shear and shocks on star formation
Authors:
Taehyun Kim,
Dimitri A. Gadotti,
Miguel Querejeta,
Isabel Pérez,
Almudena Zurita,
Justus Neumann,
Glenn van de Ven,
Jairo Méndez-Abreu,
Adriana de Lorenzo-Cáceres,
Patricia Sánchez-Blázquez,
Francesca Fragkoudi,
Lucimara P. Martins,
Luiz A. Silva-Lima,
Woong-Tae Kim,
Myeong-gu Park
Abstract:
Bars drive gas inflow. As the gas flows inwards, shocks and shear occur along the bar dust lanes. Such shocks and shear can affect the star formation and change the gas properties. For four barred galaxies, we present Hα velocity gradient maps that highlight bar-driven shocks and shear using data from the PHANGS-MUSE and PHANGS-ALMA surveys which allow us to study bar kinematics in unprecedented d…
▽ More
Bars drive gas inflow. As the gas flows inwards, shocks and shear occur along the bar dust lanes. Such shocks and shear can affect the star formation and change the gas properties. For four barred galaxies, we present Hα velocity gradient maps that highlight bar-driven shocks and shear using data from the PHANGS-MUSE and PHANGS-ALMA surveys which allow us to study bar kinematics in unprecedented detail. Velocity gradients are enhanced along the bar dust lanes, where shocks and shear are shown to occur in numerical simulations. Velocity gradient maps also efficiently pick up expanding shells around HII regions. We put pseudo slits on the regions where velocity gradients are enhanced and find that Hα and CO velocities jump up to ~170 km/s, even after removing the effects of circular motions due to the galaxy rotation. Enhanced velocity gradients either coincide with the peak of CO intensity along the bar dust lanes or are slightly offset from CO intensity peaks, depending on the objects. Using the BPT diagnostic, we identify the source of ionization on each spaxel and find that star formation is inhibited in the high velocity gradient regions of the bar, and the majority of those regions are classified as LINER or composite. This implies that star formation is inhibited where bar-driven shear and shocks are strong. Our results are consistent with the results from the numerical simulations that show star formation is inhibited in the bar where shear force is strong.
△ Less
Submitted 30 April, 2024;
originally announced May 2024.
-
Widespread rapid quenching at cosmic noon revealed by JWST deep spectroscopy
Authors:
Minjung Park,
Sirio Belli,
Charlie Conroy,
Benjamin D. Johnson,
Rebecca L. Davies,
Joel Leja,
Sandro Tacchella,
J. Trevor Mendel,
Chloë Benton,
Letizia Bugiani,
Razieh Emami,
Amirhossein Khoram,
Yijia Li,
Gabriel Maheson,
Elijah P. Mathews,
Rohan P. Naidu,
Erica J. Nelson,
Bryan A. Terrazas,
Rainer Weinberger
Abstract:
Massive quiescent galaxies in the young universe are expected to be quenched rapidly, but it is unclear whether they all experience starbursts before quenching and what physical mechanism drives rapid quenching. We study 16 massive quiescent galaxies ($\log(M_\star/M_\odot) > 10$) at $z\sim2$ selected from a representative sample of the Blue Jay survey. We reconstruct their star formation historie…
▽ More
Massive quiescent galaxies in the young universe are expected to be quenched rapidly, but it is unclear whether they all experience starbursts before quenching and what physical mechanism drives rapid quenching. We study 16 massive quiescent galaxies ($\log(M_\star/M_\odot) > 10$) at $z\sim2$ selected from a representative sample of the Blue Jay survey. We reconstruct their star formation histories by fitting spectral energy distribution models to the JWST/NIRSpec $R\sim1000$ spectra. We find that massive quiescent galaxies can be split into three categories with roughly equal numbers of galaxies according to their SFHs: 1) Relatively old galaxies quenched at early epochs; 2) Galaxies that are rapidly and recently quenched after a flat or bursty formation history (depending on the assumed prior); 3) Galaxies that are rapidly and recently quenched after a major starburst. Most recently quenched galaxies show neutral gas outflows, probed by blueshifted $\rm Na\,I\,D$ absorption, and ionized gas emission, with line ratios consistent with active galactic nucleus (AGN) diagnostics. This suggests that AGN activity drives multi-phase gas outflows, leading to rapid quenching. By tracing back the SFHs of the entire sample, we predict the number density of massive quiescent galaxies at $z=4-6$: $n=3.0\pm1.4\times10^{-5}\,\rm Mpc^{-3}$. The two oldest massive quiescent galaxies in our sample appear to have extremely early formation and quenching ($z\gtrsim6$), possibly descendants of early post-starbursts at $z>3$. These galaxies still show neutral gas reservoirs and low-level star formation, consistent with weak H$α$ emission, perhaps because the ejective AGN feedback that caused rapid quenching has weakened over time.
△ Less
Submitted 27 April, 2024;
originally announced April 2024.
-
How to Parameterize Asymmetric Quantization Ranges for Quantization-Aware Training
Authors:
Jaeseong You,
Minseop Park,
Kyunggeun Lee,
Seokjun An,
Chirag Patel,
Markus Nage
Abstract:
This paper investigates three different parameterizations of asymmetric uniform quantization for quantization-aware training: (1) scale and offset, (2) minimum and maximum, and (3) beta and gamma. We perform a comprehensive comparative analysis of these parameterizations' influence on quantization-aware training, using both controlled experiments and real-world large language models. Our particula…
▽ More
This paper investigates three different parameterizations of asymmetric uniform quantization for quantization-aware training: (1) scale and offset, (2) minimum and maximum, and (3) beta and gamma. We perform a comprehensive comparative analysis of these parameterizations' influence on quantization-aware training, using both controlled experiments and real-world large language models. Our particular focus is on their changing behavior in response to critical training hyperparameters, bit width and learning rate. Based on our investigation, we propose best practices to stabilize and accelerate quantization-aware training with learnable asymmetric quantization ranges.
△ Less
Submitted 25 April, 2024;
originally announced April 2024.
-
Low-Light Image Enhancement Framework for Improved Object Detection in Fisheye Lens Datasets
Authors:
Dai Quoc Tran,
Armstrong Aboah,
Yuntae Jeon,
Maged Shoman,
Minsoo Park,
Seunghee Park
Abstract:
This study addresses the evolving challenges in urban traffic monitoring detection systems based on fisheye lens cameras by proposing a framework that improves the efficacy and accuracy of these systems. In the context of urban infrastructure and transportation management, advanced traffic monitoring systems have become critical for managing the complexities of urbanization and increasing vehicle…
▽ More
This study addresses the evolving challenges in urban traffic monitoring detection systems based on fisheye lens cameras by proposing a framework that improves the efficacy and accuracy of these systems. In the context of urban infrastructure and transportation management, advanced traffic monitoring systems have become critical for managing the complexities of urbanization and increasing vehicle density. Traditional monitoring methods, which rely on static cameras with narrow fields of view, are ineffective in dynamic urban environments, necessitating the installation of multiple cameras, which raises costs. Fisheye lenses, which were recently introduced, provide wide and omnidirectional coverage in a single frame, making them a transformative solution. However, issues such as distorted views and blurriness arise, preventing accurate object detection on these images. Motivated by these challenges, this study proposes a novel approach that combines a ransformer-based image enhancement framework and ensemble learning technique to address these challenges and improve traffic monitoring accuracy, making significant contributions to the future of intelligent traffic management systems. Our proposed methodological framework won 5th place in the 2024 AI City Challenge, Track 4, with an F1 score of 0.5965 on experimental validation data. The experimental results demonstrate the effectiveness, efficiency, and robustness of the proposed system. Our code is publicly available at https://github.com/daitranskku/AIC2024-TRACK4-TEAM15.
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
HyperCLOVA X Technical Report
Authors:
Kang Min Yoo,
Jaegeun Han,
Sookyo In,
Heewon Jeon,
Jisu Jeong,
Jaewook Kang,
Hyunwook Kim,
Kyung-Min Kim,
Munhyong Kim,
Sungju Kim,
Donghyun Kwak,
Hanock Kwak,
Se Jung Kwon,
Bado Lee,
Dongsoo Lee,
Gichang Lee,
Jooho Lee,
Baeseong Park,
Seongjin Shin,
Joonsang Yu,
Seolki Baek,
Sumin Byeon,
Eungsup Cho,
Dooseok Choe,
Jeesung Han
, et al. (371 additional authors not shown)
Abstract:
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t…
▽ More
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment to responsible AI. The model is evaluated across various benchmarks, including comprehensive reasoning, knowledge, commonsense, factuality, coding, math, chatting, instruction-following, and harmlessness, in both Korean and English. HyperCLOVA X exhibits strong reasoning capabilities in Korean backed by a deep understanding of the language and cultural nuances. Further analysis of the inherent bilingual nature and its extension to multilingualism highlights the model's cross-lingual proficiency and strong generalization ability to untargeted languages, including machine translation between several language pairs and cross-lingual inference tasks. We believe that HyperCLOVA X can provide helpful guidance for regions or countries in developing their sovereign LLMs.
△ Less
Submitted 13 April, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
Strong interactions and isospin symmetry breaking in a supermoiré lattice
Authors:
Yonglong Xie,
Andrew T. Pierce,
Jeong Min Park,
Daniel E. Parker,
Jie Wang,
Patrick Ledwith,
Zhuozhen Cai,
Kenji Watanabe,
Takashi Taniguchi,
Eslam Khalaf,
Ashvin Vishwanath,
Pablo Jarillo-Herrero,
Amir Yacoby
Abstract:
In multilayer moiré heterostructures, the interference of multiple twist angles ubiquitously leads to tunable ultra-long-wavelength patterns known as supermoiré lattices. However, their impact on the system's many-body electronic phase diagram remains largely unexplored. We present local compressibility measurements revealing numerous incompressible states resulting from supermoiré-lattice-scale i…
▽ More
In multilayer moiré heterostructures, the interference of multiple twist angles ubiquitously leads to tunable ultra-long-wavelength patterns known as supermoiré lattices. However, their impact on the system's many-body electronic phase diagram remains largely unexplored. We present local compressibility measurements revealing numerous incompressible states resulting from supermoiré-lattice-scale isospin symmetry breaking driven by strong interactions. By using the supermoiré lattice occupancy as a probe of isospin symmetry, we observe an unexpected doubling of the miniband filling near $ν=-2$, possibly indicating a hidden phase transition or normal-state pairing proximal to the superconducting phase. Our work establishes supermoiré lattices as a tunable parameter for designing novel quantum phases and an effective tool for unraveling correlated phenomena in moiré materials.
△ Less
Submitted 1 April, 2024;
originally announced April 2024.
-
Augmented Reality based Simulated Data (ARSim) with multi-view consistency for AV perception networks
Authors:
Aqeel Anwar,
Tae Eun Choe,
Zian Wang,
Sanja Fidler,
Minwoo Park
Abstract:
Detecting a diverse range of objects under various driving scenarios is essential for the effectiveness of autonomous driving systems. However, the real-world data collected often lacks the necessary diversity presenting a long-tail distribution. Although synthetic data has been utilized to overcome this issue by generating virtual scenes, it faces hurdles such as a significant domain gap and the…
▽ More
Detecting a diverse range of objects under various driving scenarios is essential for the effectiveness of autonomous driving systems. However, the real-world data collected often lacks the necessary diversity presenting a long-tail distribution. Although synthetic data has been utilized to overcome this issue by generating virtual scenes, it faces hurdles such as a significant domain gap and the substantial efforts required from 3D artists to create realistic environments. To overcome these challenges, we present ARSim, a fully automated, comprehensive, modular framework designed to enhance real multi-view image data with 3D synthetic objects of interest. The proposed method integrates domain adaptation and randomization strategies to address covariate shift between real and simulated data by inferring essential domain attributes from real data and employing simulation-based randomization for other attributes. We construct a simplified virtual scene using real data and strategically place 3D synthetic assets within it. Illumination is achieved by estimating light distribution from multiple images capturing the surroundings of the vehicle. Camera parameters from real data are employed to render synthetic assets in each frame. The resulting augmented multi-view consistent dataset is used to train a multi-camera perception network for autonomous vehicles. Experimental results on various AV perception tasks demonstrate the superior performance of networks trained on the augmented dataset.
△ Less
Submitted 22 March, 2024;
originally announced March 2024.
-
Empowering Personalized Learning through a Conversation-based Tutoring System with Student Modeling
Authors:
Minju Park,
Sojung Kim,
Seunghyun Lee,
Soonwoo Kwon,
Kyuseok Kim
Abstract:
As the recent Large Language Models(LLM's) become increasingly competent in zero-shot and few-shot reasoning across various domains, educators are showing a growing interest in leveraging these LLM's in conversation-based tutoring systems. However, building a conversation-based personalized tutoring system poses considerable challenges in accurately assessing the student and strategically incorpor…
▽ More
As the recent Large Language Models(LLM's) become increasingly competent in zero-shot and few-shot reasoning across various domains, educators are showing a growing interest in leveraging these LLM's in conversation-based tutoring systems. However, building a conversation-based personalized tutoring system poses considerable challenges in accurately assessing the student and strategically incorporating the assessment into teaching within the conversation. In this paper, we discuss design considerations for a personalized tutoring system that involves the following two key components: (1) a student modeling with diagnostic components, and (2) a conversation-based tutor utilizing LLM with prompt engineering that incorporates student assessment outcomes and various instructional strategies. Based on these design considerations, we created a proof-of-concept tutoring system focused on personalization and tested it with 20 participants. The results substantiate that our system's framework facilitates personalization, with particular emphasis on the elements constituting student modeling. A web demo of our system is available at http://rlearning-its.com.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
Mitigating Oversmoothing Through Reverse Process of GNNs for Heterophilic Graphs
Authors:
MoonJeong Park,
Jaeseung Heo,
Dongwoo Kim
Abstract:
Graph Neural Network (GNN) resembles the diffusion process, leading to the over-smoothing of learned representations when stacking many layers. Hence, the reverse process of message passing can produce the distinguishable node representations by inverting the forward message propagation. The distinguishable representations can help us to better classify neighboring nodes with different labels, suc…
▽ More
Graph Neural Network (GNN) resembles the diffusion process, leading to the over-smoothing of learned representations when stacking many layers. Hence, the reverse process of message passing can produce the distinguishable node representations by inverting the forward message propagation. The distinguishable representations can help us to better classify neighboring nodes with different labels, such as in heterophilic graphs. In this work, we apply the design principle of the reverse process to the three variants of the GNNs. Through the experiments on heterophilic graph data, where adjacent nodes need to have different representations for successful classification, we show that the reverse process significantly improves the prediction performance in many cases. Additional analysis reveals that the reverse mechanism can mitigate the over-smoothing over hundreds of layers. Our code is available at https://github.com/ml-postech/reverse-gnn.
△ Less
Submitted 11 June, 2024; v1 submitted 11 March, 2024;
originally announced March 2024.
-
ICLN: Input Convex Loss Network for Decision Focused Learning
Authors:
Haeun Jeon,
Hyunglip Bae,
Minsu Park,
Chanyeong Kim,
Woo Chang Kim
Abstract:
In decision-making problem under uncertainty, predicting unknown parameters is often considered independent of the optimization part. Decision-focused Learning (DFL) is a task-oriented framework to integrate prediction and optimization by adapting predictive model to give better decision for the corresponding task. Here, an inevitable challenge arises when computing gradients of the optimal decisi…
▽ More
In decision-making problem under uncertainty, predicting unknown parameters is often considered independent of the optimization part. Decision-focused Learning (DFL) is a task-oriented framework to integrate prediction and optimization by adapting predictive model to give better decision for the corresponding task. Here, an inevitable challenge arises when computing gradients of the optimal decision with respect to the parameters. Existing researches cope this issue by smoothly reforming surrogate optimization or construct surrogate loss function that mimic task loss. However, they are applied to restricted optimization domain or build functions in a local manner leading a large computational time. In this paper, we propose Input Convex Loss Network (ICLN), a novel global surrogate loss which can be implemented in a general DFL paradigm. ICLN learns task loss via Input Convex Neural Networks which is guaranteed to be convex for some inputs, while keeping the global structure for the other inputs. This enables ICLN to admit general DFL through only a single surrogate loss without any sense for choosing appropriate parametric forms. We confirm effectiveness and flexibility of ICLN by evaluating our proposed model with three stochastic decision-making problems.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
eXponential FAmily Dynamical Systems (XFADS): Large-scale nonlinear Gaussian state-space modeling
Authors:
Matthew Dowling,
Yuan Zhao,
Il Memming Park
Abstract:
State-space graphical models and the variational autoencoder framework provide a principled apparatus for learning dynamical systems from data. State-of-the-art probabilistic approaches are often able to scale to large problems at the cost of flexibility of the variational posterior or expressivity of the dynamics model. However, those consolidations can be detrimental if the ultimate goal is to l…
▽ More
State-space graphical models and the variational autoencoder framework provide a principled apparatus for learning dynamical systems from data. State-of-the-art probabilistic approaches are often able to scale to large problems at the cost of flexibility of the variational posterior or expressivity of the dynamics model. However, those consolidations can be detrimental if the ultimate goal is to learn a generative model capable of explaining the spatiotemporal structure of the data and making accurate forecasts. We introduce a low-rank structured variational autoencoding framework for nonlinear Gaussian state-space graphical models capable of capturing dense covariance structures that are important for learning dynamical systems with predictive capabilities. Our inference algorithm exploits the covariance structures that arise naturally from sample based approximate Gaussian message passing and low-rank amortized posterior updates -- effectively performing approximate variational smoothing with time complexity scaling linearly in the state dimensionality. In comparisons with other deep state-space model architectures our approach consistently demonstrates the ability to learn a more predictive generative model. Furthermore, when applied to neural physiological recordings, our approach is able to learn a dynamical system capable of forecasting population spiking and behavioral correlates from a small portion of single trials.
△ Less
Submitted 31 May, 2024; v1 submitted 2 March, 2024;
originally announced March 2024.
-
Training Unbiased Diffusion Models From Biased Dataset
Authors:
Yeongmin Kim,
Byeonghu Na,
Minsang Park,
JoonHo Jang,
Dongjun Kim,
Wanmo Kang,
Il-Chul Moon
Abstract:
With significant advancements in diffusion models, addressing the potential risks of dataset bias becomes increasingly important. Since generated outputs directly suffer from dataset bias, mitigating latent bias becomes a key factor in improving sample quality and proportion. This paper proposes time-dependent importance reweighting to mitigate the bias for the diffusion models. We demonstrate tha…
▽ More
With significant advancements in diffusion models, addressing the potential risks of dataset bias becomes increasingly important. Since generated outputs directly suffer from dataset bias, mitigating latent bias becomes a key factor in improving sample quality and proportion. This paper proposes time-dependent importance reweighting to mitigate the bias for the diffusion models. We demonstrate that the time-dependent density ratio becomes more precise than previous approaches, thereby minimizing error propagation in generative learning. While directly applying it to score-matching is intractable, we discover that using the time-dependent density ratio both for reweighting and score correction can lead to a tractable form of the objective function to regenerate the unbiased data density. Furthermore, we theoretically establish a connection with traditional score-matching, and we demonstrate its convergence to an unbiased distribution. The experimental evidence supports the usefulness of the proposed method, which outperforms baselines including time-independent importance reweighting on CIFAR-10, CIFAR-100, FFHQ, and CelebA with various bias settings. Our code is available at https://github.com/alsdudrla10/TIW-DSM.
△ Less
Submitted 2 March, 2024;
originally announced March 2024.
-
REPrune: Channel Pruning via Kernel Representative Selection
Authors:
Mincheol Park,
Dongjin Kim,
Cheonjun Park,
Yuna Park,
Gyeong Eun Gong,
Won Woo Ro,
Suhyun Kim
Abstract:
Channel pruning is widely accepted to accelerate modern convolutional neural networks (CNNs). The resulting pruned model benefits from its immediate deployment on general-purpose software and hardware resources. However, its large pruning granularity, specifically at the unit of a convolution filter, often leads to undesirable accuracy drops due to the inflexibility of deciding how and where to in…
▽ More
Channel pruning is widely accepted to accelerate modern convolutional neural networks (CNNs). The resulting pruned model benefits from its immediate deployment on general-purpose software and hardware resources. However, its large pruning granularity, specifically at the unit of a convolution filter, often leads to undesirable accuracy drops due to the inflexibility of deciding how and where to introduce sparsity to the CNNs. In this paper, we propose REPrune, a novel channel pruning technique that emulates kernel pruning, fully exploiting the finer but structured granularity. REPrune identifies similar kernels within each channel using agglomerative clustering. Then, it selects filters that maximize the incorporation of kernel representatives while optimizing the maximum cluster coverage problem. By integrating with a simultaneous training-pruning paradigm, REPrune promotes efficient, progressive pruning throughout training CNNs, avoiding the conventional train-prune-finetune sequence. Experimental results highlight that REPrune performs better in computer vision tasks than existing methods, effectively achieving a balance between acceleration ratio and performance retention.
△ Less
Submitted 8 March, 2024; v1 submitted 27 February, 2024;
originally announced February 2024.
-
Eigenstate switching of topologically ordered states using non-Hermitian perturbations
Authors:
Cheol Hun Yeom,
Beom Hyun Kim,
Moon Jip Park
Abstract:
Topologically ordered phases have robust degenerate ground states against the local perturbations, providing a promising platform for fault-tolerant quantum computation. Despite of the non-local feature of the topological order, we find that local non-Hermitian perturbations can induce the transition between the topologically ordered ground states. In this work, we study the toric code in the pres…
▽ More
Topologically ordered phases have robust degenerate ground states against the local perturbations, providing a promising platform for fault-tolerant quantum computation. Despite of the non-local feature of the topological order, we find that local non-Hermitian perturbations can induce the transition between the topologically ordered ground states. In this work, we study the toric code in the presence of non-Hermitian perturbations. By controlling the non-Hermiticity, we show that non-orthogonal ground states can exhibit an eigenstate coalescence and have the spectral singularity, known as an exceptional point (EP). We explore the potential of the EPs in the control of topological order. Adiabatic encircling EPs allows for the controlled switching of eigenstates, enabling dynamic manipulation between the ground state degeneracy. Interestingly, we show a property of our scheme that arbitrary strengths of local perturbations can induce the EP and eigenstate switching. Finally, we also show the orientation-dependent behavior of non-adiabatic transitions (NAT) during the dynamic encirclement around an EP. Our work shows that control of the non-Hermiticity can serve as a promising strategy for fault-tolerant quantum information processing.
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of Electrocardiogram
Authors:
Yeongyeon Na,
Minje Park,
Yunwon Tae,
Sunghoon Joo
Abstract:
Electrocardiograms (ECG) are widely employed as a diagnostic tool for monitoring electrical signals originating from a heart. Recent machine learning research efforts have focused on the application of screening various diseases using ECG signals. However, adapting to the application of screening disease is challenging in that labeled ECG data are limited. Achieving general representation through…
▽ More
Electrocardiograms (ECG) are widely employed as a diagnostic tool for monitoring electrical signals originating from a heart. Recent machine learning research efforts have focused on the application of screening various diseases using ECG signals. However, adapting to the application of screening disease is challenging in that labeled ECG data are limited. Achieving general representation through self-supervised learning (SSL) is a well-known approach to overcome the scarcity of labeled data; however, a naive application of SSL to ECG data, without considering the spatial-temporal relationships inherent in ECG signals, may yield suboptimal results. In this paper, we introduce ST-MEM (Spatio-Temporal Masked Electrocardiogram Modeling), designed to learn spatio-temporal features by reconstructing masked 12-lead ECG data. ST-MEM outperforms other SSL baseline methods in various experimental settings for arrhythmia classification tasks. Moreover, we demonstrate that ST-MEM is adaptable to various lead combinations. Through quantitative and qualitative analysis, we show a spatio-temporal relationship within ECG data. Our code is available at https://github.com/bakqui/ST-MEM.
△ Less
Submitted 19 March, 2024; v1 submitted 2 February, 2024;
originally announced February 2024.
-
Gradient Alignment with Prototype Feature for Fully Test-time Adaptation
Authors:
Juhyeon Shin,
Jonghyun Lee,
Saehyung Lee,
Minjun Park,
Dongjun Lee,
Uiwon Hwang,
Sungroh Yoon
Abstract:
In context of Test-time Adaptation(TTA), we propose a regularizer, dubbed Gradient Alignment with Prototype feature (GAP), which alleviates the inappropriate guidance from entropy minimization loss from misclassified pseudo label. We developed a gradient alignment loss to precisely manage the adaptation process, ensuring that changes made for some data don't negatively impact the model's performan…
▽ More
In context of Test-time Adaptation(TTA), we propose a regularizer, dubbed Gradient Alignment with Prototype feature (GAP), which alleviates the inappropriate guidance from entropy minimization loss from misclassified pseudo label. We developed a gradient alignment loss to precisely manage the adaptation process, ensuring that changes made for some data don't negatively impact the model's performance on other data. We introduce a prototype feature of a class as a proxy measure of the negative impact. To make GAP regularizer feasible under the TTA constraints, where model can only access test data without labels, we tailored its formula in two ways: approximating prototype features with weight vectors of the classifier, calculating gradient without back-propagation. We demonstrate GAP significantly improves TTA methods across various datasets, which proves its versatility and effectiveness.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
H2O-SDF: Two-phase Learning for 3D Indoor Reconstruction using Object Surface Fields
Authors:
Minyoung Park,
Mirae Do,
YeonJae Shin,
Jaeseok Yoo,
Jongkwang Hong,
Joongrock Kim,
Chul Lee
Abstract:
Advanced techniques using Neural Radiance Fields (NeRF), Signed Distance Fields (SDF), and Occupancy Fields have recently emerged as solutions for 3D indoor scene reconstruction. We introduce a novel two-phase learning approach, H2O-SDF, that discriminates between object and non-object regions within indoor environments. This method achieves a nuanced balance, carefully preserving the geometric in…
▽ More
Advanced techniques using Neural Radiance Fields (NeRF), Signed Distance Fields (SDF), and Occupancy Fields have recently emerged as solutions for 3D indoor scene reconstruction. We introduce a novel two-phase learning approach, H2O-SDF, that discriminates between object and non-object regions within indoor environments. This method achieves a nuanced balance, carefully preserving the geometric integrity of room layouts while also capturing intricate surface details of specific objects. A cornerstone of our two-phase learning framework is the introduction of the Object Surface Field (OSF), a novel concept designed to mitigate the persistent vanishing gradient problem that has previously hindered the capture of high-frequency details in other methods. Our proposed approach is validated through several experiments that include ablation studies.
△ Less
Submitted 8 March, 2024; v1 submitted 12 February, 2024;
originally announced February 2024.
-
Rotating Black Holes in a Viable Lorentz-Violating Gravity: Finding Exact Solutions Without Tears
Authors:
Deniz O. Devecioglu,
Mu-In Park
Abstract:
We introduce a two-step procedure for finding Kerr-type rotating black hole solutions without tears. Considering the low-energy sector of Horava gravity as a viable Lorentz-violating gravity in four dimensions which admits a different speed of gravity, we find the exact rotating black hole solutions (with or without cosmological constant). We find that the singular region extends to r < 0 region f…
▽ More
We introduce a two-step procedure for finding Kerr-type rotating black hole solutions without tears. Considering the low-energy sector of Horava gravity as a viable Lorentz-violating gravity in four dimensions which admits a different speed of gravity, we find the exact rotating black hole solutions (with or without cosmological constant). We find that the singular region extends to r < 0 region from the ring singularity at r = 0 in Boyer-Lindquist coordinates. There are two Killing horizons where g^rr = 0 and the black hole thermodynamics laws are still valid. We find the rotating black hole solutions with electromagnetic charges only when we consider the noble electromagnetic couplings, in such a way that the speed of light is the same as the speed of gravity. With the noble choice of couplings, our Lorentz-violating gravity can be consistent with the recently-observed time delay of the coincident GW and GRB signals. Furthermore, in Appendices, we show that (a) the uniqueness of the invariant line element ds^2 under Diff_F, contrary to LV action, (b) the solutions are the Petrov type I with four distinct principal null vectors, and (c) the Hamilton-Jacobi equation for the geodesic particles are not separable.
△ Less
Submitted 3 February, 2024;
originally announced February 2024.
-
Tunable interplay between light and heavy electrons in twisted trilayer graphene
Authors:
Andrew T. Pierce,
Yonglong Xie,
Jeong Min Park,
Zhuozhen Cai,
Kenji Watanabe,
Takashi Taniguchi,
Pablo Jarillo-Herrero,
Amir Yacoby
Abstract:
In strongly interacting systems with multiple energy bands, the interplay between electrons with different effective masses and the enlarged Hilbert space drives intricate correlated phenomena that do not occur in single-band systems. Recently, magic-angle twisted trilayer graphene (MATTG) has emerged as a promising tunable platform for such investigations: the system hosts both slowly dispersing,…
▽ More
In strongly interacting systems with multiple energy bands, the interplay between electrons with different effective masses and the enlarged Hilbert space drives intricate correlated phenomena that do not occur in single-band systems. Recently, magic-angle twisted trilayer graphene (MATTG) has emerged as a promising tunable platform for such investigations: the system hosts both slowly dispersing, "heavy" electrons inhabiting its flat bands as well as delocalized "light" bands that disperse as free Dirac fermions. Most remarkably, superconductivity in twisted trilayer graphene and multilayer analogues with additional dispersive bands exhibits Pauli limit violation and spans a wider range of phase space compared to that in twisted bilayer graphene, where the dispersive bands are absent. This suggests that the interactions between different bands may play a fundamental role in stabilizing correlated phases in twisted graphene multilayers. Here, we elucidate the interplay between the light and heavy electrons in MATTG as a function of doping and magnetic field by performing local compressibility measurements with a scanning single-electron-transistor microscope. We establish that commonly observed resistive features near moiré band fillings $ν$=-2, 1, 2 and 3 host a finite population of light Dirac electrons at the Fermi level despite a gap opening in the flat band sector. At higher magnetic field and near charge neutrality, we discover a new type of phase transition sequence that is robust over nearly 10 micrometers but exhibits complex spatial dependence. Mean-field calculations establish that these transitions arise from the competing population of the two subsystems and that the Dirac sector can be viewed as a new flavor analogous to the spin and valley degrees of freedom.
△ Less
Submitted 22 January, 2024;
originally announced January 2024.
-
Joint UAV Deployment and Resource Allocation in THz-Assisted MEC-Enabled Integrated Space-Air-Ground Networks
Authors:
Yan Kyaw Tun,
György Dán,
Yu Min Park,
Choong Seon Hong
Abstract:
Multi-access edge computing (MEC)-enabled integrated space-air-ground (SAG) networks have drawn much attention recently, as they can provide communication and computing services to wireless devices in areas that lack terrestrial base stations (TBSs). Leveraging the ample bandwidth in the terahertz (THz) spectrum, in this paper, we propose MEC-enabled integrated SAG networks with collaboration amon…
▽ More
Multi-access edge computing (MEC)-enabled integrated space-air-ground (SAG) networks have drawn much attention recently, as they can provide communication and computing services to wireless devices in areas that lack terrestrial base stations (TBSs). Leveraging the ample bandwidth in the terahertz (THz) spectrum, in this paper, we propose MEC-enabled integrated SAG networks with collaboration among unmanned aerial vehicles (UAVs). We then formulate the problem of minimizing the energy consumption of devices and UAVs in the proposed MEC-enabled integrated SAG networks by optimizing tasks offloading decisions, THz sub-bands assignment, transmit power control, and UAVs deployment. The formulated problem is a mixed-integer nonlinear programming (MILP) problem with a non-convex structure, which is challenging to solve. We thus propose a block coordinate descent (BCD) approach to decompose the problem into four sub-problems: 1) device task offloading decision problem, 2) THz sub-band assignment and power control problem, 3) UAV deployment problem, and 4) UAV task offloading decision problem. We then propose to use a matching game, concave-convex procedure (CCP) method, successive convex approximation (SCA), and block successive upper-bound minimization (BSUM) approaches for solving the individual subproblems. Finally, extensive simulations are performed to demonstrate the effectiveness of our proposed algorithm.
△ Less
Submitted 21 January, 2024;
originally announced January 2024.
-
Cosmic evolution of black hole-spin and galaxy orientations: clues from the NewHorizon and Galactica simulations
Authors:
Sebastien Peirani,
Yasushi Suto,
Ricarda S. Beckmann,
Marta Volonteri,
Yen-Ting Lin,
Yohan Dubois,
Sukyoung K. Yi,
Christophe Pichon,
Katarina Kraljic,
Minjung Park,
Julien Devriendt,
San Han,
Wei-Huai Chen
Abstract:
(Reduced) Using the recent cosmological high-resolution zoom-in simulations, NewHorizon and Galactica, in which the evolution of black hole spin is followed on the fly, we have tracked the cosmic history of a hundred of black holes (BHs) with a mass greater than 2x10^4 Ms. For each of them, we have studied the variations of the three dimensional angle (Psi) subtended between the BH spins and the a…
▽ More
(Reduced) Using the recent cosmological high-resolution zoom-in simulations, NewHorizon and Galactica, in which the evolution of black hole spin is followed on the fly, we have tracked the cosmic history of a hundred of black holes (BHs) with a mass greater than 2x10^4 Ms. For each of them, we have studied the variations of the three dimensional angle (Psi) subtended between the BH spins and the angular momentum vectors of their host galaxies. The analysis of the individual evolution of the most massive BHs suggests that they are generally passing by three different regimes. First, for a short period after their birth, low mass BHs (<3x10^4 Ms) are rapidly spun up by gas accretion and their spin tends to be aligned with their host galaxy spin. Then follows a second phase in which the accretion of gas onto low mass BHs (<10^5 Ms) is quite chaotic and inefficient, reflecting the complex and disturbed morphologies of forming proto-galaxies at high redshifts. The variations of Psi are rather erratic during this phase and are mainly driven by the rapid changes of the direction of the galaxy angular momentum. Then, in a third and long phase, BHs are generally well settled in the center of galaxies around which the gas accretion becomes much more coherent (>10^5 Ms). In this case, the BH spins tend to be well aligned with the angular momentum of their host galaxy and this configuration is generally stable even though BH merger episodes can temporally induce misalignment. We have also derived the distributions of cos(Psi) at different redshifts and found that BHs and galaxy spins are generally aligned. Finally, based on a Monte Carlo method, we also predict statistics for the 2-d projected spin-orbit angles lambda. In particular, the distribution of lambda traces well the alignment tendency in the 3-d analysis. Such predictions provide an interesting background for future observational analyses.
△ Less
Submitted 25 March, 2024; v1 submitted 8 January, 2024;
originally announced January 2024.
-
Observational Signatures of AGN Feedback in the Morphology and the Ionization States of Milky Way-like Galaxies
Authors:
Nadia Qutob,
Razieh Emami,
Kung-Yi Su,
Randall Smith,
Lars Hernquist,
Dian P. Triani,
Cameron Hummels,
Drummond Fielding,
Philip F. Hopkins,
Rachel S. Somerville,
David R. Ballantyne,
Mark Vogelsberger,
Grant Tremblay,
James F. Steiner,
Douglas Finkbeiner,
Ramesh Narayan,
Minjung Park,
Josh Grindlay,
Priyamvada Natarajan,
Christopher C. Hayward,
Dušan Kereš,
Sam B. Ponnada,
Sirio Belli,
Rebecca Davies,
Gabriel Maheson
, et al. (2 additional authors not shown)
Abstract:
We make an in-depth analysis of different AGN jet models' signatures, inducing quiescence in galaxies with a halo mass of $10^{12} M_\odot$. Three jet models, including cosmic ray-dominant, hot thermal, and precessing kinetic jets, are studied at two energy flux levels each, compared to a jet-free, stellar feedback-only simulation. We examine the distribution of Mg II, O VI, and O VIII ions, along…
▽ More
We make an in-depth analysis of different AGN jet models' signatures, inducing quiescence in galaxies with a halo mass of $10^{12} M_\odot$. Three jet models, including cosmic ray-dominant, hot thermal, and precessing kinetic jets, are studied at two energy flux levels each, compared to a jet-free, stellar feedback-only simulation. We examine the distribution of Mg II, O VI, and O VIII ions, alongside gas temperature and density profiles. Low-energy ions, like Mg II, concentrate in the ISM, while higher energy ions, e.g., O VIII, prevail at the AGN jet cocoon's edge. High-energy flux jets display an isotropic ion distribution with lower overall density. High-energy thermal or cosmic ray jets pressurize at smaller radii, significantly suppressing core density. The cosmic ray jet provides extra pressure support, extending cool and warm gas distribution. A break in the ion-to-mass ratio slope in O VI and O VIII is demonstrated in the ISM-to-CGM transition (between 10-30 kpc), growing smoothly towards the CGM at greater distances.
△ Less
Submitted 22 December, 2023;
originally announced December 2023.
-
Effect of Resonant Acoustic Powder Mixing on Delay Time of W-KClO4-BaCrO4 Mixtures
Authors:
Kyungmin Kwon,
Seunghwan Ryu,
Soyun Joo,
Youngjoon Han,
Donghyeon Baek,
Moonsoo Park,
Dongwon Kim,
Seungbum Hong
Abstract:
This study investigates the impact of resonant acoustic powder mixing on the delay time of the W-KClO4-BaCrO4 (WKB) mixture and its potential implications for powder and material synthesis. Through thermal analysis, an inverse linear relationship was found between thermal conductivity and delay time, allowing us to use thermal conductivity as a reliable proxy for the delay time. By comparing the t…
▽ More
This study investigates the impact of resonant acoustic powder mixing on the delay time of the W-KClO4-BaCrO4 (WKB) mixture and its potential implications for powder and material synthesis. Through thermal analysis, an inverse linear relationship was found between thermal conductivity and delay time, allowing us to use thermal conductivity as a reliable proxy for the delay time. By comparing the thermal conductivity of WKB mixtures mixed manually and using acoustic powder mixer, we found that acoustic powder mixing resulted in minimal deviations in thermal conductivity, proving more uniform mixing. Furthermore, DSC analysis and Sestak-Berggren modeling demonstrated consistent reaction dynamics with a constant activation energy as the reaction progressed in samples mixed using acoustic waves. These findings underscore the critical role of uniform powder mixing in enhancing the thermodynamic quality of the WKB mixture and emphasize the importance of developing novel methods for powder and material synthesis.
△ Less
Submitted 20 December, 2023;
originally announced December 2023.