subscribe to arXiv mailings

Good plasmons in a bad metal

Authors: Francesco L. Ruta, Yinming Shao, Swagata Acharya, Anqi Mu, Na Hyun Jo, Sae Hee Ryu, Daria Balatsky, Dimitar Pashov, Brian S. Y. Kim, Mikhail I. Katsnelson, James G. Analytis, Eli Rotenberg, Andrew J. Millis, Mark van Schilfgaarde, D. N. Basov

Abstract: Correlated materials may exhibit unusually high resistivity increasing linearly in temperature, breaking through the Mott-Ioffe-Regel bound, above which coherent quasiparticles are destroyed. The fate of collective charge excitations, or plasmons, in these systems is a subject of debate. Several studies suggest plasmons are overdamped while others detect unrenormalized plasmons. Here, we present d… ▽ More Correlated materials may exhibit unusually high resistivity increasing linearly in temperature, breaking through the Mott-Ioffe-Regel bound, above which coherent quasiparticles are destroyed. The fate of collective charge excitations, or plasmons, in these systems is a subject of debate. Several studies suggest plasmons are overdamped while others detect unrenormalized plasmons. Here, we present direct optical images of low-loss hyperbolic plasmon polaritons (HPPs) in the correlated van der Waals metal MoOCl2. HPPs are plasmon-photon modes that waveguide through extremely anisotropic media and are remarkably long-lived in MoOCl2. Many-body theory supported by photoemission results reveals that MoOCl2 is in an orbital-selective and highly incoherent Peierls phase. Different orbitals acquire markedly different bonding-antibonding character, producing a highly-anisotropic, isolated Fermi surface. The Fermi surface is further reconstructed and made partly incoherent by electronic interactions, renormalizing the plasma frequency. HPPs remain long-lived in spite of this, allowing us to uncover previously unseen imprints of electronic correlations on plasmonic collective modes. △ Less

Submitted 9 June, 2024; originally announced June 2024.

Comments: 32 pages, 16 figures

arXiv:2405.19286 [pdf, other]

EDGE: A new model for Nuclear Star Cluster formation in dwarf galaxies

Authors: Emily I. Gray, Justin I. Read, Ethan Taylor, Matthew D. A. Orkney, Martin P. Rey, Robert M. Yates, Stacy Y. Kim, Noelia E. D. Noël, Oscar Agertz, Eric Andersson, Andrew Pontzen

Abstract: Nuclear Star Clusters (NSCs) are amongst the densest stellar systems in the Universe and are found at the centres of many bright spiral and elliptical galaxies, and up to ${\sim}$40% of dwarf galaxies. However, their formation mechanisms, and possible links to globular clusters (GCs), remain debated. This paper uses the EDGE simulations - a collection of zoom-in, cosmological simulations of isolat… ▽ More Nuclear Star Clusters (NSCs) are amongst the densest stellar systems in the Universe and are found at the centres of many bright spiral and elliptical galaxies, and up to ${\sim}$40% of dwarf galaxies. However, their formation mechanisms, and possible links to globular clusters (GCs), remain debated. This paper uses the EDGE simulations - a collection of zoom-in, cosmological simulations of isolated dwarf galaxies -- to present a new formation mechanism for NSCs. We find that, at a gas spatial and mass resolution of ${\sim}3\,$pc and ${\sim}161$ M$_\odot$, respectively, NSCs naturally emerge in a subset of our EDGE dwarfs with redshift-zero halo masses of $\rm{M}_{\rm{r}200\rm{c}} \sim 5 \times 10^9$ M$_\odot$. These dwarfs are quenched by reionisation, but retain a significant reservoir of gas that is unable to cool and form stars. Sometime after reionisation, the dwarfs then undergo a major (${\sim}$1:1) merger that excites rapid gas cooling, leading to a significant starburst. An NSC forms in this starburst that then quenches star formation thereafter. The result is a nucleated dwarf that has two stellar populations with distinct age: one pre-reionisation and one post-reionisation. Our mechanism is unique for two key reasons. Firstly, the low mass of the host dwarf means that NSCs, formed in this way, can accrete onto galaxies of almost all masses, potentially seeding the formation of NSCs everywhere. Secondly, our model predicts that NSCs should have at least two stellar populations with a large ($\gtrsim$1 billion year) age separation. This yields a predicted colour magnitude diagram for our nucleated dwarfs that has two distinct main sequence turnoffs. Several GCs orbiting the Milky Way, including Omega Centauri and M54, show exactly this behaviour, suggesting that they may, in fact, be accreted NSCs. △ Less

Submitted 29 May, 2024; originally announced May 2024.

Comments: Main text 12 pages, 8 figures. Submitted to MNRAS

arXiv:2405.05961 [pdf, other]

Towards comprehensive coverage of chemical space: Quantum mechanical properties of 836k constitutional and conformational closed shell neutral isomers consisting of HCNOFSiPSClBr

Authors: Danish Khan, Anouar Benali, Scott Y. H. Kim, Guido Falk von Rudorff, O. Anatole von Lilienfeld

Abstract: The Vector-QM24 (VQM24) dataset attempts to more comprehensively cover all possible neutral closed shell small organic and inorganic molecules and their conformers at state of the art level of theory. We have used density functional theory ($ω$B97X-D3/cc-pVDZ) to optimize 577k conformational isomers corresponding to 258k constitutional isomers.Isomers included contain up to five heavy atoms (non-h… ▽ More The Vector-QM24 (VQM24) dataset attempts to more comprehensively cover all possible neutral closed shell small organic and inorganic molecules and their conformers at state of the art level of theory. We have used density functional theory ($ω$B97X-D3/cc-pVDZ) to optimize 577k conformational isomers corresponding to 258k constitutional isomers.Isomers included contain up to five heavy atoms (non-hydrogen) consisting of $p$-block elements C, N, O, F, Si, P, S, Cl, Br. Single point diffusion quantum Monte Carlo (DMC@PBE0(ccECP/cc-pVQZ)) energies are reported for the sub-set of the lowest conformers of 10,793 molecules with up to 4 heavy atoms.This dataset has been systematically generated by considering all combinatorially possible stoichiometries, and graphs (according to Lewis rules as implemented in the {\tt SURGE} package), along with all stable conformers identified by GFN2-xTB. Apart from graphs, geometries, rotational constants, and vibrational normal modes, VQM24 includes internal, atomization, electron-electron repulsion, exchange correlation, dispersion, vibrational frequency, Gibbs free, enthalpy, ZPV, molecular orbital energies; as well as entropy, and heat capacities. Electronic properties include multipole moments (dipole, quadrupole, octupole, hexadecapole), electrostatic potentials at nuclei (alchemical potential), Mulliken charges, and molecular wavefunctions. VQM24 represents a highly accurate and unbiased dataset of molecules, ideal for testing and training transferable, scalable, and generative ML models of real quantum systems. △ Less

Submitted 13 May, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

arXiv:2405.03927 [pdf, other]

Codexity: Secure AI-assisted Code Generation

Authors: Sung Yong Kim, Zhiyu Fan, Yannic Noller, Abhik Roychoudhury

Abstract: Despite the impressive performance of Large Language Models (LLMs) in software development activities, recent studies show the concern of introducing vulnerabilities into software codebase by AI programming assistants (e.g., Copilot, CodeWhisperer). In this work, we present Codexity, a security-focused code generation framework integrated with five LLMs. Codexity leverages the feedback of static a… ▽ More Despite the impressive performance of Large Language Models (LLMs) in software development activities, recent studies show the concern of introducing vulnerabilities into software codebase by AI programming assistants (e.g., Copilot, CodeWhisperer). In this work, we present Codexity, a security-focused code generation framework integrated with five LLMs. Codexity leverages the feedback of static analysis tools such as Infer and CppCheck to mitigate security vulnerabilities in LLM-generated programs. Our evaluation in a real-world benchmark with 751 automatically generated vulnerable subjects demonstrates Codexity can prevent 60% of the vulnerabilities being exposed to the software developer. △ Less

Submitted 6 May, 2024; originally announced May 2024.

arXiv:2405.00623 [pdf, other]

doi 10.1145/3630106.3658941

"I'm Not Sure, But...": Examining the Impact of Large Language Models' Uncertainty Expression on User Reliance and Trust

Authors: Sunnie S. Y. Kim, Q. Vera Liao, Mihaela Vorvoreanu, Stephanie Ballard, Jennifer Wortman Vaughan

Abstract: Widely deployed large language models (LLMs) can produce convincing yet incorrect outputs, potentially misleading users who may rely on them as if they were correct. To reduce such overreliance, there have been calls for LLMs to communicate their uncertainty to end users. However, there has been little empirical work examining how users perceive and act upon LLMs' expressions of uncertainty. We ex… ▽ More Widely deployed large language models (LLMs) can produce convincing yet incorrect outputs, potentially misleading users who may rely on them as if they were correct. To reduce such overreliance, there have been calls for LLMs to communicate their uncertainty to end users. However, there has been little empirical work examining how users perceive and act upon LLMs' expressions of uncertainty. We explore this question through a large-scale, pre-registered, human-subject experiment (N=404) in which participants answer medical questions with or without access to responses from a fictional LLM-infused search engine. Using both behavioral and self-reported measures, we examine how different natural language expressions of uncertainty impact participants' reliance, trust, and overall task performance. We find that first-person expressions (e.g., "I'm not sure, but...") decrease participants' confidence in the system and tendency to agree with the system's answers, while increasing participants' accuracy. An exploratory analysis suggests that this increase can be attributed to reduced (but not fully eliminated) overreliance on incorrect answers. While we observe similar effects for uncertainty expressed from a general perspective (e.g., "It's not clear, but..."), these effects are weaker and not statistically significant. Our findings suggest that using natural language expressions of uncertainty may be an effective approach for reducing overreliance on LLMs, but that the precise language used matters. This highlights the importance of user testing before deploying LLMs at scale. △ Less

Submitted 15 May, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

Comments: Accepted to FAccT 2024. This version includes the appendix

arXiv:2404.05238 [pdf, other]

Allowing humans to interactively guide machines where to look does not always improve human-AI team's classification accuracy

Authors: Giang Nguyen, Mohammad Reza Taesiri, Sunnie S. Y. Kim, Anh Nguyen

Abstract: Via thousands of papers in Explainable AI (XAI), attention maps \cite{vaswani2017attention} and feature importance maps \cite{bansal2020sam} have been established as a common means for finding how important each input feature is to an AI's decisions. It is an interesting, unexplored question whether allowing users to edit the feature importance at test time would improve a human-AI team's accuracy… ▽ More Via thousands of papers in Explainable AI (XAI), attention maps \cite{vaswani2017attention} and feature importance maps \cite{bansal2020sam} have been established as a common means for finding how important each input feature is to an AI's decisions. It is an interesting, unexplored question whether allowing users to edit the feature importance at test time would improve a human-AI team's accuracy on downstream tasks. In this paper, we address this question by leveraging CHM-Corr, a state-of-the-art, ante-hoc explainable classifier \cite{taesiri2022visual} that first predicts patch-wise correspondences between the input and training-set images, and then bases on them to make classification decisions. We build CHM-Corr++, an interactive interface for CHM-Corr, enabling users to edit the feature importance map provided by CHM-Corr and observe updated model decisions. Via CHM-Corr++, users can gain insights into if, when, and how the model changes its outputs, improving their understanding beyond static explanations. However, our study with 18 expert users who performed 1,400 decisions finds no statistical significance that our interactive approach improves user accuracy on CUB-200 bird image classification over static explanations. This challenges the hypothesis that interactivity can boost human-AI team accuracy and raises needs for future research. We open-source CHM-Corr++, an interactive tool for editing image classifier attention (see an interactive demo here: http://137.184.82.109:7080/). We release code and data on github: https://github.com/anguyen8/chm-corr-interactive. △ Less

Submitted 20 April, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

Comments: Accepted for presentation at the XAI4CV Workshop, part of the CVPR 2024 proceedings

arXiv:2403.10701 [pdf, other]

IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation

Authors: Yizhi Song, Zhifei Zhang, Zhe Lin, Scott Cohen, Brian Price, Jianming Zhang, Soo Ye Kim, He Zhang, Wei Xiong, Daniel Aliaga

Abstract: Generative object compositing emerges as a promising new avenue for compositional image editing. However, the requirement of object identity preservation poses a significant challenge, limiting practical usage of most existing methods. In response, this paper introduces IMPRINT, a novel diffusion-based generative model trained with a two-stage learning framework that decouples learning of identity… ▽ More Generative object compositing emerges as a promising new avenue for compositional image editing. However, the requirement of object identity preservation poses a significant challenge, limiting practical usage of most existing methods. In response, this paper introduces IMPRINT, a novel diffusion-based generative model trained with a two-stage learning framework that decouples learning of identity preservation from that of compositing. The first stage is targeted for context-agnostic, identity-preserving pretraining of the object encoder, enabling the encoder to learn an embedding that is both view-invariant and conducive to enhanced detail preservation. The subsequent stage leverages this representation to learn seamless harmonization of the object composited to the background. In addition, IMPRINT incorporates a shape-guidance mechanism offering user-directed control over the compositing process. Extensive experiments demonstrate that IMPRINT significantly outperforms existing methods and various baselines on identity preservation and composition quality. △ Less

Submitted 15 March, 2024; originally announced March 2024.

arXiv:2403.02786 [pdf, other]

Semi-Supervised Graph Representation Learning with Human-centric Explanation for Predicting Fatty Liver Disease

Authors: So Yeon Kim, Sehee Wang, Eun Kyung Choe

Abstract: Addressing the challenge of limited labeled data in clinical settings, particularly in the prediction of fatty liver disease, this study explores the potential of graph representation learning within a semi-supervised learning framework. Leveraging graph neural networks (GNNs), our approach constructs a subject similarity graph to identify risk patterns from health checkup data. The effectiveness… ▽ More Addressing the challenge of limited labeled data in clinical settings, particularly in the prediction of fatty liver disease, this study explores the potential of graph representation learning within a semi-supervised learning framework. Leveraging graph neural networks (GNNs), our approach constructs a subject similarity graph to identify risk patterns from health checkup data. The effectiveness of various GNN approaches in this context is demonstrated, even with minimal labeled samples. Central to our methodology is the inclusion of human-centric explanations through explainable GNNs, providing personalized feature importance scores for enhanced interpretability and clinical relevance, thereby underscoring the potential of our approach in advancing healthcare practices with a keen focus on graph representation learning and human-centric explanation. △ Less

Submitted 5 March, 2024; originally announced March 2024.

Comments: Paper accepted in Human-Centric Representation Learning workshop at AAAI 2024 (https://hcrl-workshop.github.io/2024/)

arXiv:2402.06929 [pdf, other]

doi 10.33140/JEEE.03.01.14

Making a prototype of Seoul historical sites chatbot using Langchain

Authors: Jae Young Suh, Minsoo Kwak, Soo Yong Kim, Hyoungseo Cho

Abstract: In this paper, we are going to share a draft of the development of a conversational agent created to disseminate information about historical sites located in the Seoul. The primary objective of the agent is to increase awareness among visitors who are not familiar with Seoul, about the presence and precise locations of valuable cultural heritage sites. It aims to promote a basic understanding of… ▽ More In this paper, we are going to share a draft of the development of a conversational agent created to disseminate information about historical sites located in the Seoul. The primary objective of the agent is to increase awareness among visitors who are not familiar with Seoul, about the presence and precise locations of valuable cultural heritage sites. It aims to promote a basic understanding of Korea's rich and diverse cultural history. The agent is thoughtfully designed for accessibility in English and utilizes data generously provided by the Seoul Metropolitan Government. Despite the limited data volume, it consistently delivers reliable and accurate responses, seamlessly aligning with the available information. We have meticulously detailed the methodologies employed in creating this agent and provided a comprehensive overview of its underlying structure within the paper. Additionally, we delve into potential improvements to enhance this initial version of the system, with a primary emphasis on expanding the available data through our prompting. In conclusion, we provide an in-depth discussion of our expectations regarding the future impact of this agent in promoting and facilitating the sharing of historical sites. △ Less

Submitted 10 February, 2024; originally announced February 2024.

Comments: 4 pages, 4 figures, draft

arXiv:2402.05350 [pdf, other]

Descanning: From Scanned to the Original Images with a Color Correction Diffusion Model

Authors: Junghun Cha, Ali Haider, Seoyun Yang, Hoeyeong Jin, Subin Yang, A. F. M. Shahab Uddin, Jaehyoung Kim, Soo Ye Kim, Sung-Ho Bae

Abstract: A significant volume of analog information, i.e., documents and images, have been digitized in the form of scanned copies for storing, sharing, and/or analyzing in the digital world. However, the quality of such contents is severely degraded by various distortions caused by printing, storing, and scanning processes in the physical world. Although restoring high-quality content from scanned copies… ▽ More A significant volume of analog information, i.e., documents and images, have been digitized in the form of scanned copies for storing, sharing, and/or analyzing in the digital world. However, the quality of such contents is severely degraded by various distortions caused by printing, storing, and scanning processes in the physical world. Although restoring high-quality content from scanned copies has become an indispensable task for many products, it has not been systematically explored, and to the best of our knowledge, no public datasets are available. In this paper, we define this problem as Descanning and introduce a new high-quality and large-scale dataset named DESCAN-18K. It contains 18K pairs of original and scanned images collected in the wild containing multiple complex degradations. In order to eliminate such complex degradations, we propose a new image restoration model called DescanDiffusion consisting of a color encoder that corrects the global color degradation and a conditional denoising diffusion probabilistic model (DDPM) that removes local degradations. To further improve the generalization ability of DescanDiffusion, we also design a synthetic data generation scheme by reproducing prominent degradations in scanned images. We demonstrate that our DescanDiffusion outperforms other baselines including commercial restoration products, objectively and subjectively, via comprehensive experiments and analyses. △ Less

Submitted 7 February, 2024; originally announced February 2024.

Comments: Accepted to AAAI 2024

arXiv:2309.12768 [pdf, other]

WiCV@CVPR2023: The Eleventh Women In Computer Vision Workshop at the Annual CVPR Conference

Authors: Doris Antensteiner, Marah Halawa, Asra Aslam, Ivaxi Sheth, Sachini Herath, Ziqi Huang, Sunnie S. Y. Kim, Aparna Akula, Xin Wang

Abstract: In this paper, we present the details of Women in Computer Vision Workshop - WiCV 2023, organized alongside the hybrid CVPR 2023 in Vancouver, Canada. WiCV aims to amplify the voices of underrepresented women in the computer vision community, fostering increased visibility in both academia and industry. We believe that such events play a vital role in addressing gender imbalances within the field.… ▽ More In this paper, we present the details of Women in Computer Vision Workshop - WiCV 2023, organized alongside the hybrid CVPR 2023 in Vancouver, Canada. WiCV aims to amplify the voices of underrepresented women in the computer vision community, fostering increased visibility in both academia and industry. We believe that such events play a vital role in addressing gender imbalances within the field. The annual WiCV@CVPR workshop offers a) opportunity for collaboration between researchers from minority groups, b) mentorship for female junior researchers, c) financial support to presenters to alleviate finanacial burdens and d) a diverse array of role models who can inspire younger researchers at the outset of their careers. In this paper, we present a comprehensive report on the workshop program, historical trends from the past WiCV@CVPR events, and a summary of statistics related to presenters, attendees, and sponsorship for the WiCV 2023 workshop. △ Less

Submitted 22 September, 2023; originally announced September 2023.

arXiv:2309.00041 [pdf, other]

EDGE -- Dark matter or astrophysics? Breaking dark matter heating degeneracies with HI rotation in faint dwarf galaxies

Authors: Martin P. Rey, Matthew D. A. Orkney, Justin I. Read, Payel Das, Oscar Agertz, Andrew Pontzen, Anastasia A. Ponomareva, Stacy Y. Kim, William McClymont

Abstract: Low-mass dwarf galaxies are expected to reside within dark matter haloes that have a pristine, `cuspy' density profile within their stellar half-light radii. This is because they form too few stars to significantly drive dark matter heating through supernova-driven outflows. Here, we study such simulated faint systems ($10^4 \leq M_{\star} \leq 2\times 10^6 \, M_\mathrm{\odot}$) drawn from high-re… ▽ More Low-mass dwarf galaxies are expected to reside within dark matter haloes that have a pristine, `cuspy' density profile within their stellar half-light radii. This is because they form too few stars to significantly drive dark matter heating through supernova-driven outflows. Here, we study such simulated faint systems ($10^4 \leq M_{\star} \leq 2\times 10^6 \, M_\mathrm{\odot}$) drawn from high-resolution (3 pc) cosmological simulations from the `Engineering Dwarf Galaxies at the Edge of galaxy formation' (EDGE) project. We confirm that these objects have steep and rising inner dark matter density profiles at $z=0$, little affected by galaxy formation effects. But five dwarf galaxies from the suite also showcase a detectable HI reservoir ($M_{\mathrm{HI}}\approx 10^{5}-10^{6} \, M_\mathrm{\odot}$), analogous to the observed population of faint, HI-bearing dwarf galaxies. These reservoirs exhibit episodes of ordered rotation, opening windows for rotation curve analysis. Within actively star-forming dwarfs, stellar feedback easily disrupts the tenuous HI discs ($v_φ \approx 10\, \mathrm{km} \, \mathrm{s}^{-1}$), making rotation short-lived ($\ll 150 \, \mathrm{Myr}$) and more challenging to interpret for dark matter inferences. In contrast, we highlight a long-lived ($\geq 500 \, \mathrm{Myr}$) and easy-to-interpret HI rotation curve extending to $\approx 2\, r_{1/2, \text{3D}}$ in a quiescent dwarf, that has not formed new stars since $z=4$. This stable gas disc is supported by an oblate dark matter halo shape that drives high-angular momentum gas flows. Our results strongly motivate further searches for HI in rotation curves in the observed population of HI-bearing low-mass dwarfs, that provide a key regime to disentangle the respective roles of dark matter microphysics and galaxy formation effects in driving dark matter heating. △ Less

Submitted 16 March, 2024; v1 submitted 31 August, 2023; originally announced September 2023.

Comments: Matching published version in MNRAS, results unchanged. Now includes an inference of the dark matter profile from the characterised HI rotation curves

arXiv:2307.05130 [pdf, other]

EDGE: The direct link between mass growth history and the extended stellar haloes of the faintest dwarf galaxies

Authors: Alex Goater, Justin I. Read, Noelia E. D. Noël, Matthew D. A. Orkney, Stacy Y. Kim, Martin P. Rey, Eric P. Andersson, Oscar Agertz, Andrew Pontzen, Roberta Vieliute, Dhairya Kataria, Kiah Jeneway

Abstract: Ultra-faint dwarf galaxies (UFDs) are commonly found in close proximity to the Milky Way and other massive spiral galaxies. As such, their projected stellar ellipticity and extended light distributions are often thought to owe to tidal forces. In this paper, we study the projected stellar ellipticities and faint stellar outskirts of tidally isolated ultra-faints drawn from the 'Engineering Dwarfs… ▽ More Ultra-faint dwarf galaxies (UFDs) are commonly found in close proximity to the Milky Way and other massive spiral galaxies. As such, their projected stellar ellipticity and extended light distributions are often thought to owe to tidal forces. In this paper, we study the projected stellar ellipticities and faint stellar outskirts of tidally isolated ultra-faints drawn from the 'Engineering Dwarfs at Galaxy Formation's Edge' (EDGE) cosmological simulation suite. Despite their tidal isolation, our simulated dwarfs exhibit a wide range of projected ellipticities ($0.03 < \varepsilon < 0.85$), with many possessing anisotropic extended stellar haloes that mimic tidal tails, but owe instead to late-time accretion of lower mass companions. Furthermore, we find a strong causal relationship between ellipticity and formation time of an UFD, which is robust to a wide variation in the feedback model. We show that the distribution of projected ellipticities in our suite of simulated EDGE dwarfs matches well with that of 21 Local Group dwarf galaxies. Given the ellipticity in EDGE arises from an ex-situ accretion origin, the agreement in shape indicates the ellipticities of some observed dwarfs may also originate from a similar non-tidal scenario. The orbital parameters of these observed dwarfs further support that they are not currently tidally disrupting. If the baryonic content in these galaxies is still tidally intact, then the same may be true for their dark matter content, making these galaxies in our Local Group pristine laboratories for testing dark matter and galaxy formation models. △ Less

Submitted 11 July, 2023; originally announced July 2023.

Comments: 10 pages, 4 figures; submitted to MNRAS

arXiv:2306.04674 [pdf, other]

Milky Way satellite velocities reveal the Dark Matter power spectrum at small scales

Authors: Ivan Esteban, Annika H. G. Peter, Stacy Y. Kim

Abstract: Dark Matter (DM) properties at small scales remain uncertain. Recent theoretical and observational advances have provided the tools to narrow them down. Here, we show for the first time that the correlation between internal velocities and sizes of dwarf galaxies is a sharp probe of small-scale DM properties. We study modified DM power spectra, motivated by DM production during inflation. Using sem… ▽ More Dark Matter (DM) properties at small scales remain uncertain. Recent theoretical and observational advances have provided the tools to narrow them down. Here, we show for the first time that the correlation between internal velocities and sizes of dwarf galaxies is a sharp probe of small-scale DM properties. We study modified DM power spectra, motivated by DM production during inflation. Using semi-analytic models and scaling relations, we show that such models can change the kinematics and structure of dwarf galaxies without strongly affecting their total abundance. We analyze data from Milky Way classical satellite galaxies and those discovered with the Sloan Digital Sky Survey (SDSS), finding that the DM power spectrum at comoving scales ${4\, \mathrm{Mpc}^{-1} < k < 37\,\mathrm{Mpc}^{-1}}$ cannot deviate by more than a factor of 2 from scale invariance. Our results are robust against baryonic uncertainties such as the stellar mass-halo mass relation, halo occupation fraction, and subhalo tidal disruption; allowing us to independently constrain them. This work thus opens a window to probe both dwarf galaxy formation models and small-scale DM properties. △ Less

Submitted 7 June, 2023; originally announced June 2023.

Comments: 10 pages + appendices. Comments welcome!

arXiv:2305.08598 [pdf, other]

doi 10.1145/3593013.3593978

Humans, AI, and Context: Understanding End-Users' Trust in a Real-World Computer Vision Application

Authors: Sunnie S. Y. Kim, Elizabeth Anne Watkins, Olga Russakovsky, Ruth Fong, Andrés Monroy-Hernández

Abstract: Trust is an important factor in people's interactions with AI systems. However, there is a lack of empirical studies examining how real end-users trust or distrust the AI system they interact with. Most research investigates one aspect of trust in lab settings with hypothetical end-users. In this paper, we provide a holistic and nuanced understanding of trust in AI through a qualitative case study… ▽ More Trust is an important factor in people's interactions with AI systems. However, there is a lack of empirical studies examining how real end-users trust or distrust the AI system they interact with. Most research investigates one aspect of trust in lab settings with hypothetical end-users. In this paper, we provide a holistic and nuanced understanding of trust in AI through a qualitative case study of a real-world computer vision application. We report findings from interviews with 20 end-users of a popular, AI-based bird identification app where we inquired about their trust in the app from many angles. We find participants perceived the app as trustworthy and trusted it, but selectively accepted app outputs after engaging in verification behaviors, and decided against app adoption in certain high-stakes scenarios. We also find domain knowledge and context are important factors for trust-related assessment and decision-making. We discuss the implications of our findings and provide recommendations for future research on trust in AI. △ Less

Submitted 15 May, 2023; originally announced May 2023.

Comments: FAccT 2023

arXiv:2304.04461 [pdf, other]

Modernizing Old Photos Using Multiple References via Photorealistic Style Transfer

Authors: Agus Gunawan, Soo Ye Kim, Hyeonjun Sim, Jae-Ho Lee, Munchurl Kim

Abstract: This paper firstly presents old photo modernization using multiple references by performing stylization and enhancement in a unified manner. In order to modernize old photos, we propose a novel multi-reference-based old photo modernization (MROPM) framework consisting of a network MROPM-Net and a novel synthetic data generation scheme. MROPM-Net stylizes old photos using multiple references via ph… ▽ More This paper firstly presents old photo modernization using multiple references by performing stylization and enhancement in a unified manner. In order to modernize old photos, we propose a novel multi-reference-based old photo modernization (MROPM) framework consisting of a network MROPM-Net and a novel synthetic data generation scheme. MROPM-Net stylizes old photos using multiple references via photorealistic style transfer (PST) and further enhances the results to produce modern-looking images. Meanwhile, the synthetic data generation scheme trains the network to effectively utilize multiple references to perform modernization. To evaluate the performance, we propose a new old photos benchmark dataset (CHD) consisting of diverse natural indoor and outdoor scenes. Extensive experiments show that the proposed method outperforms other baselines in performing modernization on real old photos, even though no old photos were used during training. Moreover, our method can appropriately select styles from multiple references for each semantic region in the old photo to further improve the modernization performance. △ Less

Submitted 10 April, 2023; originally announced April 2023.

Comments: Accepted to CVPR 2023. Website: https://kaist-viclab.github.io/old-photo-modernization

arXiv:2303.15632 [pdf, other]

UFO: A unified method for controlling Understandability and Faithfulness Objectives in concept-based explanations for CNNs

Authors: Vikram V. Ramaswamy, Sunnie S. Y. Kim, Ruth Fong, Olga Russakovsky

Abstract: Concept-based explanations for convolutional neural networks (CNNs) aim to explain model behavior and outputs using a pre-defined set of semantic concepts (e.g., the model recognizes scene class ``bedroom'' based on the presence of concepts ``bed'' and ``pillow''). However, they often do not faithfully (i.e., accurately) characterize the model's behavior and can be too complex for people to unders… ▽ More Concept-based explanations for convolutional neural networks (CNNs) aim to explain model behavior and outputs using a pre-defined set of semantic concepts (e.g., the model recognizes scene class ``bedroom'' based on the presence of concepts ``bed'' and ``pillow''). However, they often do not faithfully (i.e., accurately) characterize the model's behavior and can be too complex for people to understand. Further, little is known about how faithful and understandable different explanation methods are, and how to control these two properties. In this work, we propose UFO, a unified method for controlling Understandability and Faithfulness Objectives in concept-based explanations. UFO formalizes understandability and faithfulness as mathematical objectives and unifies most existing concept-based explanations methods for CNNs. Using UFO, we systematically investigate how explanations change as we turn the knobs of faithfulness and understandability. Our experiments demonstrate a faithfulness-vs-understandability tradeoff: increasing understandability reduces faithfulness. We also provide insights into the ``disagreement problem'' in explainable machine learning, by analyzing when and how concept-based explanations disagree with each other. △ Less

Submitted 27 March, 2023; originally announced March 2023.

arXiv:2303.04969 [pdf, other]

Björling problem for zero mean curvature surfaces in the three-dimensional light cone

Authors: Joseph Cho, So Young Kim, Dami Lee, Wonjoo Lee, Seong-Deog Yang

Abstract: We solve the Björling problem for zero mean curvature surfaces in the three-dimensional light cone. As an application, we construct and classify all rotational zero mean curvature surfaces. We solve the Björling problem for zero mean curvature surfaces in the three-dimensional light cone. As an application, we construct and classify all rotational zero mean curvature surfaces. △ Less

Submitted 8 March, 2023; originally announced March 2023.

Comments: 15 pages, 5 figures

MSC Class: (2020): 53A10 (Primary) 53B30 (Secondary)

arXiv:2303.01528 [pdf, other]

doi 10.3847/1538-4357/acdcf6

The PAndAS View of the Andromeda Satellite System. IV Global properties

Authors: Amandine Doliva-Dolinsky, Nicolas F. Martin, Zhen Yuan, Alessandro Savino, Daniel R. Weisz, Annette M. N. Ferguson, Rodrigo A. Ibata, Stacy Y. Kim, Geraint F. Lewis, Alan W. McConnachie, Guillaume F. Thomas

Abstract: We build a statistical framework to infer the global properties of the satellite system of the Andromeda galaxy (M31) from the properties of individual dwarf galaxies located in the Pan-Andromeda Archaelogical Survey (PAndAS) and the previously determined completeness of the survey. Using forward modeling, we infer the slope of the luminosity function of the satellite system, the slope of its spat… ▽ More We build a statistical framework to infer the global properties of the satellite system of the Andromeda galaxy (M31) from the properties of individual dwarf galaxies located in the Pan-Andromeda Archaelogical Survey (PAndAS) and the previously determined completeness of the survey. Using forward modeling, we infer the slope of the luminosity function of the satellite system, the slope of its spatial density distribution, and the size-luminosity relation followed by the dwarf galaxies. We find that the slope of the luminosity function is $β=-1.5\pm0.1$. Combined with the spatial density profile, it implies that, when accounting for survey incompleteness, M31 hosts $92_{-26}^{+19}$ dwarf galaxies with $M_\textrm{V}<-5.5$ and a sky-projected distance from M31 between 30 and 300kpc. We conclude that many faint or distant dwarf galaxies remain to be discovered around Andromeda, especially outside the PAndAS footprint. Finally, we use our model to test if the higher number of satellites situated in the hemisphere facing the Milky Way could be explained simply by the detection limits of dwarf galaxy searches. We rule this out at $>99.9\%$ confidence and conclude that this anisotropy is an intrinsic feature of the M31 satellite system. The statistical framework we present here is a powerful tool to robustly constrain the properties of a satellite system and compare those across hosts, especially considering the upcoming start of the Euclid or Rubin large photometric surveys that are expected to uncover a large number of dwarf galaxies in the Local Volume. △ Less

Submitted 2 March, 2023; originally announced March 2023.

Comments: Submitted to ApJ - 12 pages, 6 figures, 2 tables

arXiv:2302.14331 [pdf]

Lifetime-configurable soft robots via photodegradable silicone elastomer composites

Authors: Min-Ha Oh, Young-Hwan Kim, Seung-Min Lee, Gyeong-Seok Hwang, Kyung-Sub Kim, Jae-Young Bae, Ju-Young Kim, Ju-Yong Lee, Yu-Chan Kim, Sang Yup Kim, Seung-Kyun Kang

Abstract: Developing soft robots that can control their own life-cycle and degrade on-demand while maintaining hyper-elasticity is a significant research challenge. On-demand degradable soft robots, which conserve their original functionality during operation and rapidly degrade under specific external stimulation, present the opportunity to self-direct the disappearance of temporary robots. This study prop… ▽ More Developing soft robots that can control their own life-cycle and degrade on-demand while maintaining hyper-elasticity is a significant research challenge. On-demand degradable soft robots, which conserve their original functionality during operation and rapidly degrade under specific external stimulation, present the opportunity to self-direct the disappearance of temporary robots. This study proposes soft robots and materials that exhibit excellent mechanical stretchability and can degrade under ultraviolet (UV) light by mixing a fluoride-generating diphenyliodonium hexafluorophosphate (DPI-HFP) with a silicone resin. Spectroscopic analysis revealed the mechanism of Si-O-Si backbone cleavage using fluoride ion (F-), which was generated from UV exposed DPI-HFP. Furthermore, photo-differential scanning calorimetry (DSC) based thermal analysis indicated increased decomposition kinetics at increased temperatures. Additionally, we demonstrated a robotics application of this composite by fabricating a gaiting robot. The integration of soft electronics, including strain sensors, temperature sensors, and photodetectors, expanded the robotic functionalities. This study provides a simple yet novel strategy for designing lifecycle mimicking soft robotics that can be applied to reduce soft robotics waste, explore hazardous areas where retrieval of robots is impossible, and ensure hardware security with on-demand destructive material platforms. △ Less

Submitted 28 February, 2023; originally announced February 2023.

Comments: 58 pages, 6 figures, 2 Supplementary Text, 15 Supplementary figures, 1 movie

arXiv:2302.12818 [pdf, other]

doi 10.1093/mnras/stad2516

EDGE: The shape of dark matter haloes in the faintest galaxies

Authors: Matthew D. A. Orkney, Ethan Taylor, Justin I. Read, Martin P. Rey, Andrew Pontzen, Oscar Agertz, Stacy Y. Kim, Maxime Delorme

Abstract: Collisionless Dark Matter Only (DMO) structure formation simulations predict that Dark Matter (DM) haloes are prolate in their centres and triaxial towards their outskirts. The addition of gas condensation transforms the central DM shape to be rounder and more oblate. It is not clear, however, whether such shape transformations occur in `ultra-faint' dwarfs, which have extremely low baryon fractio… ▽ More Collisionless Dark Matter Only (DMO) structure formation simulations predict that Dark Matter (DM) haloes are prolate in their centres and triaxial towards their outskirts. The addition of gas condensation transforms the central DM shape to be rounder and more oblate. It is not clear, however, whether such shape transformations occur in `ultra-faint' dwarfs, which have extremely low baryon fractions. We present the first study of the shape and velocity anisotropy of ultra-faint dwarf galaxies that have gas mass fractions of $f_{\rm gas}(r<R_{\rm half}) < 0.06$. These dwarfs are drawn from the Engineering Dwarfs at Galaxy formation's Edge (EDGE) project, using high resolution simulations that allow us to resolve DM halo shapes within the half light radius ($\sim 100\,$pc). We show that gas-poor ultra-faints ($M_{\rm 200c} \leqslant 1.5\times10^9\,$M$_\odot$; $f_{\rm gas} < 10^{-5}$) retain their pristine prolate DM halo shape even when gas, star formation and feedback are included. This could provide a new and robust test of DM models. By contrast, gas-rich ultra-faints ($M_{\rm 200c} > 3\times10^9\,$M$_\odot$; $f_{\rm gas} > 10^{-4}$) become rounder and more oblate within $\sim 10$ half light radii. Finally, we find that most of our simulated dwarfs have significant radial velocity anisotropy that rises to $\tildeβ > 0.5$ at $R \gtrsim 3 R_{\rm half}$. The one exception is a dwarf that forms a rotating gas/stellar disc because of a planar, major merger. Such strong anisotropy should be taken into account when building mass models of gas-poor ultra-faints. △ Less

Submitted 5 September, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

Comments: 16 pages and 11 figures (excluding appendices), accepted by MNRAS

arXiv:2212.14389 [pdf, other]

Controllable Mechanical-domain Energy Accumulators

Authors: Sung Y. Kim, David J. Braun

Abstract: Springs are efficient in storing and returning elastic potential energy but are unable to hold the energy they store in the absence of an external load. Lockable springs use clutches to hold elastic potential energy in the absence of an external load, but have not yet been widely adopted in applications, partly because clutches introduce design complexity, reduce energy efficiency, and typically d… ▽ More Springs are efficient in storing and returning elastic potential energy but are unable to hold the energy they store in the absence of an external load. Lockable springs use clutches to hold elastic potential energy in the absence of an external load, but have not yet been widely adopted in applications, partly because clutches introduce design complexity, reduce energy efficiency, and typically do not afford high fidelity control over the energy stored by the spring. Here, we present the design of a novel lockable compression spring that uses a small capstan clutch to passively lock a mechanical spring. The capstan clutch can lock over 1000 N force at any arbitrary deflection, unlock the spring in less than 10 ms with a control force less than 1 % of the maximal spring force, and provide an 80 % energy storage and return efficiency (comparable to a highly efficient electric motor operated at constant nominal speed). By retaining the form factor of a regular spring while providing high-fidelity locking capability even under large spring forces, the proposed design could facilitate the development of energy-efficient spring-based actuators and robots. △ Less

Submitted 21 February, 2023; v1 submitted 29 December, 2022; originally announced December 2022.

Comments: Accepted for presentation at the 2023 IEEE International Conference on Robotics and Automation

arXiv:2212.12118 [pdf, other]

doi 10.1103/PhysRevB.107.045112

Dimensional crossover of charge order in IrTe$_2$ with strong interlayer coupling

Authors: Hyoung Kug Kim, So Young Kim, C. J. Won, Sang-Wook Cheong, Jonghwan Kim, Jun Sung Kim, Tae-Hwan Kim

Abstract: Tuning dimensionality in van der Waals materials with finite interlayer coupling has introduced various electronic phase transitions by conventional mechanical exfoliation. Particularly when the electronic order is tied to the modulation of the interlayer coupling, such dimensional tunability has a strong impact on its stability and properties, which has rarely been investigated experimentally. He… ▽ More Tuning dimensionality in van der Waals materials with finite interlayer coupling has introduced various electronic phase transitions by conventional mechanical exfoliation. Particularly when the electronic order is tied to the modulation of the interlayer coupling, such dimensional tunability has a strong impact on its stability and properties, which has rarely been investigated experimentally. Here, we demonstrate a dimensional crossover of charge order in IrTe$_2$ from genuine two- to quasi-three-dimension using low-temperature scanning tunneling microscopy and spectroscopy. Employing atomically thin IrTe$_2$ flakes ranging from monolayer to multilayer, we observe a gradual phase transition of charge order and exponential decay of Coulomb gap with increasing thickness. Moreover, we find a suppression of the density of states emerging at an abrupt lateral interface between two- and three-dimension. These findings are attributed to the interplay between the strongly coupled layers and substrate-driven perturbation, which can provide a new insight into the dimensional crossover of strongly coupled layered materials with hidden electronic phases. △ Less

Submitted 22 December, 2022; originally announced December 2022.

Comments: 8 pages, 6 figures

Journal ref: Phys. Rev. B 107, 045112 (2023)

arXiv:2212.00932 [pdf, other]

ObjectStitch: Generative Object Compositing

Authors: Yizhi Song, Zhifei Zhang, Zhe Lin, Scott Cohen, Brian Price, Jianming Zhang, Soo Ye Kim, Daniel Aliaga

Abstract: Object compositing based on 2D images is a challenging problem since it typically involves multiple processing stages such as color harmonization, geometry correction and shadow generation to generate realistic results. Furthermore, annotating training data pairs for compositing requires substantial manual effort from professionals, and is hardly scalable. Thus, with the recent advances in generat… ▽ More Object compositing based on 2D images is a challenging problem since it typically involves multiple processing stages such as color harmonization, geometry correction and shadow generation to generate realistic results. Furthermore, annotating training data pairs for compositing requires substantial manual effort from professionals, and is hardly scalable. Thus, with the recent advances in generative models, in this work, we propose a self-supervised framework for object compositing by leveraging the power of conditional diffusion models. Our framework can hollistically address the object compositing task in a unified model, transforming the viewpoint, geometry, color and shadow of the generated object while requiring no manual labeling. To preserve the input object's characteristics, we introduce a content adaptor that helps to maintain categorical semantics and object appearance. A data augmentation method is further adopted to improve the fidelity of the generator. Our method outperforms relevant baselines in both realism and faithfulness of the synthesized result images in a user study on various real-world images. △ Less

Submitted 5 December, 2022; v1 submitted 1 December, 2022; originally announced December 2022.

arXiv:2210.03735 [pdf, other]

doi 10.1145/3544548.3581001

"Help Me Help the AI": Understanding How Explainability Can Support Human-AI Interaction

Authors: Sunnie S. Y. Kim, Elizabeth Anne Watkins, Olga Russakovsky, Ruth Fong, Andrés Monroy-Hernández

Abstract: Despite the proliferation of explainable AI (XAI) methods, little is understood about end-users' explainability needs and behaviors around XAI explanations. To address this gap and contribute to understanding how explainability can support human-AI interaction, we conducted a mixed-methods study with 20 end-users of a real-world AI application, the Merlin bird identification app, and inquired abou… ▽ More Despite the proliferation of explainable AI (XAI) methods, little is understood about end-users' explainability needs and behaviors around XAI explanations. To address this gap and contribute to understanding how explainability can support human-AI interaction, we conducted a mixed-methods study with 20 end-users of a real-world AI application, the Merlin bird identification app, and inquired about their XAI needs, uses, and perceptions. We found that participants desire practically useful information that can improve their collaboration with the AI, more so than technical system details. Relatedly, participants intended to use XAI explanations for various purposes beyond understanding the AI's outputs: calibrating trust, improving their task skills, changing their behavior to supply better inputs to the AI, and giving constructive feedback to developers. Finally, among existing XAI approaches, participants preferred part-based explanations that resemble human reasoning and explanations. We discuss the implications of our findings and provide recommendations for future XAI design. △ Less

Submitted 16 February, 2023; v1 submitted 2 October, 2022; originally announced October 2022.

Comments: CHI 2023

Journal ref: Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (CHI '23), April 23-28, 2023, Hamburg, Germany. ACM, New York, NY, USA

arXiv:2209.15155 [pdf]

doi 10.1126/science.adf1065

Negative refraction in hyperbolic hetero-bicrystals

Authors: A. J. Sternbach, S. L. Moore, A. Rikhter, S. Zhang, R. Jing, Y. Shao, B. S. Y. Kim, S. Xu, S. Liu, J. H. Edgar, A. Rubio, C. Dean, J. Hone, M. M. Fogler, D. N. Basov

Abstract: We visualized negative refraction of phonon polaritons, which occurs at the interface between two natural crystals. The polaritons - hybrids of infrared photons and lattice vibrations - form collimated rays that display negative refraction when passing through a planar interface between the two hyperbolic van der Waals materials: molybdenum oxide ($MoO_3$) and isotopically pure hexagonal boron nit… ▽ More We visualized negative refraction of phonon polaritons, which occurs at the interface between two natural crystals. The polaritons - hybrids of infrared photons and lattice vibrations - form collimated rays that display negative refraction when passing through a planar interface between the two hyperbolic van der Waals materials: molybdenum oxide ($MoO_3$) and isotopically pure hexagonal boron nitride ($h^{11}BN$). At a special frequency $ω_0$, these rays can circulate along closed diamond-shaped trajectories. We have shown that polariton eigenmodes display regions of both positive and negative dispersion interrupted by multiple gaps that result from polaritonic level repulsion and strong coupling. △ Less

Submitted 7 July, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

arXiv:2209.15022 [pdf, other]

doi 10.1093/mnras/stad752

Andromeda XXV -- a dwarf galaxy with a low central dark matter density

Authors: Emily J. E. Charles, Michelle L. M. Collins, R. Michael Rich, Justin I. Read, Stacy Y. Kim, Rodrigo A. Ibata, Nicolas F. Martin, Scott C. Chapman, Eduardo Balbinot, Daniel R. Weisz

Abstract: Andromeda (And) XXV has previously been reported as a dwarf spheroidal galaxy (dSph) with little-to-no dark matter. However, the uncertainties on this result were significant. In this study, we double the number of member stars and re-derive the kinematics and mass of And XXV. We find that And XXV has a systemic velocity of $ν_\mathrm{r}=-107.7\pm1.0 \mathrm{~km s}^{-1}$ and a velocity dispersion… ▽ More Andromeda (And) XXV has previously been reported as a dwarf spheroidal galaxy (dSph) with little-to-no dark matter. However, the uncertainties on this result were significant. In this study, we double the number of member stars and re-derive the kinematics and mass of And XXV. We find that And XXV has a systemic velocity of $ν_\mathrm{r}=-107.7\pm1.0 \mathrm{~km s}^{-1}$ and a velocity dispersion of $σ_ν=4.5\pm1.0\mathrm{~km s}^{-1}$. With this better constrained velocity dispersion, we derive a mass contained within the half-light radius of $M(r< r_\mathrm{h})=6.9^{+3.2}_{-2.8}\times10^6\mathrm{~M}_\odot$. This mass corresponds to a mass-to-light ratio of $\mathrm{[M/L]}_\mathrm{r_\mathrm{h}}=37^{+17}_{-15}\mathrm{~M}_\odot/\mathrm{L}_\odot$, demonstrating, for the first time, that And XXV has an unambiguous dark matter component. We also measure the metallicity of And XXV to be $\mathrm{[Fe/H]}=-1.9\pm0.1$$\mathrm{~}$dex, which is in agreement with previous results. Finally, we extend the analysis of And XXV to include mass modelling using GravSphere. We find that And XXV has a low central dark matter density, $ρ_\mathrm{DM}(150\mathrm{pc})= 2.7^{+1.8}_{-1.6}\times10^7\mathrm{~M}_\odot\mathrm{kpc}^{-3}$, making And XXV a clear outlier when compared to other Local Group (LG) dSphs of the similar stellar mass. In a companion paper, we will explore whether some combination of dark matter cusp-core transformations and/or tides can explain And XXV's low density. △ Less

Submitted 29 September, 2022; originally announced September 2022.

Comments: 13 pages, 8 figures (7 main, 1 appendix). Submitted to MNRAS

arXiv:2208.02255 [pdf, other]

doi 10.3847/1538-4357/acc582

Streams on FIRE: Populations of Detectable Stellar Streams in the Milky Way and FIRE

Authors: Nora Shipp, Nondh Panithanpaisal, Lina Necib, Robyn Sanderson, Denis Erkal, Ting S. Li, Isaiah B. Santistevan, Andrew Wetzel, Lara R. Cullinane, Alexander P. Ji, Sergey E. Koposov, Kyler Kuehn, Geraint F. Lewis, Andrew B. Pace, Daniel B. Zucker, Joss Bland-Hawthorn, Emily C. Cunningham, Stacy Y. Kim, Sophia Lilleengen, Jorge Moreno, Sanjib Sharma

Abstract: We present the first detailed study comparing the populations of stellar streams in cosmological simulations to observed Milky Way dwarf galaxy streams. In particular, we compare streams identified around Milky Way analogs in the FIRE-2 simulations to stellar streams observed by the Southern Stellar Stream Spectroscopic Survey (S5). For an accurate comparison between the stream populations, we pro… ▽ More We present the first detailed study comparing the populations of stellar streams in cosmological simulations to observed Milky Way dwarf galaxy streams. In particular, we compare streams identified around Milky Way analogs in the FIRE-2 simulations to stellar streams observed by the Southern Stellar Stream Spectroscopic Survey (S5). For an accurate comparison between the stream populations, we produce mock Dark Energy Survey (DES) observations of the FIRE streams and estimate the detectability of their tidal tails and progenitors. The number and stellar mass distributions of detectable stellar streams is consistent between observations and simulations. However, there are discrepancies in the distributions of pericenters and apocenters, with the detectable FIRE streams, on average, forming at larger pericenters (out to > 110 kpc) and surviving only at larger apocenters (> 40 kpc) than those observed in the Milky Way. We find that the population of high-stellar mass dwarf galaxy streams in the Milky Way is incomplete. Interestingly, a large fraction of the FIRE streams would only be detected as satellites in DES-like observations, since their tidal tails are too low-surface brightness to be detectable. We thus predict a population of yet-undetected tidal tails around Milky Way satellites, as well as a population of fully undetected low surface brightness stellar streams, and estimate their detectability with the Rubin Observatory. Finally, we discuss the causes and implications of the discrepancies between the stream populations in FIRE and the Milky Way, and explore future avenues for tests of satellite disruption in cosmological simulations. △ Less

Submitted 3 August, 2022; originally announced August 2022.

Comments: 24 pages, 13 figures, 3 tables, submitted to ApJ

arXiv:2207.09615 [pdf, other]

Overlooked factors in concept-based explanations: Dataset choice, concept learnability, and human capability

Authors: Vikram V. Ramaswamy, Sunnie S. Y. Kim, Ruth Fong, Olga Russakovsky

Abstract: Concept-based interpretability methods aim to explain deep neural network model predictions using a predefined set of semantic concepts. These methods evaluate a trained model on a new, "probe" dataset and correlate model predictions with the visual concepts labeled in that dataset. Despite their popularity, they suffer from limitations that are not well-understood and articulated by the literatur… ▽ More Concept-based interpretability methods aim to explain deep neural network model predictions using a predefined set of semantic concepts. These methods evaluate a trained model on a new, "probe" dataset and correlate model predictions with the visual concepts labeled in that dataset. Despite their popularity, they suffer from limitations that are not well-understood and articulated by the literature. In this work, we analyze three commonly overlooked factors in concept-based explanations. First, the choice of the probe dataset has a profound impact on the generated explanations. Our analysis reveals that different probe datasets may lead to very different explanations, and suggests that the explanations are not generalizable outside the probe dataset. Second, we find that concepts in the probe dataset are often less salient and harder to learn than the classes they claim to explain, calling into question the correctness of the explanations. We argue that only visually salient concepts should be used in concept-based explanations. Finally, while existing methods use hundreds or even thousands of concepts, our human studies reveal a much stricter upper bound of 32 concepts or less, beyond which the explanations are much less practically useful. We make suggestions for future development and analysis of concept-based interpretability methods. Code for our analysis and user interface can be found at \url{https://github.com/princetonvisualai/OverlookedFactors} △ Less

Submitted 12 May, 2023; v1 submitted 19 July, 2022; originally announced July 2022.

Comments: Published at CVPR 2023

arXiv:2207.08287 [pdf]

Spatial Distribution of Solar PV Deployment: An Application of the Region-Based Convolutional Neural Network

Authors: Serena Y. Kim, Koushik Ganesan, Crystal Soderman, Raven O'Rourke

Abstract: This paper presents a comprehensive analysis of the social and environmental determinants of solar photovoltaic (PV) deployment rates in Colorado, USA. Using 652,795 satellite imagery and computer vision frameworks based on a convolutional neural network, we estimated the proportion of households with solar PV systems and the roof areas covered by solar panels. At the census block group level, 7%… ▽ More This paper presents a comprehensive analysis of the social and environmental determinants of solar photovoltaic (PV) deployment rates in Colorado, USA. Using 652,795 satellite imagery and computer vision frameworks based on a convolutional neural network, we estimated the proportion of households with solar PV systems and the roof areas covered by solar panels. At the census block group level, 7% of Coloradan households have a rooftop PV system, and 2.5% of roof areas in Colorado are covered by solar panels as of 2021. Our machine learning models predict solar PV deployment based on 43 natural and social characteristics of neighborhoods. Using four algorithms (Random Forest, CATBoost, LightGBM, XGBoost), we find that the share of Democratic party votes, hail risks, strong wind risks, median home value, and solar PV permitting timelines are the most important predictors of solar PV count per household. In addition to the size of the houses, PV-to-roof area ratio is highly dependent on solar PV permitting timelines, proportion of renters and multifamily housing, and winter weather risks. We also find racial and ethnic disparities in rooftop solar deployment. The average marginal effects of median household income on solar deployment are lower in communities with a greater proportion of African American and Hispanic residents and are higher in communities with a greater proportion of White and Asian residents. In the ongoing energy transition, knowing the key predictors of solar deployment can better inform business and policy decision making for more efficient and equitable grid infrastructure investment and distributed energy resource management. △ Less

Submitted 17 July, 2022; originally announced July 2022.

arXiv:2207.02516 [pdf, other]

Ask Me What You Need: Product Retrieval using Knowledge from GPT-3

Authors: Su Young Kim, Hyeonjin Park, Kyuyong Shin, Kyung-Min Kim

Abstract: As online merchandise become more common, many studies focus on embedding-based methods where queries and products are represented in the semantic space. These methods alleviate the problem of vocab mismatch between the language of queries and products. However, past studies usually dealt with queries that precisely describe the product, and there still exists the need to answer imprecise queries… ▽ More As online merchandise become more common, many studies focus on embedding-based methods where queries and products are represented in the semantic space. These methods alleviate the problem of vocab mismatch between the language of queries and products. However, past studies usually dealt with queries that precisely describe the product, and there still exists the need to answer imprecise queries that may require common sense knowledge, i.e., 'what should I get my mom for Mother's Day.' In this paper, we propose a GPT-3 based product retrieval system that leverages the knowledge-base (KB) of GPT-3 for question answering; users do not need to know the specific illustrative keywords for a product when querying. Our method tunes prompt tokens of GPT-3 to prompt knowledge and render answers that are mapped directly to products without further processing. Our method shows consistent performance improvement on two real-world and one public dataset, compared to the baseline methods. We provide an in-depth discussion on leveraging GPT-3 knowledge into a question answering based retrieval system. △ Less

Submitted 6 July, 2022; originally announced July 2022.

Comments: Accepted to DLP-KDD 2022 Workshop

arXiv:2206.14075 [pdf, other]

Mapping charge capture and acceleration in a plasma wakefield of a proton bunch using variable emittance electron beam injection

Authors: E. Granados, L. Verra, A. -M. Bachmann, E. Chevallay, S. Doebert, V. Fedosseev, F. Friebel, S. Gessner, E. Gschwendtner, S. Y. Kim, S. Mazzoni, J. T. Moody, M. Turner

Abstract: In the Phase 2 of the AWAKE first experimental run (from May to November 2018), an electron beam was used to probe and test proton-driven wakefield acceleration in a rubidium plasma column. In this work, we analyze the overall charge capture and shot-to-shot reproducibility of the proton-driven plasma wakefield accelerator with various electron bunch injection parameters. The witness electron bunc… ▽ More In the Phase 2 of the AWAKE first experimental run (from May to November 2018), an electron beam was used to probe and test proton-driven wakefield acceleration in a rubidium plasma column. In this work, we analyze the overall charge capture and shot-to-shot reproducibility of the proton-driven plasma wakefield accelerator with various electron bunch injection parameters. The witness electron bunches were produced using an RF-gun equipped with a Cs2Te photocathode illuminated by a tailorable ultrafast deep ultraviolet (UV) laser pulse. The construction of the UV beam optical system enabled appropriate transverse beam shaping and control of its pulse duration, size, and position on the photocathode, as well as time delay with respect to the ionizing laser pulse that seeds the plasma wakefields in the proton bunches. Variable photocathode illumination provided the required flexibility to produce electron bunches with variable charge, emittance, and injection trajectory into the plasma column. We demonstrate charge capture rates exceeding 15% (40 pC of GeV accelerated charge for a 385 pC injected electron bunch) under optimized electron injection conditions. △ Less

Submitted 28 June, 2022; originally announced June 2022.

arXiv:2206.12754 [pdf]

Atomically imprinted graphene plasmonic cavities

Authors: Brian S. Y. Kim, Aaron J. Sternbach, Min Sup Choi, Zhiyuan Sun, Francesco L. Ruta, Yinming Shao, Alexander S. McLeod, Lin Xiong, Yinan Dong, Anjaly Rajendran, Song Liu, Ankur Nipane, Sang Hoon Chae, Amirali Zangiabadi, Xiaodong Xu, Andrew J. Millis, P. James Schuck, Cory. R. Dean, James C. Hone, D. N. Basov

Abstract: Plasmon polaritons in van der Waals (vdW) materials hold promise for next-generation photonics. The ability to deterministically imprint spatial patterns of high carrier density in cavities and circuitry with nanoscale features underlies future progress in nonlinear nanophotonics and strong light-matter interactions. Here, we demonstrate a general strategy to atomically imprint low-loss graphene p… ▽ More Plasmon polaritons in van der Waals (vdW) materials hold promise for next-generation photonics. The ability to deterministically imprint spatial patterns of high carrier density in cavities and circuitry with nanoscale features underlies future progress in nonlinear nanophotonics and strong light-matter interactions. Here, we demonstrate a general strategy to atomically imprint low-loss graphene plasmonic structures using oxidation-activated charge transfer (OCT). We cover graphene with a monolayer of WSe$_2$, which is subsequently oxidized into high work-function WOx to activate charge transfer. Nano-infrared imaging reveals low-loss plasmon polaritons at the WOx/graphene interface. We insert WSe$_2$ spacers to precisely control the OCT-induced carrier density and achieve a near-intrinsic quality factor of plasmons. Finally, we imprint canonical plasmonic cavities exhibiting laterally abrupt doping profiles with single-digit nanoscale precision via programmable OCT. Specifically, we demonstrate technologically appealing but elusive plasmonic whispering-gallery resonators based on free-standing graphene encapsulated in WOx. Our results open avenues for novel quantum photonic architectures incorporating two-dimensional materials. △ Less

Submitted 25 June, 2022; originally announced June 2022.

Comments: 17 pages, 4 figures

arXiv:2206.07690 [pdf, other]

ELUDE: Generating interpretable explanations via a decomposition into labelled and unlabelled features

Authors: Vikram V. Ramaswamy, Sunnie S. Y. Kim, Nicole Meister, Ruth Fong, Olga Russakovsky

Abstract: Deep learning models have achieved remarkable success in different areas of machine learning over the past decade; however, the size and complexity of these models make them difficult to understand. In an effort to make them more interpretable, several recent works focus on explaining parts of a deep neural network through human-interpretable, semantic attributes. However, it may be impossible to… ▽ More Deep learning models have achieved remarkable success in different areas of machine learning over the past decade; however, the size and complexity of these models make them difficult to understand. In an effort to make them more interpretable, several recent works focus on explaining parts of a deep neural network through human-interpretable, semantic attributes. However, it may be impossible to completely explain complex models using only semantic attributes. In this work, we propose to augment these attributes with a small set of uninterpretable features. Specifically, we develop a novel explanation framework ELUDE (Explanation via Labelled and Unlabelled DEcomposition) that decomposes a model's prediction into two parts: one that is explainable through a linear combination of the semantic attributes, and another that is dependent on the set of uninterpretable features. By identifying the latter, we are able to analyze the "unexplained" portion of the model, obtaining insights into the information used by the model. We show that the set of unlabelled features can generalize to multiple models trained with the same feature space and compare our work to two popular attribute-oriented methods, Interpretable Basis Decomposition and Concept Bottleneck, and discuss the additional insights ELUDE provides. △ Less

Submitted 16 June, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

arXiv:2206.03048 [pdf, other]

Layered Depth Refinement with Mask Guidance

Authors: Soo Ye Kim, Jianming Zhang, Simon Niklaus, Yifei Fan, Simon Chen, Zhe Lin, Munchurl Kim

Abstract: Depth maps are used in a wide range of applications from 3D rendering to 2D image effects such as Bokeh. However, those predicted by single image depth estimation (SIDE) models often fail to capture isolated holes in objects and/or have inaccurate boundary regions. Meanwhile, high-quality masks are much easier to obtain, using commercial auto-masking tools or off-the-shelf methods of segmentation… ▽ More Depth maps are used in a wide range of applications from 3D rendering to 2D image effects such as Bokeh. However, those predicted by single image depth estimation (SIDE) models often fail to capture isolated holes in objects and/or have inaccurate boundary regions. Meanwhile, high-quality masks are much easier to obtain, using commercial auto-masking tools or off-the-shelf methods of segmentation and matting or even by manual editing. Hence, in this paper, we formulate a novel problem of mask-guided depth refinement that utilizes a generic mask to refine the depth prediction of SIDE models. Our framework performs layered refinement and inpainting/outpainting, decomposing the depth map into two separate layers signified by the mask and the inverse mask. As datasets with both depth and mask annotations are scarce, we propose a self-supervised learning scheme that uses arbitrary masks and RGB-D datasets. We empirically show that our method is robust to different types of masks and initial depth predictions, accurately refining depth values in inner and outer mask boundary regions. We further analyze our model with an ablation study and demonstrate results on real applications. More information can be found at https://sooyekim.github.io/MaskDepth/ . △ Less

Submitted 7 June, 2022; originally announced June 2022.

Comments: Accepted to CVPR 2022 (camera-ready version)

arXiv:2206.01828 [pdf]

doi 10.1126/sciadv.add6169

Infrared Plasmons Propagate through a Hyperbolic Nodal Metal

Authors: Yinming Shao, Aaron J. Sternbach, Brian S. Y. Kim, Andrey A. Rikhter, Xinyi Xu, Umberto De Giovannini, Ran Jing, Sang Hoon Chae, Zhiyuan Sun, Seng Huat Lee, Yanglin Zhu, Zhiqiang Mao, J. Hone, Raquel Queiroz, A. J. Millis, P. James Schuck, A. Rubio, M. M. Fogler, D. N. Basov

Abstract: Metals are canonical plasmonic media at infrared and optical wavelengths, allowing one to guide and manipulate light at the nano-scale. A special form of optical waveguiding is afforded by highly anisotropic crystals revealing the opposite signs of the dielectric functions along orthogonal directions. These media are classified as hyperbolic and include crystalline insulators, semiconductors and a… ▽ More Metals are canonical plasmonic media at infrared and optical wavelengths, allowing one to guide and manipulate light at the nano-scale. A special form of optical waveguiding is afforded by highly anisotropic crystals revealing the opposite signs of the dielectric functions along orthogonal directions. These media are classified as hyperbolic and include crystalline insulators, semiconductors and artificial metamaterials. Layered anisotropic metals are also anticipated to support hyperbolic waveguiding. Yet this behavior remains elusive, primarily because interband losses arrest the propagation of infrared modes. Here, we report on the observation of propagating hyperbolic waves in a prototypical layered nodal-line semimetal ZrSiSe. The observed waveguiding originates from polaritonic hybridization between near-infrared light and nodal-line plasmons. Unique nodal electronic structures simultaneously suppress interband loss and boost the plasmonic response, ultimately enabling the propagation of infrared modes through the bulk of the crystal. △ Less

Submitted 3 June, 2022; originally announced June 2022.

Journal ref: Sci. Adv. 8, eadd6169 (2022)

arXiv:2205.06859 [pdf]

doi 10.48550/arXiv.2205.06859

Large-Area Intercalated 2D-Pb/Graphene Heterostructure as a Platform for Generating Spin-Orbit Torque

Authors: Alexander Vera, Boyang Zheng, Wilson Yanez, Kaijie Yang, Seong Yeoul Kim, Jimmy C. Kotsakidis, Hesham El-Sherif, Gopi Krishnan, Roland J. Koch, Timothy A. Bowen, Chengye Dong, Yuanxi Wang, Maxwell Wetherington, Eli Rotenberg, Nabil Bassim, Adam L. Friedman, Robert M. Wallace, Chaoxing Liu, Nitin Samarth, Vincent H. Crespi, Joshua A. Robinson

Abstract: A scalable platform to synthesize ultrathin heavy metals may enable high efficiency charge-to-spin conversion for next-generation spintronics. Here we report centimeter-scale synthesis of air-stable, epitaxially registered monolayer Pb underneath bilayer graphene on SiC (0001) by confinement heteroepitaxy (CHet). Diffraction, spectroscopy, and microscopy reveal CHet-based Pb intercalation predomin… ▽ More A scalable platform to synthesize ultrathin heavy metals may enable high efficiency charge-to-spin conversion for next-generation spintronics. Here we report centimeter-scale synthesis of air-stable, epitaxially registered monolayer Pb underneath bilayer graphene on SiC (0001) by confinement heteroepitaxy (CHet). Diffraction, spectroscopy, and microscopy reveal CHet-based Pb intercalation predominantly exhibits a mottled hexagonal superstructure due to an ordered network of Frenkel-Kontorova-like domain walls. The system's air stability enables ex-situ spin torque ferromagnetic resonance (ST-FMR) measurements that demonstrate charge-to-spin conversion in graphene/Pb/ferromagnet heterostructures with a 1.5x increase in the effective field ratio compared to control samples. △ Less

Submitted 27 March, 2024; v1 submitted 13 May, 2022; originally announced May 2022.

Comments: 27 pages, 5 figures. Supporting Information included (16 pages, 14 figures, 1 table)

arXiv:2204.09215 [pdf, other]

doi 10.1103/PhysRevD.106.012005

Measurement of cosmogenic $^9$Li and $^8$He production rates at RENO

Authors: H. G. Lee, J. H. Choi, H. I. Jang, J. S. Jang, S. H. Jeon, K. K. Joo, D. E. Jung, J. G. Kim, J. H. Kim, J. Y. Kim, S. B. Kim, S. Y. Kim, W. Kim, E. Kwon, D. H. Lee, W. J. Lee, I. T. Lim, D. H. Moon, M. Y. Pac, J. S. Park, R. G. Park, H. Seo, J. W. Seo, C. D. Shin, B. S. Yang , et al. (4 additional authors not shown)

Abstract: We report the measured production rates of unstable isotopes $^9$Li and $^8$He produced by cosmic muon spallation on $^{12}$C using two identical detectors of the RENO experiment. Their beta-decays accompanied by a neutron make a significant contribution to backgrounds of reactor antineutrino events in precise determination of the smallest neutrino mixing angle. The mean muon energy of its near (f… ▽ More We report the measured production rates of unstable isotopes $^9$Li and $^8$He produced by cosmic muon spallation on $^{12}$C using two identical detectors of the RENO experiment. Their beta-decays accompanied by a neutron make a significant contribution to backgrounds of reactor antineutrino events in precise determination of the smallest neutrino mixing angle. The mean muon energy of its near (far) detector with an overburden of 120 (450) m.w.e. is estimated as 33.1 +- 2.3 (73.6 +- 4.4) GeV. Based on roughly 3100 days of data, the cosmogenic production rate of $^9$Li ($^8$He) isotope is measured to be 44.2 +- 3.1 (10.6 +- 7.4) per day at near detector and 10.0 +- 1.1 (2.1 +- 1.5) per day at far detector. This corresponds to yields of $^9$Li ($^8$He), 4.80 +- 0.36 (1.15 +- 0.81) and 9.9 +- 1.1 (2.1 +- 1.5) at near and far detectors, respectively, in a unit of 10$^{-8}$ $μ^{-1}$ g${^-1}$ cm${^2}$. Combining the measured $^9$Li yields with other available underground measurements, an excellent power-law relationship of the yield with respect to the mean muon energy is found to have an exponent of $α$ = 0.75 +- 0.05. △ Less

Submitted 2 July, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

Comments: 11 pages, 14 figures

arXiv:2201.13434 [pdf, other]

doi 10.1093/mnras/stac1755

EDGE: the puzzling ellipticity of Eridanus II's star cluster and its implications for dark matter at the heart of an ultra-faint dwarf

Authors: Matthew D. A. Orkney, Justin I. Read, Oscar Agertz, Andrew Pontzen, Martin P. Rey, Alex Goater, Ethan Taylor, Stacy Y. Kim, Maxime Delorme

Abstract: The Eridanus II (EriII) 'ultra-faint' dwarf has a large ($15\,\text{pc}$) and low mass ($4.3\times10^3\,\text{M}_\odot$) star cluster (SC) offset from its centre by $23\pm3\,\text{pc}$ in projection. Its size and offset are naturally explained if EriII has a central dark matter core, but such a core may be challenging to explain in a $Λ$CDM cosmology. In this paper, we revisit the survival and evo… ▽ More The Eridanus II (EriII) 'ultra-faint' dwarf has a large ($15\,\text{pc}$) and low mass ($4.3\times10^3\,\text{M}_\odot$) star cluster (SC) offset from its centre by $23\pm3\,\text{pc}$ in projection. Its size and offset are naturally explained if EriII has a central dark matter core, but such a core may be challenging to explain in a $Λ$CDM cosmology. In this paper, we revisit the survival and evolution of EriII's SC, focussing for the first time on its puzzlingly large ellipticity ($0.31^{+0.05}_{-0.06}$). We perform a suite of 960 direct $N$-body simulations of SCs, orbiting within a range of spherical background potentials fit to ultra-faint dwarf (UFD) galaxy simulations. We find only two scenarios that come close to explaining EriII's SC. In the first, EriII has a low density dark matter core (of size $\sim70\,\text{pc}$ and density $\lesssim2\times10^8\,\text{M}_{\odot}\,\text{kpc}^{-3}$). In this model, the high ellipticity of EriII's SC is set at birth, with the lack of tidal forces in the core allowing its ellipticity to remain frozen in for long times. In the second, EriII's SC orbits in a partial core, with its high ellipticity owing to its imminent tidal destruction. However, this latter model struggles to reproduce the large size of EriII's SC, and it predicts substantial tidal tails around EriII's SC that should have already been seen in the data. This leads us to favour the cored model. We discuss potential caveats to these findings, and the implications of the cored model for galaxy formation and the nature of dark matter. △ Less

Submitted 1 August, 2022; v1 submitted 31 January, 2022; originally announced January 2022.

Comments: 16 pages, 12 figures + appendices. Published with MNRAS. Comments welcome

arXiv:2112.14974 [pdf, ps, other]

Q-effectiveness for holomorphic subelliptic multipliers

Authors: Dmitri Zaitsev, Sung Yeon Kim

Abstract: We provide a solution to the effectiveness problem in Kohn's algorithm for generating holomorphic subelliptic multipliers for $(0,q)$ forms for arbitrary $q$. As an application, we obtain subelliptic estimates for $(0,q)$ forms with effectively controlled order $ε>0$ (the Sobolev exponent) for domains given by sums of squares of holomorphic functions (J.J. Kohn called them "special domains"). Thes… ▽ More We provide a solution to the effectiveness problem in Kohn's algorithm for generating holomorphic subelliptic multipliers for $(0,q)$ forms for arbitrary $q$. As an application, we obtain subelliptic estimates for $(0,q)$ forms with effectively controlled order $ε>0$ (the Sobolev exponent) for domains given by sums of squares of holomorphic functions (J.J. Kohn called them "special domains"). These domains are of particular interest due to their relation with complex and algebraic geometry. Our methods include triangular resolutions introduced by the authors in their previous work. △ Less

Submitted 30 December, 2021; originally announced December 2021.

Comments: arXiv admin note: substantial text overlap with arXiv:2003.06482

MSC Class: 2010; 32T25; 32T27; 32W05; 32S05; 32S10; 32S45; 32B10; 32V15; 32V35; 32V40

arXiv:2112.03280 [pdf, other]

doi 10.1093/mnras/stac502

EDGE: What shapes the relationship between HI and stellar observables in faint dwarf galaxies?

Authors: Martin P. Rey, Andrew Pontzen, Oscar Agertz, Matthew D. A. Orkney, Justin I. Read, Amélie Saintonge, Stacy Y. Kim, Payel Das

Abstract: We show how the interplay between feedback and mass-growth histories introduces scatter in the relationship between stellar and neutral gas properties of field faint dwarf galaxies ($M_{\star} \lessapprox 10^{6} M_{\odot}$). Across a suite of cosmological, high-resolution zoomed simulations, we find that dwarf galaxies of stellar masses $10^5 \leq M_{\star} \leq 10^{6} M_{\odot}$ are bimodal in th… ▽ More We show how the interplay between feedback and mass-growth histories introduces scatter in the relationship between stellar and neutral gas properties of field faint dwarf galaxies ($M_{\star} \lessapprox 10^{6} M_{\odot}$). Across a suite of cosmological, high-resolution zoomed simulations, we find that dwarf galaxies of stellar masses $10^5 \leq M_{\star} \leq 10^{6} M_{\odot}$ are bimodal in their cold gas content, being either HI-rich or HI-deficient. This bimodality is generated through the coupling between (i) the modulation of HI contents by the background of ultraviolet radiation (UVB) at late times and (ii) the significant scatter in the stellar-mass-halo-mass relationship induced by reionization. Furthermore, our HI-rich dwarfs exhibit disturbed and time-variable neutral gas distributions primarily due to stellar feedback. Over the last four billion years, we observe order-of-magnitude changes around the median $M_{HI}$, factor-of-a-few variations in HI spatial extents, and spatial offsets between HI and stellar components regularly exceeding the galaxies' optical sizes. Time variability introduces further scatter in the $M_{\star}-M_{HI}$ relation and affects a galaxy's detectability in HI at any given time. These effects will need to be accounted for when interpreting observations of the population of faint, HI-bearing dwarfs by the combination of optical and radio wide, deep surveys. △ Less

Submitted 12 March, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

Comments: Matching MNRAS-accepted version after minor revisions. Results unchanged

arXiv:2112.03184 [pdf, other]

HIVE: Evaluating the Human Interpretability of Visual Explanations

Authors: Sunnie S. Y. Kim, Nicole Meister, Vikram V. Ramaswamy, Ruth Fong, Olga Russakovsky

Abstract: As AI technology is increasingly applied to high-impact, high-risk domains, there have been a number of new methods aimed at making AI models more human interpretable. Despite the recent growth of interpretability work, there is a lack of systematic evaluation of proposed techniques. In this work, we introduce HIVE (Human Interpretability of Visual Explanations), a novel human evaluation framework… ▽ More As AI technology is increasingly applied to high-impact, high-risk domains, there have been a number of new methods aimed at making AI models more human interpretable. Despite the recent growth of interpretability work, there is a lack of systematic evaluation of proposed techniques. In this work, we introduce HIVE (Human Interpretability of Visual Explanations), a novel human evaluation framework that assesses the utility of explanations to human users in AI-assisted decision making scenarios, and enables falsifiable hypothesis testing, cross-method comparison, and human-centered evaluation of visual interpretability methods. To the best of our knowledge, this is the first work of its kind. Using HIVE, we conduct IRB-approved human studies with nearly 1000 participants and evaluate four methods that represent the diversity of computer vision interpretability works: GradCAM, BagNet, ProtoPNet, and ProtoTree. Our results suggest that explanations engender human trust, even for incorrect predictions, yet are not distinct enough for users to distinguish between correct and incorrect predictions. We open-source HIVE to enable future studies and encourage more human-centered approaches to interpretability research. △ Less

Submitted 21 July, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

Comments: ECCV 2022. Code and supplementary material are at https://princetonvisualai.github.io/HIVE

arXiv:2112.00321 [pdf]

doi 10.1038/s41467-021-27524-w

Deep-ultraviolet electroluminescence and photocurrent generation in graphene/hBN/graphene heterostructures

Authors: Su-Beom Song, Sangho Yoon, So Young Kim, Sera Yang, Seung-Young Seo, Soonyoung Cha, Hyeon-Woo Jeong, Kenji Watanabe, Takashi Taniguchi, Gil-Ho Lee, Jun Sung Kim, Moon-Ho Jo, Jonghwan Kim

Abstract: Hexagonal boron nitride (hBN) is a van der Waals semiconductor with a wide bandgap of ~ 5.96 eV. Despite the indirect bandgap characteristics of hBN, charge carriers excited by high energy electrons or photons efficiently emit luminescence at deep-ultraviolet (DUV) frequencies via strong electron-phonon interaction, suggesting potential DUV light emitting device applications. However, electrolumin… ▽ More Hexagonal boron nitride (hBN) is a van der Waals semiconductor with a wide bandgap of ~ 5.96 eV. Despite the indirect bandgap characteristics of hBN, charge carriers excited by high energy electrons or photons efficiently emit luminescence at deep-ultraviolet (DUV) frequencies via strong electron-phonon interaction, suggesting potential DUV light emitting device applications. However, electroluminescence from hBN has not been demonstrated at DUV frequencies so far. In this study, we report DUV electroluminescence and photocurrent generation in graphene/hBN/graphene heterostructures at room temperature. Tunneling carrier injection from graphene electrodes into the band edges of hBN enables prominent electroluminescence at DUV frequencies. On the other hand, under DUV laser illumination and external bias voltage, graphene electrodes efficiently collect photo-excited carriers in hBN, which generates high photocurrent. Laser excitation micro-spectroscopy shows that the radiative recombination and photocarrier excitation processes in the heterostructures mainly originate from the pristine structure and the stacking faults in hBN. Our work provides a pathway toward efficient DUV light emitting and detection devices based on hBN. △ Less

Submitted 1 December, 2021; originally announced December 2021.

Comments: 29 pages, 4 figures, Su-Beom Song and Sangho Yoon contributed equally to this work, To whom correspondence should be addressed: jonghwankim@postech.ac.kr

arXiv:2111.11294 [pdf, other]

Scaling Law for Recommendation Models: Towards General-purpose User Representations

Authors: Kyuyong Shin, Hanock Kwak, Su Young Kim, Max Nihlen Ramstrom, Jisu Jeong, Jung-Woo Ha, Kyung-Min Kim

Abstract: Recent advancement of large-scale pretrained models such as BERT, GPT-3, CLIP, and Gopher, has shown astonishing achievements across various task domains. Unlike vision recognition and language models, studies on general-purpose user representation at scale still remain underexplored. Here we explore the possibility of general-purpose user representation learning by training a universal user encod… ▽ More Recent advancement of large-scale pretrained models such as BERT, GPT-3, CLIP, and Gopher, has shown astonishing achievements across various task domains. Unlike vision recognition and language models, studies on general-purpose user representation at scale still remain underexplored. Here we explore the possibility of general-purpose user representation learning by training a universal user encoder at large scales. We demonstrate that the scaling law is present in user representation learning areas, where the training error scales as a power-law with the amount of computation. Our Contrastive Learning User Encoder (CLUE), optimizes task-agnostic objectives, and the resulting user embeddings stretch our expectation of what is possible to do in various downstream tasks. CLUE also shows great transferability to other domains and companies, as performances on an online experiment shows significant improvements in Click-Through-Rate (CTR). Furthermore, we also investigate how the model performance is influenced by the scale factors, such as training data size, model capacity, sequence length, and batch size. Finally, we discuss the broader impacts of CLUE in general. △ Less

Submitted 22 November, 2022; v1 submitted 15 November, 2021; originally announced November 2021.

Comments: Accepted at AAAI 2023. This version includes the technical appendix

arXiv:2111.07482 [pdf, other]

doi 10.1140/epjc/s10052-022-10284-2

Characterization of the correlated background for a sterile neutrino search using the first dataset of the JSNS$^2$ experiment

Authors: Y. Hino, S. Ajimura, M. K. Cheoun, J. H. Choi, T. Dodo, H. Furuta, J. Goh, K. Haga, M. Harada, S. Hasegawa, T. Hiraiwa, W. Hwang, H. I. Jang, J. S. Jang, H. Jeon, S. Jeon, K. K. Joo, J. R. Jordan, D. E. Jung, S. K. Kang, Y. Kasugai, T. Kawasaki, E. J. Kim, J. Y. Kim, S. B. Kim , et al. (40 additional authors not shown)

Abstract: JSNS$^2$ (J-PARC Sterile Neutrino Search at J-PARC Spallation Neutron Source) is an experiment that is searching for sterile neutrinos via the observation of $\barν_μ \to \barν_{e}$ appearance oscillations using muon decay-at-rest neutrinos. Before dedicated data taking in the first-half of 2021, we performed a commissioning run for 10 days in June 2020. Using the data obtained in this commissioni… ▽ More JSNS$^2$ (J-PARC Sterile Neutrino Search at J-PARC Spallation Neutron Source) is an experiment that is searching for sterile neutrinos via the observation of $\barν_μ \to \barν_{e}$ appearance oscillations using muon decay-at-rest neutrinos. Before dedicated data taking in the first-half of 2021, we performed a commissioning run for 10 days in June 2020. Using the data obtained in this commissioning run, in this paper, we present an estimate of the correlated background which imitates the $\barν_{e}$ signal in a sterile neutrino search. In addition, in order to demonstrate future prospects of the JSNS$^2$ experiment, possible pulse shape discrimination improvements towards reducing cosmic ray induced fast neutron background are described. △ Less

Submitted 11 March, 2022; v1 submitted 14 November, 2021; originally announced November 2021.

Comments: 7 pages, 3 figures

arXiv:2110.15460 [pdf, other]

doi 10.2514/6.2021-4117

Human-Computer Interaction Glow Up: Examining Operational Trust and Intention Towards Mars Autonomous Systems

Authors: Thomas Chan, Jeremy Argueta, Jazlyn Armendariz, Allison Graham, Sarah Hwang, Basak Ramaswamy, So Young Kim, Scott Davidoff

Abstract: Tactful coordination on earth between hundreds of operators from diverse disciplines and backgrounds is needed to ensure that Martian rovers have a high likelihood of achieving their science goals while enduring the harsh environment of the red planet. The operations team includes many individuals, each with independent and overlapping objectives, working to decide what to execute on the Mars surf… ▽ More Tactful coordination on earth between hundreds of operators from diverse disciplines and backgrounds is needed to ensure that Martian rovers have a high likelihood of achieving their science goals while enduring the harsh environment of the red planet. The operations team includes many individuals, each with independent and overlapping objectives, working to decide what to execute on the Mars surface during the next planning period. The team must work together to understand each other's objectives and constraints within a fixed time period, often requiring frequent revision. This study examines the challenges faced during Mars surface operations, from high-level science objectives to formulating a valid, safe, and optimal activity plan that is ready to be radiated to the rover. Through this examination, we aim to illuminate how planning intent can be formulated and effectively communicated to future spacecrafts that will become more and more autonomous. Our findings reveal the intricate nature of human-to-human interactions that require a large array of soft skills and core competencies to communicate concurrently with science and engineering teams during plan formulation. Additionally, our findings exposed significant challenges in eliciting planning intent from operators, which will intensify in the future, as operators on the ground asynchronously co-operate the rover with the on board autonomy. Building a marvellous robot and landing it onto the Mars surface are remarkable feats -however, ensuring that scientists can get the best out of the mission is an ongoing challenge and will not cease to be a difficult task with increased autonomy. △ Less

Submitted 28 October, 2021; originally announced October 2021.

Comments: 9 pages, 1 figure, to appear in Proceedings of the 2021 American Institute of Aeronautics and Astronautics ASCEND Conference (AIAA ASCEND 2021)

arXiv:2106.09050 [pdf, other]

The Milky Way satellite velocity function is a sharp probe of small-scale structure problems

Authors: Stacy Y. Kim, Annika H. G. Peter

Abstract: Twenty years ago, the mismatch between the observed number of Milky Way satellite galaxies and the predicted number of cold dark matter (CDM) subhalos was dubbed the ``missing satellites problem". Although mostly framed since in terms of satellite counts in luminosity space, the missing satellites problem was originally posed in velocity space. Importantly, the stellar velocity dispersion function… ▽ More Twenty years ago, the mismatch between the observed number of Milky Way satellite galaxies and the predicted number of cold dark matter (CDM) subhalos was dubbed the ``missing satellites problem". Although mostly framed since in terms of satellite counts in luminosity space, the missing satellites problem was originally posed in velocity space. Importantly, the stellar velocity dispersion function encodes information about the density profile of satellites as well as their abundance. In this work, we completeness correct the MW satellite stellar velocity dispersion function down to its ultrafaint dwarfs ($L \gtrsim 340$ L$_\odot$). Our most conservative completeness correction is in good agreement with a simple CDM model in which massive, classical satellites (M$_{\rm 200} \gtrsim 5 \times 10^8~$M$_\odot$) have baryon-driven cores, while lower mass, ultrafaint satellites inhabit cuspy halos that are not strongly tidally stripped. Tidal destruction of satellites by the MW's disk must be minimal, otherwise the completeness-corrected velocity function exceeds any plausible CDM prediction -- a ``too many satellites" problem. We rule out non-core-collapsing self-interacting dark matter models with a constant cross section $\gtrsim$ 0.1 cm$^2$/g. Constraints on warm dark matter are stronger than those based on the luminosity function due to its additional sensitivity to subhalo central densities, which suppresses number counts by up to an additional 30%. A thermal relic mass $\gtrsim$ 6 keV is preferred. Reducing uncertainties on stellar velocity dispersion measurements and the amount of tidal stripping experienced by the faintest dwarfs is key to determining the severity of the too many satellites problem. △ Less

Submitted 4 August, 2022; v1 submitted 16 June, 2021; originally announced June 2021.

Comments: 20 pages, 13 figures. Key results are summarized in Figure 7. Updated following submission to MNRAS and reviewer's comments. Comments welcome!

arXiv:2106.00815 [pdf, other]

Cleaning and Structuring the Label Space of the iMet Collection 2020

Authors: Vivien Nguyen, Sunnie S. Y. Kim

Abstract: The iMet 2020 dataset is a valuable resource in the space of fine-grained art attribution recognition, but we believe it has yet to reach its true potential. We document the unique properties of the dataset and observe that many of the attribute labels are noisy, more than is implied by the dataset description. Oftentimes, there are also semantic relationships between the labels (e.g., identical,… ▽ More The iMet 2020 dataset is a valuable resource in the space of fine-grained art attribution recognition, but we believe it has yet to reach its true potential. We document the unique properties of the dataset and observe that many of the attribute labels are noisy, more than is implied by the dataset description. Oftentimes, there are also semantic relationships between the labels (e.g., identical, mutual exclusion, subsumption, overlap with uncertainty) which we believe are underutilized. We propose an approach to cleaning and structuring the iMet 2020 labels, and discuss the implications and value of doing so. Further, we demonstrate the benefits of our proposed approach through several experiments. Our code and cleaned labels are available at https://github.com/sunniesuhyoung/iMet2020cleaned. △ Less

Submitted 1 June, 2021; originally announced June 2021.

Comments: A shorter version of this work was accepted to the CVPR 2021 FGVC Workshop

arXiv:2104.13582 [pdf, other]

doi 10.5281/zenodo.4834352

[Re] Don't Judge an Object by Its Context: Learning to Overcome Contextual Bias

Authors: Sunnie S. Y. Kim, Sharon Zhang, Nicole Meister, Olga Russakovsky

Abstract: Singh et al. (2020) point out the dangers of contextual bias in visual recognition datasets. They propose two methods, CAM-based and feature-split, that better recognize an object or attribute in the absence of its typical context while maintaining competitive within-context accuracy. To verify their performance, we attempted to reproduce all 12 tables in the original paper, including those in the… ▽ More Singh et al. (2020) point out the dangers of contextual bias in visual recognition datasets. They propose two methods, CAM-based and feature-split, that better recognize an object or attribute in the absence of its typical context while maintaining competitive within-context accuracy. To verify their performance, we attempted to reproduce all 12 tables in the original paper, including those in the appendix. We also conducted additional experiments to better understand the proposed methods, including increasing the regularization in CAM-based and removing the weighted loss in feature-split. As the original code was not made available, we implemented the entire pipeline from scratch in PyTorch 1.7.0. Our implementation is based on the paper and email exchanges with the authors. We found that both proposed methods in the original paper help mitigate contextual bias, although for some methods, we could not completely replicate the quantitative results in the paper even after completing an extensive hyperparameter search. For example, on COCO-Stuff, DeepFashion, and UnRel, our feature-split model achieved an increase in accuracy on out-of-context images over the standard baseline, whereas on AwA, we saw a drop in performance. For the proposed CAM-based method, we were able to reproduce the original paper's results to within 0.5$\%$ mAP. Our implementation can be found at https://github.com/princetonvisualai/ContextualBias. △ Less

Submitted 28 April, 2021; originally announced April 2021.

Comments: ML Reproducibility Challenge 2020. Accepted for publication in the ReScience C journal

arXiv:2101.05269 [pdf, other]

doi 10.3847/1538-4357/abf7c4

Supernova Model Discrimination with Hyper-Kamiokande

Authors: Hyper-Kamiokande Collaboration, :, K. Abe, P. Adrich, H. Aihara, R. Akutsu, I. Alekseev, A. Ali, F. Ameli, I. Anghel, L. H. V. Anthony, M. Antonova, A. Araya, Y. Asaoka, Y. Ashida, V. Aushev, F. Ballester, I. Bandac, M. Barbi, G. J. Barker, G. Barr, M. Batkiewicz-Kwasniak, M. Bellato, V. Berardi, M. Bergevin , et al. (478 additional authors not shown)

Abstract: Core-collapse supernovae are among the most magnificent events in the observable universe. They produce many of the chemical elements necessary for life to exist and their remnants -- neutron stars and black holes -- are interesting astrophysical objects in their own right. However, despite millennia of observations and almost a century of astrophysical study, the explosion mechanism of core-colla… ▽ More Core-collapse supernovae are among the most magnificent events in the observable universe. They produce many of the chemical elements necessary for life to exist and their remnants -- neutron stars and black holes -- are interesting astrophysical objects in their own right. However, despite millennia of observations and almost a century of astrophysical study, the explosion mechanism of core-collapse supernovae is not yet well understood. Hyper-Kamiokande is a next-generation neutrino detector that will be able to observe the neutrino flux from the next galactic core-collapse supernova in unprecedented detail. We focus on the first 500 ms of the neutrino burst, corresponding to the accretion phase, and use a newly-developed, high-precision supernova event generator to simulate Hyper-Kamiokande's response to five different supernova models. We show that Hyper-Kamiokande will be able to distinguish between these models with high accuracy for a supernova at a distance of up to 100 kpc. Once the next galactic supernova happens, this ability will be a powerful tool for guiding simulations towards a precise reproduction of the explosion mechanism observed in nature. △ Less

Submitted 20 July, 2021; v1 submitted 13 January, 2021; originally announced January 2021.

Comments: 21 pages, 7 figures. Article based on thesis published as arXiv:2002.01649. v2: added references and some explanations in response to reviewer comments

Journal ref: Astrophys.J. 916 (2021) 15

Showing 1–50 of 107 results for author: Kim, S Y