-
Good plasmons in a bad metal
Authors:
Francesco L. Ruta,
Yinming Shao,
Swagata Acharya,
Anqi Mu,
Na Hyun Jo,
Sae Hee Ryu,
Daria Balatsky,
Dimitar Pashov,
Brian S. Y. Kim,
Mikhail I. Katsnelson,
James G. Analytis,
Eli Rotenberg,
Andrew J. Millis,
Mark van Schilfgaarde,
D. N. Basov
Abstract:
Correlated materials may exhibit unusually high resistivity increasing linearly in temperature, breaking through the Mott-Ioffe-Regel bound, above which coherent quasiparticles are destroyed. The fate of collective charge excitations, or plasmons, in these systems is a subject of debate. Several studies suggest plasmons are overdamped while others detect unrenormalized plasmons. Here, we present d…
▽ More
Correlated materials may exhibit unusually high resistivity increasing linearly in temperature, breaking through the Mott-Ioffe-Regel bound, above which coherent quasiparticles are destroyed. The fate of collective charge excitations, or plasmons, in these systems is a subject of debate. Several studies suggest plasmons are overdamped while others detect unrenormalized plasmons. Here, we present direct optical images of low-loss hyperbolic plasmon polaritons (HPPs) in the correlated van der Waals metal MoOCl2. HPPs are plasmon-photon modes that waveguide through extremely anisotropic media and are remarkably long-lived in MoOCl2. Many-body theory supported by photoemission results reveals that MoOCl2 is in an orbital-selective and highly incoherent Peierls phase. Different orbitals acquire markedly different bonding-antibonding character, producing a highly-anisotropic, isolated Fermi surface. The Fermi surface is further reconstructed and made partly incoherent by electronic interactions, renormalizing the plasma frequency. HPPs remain long-lived in spite of this, allowing us to uncover previously unseen imprints of electronic correlations on plasmonic collective modes.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
EDGE: A new model for Nuclear Star Cluster formation in dwarf galaxies
Authors:
Emily I. Gray,
Justin I. Read,
Ethan Taylor,
Matthew D. A. Orkney,
Martin P. Rey,
Robert M. Yates,
Stacy Y. Kim,
Noelia E. D. Noël,
Oscar Agertz,
Eric Andersson,
Andrew Pontzen
Abstract:
Nuclear Star Clusters (NSCs) are amongst the densest stellar systems in the Universe and are found at the centres of many bright spiral and elliptical galaxies, and up to ${\sim}$40% of dwarf galaxies. However, their formation mechanisms, and possible links to globular clusters (GCs), remain debated. This paper uses the EDGE simulations - a collection of zoom-in, cosmological simulations of isolat…
▽ More
Nuclear Star Clusters (NSCs) are amongst the densest stellar systems in the Universe and are found at the centres of many bright spiral and elliptical galaxies, and up to ${\sim}$40% of dwarf galaxies. However, their formation mechanisms, and possible links to globular clusters (GCs), remain debated. This paper uses the EDGE simulations - a collection of zoom-in, cosmological simulations of isolated dwarf galaxies -- to present a new formation mechanism for NSCs. We find that, at a gas spatial and mass resolution of ${\sim}3\,$pc and ${\sim}161$ M$_\odot$, respectively, NSCs naturally emerge in a subset of our EDGE dwarfs with redshift-zero halo masses of $\rm{M}_{\rm{r}200\rm{c}} \sim 5 \times 10^9$ M$_\odot$. These dwarfs are quenched by reionisation, but retain a significant reservoir of gas that is unable to cool and form stars. Sometime after reionisation, the dwarfs then undergo a major (${\sim}$1:1) merger that excites rapid gas cooling, leading to a significant starburst. An NSC forms in this starburst that then quenches star formation thereafter. The result is a nucleated dwarf that has two stellar populations with distinct age: one pre-reionisation and one post-reionisation. Our mechanism is unique for two key reasons. Firstly, the low mass of the host dwarf means that NSCs, formed in this way, can accrete onto galaxies of almost all masses, potentially seeding the formation of NSCs everywhere. Secondly, our model predicts that NSCs should have at least two stellar populations with a large ($\gtrsim$1 billion year) age separation. This yields a predicted colour magnitude diagram for our nucleated dwarfs that has two distinct main sequence turnoffs. Several GCs orbiting the Milky Way, including Omega Centauri and M54, show exactly this behaviour, suggesting that they may, in fact, be accreted NSCs.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Towards comprehensive coverage of chemical space: Quantum mechanical properties of 836k constitutional and conformational closed shell neutral isomers consisting of HCNOFSiPSClBr
Authors:
Danish Khan,
Anouar Benali,
Scott Y. H. Kim,
Guido Falk von Rudorff,
O. Anatole von Lilienfeld
Abstract:
The Vector-QM24 (VQM24) dataset attempts to more comprehensively cover all possible neutral closed shell small organic and inorganic molecules and their conformers at state of the art level of theory. We have used density functional theory ($ω$B97X-D3/cc-pVDZ) to optimize 577k conformational isomers corresponding to 258k constitutional isomers.Isomers included contain up to five heavy atoms (non-h…
▽ More
The Vector-QM24 (VQM24) dataset attempts to more comprehensively cover all possible neutral closed shell small organic and inorganic molecules and their conformers at state of the art level of theory. We have used density functional theory ($ω$B97X-D3/cc-pVDZ) to optimize 577k conformational isomers corresponding to 258k constitutional isomers.Isomers included contain up to five heavy atoms (non-hydrogen) consisting of $p$-block elements C, N, O, F, Si, P, S, Cl, Br. Single point diffusion quantum Monte Carlo (DMC@PBE0(ccECP/cc-pVQZ)) energies are reported for the sub-set of the lowest conformers of 10,793 molecules with up to 4 heavy atoms.This dataset has been systematically generated by considering all combinatorially possible stoichiometries, and graphs (according to Lewis rules as implemented in the {\tt SURGE} package), along with all stable conformers identified by GFN2-xTB. Apart from graphs, geometries, rotational constants, and vibrational normal modes, VQM24 includes internal, atomization, electron-electron repulsion, exchange correlation, dispersion, vibrational frequency, Gibbs free, enthalpy, ZPV, molecular orbital energies; as well as entropy, and heat capacities. Electronic properties include multipole moments (dipole, quadrupole, octupole, hexadecapole), electrostatic potentials at nuclei (alchemical potential), Mulliken charges, and molecular wavefunctions. VQM24 represents a highly accurate and unbiased dataset of molecules, ideal for testing and training transferable, scalable, and generative ML models of real quantum systems.
△ Less
Submitted 13 May, 2024; v1 submitted 9 May, 2024;
originally announced May 2024.
-
Codexity: Secure AI-assisted Code Generation
Authors:
Sung Yong Kim,
Zhiyu Fan,
Yannic Noller,
Abhik Roychoudhury
Abstract:
Despite the impressive performance of Large Language Models (LLMs) in software development activities, recent studies show the concern of introducing vulnerabilities into software codebase by AI programming assistants (e.g., Copilot, CodeWhisperer). In this work, we present Codexity, a security-focused code generation framework integrated with five LLMs. Codexity leverages the feedback of static a…
▽ More
Despite the impressive performance of Large Language Models (LLMs) in software development activities, recent studies show the concern of introducing vulnerabilities into software codebase by AI programming assistants (e.g., Copilot, CodeWhisperer). In this work, we present Codexity, a security-focused code generation framework integrated with five LLMs. Codexity leverages the feedback of static analysis tools such as Infer and CppCheck to mitigate security vulnerabilities in LLM-generated programs. Our evaluation in a real-world benchmark with 751 automatically generated vulnerable subjects demonstrates Codexity can prevent 60% of the vulnerabilities being exposed to the software developer.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
"I'm Not Sure, But...": Examining the Impact of Large Language Models' Uncertainty Expression on User Reliance and Trust
Authors:
Sunnie S. Y. Kim,
Q. Vera Liao,
Mihaela Vorvoreanu,
Stephanie Ballard,
Jennifer Wortman Vaughan
Abstract:
Widely deployed large language models (LLMs) can produce convincing yet incorrect outputs, potentially misleading users who may rely on them as if they were correct. To reduce such overreliance, there have been calls for LLMs to communicate their uncertainty to end users. However, there has been little empirical work examining how users perceive and act upon LLMs' expressions of uncertainty. We ex…
▽ More
Widely deployed large language models (LLMs) can produce convincing yet incorrect outputs, potentially misleading users who may rely on them as if they were correct. To reduce such overreliance, there have been calls for LLMs to communicate their uncertainty to end users. However, there has been little empirical work examining how users perceive and act upon LLMs' expressions of uncertainty. We explore this question through a large-scale, pre-registered, human-subject experiment (N=404) in which participants answer medical questions with or without access to responses from a fictional LLM-infused search engine. Using both behavioral and self-reported measures, we examine how different natural language expressions of uncertainty impact participants' reliance, trust, and overall task performance. We find that first-person expressions (e.g., "I'm not sure, but...") decrease participants' confidence in the system and tendency to agree with the system's answers, while increasing participants' accuracy. An exploratory analysis suggests that this increase can be attributed to reduced (but not fully eliminated) overreliance on incorrect answers. While we observe similar effects for uncertainty expressed from a general perspective (e.g., "It's not clear, but..."), these effects are weaker and not statistically significant. Our findings suggest that using natural language expressions of uncertainty may be an effective approach for reducing overreliance on LLMs, but that the precise language used matters. This highlights the importance of user testing before deploying LLMs at scale.
△ Less
Submitted 15 May, 2024; v1 submitted 1 May, 2024;
originally announced May 2024.
-
Allowing humans to interactively guide machines where to look does not always improve human-AI team's classification accuracy
Authors:
Giang Nguyen,
Mohammad Reza Taesiri,
Sunnie S. Y. Kim,
Anh Nguyen
Abstract:
Via thousands of papers in Explainable AI (XAI), attention maps \cite{vaswani2017attention} and feature importance maps \cite{bansal2020sam} have been established as a common means for finding how important each input feature is to an AI's decisions. It is an interesting, unexplored question whether allowing users to edit the feature importance at test time would improve a human-AI team's accuracy…
▽ More
Via thousands of papers in Explainable AI (XAI), attention maps \cite{vaswani2017attention} and feature importance maps \cite{bansal2020sam} have been established as a common means for finding how important each input feature is to an AI's decisions. It is an interesting, unexplored question whether allowing users to edit the feature importance at test time would improve a human-AI team's accuracy on downstream tasks. In this paper, we address this question by leveraging CHM-Corr, a state-of-the-art, ante-hoc explainable classifier \cite{taesiri2022visual} that first predicts patch-wise correspondences between the input and training-set images, and then bases on them to make classification decisions. We build CHM-Corr++, an interactive interface for CHM-Corr, enabling users to edit the feature importance map provided by CHM-Corr and observe updated model decisions. Via CHM-Corr++, users can gain insights into if, when, and how the model changes its outputs, improving their understanding beyond static explanations. However, our study with 18 expert users who performed 1,400 decisions finds no statistical significance that our interactive approach improves user accuracy on CUB-200 bird image classification over static explanations. This challenges the hypothesis that interactivity can boost human-AI team accuracy and raises needs for future research. We open-source CHM-Corr++, an interactive tool for editing image classifier attention (see an interactive demo here: http://137.184.82.109:7080/). We release code and data on github: https://github.com/anguyen8/chm-corr-interactive.
△ Less
Submitted 20 April, 2024; v1 submitted 8 April, 2024;
originally announced April 2024.
-
IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation
Authors:
Yizhi Song,
Zhifei Zhang,
Zhe Lin,
Scott Cohen,
Brian Price,
Jianming Zhang,
Soo Ye Kim,
He Zhang,
Wei Xiong,
Daniel Aliaga
Abstract:
Generative object compositing emerges as a promising new avenue for compositional image editing. However, the requirement of object identity preservation poses a significant challenge, limiting practical usage of most existing methods. In response, this paper introduces IMPRINT, a novel diffusion-based generative model trained with a two-stage learning framework that decouples learning of identity…
▽ More
Generative object compositing emerges as a promising new avenue for compositional image editing. However, the requirement of object identity preservation poses a significant challenge, limiting practical usage of most existing methods. In response, this paper introduces IMPRINT, a novel diffusion-based generative model trained with a two-stage learning framework that decouples learning of identity preservation from that of compositing. The first stage is targeted for context-agnostic, identity-preserving pretraining of the object encoder, enabling the encoder to learn an embedding that is both view-invariant and conducive to enhanced detail preservation. The subsequent stage leverages this representation to learn seamless harmonization of the object composited to the background. In addition, IMPRINT incorporates a shape-guidance mechanism offering user-directed control over the compositing process. Extensive experiments demonstrate that IMPRINT significantly outperforms existing methods and various baselines on identity preservation and composition quality.
△ Less
Submitted 15 March, 2024;
originally announced March 2024.
-
Semi-Supervised Graph Representation Learning with Human-centric Explanation for Predicting Fatty Liver Disease
Authors:
So Yeon Kim,
Sehee Wang,
Eun Kyung Choe
Abstract:
Addressing the challenge of limited labeled data in clinical settings, particularly in the prediction of fatty liver disease, this study explores the potential of graph representation learning within a semi-supervised learning framework. Leveraging graph neural networks (GNNs), our approach constructs a subject similarity graph to identify risk patterns from health checkup data. The effectiveness…
▽ More
Addressing the challenge of limited labeled data in clinical settings, particularly in the prediction of fatty liver disease, this study explores the potential of graph representation learning within a semi-supervised learning framework. Leveraging graph neural networks (GNNs), our approach constructs a subject similarity graph to identify risk patterns from health checkup data. The effectiveness of various GNN approaches in this context is demonstrated, even with minimal labeled samples. Central to our methodology is the inclusion of human-centric explanations through explainable GNNs, providing personalized feature importance scores for enhanced interpretability and clinical relevance, thereby underscoring the potential of our approach in advancing healthcare practices with a keen focus on graph representation learning and human-centric explanation.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
Making a prototype of Seoul historical sites chatbot using Langchain
Authors:
Jae Young Suh,
Minsoo Kwak,
Soo Yong Kim,
Hyoungseo Cho
Abstract:
In this paper, we are going to share a draft of the development of a conversational agent created to disseminate information about historical sites located in the Seoul. The primary objective of the agent is to increase awareness among visitors who are not familiar with Seoul, about the presence and precise locations of valuable cultural heritage sites. It aims to promote a basic understanding of…
▽ More
In this paper, we are going to share a draft of the development of a conversational agent created to disseminate information about historical sites located in the Seoul. The primary objective of the agent is to increase awareness among visitors who are not familiar with Seoul, about the presence and precise locations of valuable cultural heritage sites. It aims to promote a basic understanding of Korea's rich and diverse cultural history. The agent is thoughtfully designed for accessibility in English and utilizes data generously provided by the Seoul Metropolitan Government. Despite the limited data volume, it consistently delivers reliable and accurate responses, seamlessly aligning with the available information. We have meticulously detailed the methodologies employed in creating this agent and provided a comprehensive overview of its underlying structure within the paper. Additionally, we delve into potential improvements to enhance this initial version of the system, with a primary emphasis on expanding the available data through our prompting. In conclusion, we provide an in-depth discussion of our expectations regarding the future impact of this agent in promoting and facilitating the sharing of historical sites.
△ Less
Submitted 10 February, 2024;
originally announced February 2024.
-
Descanning: From Scanned to the Original Images with a Color Correction Diffusion Model
Authors:
Junghun Cha,
Ali Haider,
Seoyun Yang,
Hoeyeong Jin,
Subin Yang,
A. F. M. Shahab Uddin,
Jaehyoung Kim,
Soo Ye Kim,
Sung-Ho Bae
Abstract:
A significant volume of analog information, i.e., documents and images, have been digitized in the form of scanned copies for storing, sharing, and/or analyzing in the digital world. However, the quality of such contents is severely degraded by various distortions caused by printing, storing, and scanning processes in the physical world. Although restoring high-quality content from scanned copies…
▽ More
A significant volume of analog information, i.e., documents and images, have been digitized in the form of scanned copies for storing, sharing, and/or analyzing in the digital world. However, the quality of such contents is severely degraded by various distortions caused by printing, storing, and scanning processes in the physical world. Although restoring high-quality content from scanned copies has become an indispensable task for many products, it has not been systematically explored, and to the best of our knowledge, no public datasets are available. In this paper, we define this problem as Descanning and introduce a new high-quality and large-scale dataset named DESCAN-18K. It contains 18K pairs of original and scanned images collected in the wild containing multiple complex degradations. In order to eliminate such complex degradations, we propose a new image restoration model called DescanDiffusion consisting of a color encoder that corrects the global color degradation and a conditional denoising diffusion probabilistic model (DDPM) that removes local degradations. To further improve the generalization ability of DescanDiffusion, we also design a synthetic data generation scheme by reproducing prominent degradations in scanned images. We demonstrate that our DescanDiffusion outperforms other baselines including commercial restoration products, objectively and subjectively, via comprehensive experiments and analyses.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
WiCV@CVPR2023: The Eleventh Women In Computer Vision Workshop at the Annual CVPR Conference
Authors:
Doris Antensteiner,
Marah Halawa,
Asra Aslam,
Ivaxi Sheth,
Sachini Herath,
Ziqi Huang,
Sunnie S. Y. Kim,
Aparna Akula,
Xin Wang
Abstract:
In this paper, we present the details of Women in Computer Vision Workshop - WiCV 2023, organized alongside the hybrid CVPR 2023 in Vancouver, Canada. WiCV aims to amplify the voices of underrepresented women in the computer vision community, fostering increased visibility in both academia and industry. We believe that such events play a vital role in addressing gender imbalances within the field.…
▽ More
In this paper, we present the details of Women in Computer Vision Workshop - WiCV 2023, organized alongside the hybrid CVPR 2023 in Vancouver, Canada. WiCV aims to amplify the voices of underrepresented women in the computer vision community, fostering increased visibility in both academia and industry. We believe that such events play a vital role in addressing gender imbalances within the field. The annual WiCV@CVPR workshop offers a) opportunity for collaboration between researchers from minority groups, b) mentorship for female junior researchers, c) financial support to presenters to alleviate finanacial burdens and d) a diverse array of role models who can inspire younger researchers at the outset of their careers. In this paper, we present a comprehensive report on the workshop program, historical trends from the past WiCV@CVPR events, and a summary of statistics related to presenters, attendees, and sponsorship for the WiCV 2023 workshop.
△ Less
Submitted 22 September, 2023;
originally announced September 2023.
-
EDGE -- Dark matter or astrophysics? Breaking dark matter heating degeneracies with HI rotation in faint dwarf galaxies
Authors:
Martin P. Rey,
Matthew D. A. Orkney,
Justin I. Read,
Payel Das,
Oscar Agertz,
Andrew Pontzen,
Anastasia A. Ponomareva,
Stacy Y. Kim,
William McClymont
Abstract:
Low-mass dwarf galaxies are expected to reside within dark matter haloes that have a pristine, `cuspy' density profile within their stellar half-light radii. This is because they form too few stars to significantly drive dark matter heating through supernova-driven outflows. Here, we study such simulated faint systems ($10^4 \leq M_{\star} \leq 2\times 10^6 \, M_\mathrm{\odot}$) drawn from high-re…
▽ More
Low-mass dwarf galaxies are expected to reside within dark matter haloes that have a pristine, `cuspy' density profile within their stellar half-light radii. This is because they form too few stars to significantly drive dark matter heating through supernova-driven outflows. Here, we study such simulated faint systems ($10^4 \leq M_{\star} \leq 2\times 10^6 \, M_\mathrm{\odot}$) drawn from high-resolution (3 pc) cosmological simulations from the `Engineering Dwarf Galaxies at the Edge of galaxy formation' (EDGE) project. We confirm that these objects have steep and rising inner dark matter density profiles at $z=0$, little affected by galaxy formation effects. But five dwarf galaxies from the suite also showcase a detectable HI reservoir ($M_{\mathrm{HI}}\approx 10^{5}-10^{6} \, M_\mathrm{\odot}$), analogous to the observed population of faint, HI-bearing dwarf galaxies. These reservoirs exhibit episodes of ordered rotation, opening windows for rotation curve analysis. Within actively star-forming dwarfs, stellar feedback easily disrupts the tenuous HI discs ($v_φ \approx 10\, \mathrm{km} \, \mathrm{s}^{-1}$), making rotation short-lived ($\ll 150 \, \mathrm{Myr}$) and more challenging to interpret for dark matter inferences. In contrast, we highlight a long-lived ($\geq 500 \, \mathrm{Myr}$) and easy-to-interpret HI rotation curve extending to $\approx 2\, r_{1/2, \text{3D}}$ in a quiescent dwarf, that has not formed new stars since $z=4$. This stable gas disc is supported by an oblate dark matter halo shape that drives high-angular momentum gas flows. Our results strongly motivate further searches for HI in rotation curves in the observed population of HI-bearing low-mass dwarfs, that provide a key regime to disentangle the respective roles of dark matter microphysics and galaxy formation effects in driving dark matter heating.
△ Less
Submitted 16 March, 2024; v1 submitted 31 August, 2023;
originally announced September 2023.
-
EDGE: The direct link between mass growth history and the extended stellar haloes of the faintest dwarf galaxies
Authors:
Alex Goater,
Justin I. Read,
Noelia E. D. Noël,
Matthew D. A. Orkney,
Stacy Y. Kim,
Martin P. Rey,
Eric P. Andersson,
Oscar Agertz,
Andrew Pontzen,
Roberta Vieliute,
Dhairya Kataria,
Kiah Jeneway
Abstract:
Ultra-faint dwarf galaxies (UFDs) are commonly found in close proximity to the Milky Way and other massive spiral galaxies. As such, their projected stellar ellipticity and extended light distributions are often thought to owe to tidal forces. In this paper, we study the projected stellar ellipticities and faint stellar outskirts of tidally isolated ultra-faints drawn from the 'Engineering Dwarfs…
▽ More
Ultra-faint dwarf galaxies (UFDs) are commonly found in close proximity to the Milky Way and other massive spiral galaxies. As such, their projected stellar ellipticity and extended light distributions are often thought to owe to tidal forces. In this paper, we study the projected stellar ellipticities and faint stellar outskirts of tidally isolated ultra-faints drawn from the 'Engineering Dwarfs at Galaxy Formation's Edge' (EDGE) cosmological simulation suite. Despite their tidal isolation, our simulated dwarfs exhibit a wide range of projected ellipticities ($0.03 < \varepsilon < 0.85$), with many possessing anisotropic extended stellar haloes that mimic tidal tails, but owe instead to late-time accretion of lower mass companions. Furthermore, we find a strong causal relationship between ellipticity and formation time of an UFD, which is robust to a wide variation in the feedback model. We show that the distribution of projected ellipticities in our suite of simulated EDGE dwarfs matches well with that of 21 Local Group dwarf galaxies. Given the ellipticity in EDGE arises from an ex-situ accretion origin, the agreement in shape indicates the ellipticities of some observed dwarfs may also originate from a similar non-tidal scenario. The orbital parameters of these observed dwarfs further support that they are not currently tidally disrupting. If the baryonic content in these galaxies is still tidally intact, then the same may be true for their dark matter content, making these galaxies in our Local Group pristine laboratories for testing dark matter and galaxy formation models.
△ Less
Submitted 11 July, 2023;
originally announced July 2023.
-
Milky Way satellite velocities reveal the Dark Matter power spectrum at small scales
Authors:
Ivan Esteban,
Annika H. G. Peter,
Stacy Y. Kim
Abstract:
Dark Matter (DM) properties at small scales remain uncertain. Recent theoretical and observational advances have provided the tools to narrow them down. Here, we show for the first time that the correlation between internal velocities and sizes of dwarf galaxies is a sharp probe of small-scale DM properties. We study modified DM power spectra, motivated by DM production during inflation. Using sem…
▽ More
Dark Matter (DM) properties at small scales remain uncertain. Recent theoretical and observational advances have provided the tools to narrow them down. Here, we show for the first time that the correlation between internal velocities and sizes of dwarf galaxies is a sharp probe of small-scale DM properties. We study modified DM power spectra, motivated by DM production during inflation. Using semi-analytic models and scaling relations, we show that such models can change the kinematics and structure of dwarf galaxies without strongly affecting their total abundance. We analyze data from Milky Way classical satellite galaxies and those discovered with the Sloan Digital Sky Survey (SDSS), finding that the DM power spectrum at comoving scales ${4\, \mathrm{Mpc}^{-1} < k < 37\,\mathrm{Mpc}^{-1}}$ cannot deviate by more than a factor of 2 from scale invariance. Our results are robust against baryonic uncertainties such as the stellar mass-halo mass relation, halo occupation fraction, and subhalo tidal disruption; allowing us to independently constrain them. This work thus opens a window to probe both dwarf galaxy formation models and small-scale DM properties.
△ Less
Submitted 7 June, 2023;
originally announced June 2023.
-
Humans, AI, and Context: Understanding End-Users' Trust in a Real-World Computer Vision Application
Authors:
Sunnie S. Y. Kim,
Elizabeth Anne Watkins,
Olga Russakovsky,
Ruth Fong,
Andrés Monroy-Hernández
Abstract:
Trust is an important factor in people's interactions with AI systems. However, there is a lack of empirical studies examining how real end-users trust or distrust the AI system they interact with. Most research investigates one aspect of trust in lab settings with hypothetical end-users. In this paper, we provide a holistic and nuanced understanding of trust in AI through a qualitative case study…
▽ More
Trust is an important factor in people's interactions with AI systems. However, there is a lack of empirical studies examining how real end-users trust or distrust the AI system they interact with. Most research investigates one aspect of trust in lab settings with hypothetical end-users. In this paper, we provide a holistic and nuanced understanding of trust in AI through a qualitative case study of a real-world computer vision application. We report findings from interviews with 20 end-users of a popular, AI-based bird identification app where we inquired about their trust in the app from many angles. We find participants perceived the app as trustworthy and trusted it, but selectively accepted app outputs after engaging in verification behaviors, and decided against app adoption in certain high-stakes scenarios. We also find domain knowledge and context are important factors for trust-related assessment and decision-making. We discuss the implications of our findings and provide recommendations for future research on trust in AI.
△ Less
Submitted 15 May, 2023;
originally announced May 2023.
-
Modernizing Old Photos Using Multiple References via Photorealistic Style Transfer
Authors:
Agus Gunawan,
Soo Ye Kim,
Hyeonjun Sim,
Jae-Ho Lee,
Munchurl Kim
Abstract:
This paper firstly presents old photo modernization using multiple references by performing stylization and enhancement in a unified manner. In order to modernize old photos, we propose a novel multi-reference-based old photo modernization (MROPM) framework consisting of a network MROPM-Net and a novel synthetic data generation scheme. MROPM-Net stylizes old photos using multiple references via ph…
▽ More
This paper firstly presents old photo modernization using multiple references by performing stylization and enhancement in a unified manner. In order to modernize old photos, we propose a novel multi-reference-based old photo modernization (MROPM) framework consisting of a network MROPM-Net and a novel synthetic data generation scheme. MROPM-Net stylizes old photos using multiple references via photorealistic style transfer (PST) and further enhances the results to produce modern-looking images. Meanwhile, the synthetic data generation scheme trains the network to effectively utilize multiple references to perform modernization. To evaluate the performance, we propose a new old photos benchmark dataset (CHD) consisting of diverse natural indoor and outdoor scenes. Extensive experiments show that the proposed method outperforms other baselines in performing modernization on real old photos, even though no old photos were used during training. Moreover, our method can appropriately select styles from multiple references for each semantic region in the old photo to further improve the modernization performance.
△ Less
Submitted 10 April, 2023;
originally announced April 2023.
-
UFO: A unified method for controlling Understandability and Faithfulness Objectives in concept-based explanations for CNNs
Authors:
Vikram V. Ramaswamy,
Sunnie S. Y. Kim,
Ruth Fong,
Olga Russakovsky
Abstract:
Concept-based explanations for convolutional neural networks (CNNs) aim to explain model behavior and outputs using a pre-defined set of semantic concepts (e.g., the model recognizes scene class ``bedroom'' based on the presence of concepts ``bed'' and ``pillow''). However, they often do not faithfully (i.e., accurately) characterize the model's behavior and can be too complex for people to unders…
▽ More
Concept-based explanations for convolutional neural networks (CNNs) aim to explain model behavior and outputs using a pre-defined set of semantic concepts (e.g., the model recognizes scene class ``bedroom'' based on the presence of concepts ``bed'' and ``pillow''). However, they often do not faithfully (i.e., accurately) characterize the model's behavior and can be too complex for people to understand. Further, little is known about how faithful and understandable different explanation methods are, and how to control these two properties. In this work, we propose UFO, a unified method for controlling Understandability and Faithfulness Objectives in concept-based explanations. UFO formalizes understandability and faithfulness as mathematical objectives and unifies most existing concept-based explanations methods for CNNs. Using UFO, we systematically investigate how explanations change as we turn the knobs of faithfulness and understandability. Our experiments demonstrate a faithfulness-vs-understandability tradeoff: increasing understandability reduces faithfulness. We also provide insights into the ``disagreement problem'' in explainable machine learning, by analyzing when and how concept-based explanations disagree with each other.
△ Less
Submitted 27 March, 2023;
originally announced March 2023.
-
Björling problem for zero mean curvature surfaces in the three-dimensional light cone
Authors:
Joseph Cho,
So Young Kim,
Dami Lee,
Wonjoo Lee,
Seong-Deog Yang
Abstract:
We solve the Björling problem for zero mean curvature surfaces in the three-dimensional light cone. As an application, we construct and classify all rotational zero mean curvature surfaces.
We solve the Björling problem for zero mean curvature surfaces in the three-dimensional light cone. As an application, we construct and classify all rotational zero mean curvature surfaces.
△ Less
Submitted 8 March, 2023;
originally announced March 2023.
-
The PAndAS View of the Andromeda Satellite System. IV Global properties
Authors:
Amandine Doliva-Dolinsky,
Nicolas F. Martin,
Zhen Yuan,
Alessandro Savino,
Daniel R. Weisz,
Annette M. N. Ferguson,
Rodrigo A. Ibata,
Stacy Y. Kim,
Geraint F. Lewis,
Alan W. McConnachie,
Guillaume F. Thomas
Abstract:
We build a statistical framework to infer the global properties of the satellite system of the Andromeda galaxy (M31) from the properties of individual dwarf galaxies located in the Pan-Andromeda Archaelogical Survey (PAndAS) and the previously determined completeness of the survey. Using forward modeling, we infer the slope of the luminosity function of the satellite system, the slope of its spat…
▽ More
We build a statistical framework to infer the global properties of the satellite system of the Andromeda galaxy (M31) from the properties of individual dwarf galaxies located in the Pan-Andromeda Archaelogical Survey (PAndAS) and the previously determined completeness of the survey. Using forward modeling, we infer the slope of the luminosity function of the satellite system, the slope of its spatial density distribution, and the size-luminosity relation followed by the dwarf galaxies. We find that the slope of the luminosity function is $β=-1.5\pm0.1$. Combined with the spatial density profile, it implies that, when accounting for survey incompleteness, M31 hosts $92_{-26}^{+19}$ dwarf galaxies with $M_\textrm{V}<-5.5$ and a sky-projected distance from M31 between 30 and 300kpc. We conclude that many faint or distant dwarf galaxies remain to be discovered around Andromeda, especially outside the PAndAS footprint. Finally, we use our model to test if the higher number of satellites situated in the hemisphere facing the Milky Way could be explained simply by the detection limits of dwarf galaxy searches. We rule this out at $>99.9\%$ confidence and conclude that this anisotropy is an intrinsic feature of the M31 satellite system. The statistical framework we present here is a powerful tool to robustly constrain the properties of a satellite system and compare those across hosts, especially considering the upcoming start of the Euclid or Rubin large photometric surveys that are expected to uncover a large number of dwarf galaxies in the Local Volume.
△ Less
Submitted 2 March, 2023;
originally announced March 2023.
-
Lifetime-configurable soft robots via photodegradable silicone elastomer composites
Authors:
Min-Ha Oh,
Young-Hwan Kim,
Seung-Min Lee,
Gyeong-Seok Hwang,
Kyung-Sub Kim,
Jae-Young Bae,
Ju-Young Kim,
Ju-Yong Lee,
Yu-Chan Kim,
Sang Yup Kim,
Seung-Kyun Kang
Abstract:
Developing soft robots that can control their own life-cycle and degrade on-demand while maintaining hyper-elasticity is a significant research challenge. On-demand degradable soft robots, which conserve their original functionality during operation and rapidly degrade under specific external stimulation, present the opportunity to self-direct the disappearance of temporary robots. This study prop…
▽ More
Developing soft robots that can control their own life-cycle and degrade on-demand while maintaining hyper-elasticity is a significant research challenge. On-demand degradable soft robots, which conserve their original functionality during operation and rapidly degrade under specific external stimulation, present the opportunity to self-direct the disappearance of temporary robots. This study proposes soft robots and materials that exhibit excellent mechanical stretchability and can degrade under ultraviolet (UV) light by mixing a fluoride-generating diphenyliodonium hexafluorophosphate (DPI-HFP) with a silicone resin. Spectroscopic analysis revealed the mechanism of Si-O-Si backbone cleavage using fluoride ion (F-), which was generated from UV exposed DPI-HFP. Furthermore, photo-differential scanning calorimetry (DSC) based thermal analysis indicated increased decomposition kinetics at increased temperatures. Additionally, we demonstrated a robotics application of this composite by fabricating a gaiting robot. The integration of soft electronics, including strain sensors, temperature sensors, and photodetectors, expanded the robotic functionalities. This study provides a simple yet novel strategy for designing lifecycle mimicking soft robotics that can be applied to reduce soft robotics waste, explore hazardous areas where retrieval of robots is impossible, and ensure hardware security with on-demand destructive material platforms.
△ Less
Submitted 28 February, 2023;
originally announced February 2023.
-
EDGE: The shape of dark matter haloes in the faintest galaxies
Authors:
Matthew D. A. Orkney,
Ethan Taylor,
Justin I. Read,
Martin P. Rey,
Andrew Pontzen,
Oscar Agertz,
Stacy Y. Kim,
Maxime Delorme
Abstract:
Collisionless Dark Matter Only (DMO) structure formation simulations predict that Dark Matter (DM) haloes are prolate in their centres and triaxial towards their outskirts. The addition of gas condensation transforms the central DM shape to be rounder and more oblate. It is not clear, however, whether such shape transformations occur in `ultra-faint' dwarfs, which have extremely low baryon fractio…
▽ More
Collisionless Dark Matter Only (DMO) structure formation simulations predict that Dark Matter (DM) haloes are prolate in their centres and triaxial towards their outskirts. The addition of gas condensation transforms the central DM shape to be rounder and more oblate. It is not clear, however, whether such shape transformations occur in `ultra-faint' dwarfs, which have extremely low baryon fractions. We present the first study of the shape and velocity anisotropy of ultra-faint dwarf galaxies that have gas mass fractions of $f_{\rm gas}(r<R_{\rm half}) < 0.06$. These dwarfs are drawn from the Engineering Dwarfs at Galaxy formation's Edge (EDGE) project, using high resolution simulations that allow us to resolve DM halo shapes within the half light radius ($\sim 100\,$pc). We show that gas-poor ultra-faints ($M_{\rm 200c} \leqslant 1.5\times10^9\,$M$_\odot$; $f_{\rm gas} < 10^{-5}$) retain their pristine prolate DM halo shape even when gas, star formation and feedback are included. This could provide a new and robust test of DM models. By contrast, gas-rich ultra-faints ($M_{\rm 200c} > 3\times10^9\,$M$_\odot$; $f_{\rm gas} > 10^{-4}$) become rounder and more oblate within $\sim 10$ half light radii. Finally, we find that most of our simulated dwarfs have significant radial velocity anisotropy that rises to $\tildeβ > 0.5$ at $R \gtrsim 3 R_{\rm half}$. The one exception is a dwarf that forms a rotating gas/stellar disc because of a planar, major merger. Such strong anisotropy should be taken into account when building mass models of gas-poor ultra-faints.
△ Less
Submitted 5 September, 2023; v1 submitted 24 February, 2023;
originally announced February 2023.
-
Controllable Mechanical-domain Energy Accumulators
Authors:
Sung Y. Kim,
David J. Braun
Abstract:
Springs are efficient in storing and returning elastic potential energy but are unable to hold the energy they store in the absence of an external load. Lockable springs use clutches to hold elastic potential energy in the absence of an external load, but have not yet been widely adopted in applications, partly because clutches introduce design complexity, reduce energy efficiency, and typically d…
▽ More
Springs are efficient in storing and returning elastic potential energy but are unable to hold the energy they store in the absence of an external load. Lockable springs use clutches to hold elastic potential energy in the absence of an external load, but have not yet been widely adopted in applications, partly because clutches introduce design complexity, reduce energy efficiency, and typically do not afford high fidelity control over the energy stored by the spring. Here, we present the design of a novel lockable compression spring that uses a small capstan clutch to passively lock a mechanical spring. The capstan clutch can lock over 1000 N force at any arbitrary deflection, unlock the spring in less than 10 ms with a control force less than 1 % of the maximal spring force, and provide an 80 % energy storage and return efficiency (comparable to a highly efficient electric motor operated at constant nominal speed). By retaining the form factor of a regular spring while providing high-fidelity locking capability even under large spring forces, the proposed design could facilitate the development of energy-efficient spring-based actuators and robots.
△ Less
Submitted 21 February, 2023; v1 submitted 29 December, 2022;
originally announced December 2022.
-
Dimensional crossover of charge order in IrTe$_2$ with strong interlayer coupling
Authors:
Hyoung Kug Kim,
So Young Kim,
C. J. Won,
Sang-Wook Cheong,
Jonghwan Kim,
Jun Sung Kim,
Tae-Hwan Kim
Abstract:
Tuning dimensionality in van der Waals materials with finite interlayer coupling has introduced various electronic phase transitions by conventional mechanical exfoliation. Particularly when the electronic order is tied to the modulation of the interlayer coupling, such dimensional tunability has a strong impact on its stability and properties, which has rarely been investigated experimentally. He…
▽ More
Tuning dimensionality in van der Waals materials with finite interlayer coupling has introduced various electronic phase transitions by conventional mechanical exfoliation. Particularly when the electronic order is tied to the modulation of the interlayer coupling, such dimensional tunability has a strong impact on its stability and properties, which has rarely been investigated experimentally. Here, we demonstrate a dimensional crossover of charge order in IrTe$_2$ from genuine two- to quasi-three-dimension using low-temperature scanning tunneling microscopy and spectroscopy. Employing atomically thin IrTe$_2$ flakes ranging from monolayer to multilayer, we observe a gradual phase transition of charge order and exponential decay of Coulomb gap with increasing thickness. Moreover, we find a suppression of the density of states emerging at an abrupt lateral interface between two- and three-dimension. These findings are attributed to the interplay between the strongly coupled layers and substrate-driven perturbation, which can provide a new insight into the dimensional crossover of strongly coupled layered materials with hidden electronic phases.
△ Less
Submitted 22 December, 2022;
originally announced December 2022.
-
ObjectStitch: Generative Object Compositing
Authors:
Yizhi Song,
Zhifei Zhang,
Zhe Lin,
Scott Cohen,
Brian Price,
Jianming Zhang,
Soo Ye Kim,
Daniel Aliaga
Abstract:
Object compositing based on 2D images is a challenging problem since it typically involves multiple processing stages such as color harmonization, geometry correction and shadow generation to generate realistic results. Furthermore, annotating training data pairs for compositing requires substantial manual effort from professionals, and is hardly scalable. Thus, with the recent advances in generat…
▽ More
Object compositing based on 2D images is a challenging problem since it typically involves multiple processing stages such as color harmonization, geometry correction and shadow generation to generate realistic results. Furthermore, annotating training data pairs for compositing requires substantial manual effort from professionals, and is hardly scalable. Thus, with the recent advances in generative models, in this work, we propose a self-supervised framework for object compositing by leveraging the power of conditional diffusion models. Our framework can hollistically address the object compositing task in a unified model, transforming the viewpoint, geometry, color and shadow of the generated object while requiring no manual labeling. To preserve the input object's characteristics, we introduce a content adaptor that helps to maintain categorical semantics and object appearance. A data augmentation method is further adopted to improve the fidelity of the generator. Our method outperforms relevant baselines in both realism and faithfulness of the synthesized result images in a user study on various real-world images.
△ Less
Submitted 5 December, 2022; v1 submitted 1 December, 2022;
originally announced December 2022.
-
"Help Me Help the AI": Understanding How Explainability Can Support Human-AI Interaction
Authors:
Sunnie S. Y. Kim,
Elizabeth Anne Watkins,
Olga Russakovsky,
Ruth Fong,
Andrés Monroy-Hernández
Abstract:
Despite the proliferation of explainable AI (XAI) methods, little is understood about end-users' explainability needs and behaviors around XAI explanations. To address this gap and contribute to understanding how explainability can support human-AI interaction, we conducted a mixed-methods study with 20 end-users of a real-world AI application, the Merlin bird identification app, and inquired abou…
▽ More
Despite the proliferation of explainable AI (XAI) methods, little is understood about end-users' explainability needs and behaviors around XAI explanations. To address this gap and contribute to understanding how explainability can support human-AI interaction, we conducted a mixed-methods study with 20 end-users of a real-world AI application, the Merlin bird identification app, and inquired about their XAI needs, uses, and perceptions. We found that participants desire practically useful information that can improve their collaboration with the AI, more so than technical system details. Relatedly, participants intended to use XAI explanations for various purposes beyond understanding the AI's outputs: calibrating trust, improving their task skills, changing their behavior to supply better inputs to the AI, and giving constructive feedback to developers. Finally, among existing XAI approaches, participants preferred part-based explanations that resemble human reasoning and explanations. We discuss the implications of our findings and provide recommendations for future XAI design.
△ Less
Submitted 16 February, 2023; v1 submitted 2 October, 2022;
originally announced October 2022.
-
Negative refraction in hyperbolic hetero-bicrystals
Authors:
A. J. Sternbach,
S. L. Moore,
A. Rikhter,
S. Zhang,
R. Jing,
Y. Shao,
B. S. Y. Kim,
S. Xu,
S. Liu,
J. H. Edgar,
A. Rubio,
C. Dean,
J. Hone,
M. M. Fogler,
D. N. Basov
Abstract:
We visualized negative refraction of phonon polaritons, which occurs at the interface between two natural crystals. The polaritons - hybrids of infrared photons and lattice vibrations - form collimated rays that display negative refraction when passing through a planar interface between the two hyperbolic van der Waals materials: molybdenum oxide ($MoO_3$) and isotopically pure hexagonal boron nit…
▽ More
We visualized negative refraction of phonon polaritons, which occurs at the interface between two natural crystals. The polaritons - hybrids of infrared photons and lattice vibrations - form collimated rays that display negative refraction when passing through a planar interface between the two hyperbolic van der Waals materials: molybdenum oxide ($MoO_3$) and isotopically pure hexagonal boron nitride ($h^{11}BN$). At a special frequency $ω_0$, these rays can circulate along closed diamond-shaped trajectories. We have shown that polariton eigenmodes display regions of both positive and negative dispersion interrupted by multiple gaps that result from polaritonic level repulsion and strong coupling.
△ Less
Submitted 7 July, 2023; v1 submitted 29 September, 2022;
originally announced September 2022.
-
Andromeda XXV -- a dwarf galaxy with a low central dark matter density
Authors:
Emily J. E. Charles,
Michelle L. M. Collins,
R. Michael Rich,
Justin I. Read,
Stacy Y. Kim,
Rodrigo A. Ibata,
Nicolas F. Martin,
Scott C. Chapman,
Eduardo Balbinot,
Daniel R. Weisz
Abstract:
Andromeda (And) XXV has previously been reported as a dwarf spheroidal galaxy (dSph) with little-to-no dark matter. However, the uncertainties on this result were significant. In this study, we double the number of member stars and re-derive the kinematics and mass of And XXV. We find that And XXV has a systemic velocity of $ν_\mathrm{r}=-107.7\pm1.0 \mathrm{~km s}^{-1}$ and a velocity dispersion…
▽ More
Andromeda (And) XXV has previously been reported as a dwarf spheroidal galaxy (dSph) with little-to-no dark matter. However, the uncertainties on this result were significant. In this study, we double the number of member stars and re-derive the kinematics and mass of And XXV. We find that And XXV has a systemic velocity of $ν_\mathrm{r}=-107.7\pm1.0 \mathrm{~km s}^{-1}$ and a velocity dispersion of $σ_ν=4.5\pm1.0\mathrm{~km s}^{-1}$. With this better constrained velocity dispersion, we derive a mass contained within the half-light radius of $M(r< r_\mathrm{h})=6.9^{+3.2}_{-2.8}\times10^6\mathrm{~M}_\odot$. This mass corresponds to a mass-to-light ratio of $\mathrm{[M/L]}_\mathrm{r_\mathrm{h}}=37^{+17}_{-15}\mathrm{~M}_\odot/\mathrm{L}_\odot$, demonstrating, for the first time, that And XXV has an unambiguous dark matter component. We also measure the metallicity of And XXV to be $\mathrm{[Fe/H]}=-1.9\pm0.1$$\mathrm{~}$dex, which is in agreement with previous results. Finally, we extend the analysis of And XXV to include mass modelling using GravSphere. We find that And XXV has a low central dark matter density, $ρ_\mathrm{DM}(150\mathrm{pc})= 2.7^{+1.8}_{-1.6}\times10^7\mathrm{~M}_\odot\mathrm{kpc}^{-3}$, making And XXV a clear outlier when compared to other Local Group (LG) dSphs of the similar stellar mass. In a companion paper, we will explore whether some combination of dark matter cusp-core transformations and/or tides can explain And XXV's low density.
△ Less
Submitted 29 September, 2022;
originally announced September 2022.
-
Streams on FIRE: Populations of Detectable Stellar Streams in the Milky Way and FIRE
Authors:
Nora Shipp,
Nondh Panithanpaisal,
Lina Necib,
Robyn Sanderson,
Denis Erkal,
Ting S. Li,
Isaiah B. Santistevan,
Andrew Wetzel,
Lara R. Cullinane,
Alexander P. Ji,
Sergey E. Koposov,
Kyler Kuehn,
Geraint F. Lewis,
Andrew B. Pace,
Daniel B. Zucker,
Joss Bland-Hawthorn,
Emily C. Cunningham,
Stacy Y. Kim,
Sophia Lilleengen,
Jorge Moreno,
Sanjib Sharma
Abstract:
We present the first detailed study comparing the populations of stellar streams in cosmological simulations to observed Milky Way dwarf galaxy streams. In particular, we compare streams identified around Milky Way analogs in the FIRE-2 simulations to stellar streams observed by the Southern Stellar Stream Spectroscopic Survey (S5). For an accurate comparison between the stream populations, we pro…
▽ More
We present the first detailed study comparing the populations of stellar streams in cosmological simulations to observed Milky Way dwarf galaxy streams. In particular, we compare streams identified around Milky Way analogs in the FIRE-2 simulations to stellar streams observed by the Southern Stellar Stream Spectroscopic Survey (S5). For an accurate comparison between the stream populations, we produce mock Dark Energy Survey (DES) observations of the FIRE streams and estimate the detectability of their tidal tails and progenitors. The number and stellar mass distributions of detectable stellar streams is consistent between observations and simulations. However, there are discrepancies in the distributions of pericenters and apocenters, with the detectable FIRE streams, on average, forming at larger pericenters (out to > 110 kpc) and surviving only at larger apocenters (> 40 kpc) than those observed in the Milky Way. We find that the population of high-stellar mass dwarf galaxy streams in the Milky Way is incomplete. Interestingly, a large fraction of the FIRE streams would only be detected as satellites in DES-like observations, since their tidal tails are too low-surface brightness to be detectable. We thus predict a population of yet-undetected tidal tails around Milky Way satellites, as well as a population of fully undetected low surface brightness stellar streams, and estimate their detectability with the Rubin Observatory. Finally, we discuss the causes and implications of the discrepancies between the stream populations in FIRE and the Milky Way, and explore future avenues for tests of satellite disruption in cosmological simulations.
△ Less
Submitted 3 August, 2022;
originally announced August 2022.
-
Overlooked factors in concept-based explanations: Dataset choice, concept learnability, and human capability
Authors:
Vikram V. Ramaswamy,
Sunnie S. Y. Kim,
Ruth Fong,
Olga Russakovsky
Abstract:
Concept-based interpretability methods aim to explain deep neural network model predictions using a predefined set of semantic concepts. These methods evaluate a trained model on a new, "probe" dataset and correlate model predictions with the visual concepts labeled in that dataset. Despite their popularity, they suffer from limitations that are not well-understood and articulated by the literatur…
▽ More
Concept-based interpretability methods aim to explain deep neural network model predictions using a predefined set of semantic concepts. These methods evaluate a trained model on a new, "probe" dataset and correlate model predictions with the visual concepts labeled in that dataset. Despite their popularity, they suffer from limitations that are not well-understood and articulated by the literature. In this work, we analyze three commonly overlooked factors in concept-based explanations. First, the choice of the probe dataset has a profound impact on the generated explanations. Our analysis reveals that different probe datasets may lead to very different explanations, and suggests that the explanations are not generalizable outside the probe dataset. Second, we find that concepts in the probe dataset are often less salient and harder to learn than the classes they claim to explain, calling into question the correctness of the explanations. We argue that only visually salient concepts should be used in concept-based explanations. Finally, while existing methods use hundreds or even thousands of concepts, our human studies reveal a much stricter upper bound of 32 concepts or less, beyond which the explanations are much less practically useful. We make suggestions for future development and analysis of concept-based interpretability methods. Code for our analysis and user interface can be found at \url{https://github.com/princetonvisualai/OverlookedFactors}
△ Less
Submitted 12 May, 2023; v1 submitted 19 July, 2022;
originally announced July 2022.
-
Spatial Distribution of Solar PV Deployment: An Application of the Region-Based Convolutional Neural Network
Authors:
Serena Y. Kim,
Koushik Ganesan,
Crystal Soderman,
Raven O'Rourke
Abstract:
This paper presents a comprehensive analysis of the social and environmental determinants of solar photovoltaic (PV) deployment rates in Colorado, USA. Using 652,795 satellite imagery and computer vision frameworks based on a convolutional neural network, we estimated the proportion of households with solar PV systems and the roof areas covered by solar panels. At the census block group level, 7%…
▽ More
This paper presents a comprehensive analysis of the social and environmental determinants of solar photovoltaic (PV) deployment rates in Colorado, USA. Using 652,795 satellite imagery and computer vision frameworks based on a convolutional neural network, we estimated the proportion of households with solar PV systems and the roof areas covered by solar panels. At the census block group level, 7% of Coloradan households have a rooftop PV system, and 2.5% of roof areas in Colorado are covered by solar panels as of 2021. Our machine learning models predict solar PV deployment based on 43 natural and social characteristics of neighborhoods. Using four algorithms (Random Forest, CATBoost, LightGBM, XGBoost), we find that the share of Democratic party votes, hail risks, strong wind risks, median home value, and solar PV permitting timelines are the most important predictors of solar PV count per household. In addition to the size of the houses, PV-to-roof area ratio is highly dependent on solar PV permitting timelines, proportion of renters and multifamily housing, and winter weather risks. We also find racial and ethnic disparities in rooftop solar deployment. The average marginal effects of median household income on solar deployment are lower in communities with a greater proportion of African American and Hispanic residents and are higher in communities with a greater proportion of White and Asian residents. In the ongoing energy transition, knowing the key predictors of solar deployment can better inform business and policy decision making for more efficient and equitable grid infrastructure investment and distributed energy resource management.
△ Less
Submitted 17 July, 2022;
originally announced July 2022.
-
Ask Me What You Need: Product Retrieval using Knowledge from GPT-3
Authors:
Su Young Kim,
Hyeonjin Park,
Kyuyong Shin,
Kyung-Min Kim
Abstract:
As online merchandise become more common, many studies focus on embedding-based methods where queries and products are represented in the semantic space. These methods alleviate the problem of vocab mismatch between the language of queries and products. However, past studies usually dealt with queries that precisely describe the product, and there still exists the need to answer imprecise queries…
▽ More
As online merchandise become more common, many studies focus on embedding-based methods where queries and products are represented in the semantic space. These methods alleviate the problem of vocab mismatch between the language of queries and products. However, past studies usually dealt with queries that precisely describe the product, and there still exists the need to answer imprecise queries that may require common sense knowledge, i.e., 'what should I get my mom for Mother's Day.' In this paper, we propose a GPT-3 based product retrieval system that leverages the knowledge-base (KB) of GPT-3 for question answering; users do not need to know the specific illustrative keywords for a product when querying. Our method tunes prompt tokens of GPT-3 to prompt knowledge and render answers that are mapped directly to products without further processing. Our method shows consistent performance improvement on two real-world and one public dataset, compared to the baseline methods. We provide an in-depth discussion on leveraging GPT-3 knowledge into a question answering based retrieval system.
△ Less
Submitted 6 July, 2022;
originally announced July 2022.
-
Mapping charge capture and acceleration in a plasma wakefield of a proton bunch using variable emittance electron beam injection
Authors:
E. Granados,
L. Verra,
A. -M. Bachmann,
E. Chevallay,
S. Doebert,
V. Fedosseev,
F. Friebel,
S. Gessner,
E. Gschwendtner,
S. Y. Kim,
S. Mazzoni,
J. T. Moody,
M. Turner
Abstract:
In the Phase 2 of the AWAKE first experimental run (from May to November 2018), an electron beam was used to probe and test proton-driven wakefield acceleration in a rubidium plasma column. In this work, we analyze the overall charge capture and shot-to-shot reproducibility of the proton-driven plasma wakefield accelerator with various electron bunch injection parameters. The witness electron bunc…
▽ More
In the Phase 2 of the AWAKE first experimental run (from May to November 2018), an electron beam was used to probe and test proton-driven wakefield acceleration in a rubidium plasma column. In this work, we analyze the overall charge capture and shot-to-shot reproducibility of the proton-driven plasma wakefield accelerator with various electron bunch injection parameters. The witness electron bunches were produced using an RF-gun equipped with a Cs2Te photocathode illuminated by a tailorable ultrafast deep ultraviolet (UV) laser pulse. The construction of the UV beam optical system enabled appropriate transverse beam shaping and control of its pulse duration, size, and position on the photocathode, as well as time delay with respect to the ionizing laser pulse that seeds the plasma wakefields in the proton bunches. Variable photocathode illumination provided the required flexibility to produce electron bunches with variable charge, emittance, and injection trajectory into the plasma column. We demonstrate charge capture rates exceeding 15% (40 pC of GeV accelerated charge for a 385 pC injected electron bunch) under optimized electron injection conditions.
△ Less
Submitted 28 June, 2022;
originally announced June 2022.
-
Atomically imprinted graphene plasmonic cavities
Authors:
Brian S. Y. Kim,
Aaron J. Sternbach,
Min Sup Choi,
Zhiyuan Sun,
Francesco L. Ruta,
Yinming Shao,
Alexander S. McLeod,
Lin Xiong,
Yinan Dong,
Anjaly Rajendran,
Song Liu,
Ankur Nipane,
Sang Hoon Chae,
Amirali Zangiabadi,
Xiaodong Xu,
Andrew J. Millis,
P. James Schuck,
Cory. R. Dean,
James C. Hone,
D. N. Basov
Abstract:
Plasmon polaritons in van der Waals (vdW) materials hold promise for next-generation photonics. The ability to deterministically imprint spatial patterns of high carrier density in cavities and circuitry with nanoscale features underlies future progress in nonlinear nanophotonics and strong light-matter interactions. Here, we demonstrate a general strategy to atomically imprint low-loss graphene p…
▽ More
Plasmon polaritons in van der Waals (vdW) materials hold promise for next-generation photonics. The ability to deterministically imprint spatial patterns of high carrier density in cavities and circuitry with nanoscale features underlies future progress in nonlinear nanophotonics and strong light-matter interactions. Here, we demonstrate a general strategy to atomically imprint low-loss graphene plasmonic structures using oxidation-activated charge transfer (OCT). We cover graphene with a monolayer of WSe$_2$, which is subsequently oxidized into high work-function WOx to activate charge transfer. Nano-infrared imaging reveals low-loss plasmon polaritons at the WOx/graphene interface. We insert WSe$_2$ spacers to precisely control the OCT-induced carrier density and achieve a near-intrinsic quality factor of plasmons. Finally, we imprint canonical plasmonic cavities exhibiting laterally abrupt doping profiles with single-digit nanoscale precision via programmable OCT. Specifically, we demonstrate technologically appealing but elusive plasmonic whispering-gallery resonators based on free-standing graphene encapsulated in WOx. Our results open avenues for novel quantum photonic architectures incorporating two-dimensional materials.
△ Less
Submitted 25 June, 2022;
originally announced June 2022.
-
ELUDE: Generating interpretable explanations via a decomposition into labelled and unlabelled features
Authors:
Vikram V. Ramaswamy,
Sunnie S. Y. Kim,
Nicole Meister,
Ruth Fong,
Olga Russakovsky
Abstract:
Deep learning models have achieved remarkable success in different areas of machine learning over the past decade; however, the size and complexity of these models make them difficult to understand. In an effort to make them more interpretable, several recent works focus on explaining parts of a deep neural network through human-interpretable, semantic attributes. However, it may be impossible to…
▽ More
Deep learning models have achieved remarkable success in different areas of machine learning over the past decade; however, the size and complexity of these models make them difficult to understand. In an effort to make them more interpretable, several recent works focus on explaining parts of a deep neural network through human-interpretable, semantic attributes. However, it may be impossible to completely explain complex models using only semantic attributes. In this work, we propose to augment these attributes with a small set of uninterpretable features. Specifically, we develop a novel explanation framework ELUDE (Explanation via Labelled and Unlabelled DEcomposition) that decomposes a model's prediction into two parts: one that is explainable through a linear combination of the semantic attributes, and another that is dependent on the set of uninterpretable features. By identifying the latter, we are able to analyze the "unexplained" portion of the model, obtaining insights into the information used by the model. We show that the set of unlabelled features can generalize to multiple models trained with the same feature space and compare our work to two popular attribute-oriented methods, Interpretable Basis Decomposition and Concept Bottleneck, and discuss the additional insights ELUDE provides.
△ Less
Submitted 16 June, 2022; v1 submitted 15 June, 2022;
originally announced June 2022.
-
Layered Depth Refinement with Mask Guidance
Authors:
Soo Ye Kim,
Jianming Zhang,
Simon Niklaus,
Yifei Fan,
Simon Chen,
Zhe Lin,
Munchurl Kim
Abstract:
Depth maps are used in a wide range of applications from 3D rendering to 2D image effects such as Bokeh. However, those predicted by single image depth estimation (SIDE) models often fail to capture isolated holes in objects and/or have inaccurate boundary regions. Meanwhile, high-quality masks are much easier to obtain, using commercial auto-masking tools or off-the-shelf methods of segmentation…
▽ More
Depth maps are used in a wide range of applications from 3D rendering to 2D image effects such as Bokeh. However, those predicted by single image depth estimation (SIDE) models often fail to capture isolated holes in objects and/or have inaccurate boundary regions. Meanwhile, high-quality masks are much easier to obtain, using commercial auto-masking tools or off-the-shelf methods of segmentation and matting or even by manual editing. Hence, in this paper, we formulate a novel problem of mask-guided depth refinement that utilizes a generic mask to refine the depth prediction of SIDE models. Our framework performs layered refinement and inpainting/outpainting, decomposing the depth map into two separate layers signified by the mask and the inverse mask. As datasets with both depth and mask annotations are scarce, we propose a self-supervised learning scheme that uses arbitrary masks and RGB-D datasets. We empirically show that our method is robust to different types of masks and initial depth predictions, accurately refining depth values in inner and outer mask boundary regions. We further analyze our model with an ablation study and demonstrate results on real applications. More information can be found at https://sooyekim.github.io/MaskDepth/ .
△ Less
Submitted 7 June, 2022;
originally announced June 2022.
-
Infrared Plasmons Propagate through a Hyperbolic Nodal Metal
Authors:
Yinming Shao,
Aaron J. Sternbach,
Brian S. Y. Kim,
Andrey A. Rikhter,
Xinyi Xu,
Umberto De Giovannini,
Ran Jing,
Sang Hoon Chae,
Zhiyuan Sun,
Seng Huat Lee,
Yanglin Zhu,
Zhiqiang Mao,
J. Hone,
Raquel Queiroz,
A. J. Millis,
P. James Schuck,
A. Rubio,
M. M. Fogler,
D. N. Basov
Abstract:
Metals are canonical plasmonic media at infrared and optical wavelengths, allowing one to guide and manipulate light at the nano-scale. A special form of optical waveguiding is afforded by highly anisotropic crystals revealing the opposite signs of the dielectric functions along orthogonal directions. These media are classified as hyperbolic and include crystalline insulators, semiconductors and a…
▽ More
Metals are canonical plasmonic media at infrared and optical wavelengths, allowing one to guide and manipulate light at the nano-scale. A special form of optical waveguiding is afforded by highly anisotropic crystals revealing the opposite signs of the dielectric functions along orthogonal directions. These media are classified as hyperbolic and include crystalline insulators, semiconductors and artificial metamaterials. Layered anisotropic metals are also anticipated to support hyperbolic waveguiding. Yet this behavior remains elusive, primarily because interband losses arrest the propagation of infrared modes. Here, we report on the observation of propagating hyperbolic waves in a prototypical layered nodal-line semimetal ZrSiSe. The observed waveguiding originates from polaritonic hybridization between near-infrared light and nodal-line plasmons. Unique nodal electronic structures simultaneously suppress interband loss and boost the plasmonic response, ultimately enabling the propagation of infrared modes through the bulk of the crystal.
△ Less
Submitted 3 June, 2022;
originally announced June 2022.
-
Large-Area Intercalated 2D-Pb/Graphene Heterostructure as a Platform for Generating Spin-Orbit Torque
Authors:
Alexander Vera,
Boyang Zheng,
Wilson Yanez,
Kaijie Yang,
Seong Yeoul Kim,
Jimmy C. Kotsakidis,
Hesham El-Sherif,
Gopi Krishnan,
Roland J. Koch,
Timothy A. Bowen,
Chengye Dong,
Yuanxi Wang,
Maxwell Wetherington,
Eli Rotenberg,
Nabil Bassim,
Adam L. Friedman,
Robert M. Wallace,
Chaoxing Liu,
Nitin Samarth,
Vincent H. Crespi,
Joshua A. Robinson
Abstract:
A scalable platform to synthesize ultrathin heavy metals may enable high efficiency charge-to-spin conversion for next-generation spintronics. Here we report centimeter-scale synthesis of air-stable, epitaxially registered monolayer Pb underneath bilayer graphene on SiC (0001) by confinement heteroepitaxy (CHet). Diffraction, spectroscopy, and microscopy reveal CHet-based Pb intercalation predomin…
▽ More
A scalable platform to synthesize ultrathin heavy metals may enable high efficiency charge-to-spin conversion for next-generation spintronics. Here we report centimeter-scale synthesis of air-stable, epitaxially registered monolayer Pb underneath bilayer graphene on SiC (0001) by confinement heteroepitaxy (CHet). Diffraction, spectroscopy, and microscopy reveal CHet-based Pb intercalation predominantly exhibits a mottled hexagonal superstructure due to an ordered network of Frenkel-Kontorova-like domain walls. The system's air stability enables ex-situ spin torque ferromagnetic resonance (ST-FMR) measurements that demonstrate charge-to-spin conversion in graphene/Pb/ferromagnet heterostructures with a 1.5x increase in the effective field ratio compared to control samples.
△ Less
Submitted 27 March, 2024; v1 submitted 13 May, 2022;
originally announced May 2022.
-
Measurement of cosmogenic $^9$Li and $^8$He production rates at RENO
Authors:
H. G. Lee,
J. H. Choi,
H. I. Jang,
J. S. Jang,
S. H. Jeon,
K. K. Joo,
D. E. Jung,
J. G. Kim,
J. H. Kim,
J. Y. Kim,
S. B. Kim,
S. Y. Kim,
W. Kim,
E. Kwon,
D. H. Lee,
W. J. Lee,
I. T. Lim,
D. H. Moon,
M. Y. Pac,
J. S. Park,
R. G. Park,
H. Seo,
J. W. Seo,
C. D. Shin,
B. S. Yang
, et al. (4 additional authors not shown)
Abstract:
We report the measured production rates of unstable isotopes $^9$Li and $^8$He produced by cosmic muon spallation on $^{12}$C using two identical detectors of the RENO experiment. Their beta-decays accompanied by a neutron make a significant contribution to backgrounds of reactor antineutrino events in precise determination of the smallest neutrino mixing angle. The mean muon energy of its near (f…
▽ More
We report the measured production rates of unstable isotopes $^9$Li and $^8$He produced by cosmic muon spallation on $^{12}$C using two identical detectors of the RENO experiment. Their beta-decays accompanied by a neutron make a significant contribution to backgrounds of reactor antineutrino events in precise determination of the smallest neutrino mixing angle. The mean muon energy of its near (far) detector with an overburden of 120 (450) m.w.e. is estimated as 33.1 +- 2.3 (73.6 +- 4.4) GeV. Based on roughly 3100 days of data, the cosmogenic production rate of $^9$Li ($^8$He) isotope is measured to be 44.2 +- 3.1 (10.6 +- 7.4) per day at near detector and 10.0 +- 1.1 (2.1 +- 1.5) per day at far detector. This corresponds to yields of $^9$Li ($^8$He), 4.80 +- 0.36 (1.15 +- 0.81) and 9.9 +- 1.1 (2.1 +- 1.5) at near and far detectors, respectively, in a unit of 10$^{-8}$ $μ^{-1}$ g${^-1}$ cm${^2}$. Combining the measured $^9$Li yields with other available underground measurements, an excellent power-law relationship of the yield with respect to the mean muon energy is found to have an exponent of $α$ = 0.75 +- 0.05.
△ Less
Submitted 2 July, 2022; v1 submitted 20 April, 2022;
originally announced April 2022.
-
EDGE: the puzzling ellipticity of Eridanus II's star cluster and its implications for dark matter at the heart of an ultra-faint dwarf
Authors:
Matthew D. A. Orkney,
Justin I. Read,
Oscar Agertz,
Andrew Pontzen,
Martin P. Rey,
Alex Goater,
Ethan Taylor,
Stacy Y. Kim,
Maxime Delorme
Abstract:
The Eridanus II (EriII) 'ultra-faint' dwarf has a large ($15\,\text{pc}$) and low mass ($4.3\times10^3\,\text{M}_\odot$) star cluster (SC) offset from its centre by $23\pm3\,\text{pc}$ in projection. Its size and offset are naturally explained if EriII has a central dark matter core, but such a core may be challenging to explain in a $Λ$CDM cosmology. In this paper, we revisit the survival and evo…
▽ More
The Eridanus II (EriII) 'ultra-faint' dwarf has a large ($15\,\text{pc}$) and low mass ($4.3\times10^3\,\text{M}_\odot$) star cluster (SC) offset from its centre by $23\pm3\,\text{pc}$ in projection. Its size and offset are naturally explained if EriII has a central dark matter core, but such a core may be challenging to explain in a $Λ$CDM cosmology. In this paper, we revisit the survival and evolution of EriII's SC, focussing for the first time on its puzzlingly large ellipticity ($0.31^{+0.05}_{-0.06}$). We perform a suite of 960 direct $N$-body simulations of SCs, orbiting within a range of spherical background potentials fit to ultra-faint dwarf (UFD) galaxy simulations. We find only two scenarios that come close to explaining EriII's SC. In the first, EriII has a low density dark matter core (of size $\sim70\,\text{pc}$ and density $\lesssim2\times10^8\,\text{M}_{\odot}\,\text{kpc}^{-3}$). In this model, the high ellipticity of EriII's SC is set at birth, with the lack of tidal forces in the core allowing its ellipticity to remain frozen in for long times. In the second, EriII's SC orbits in a partial core, with its high ellipticity owing to its imminent tidal destruction. However, this latter model struggles to reproduce the large size of EriII's SC, and it predicts substantial tidal tails around EriII's SC that should have already been seen in the data. This leads us to favour the cored model. We discuss potential caveats to these findings, and the implications of the cored model for galaxy formation and the nature of dark matter.
△ Less
Submitted 1 August, 2022; v1 submitted 31 January, 2022;
originally announced January 2022.
-
Q-effectiveness for holomorphic subelliptic multipliers
Authors:
Dmitri Zaitsev,
Sung Yeon Kim
Abstract:
We provide a solution to the effectiveness problem in Kohn's algorithm for generating holomorphic subelliptic multipliers for $(0,q)$ forms for arbitrary $q$. As an application, we obtain subelliptic estimates for $(0,q)$ forms with effectively controlled order $ε>0$ (the Sobolev exponent) for domains given by sums of squares of holomorphic functions (J.J. Kohn called them "special domains"). Thes…
▽ More
We provide a solution to the effectiveness problem in Kohn's algorithm for generating holomorphic subelliptic multipliers for $(0,q)$ forms for arbitrary $q$. As an application, we obtain subelliptic estimates for $(0,q)$ forms with effectively controlled order $ε>0$ (the Sobolev exponent) for domains given by sums of squares of holomorphic functions (J.J. Kohn called them "special domains"). These domains are of particular interest due to their relation with complex and algebraic geometry. Our methods include triangular resolutions introduced by the authors in their previous work.
△ Less
Submitted 30 December, 2021;
originally announced December 2021.
-
EDGE: What shapes the relationship between HI and stellar observables in faint dwarf galaxies?
Authors:
Martin P. Rey,
Andrew Pontzen,
Oscar Agertz,
Matthew D. A. Orkney,
Justin I. Read,
Amélie Saintonge,
Stacy Y. Kim,
Payel Das
Abstract:
We show how the interplay between feedback and mass-growth histories introduces scatter in the relationship between stellar and neutral gas properties of field faint dwarf galaxies ($M_{\star} \lessapprox 10^{6} M_{\odot}$). Across a suite of cosmological, high-resolution zoomed simulations, we find that dwarf galaxies of stellar masses $10^5 \leq M_{\star} \leq 10^{6} M_{\odot}$ are bimodal in th…
▽ More
We show how the interplay between feedback and mass-growth histories introduces scatter in the relationship between stellar and neutral gas properties of field faint dwarf galaxies ($M_{\star} \lessapprox 10^{6} M_{\odot}$). Across a suite of cosmological, high-resolution zoomed simulations, we find that dwarf galaxies of stellar masses $10^5 \leq M_{\star} \leq 10^{6} M_{\odot}$ are bimodal in their cold gas content, being either HI-rich or HI-deficient. This bimodality is generated through the coupling between (i) the modulation of HI contents by the background of ultraviolet radiation (UVB) at late times and (ii) the significant scatter in the stellar-mass-halo-mass relationship induced by reionization. Furthermore, our HI-rich dwarfs exhibit disturbed and time-variable neutral gas distributions primarily due to stellar feedback. Over the last four billion years, we observe order-of-magnitude changes around the median $M_{HI}$, factor-of-a-few variations in HI spatial extents, and spatial offsets between HI and stellar components regularly exceeding the galaxies' optical sizes. Time variability introduces further scatter in the $M_{\star}-M_{HI}$ relation and affects a galaxy's detectability in HI at any given time. These effects will need to be accounted for when interpreting observations of the population of faint, HI-bearing dwarfs by the combination of optical and radio wide, deep surveys.
△ Less
Submitted 12 March, 2022; v1 submitted 6 December, 2021;
originally announced December 2021.
-
HIVE: Evaluating the Human Interpretability of Visual Explanations
Authors:
Sunnie S. Y. Kim,
Nicole Meister,
Vikram V. Ramaswamy,
Ruth Fong,
Olga Russakovsky
Abstract:
As AI technology is increasingly applied to high-impact, high-risk domains, there have been a number of new methods aimed at making AI models more human interpretable. Despite the recent growth of interpretability work, there is a lack of systematic evaluation of proposed techniques. In this work, we introduce HIVE (Human Interpretability of Visual Explanations), a novel human evaluation framework…
▽ More
As AI technology is increasingly applied to high-impact, high-risk domains, there have been a number of new methods aimed at making AI models more human interpretable. Despite the recent growth of interpretability work, there is a lack of systematic evaluation of proposed techniques. In this work, we introduce HIVE (Human Interpretability of Visual Explanations), a novel human evaluation framework that assesses the utility of explanations to human users in AI-assisted decision making scenarios, and enables falsifiable hypothesis testing, cross-method comparison, and human-centered evaluation of visual interpretability methods. To the best of our knowledge, this is the first work of its kind. Using HIVE, we conduct IRB-approved human studies with nearly 1000 participants and evaluate four methods that represent the diversity of computer vision interpretability works: GradCAM, BagNet, ProtoPNet, and ProtoTree. Our results suggest that explanations engender human trust, even for incorrect predictions, yet are not distinct enough for users to distinguish between correct and incorrect predictions. We open-source HIVE to enable future studies and encourage more human-centered approaches to interpretability research.
△ Less
Submitted 21 July, 2022; v1 submitted 6 December, 2021;
originally announced December 2021.
-
Deep-ultraviolet electroluminescence and photocurrent generation in graphene/hBN/graphene heterostructures
Authors:
Su-Beom Song,
Sangho Yoon,
So Young Kim,
Sera Yang,
Seung-Young Seo,
Soonyoung Cha,
Hyeon-Woo Jeong,
Kenji Watanabe,
Takashi Taniguchi,
Gil-Ho Lee,
Jun Sung Kim,
Moon-Ho Jo,
Jonghwan Kim
Abstract:
Hexagonal boron nitride (hBN) is a van der Waals semiconductor with a wide bandgap of ~ 5.96 eV. Despite the indirect bandgap characteristics of hBN, charge carriers excited by high energy electrons or photons efficiently emit luminescence at deep-ultraviolet (DUV) frequencies via strong electron-phonon interaction, suggesting potential DUV light emitting device applications. However, electrolumin…
▽ More
Hexagonal boron nitride (hBN) is a van der Waals semiconductor with a wide bandgap of ~ 5.96 eV. Despite the indirect bandgap characteristics of hBN, charge carriers excited by high energy electrons or photons efficiently emit luminescence at deep-ultraviolet (DUV) frequencies via strong electron-phonon interaction, suggesting potential DUV light emitting device applications. However, electroluminescence from hBN has not been demonstrated at DUV frequencies so far. In this study, we report DUV electroluminescence and photocurrent generation in graphene/hBN/graphene heterostructures at room temperature. Tunneling carrier injection from graphene electrodes into the band edges of hBN enables prominent electroluminescence at DUV frequencies. On the other hand, under DUV laser illumination and external bias voltage, graphene electrodes efficiently collect photo-excited carriers in hBN, which generates high photocurrent. Laser excitation micro-spectroscopy shows that the radiative recombination and photocarrier excitation processes in the heterostructures mainly originate from the pristine structure and the stacking faults in hBN. Our work provides a pathway toward efficient DUV light emitting and detection devices based on hBN.
△ Less
Submitted 1 December, 2021;
originally announced December 2021.
-
Scaling Law for Recommendation Models: Towards General-purpose User Representations
Authors:
Kyuyong Shin,
Hanock Kwak,
Su Young Kim,
Max Nihlen Ramstrom,
Jisu Jeong,
Jung-Woo Ha,
Kyung-Min Kim
Abstract:
Recent advancement of large-scale pretrained models such as BERT, GPT-3, CLIP, and Gopher, has shown astonishing achievements across various task domains. Unlike vision recognition and language models, studies on general-purpose user representation at scale still remain underexplored. Here we explore the possibility of general-purpose user representation learning by training a universal user encod…
▽ More
Recent advancement of large-scale pretrained models such as BERT, GPT-3, CLIP, and Gopher, has shown astonishing achievements across various task domains. Unlike vision recognition and language models, studies on general-purpose user representation at scale still remain underexplored. Here we explore the possibility of general-purpose user representation learning by training a universal user encoder at large scales. We demonstrate that the scaling law is present in user representation learning areas, where the training error scales as a power-law with the amount of computation. Our Contrastive Learning User Encoder (CLUE), optimizes task-agnostic objectives, and the resulting user embeddings stretch our expectation of what is possible to do in various downstream tasks. CLUE also shows great transferability to other domains and companies, as performances on an online experiment shows significant improvements in Click-Through-Rate (CTR). Furthermore, we also investigate how the model performance is influenced by the scale factors, such as training data size, model capacity, sequence length, and batch size. Finally, we discuss the broader impacts of CLUE in general.
△ Less
Submitted 22 November, 2022; v1 submitted 15 November, 2021;
originally announced November 2021.
-
Characterization of the correlated background for a sterile neutrino search using the first dataset of the JSNS$^2$ experiment
Authors:
Y. Hino,
S. Ajimura,
M. K. Cheoun,
J. H. Choi,
T. Dodo,
H. Furuta,
J. Goh,
K. Haga,
M. Harada,
S. Hasegawa,
T. Hiraiwa,
W. Hwang,
H. I. Jang,
J. S. Jang,
H. Jeon,
S. Jeon,
K. K. Joo,
J. R. Jordan,
D. E. Jung,
S. K. Kang,
Y. Kasugai,
T. Kawasaki,
E. J. Kim,
J. Y. Kim,
S. B. Kim
, et al. (40 additional authors not shown)
Abstract:
JSNS$^2$ (J-PARC Sterile Neutrino Search at J-PARC Spallation Neutron Source) is an experiment that is searching for sterile neutrinos via the observation of $\barν_μ \to \barν_{e}$ appearance oscillations using muon decay-at-rest neutrinos. Before dedicated data taking in the first-half of 2021, we performed a commissioning run for 10 days in June 2020. Using the data obtained in this commissioni…
▽ More
JSNS$^2$ (J-PARC Sterile Neutrino Search at J-PARC Spallation Neutron Source) is an experiment that is searching for sterile neutrinos via the observation of $\barν_μ \to \barν_{e}$ appearance oscillations using muon decay-at-rest neutrinos. Before dedicated data taking in the first-half of 2021, we performed a commissioning run for 10 days in June 2020. Using the data obtained in this commissioning run, in this paper, we present an estimate of the correlated background which imitates the $\barν_{e}$ signal in a sterile neutrino search. In addition, in order to demonstrate future prospects of the JSNS$^2$ experiment, possible pulse shape discrimination improvements towards reducing cosmic ray induced fast neutron background are described.
△ Less
Submitted 11 March, 2022; v1 submitted 14 November, 2021;
originally announced November 2021.
-
Human-Computer Interaction Glow Up: Examining Operational Trust and Intention Towards Mars Autonomous Systems
Authors:
Thomas Chan,
Jeremy Argueta,
Jazlyn Armendariz,
Allison Graham,
Sarah Hwang,
Basak Ramaswamy,
So Young Kim,
Scott Davidoff
Abstract:
Tactful coordination on earth between hundreds of operators from diverse disciplines and backgrounds is needed to ensure that Martian rovers have a high likelihood of achieving their science goals while enduring the harsh environment of the red planet. The operations team includes many individuals, each with independent and overlapping objectives, working to decide what to execute on the Mars surf…
▽ More
Tactful coordination on earth between hundreds of operators from diverse disciplines and backgrounds is needed to ensure that Martian rovers have a high likelihood of achieving their science goals while enduring the harsh environment of the red planet. The operations team includes many individuals, each with independent and overlapping objectives, working to decide what to execute on the Mars surface during the next planning period. The team must work together to understand each other's objectives and constraints within a fixed time period, often requiring frequent revision. This study examines the challenges faced during Mars surface operations, from high-level science objectives to formulating a valid, safe, and optimal activity plan that is ready to be radiated to the rover. Through this examination, we aim to illuminate how planning intent can be formulated and effectively communicated to future spacecrafts that will become more and more autonomous. Our findings reveal the intricate nature of human-to-human interactions that require a large array of soft skills and core competencies to communicate concurrently with science and engineering teams during plan formulation. Additionally, our findings exposed significant challenges in eliciting planning intent from operators, which will intensify in the future, as operators on the ground asynchronously co-operate the rover with the on board autonomy. Building a marvellous robot and landing it onto the Mars surface are remarkable feats -however, ensuring that scientists can get the best out of the mission is an ongoing challenge and will not cease to be a difficult task with increased autonomy.
△ Less
Submitted 28 October, 2021;
originally announced October 2021.
-
The Milky Way satellite velocity function is a sharp probe of small-scale structure problems
Authors:
Stacy Y. Kim,
Annika H. G. Peter
Abstract:
Twenty years ago, the mismatch between the observed number of Milky Way satellite galaxies and the predicted number of cold dark matter (CDM) subhalos was dubbed the ``missing satellites problem". Although mostly framed since in terms of satellite counts in luminosity space, the missing satellites problem was originally posed in velocity space. Importantly, the stellar velocity dispersion function…
▽ More
Twenty years ago, the mismatch between the observed number of Milky Way satellite galaxies and the predicted number of cold dark matter (CDM) subhalos was dubbed the ``missing satellites problem". Although mostly framed since in terms of satellite counts in luminosity space, the missing satellites problem was originally posed in velocity space. Importantly, the stellar velocity dispersion function encodes information about the density profile of satellites as well as their abundance. In this work, we completeness correct the MW satellite stellar velocity dispersion function down to its ultrafaint dwarfs ($L \gtrsim 340$ L$_\odot$). Our most conservative completeness correction is in good agreement with a simple CDM model in which massive, classical satellites (M$_{\rm 200} \gtrsim 5 \times 10^8~$M$_\odot$) have baryon-driven cores, while lower mass, ultrafaint satellites inhabit cuspy halos that are not strongly tidally stripped. Tidal destruction of satellites by the MW's disk must be minimal, otherwise the completeness-corrected velocity function exceeds any plausible CDM prediction -- a ``too many satellites" problem. We rule out non-core-collapsing self-interacting dark matter models with a constant cross section $\gtrsim$ 0.1 cm$^2$/g. Constraints on warm dark matter are stronger than those based on the luminosity function due to its additional sensitivity to subhalo central densities, which suppresses number counts by up to an additional 30%. A thermal relic mass $\gtrsim$ 6 keV is preferred. Reducing uncertainties on stellar velocity dispersion measurements and the amount of tidal stripping experienced by the faintest dwarfs is key to determining the severity of the too many satellites problem.
△ Less
Submitted 4 August, 2022; v1 submitted 16 June, 2021;
originally announced June 2021.
-
Cleaning and Structuring the Label Space of the iMet Collection 2020
Authors:
Vivien Nguyen,
Sunnie S. Y. Kim
Abstract:
The iMet 2020 dataset is a valuable resource in the space of fine-grained art attribution recognition, but we believe it has yet to reach its true potential. We document the unique properties of the dataset and observe that many of the attribute labels are noisy, more than is implied by the dataset description. Oftentimes, there are also semantic relationships between the labels (e.g., identical,…
▽ More
The iMet 2020 dataset is a valuable resource in the space of fine-grained art attribution recognition, but we believe it has yet to reach its true potential. We document the unique properties of the dataset and observe that many of the attribute labels are noisy, more than is implied by the dataset description. Oftentimes, there are also semantic relationships between the labels (e.g., identical, mutual exclusion, subsumption, overlap with uncertainty) which we believe are underutilized. We propose an approach to cleaning and structuring the iMet 2020 labels, and discuss the implications and value of doing so. Further, we demonstrate the benefits of our proposed approach through several experiments. Our code and cleaned labels are available at https://github.com/sunniesuhyoung/iMet2020cleaned.
△ Less
Submitted 1 June, 2021;
originally announced June 2021.
-
[Re] Don't Judge an Object by Its Context: Learning to Overcome Contextual Bias
Authors:
Sunnie S. Y. Kim,
Sharon Zhang,
Nicole Meister,
Olga Russakovsky
Abstract:
Singh et al. (2020) point out the dangers of contextual bias in visual recognition datasets. They propose two methods, CAM-based and feature-split, that better recognize an object or attribute in the absence of its typical context while maintaining competitive within-context accuracy. To verify their performance, we attempted to reproduce all 12 tables in the original paper, including those in the…
▽ More
Singh et al. (2020) point out the dangers of contextual bias in visual recognition datasets. They propose two methods, CAM-based and feature-split, that better recognize an object or attribute in the absence of its typical context while maintaining competitive within-context accuracy. To verify their performance, we attempted to reproduce all 12 tables in the original paper, including those in the appendix. We also conducted additional experiments to better understand the proposed methods, including increasing the regularization in CAM-based and removing the weighted loss in feature-split. As the original code was not made available, we implemented the entire pipeline from scratch in PyTorch 1.7.0. Our implementation is based on the paper and email exchanges with the authors. We found that both proposed methods in the original paper help mitigate contextual bias, although for some methods, we could not completely replicate the quantitative results in the paper even after completing an extensive hyperparameter search. For example, on COCO-Stuff, DeepFashion, and UnRel, our feature-split model achieved an increase in accuracy on out-of-context images over the standard baseline, whereas on AwA, we saw a drop in performance. For the proposed CAM-based method, we were able to reproduce the original paper's results to within 0.5$\%$ mAP. Our implementation can be found at https://github.com/princetonvisualai/ContextualBias.
△ Less
Submitted 28 April, 2021;
originally announced April 2021.
-
Supernova Model Discrimination with Hyper-Kamiokande
Authors:
Hyper-Kamiokande Collaboration,
:,
K. Abe,
P. Adrich,
H. Aihara,
R. Akutsu,
I. Alekseev,
A. Ali,
F. Ameli,
I. Anghel,
L. H. V. Anthony,
M. Antonova,
A. Araya,
Y. Asaoka,
Y. Ashida,
V. Aushev,
F. Ballester,
I. Bandac,
M. Barbi,
G. J. Barker,
G. Barr,
M. Batkiewicz-Kwasniak,
M. Bellato,
V. Berardi,
M. Bergevin
, et al. (478 additional authors not shown)
Abstract:
Core-collapse supernovae are among the most magnificent events in the observable universe. They produce many of the chemical elements necessary for life to exist and their remnants -- neutron stars and black holes -- are interesting astrophysical objects in their own right. However, despite millennia of observations and almost a century of astrophysical study, the explosion mechanism of core-colla…
▽ More
Core-collapse supernovae are among the most magnificent events in the observable universe. They produce many of the chemical elements necessary for life to exist and their remnants -- neutron stars and black holes -- are interesting astrophysical objects in their own right. However, despite millennia of observations and almost a century of astrophysical study, the explosion mechanism of core-collapse supernovae is not yet well understood. Hyper-Kamiokande is a next-generation neutrino detector that will be able to observe the neutrino flux from the next galactic core-collapse supernova in unprecedented detail. We focus on the first 500 ms of the neutrino burst, corresponding to the accretion phase, and use a newly-developed, high-precision supernova event generator to simulate Hyper-Kamiokande's response to five different supernova models. We show that Hyper-Kamiokande will be able to distinguish between these models with high accuracy for a supernova at a distance of up to 100 kpc. Once the next galactic supernova happens, this ability will be a powerful tool for guiding simulations towards a precise reproduction of the explosion mechanism observed in nature.
△ Less
Submitted 20 July, 2021; v1 submitted 13 January, 2021;
originally announced January 2021.