-
Resurgence of superconductivity and the role of $d_{xy}$ hole band in FeSe$_{1-x}$Te$_x$
Authors:
Archie B. Morfoot,
Timur K. Kim,
Matthew D. Watson,
Amir A. Haghighirad,
Shiv J. Singh,
Nick Bultinck,
Amalia I. Coldea
Abstract:
Iron-chalcogenide superconductors display rich phenomena caused by orbital-dependent band shifts and electronic correlations. Additionally, they are potential candidates for topological superconductivity due to the band inversion between the Fe $d$ bands and the chalcogen $p_z$ band. Here we present a detailed study of the electronic structure of the nematic superconductors FeSe$_{1-x}$Te$_x$ (…
▽ More
Iron-chalcogenide superconductors display rich phenomena caused by orbital-dependent band shifts and electronic correlations. Additionally, they are potential candidates for topological superconductivity due to the band inversion between the Fe $d$ bands and the chalcogen $p_z$ band. Here we present a detailed study of the electronic structure of the nematic superconductors FeSe$_{1-x}$Te$_x$ ($0<x<0.4$) using angle-resolved photoemission spectroscopy to understand the role of orbital-dependent band shifts, electronic correlations and the chalcogen band. We assess the changes in the effective masses using a three-band low energy model, and the band renormalization via comparison with DFT band structure calculations. The effective masses decrease for all three-hole bands inside the nematic phase followed by a strong increase for the band with $d_{xy}$ orbital character. Interestingly, this nearly-flat $d_{xy}$ band becomes more correlated as it shifts towards the Fermi level with increasing Te concentrations and as the second superconducting dome emerges. Our findings suggests that the $d_{xy}$ hole band, which is very sensitive to the chalcogen height, could be involved in promoting an additional pairing channel and increasing the density of states to stabilize the second superconducting dome in FeSe$_{1-x}$Te$_x$. This simultaneous shift of the $d_{xy}$ hole band and enhanced superconductivity is in contrast with FeSe$_{1-x}$S$_x$.
△ Less
Submitted 19 January, 2024;
originally announced January 2024.
-
COOL-LAMPS VI: Lens model and New Constraints on the Properties of COOL J1241+2219, a Bright z = 5 Lyman Break Galaxy and its z = 1 Cluster Lens
Authors:
Maxwell Klein,
Keren Sharon,
Kate Napier,
Michael D. Gladders,
Gourav Khullar,
Matthew Bayliss,
Håkon Dahle,
M. Riley Owens,
Antony Stark,
Sasha Brownsberger,
Keunho J. Kim,
Nicole Kuchta,
Guillaume Mahler,
Grace Smith,
Ryan Walker,
Katya Gozman,
Michael N. Martinez,
Owen S. Matthews Acuña,
Kaiya Merz,
Jorge A. Sanchez,
Daniel J. Kavin Stein,
Ezra O. Sukay,
Kiyan Tavangar
Abstract:
We present a strong lensing analysis of COOL J1241+2219, the brightest known gravitationally lensed galaxy at $z \geq 5$, based on new multi-band Hubble Space Telescope (HST) imaging data. The lensed galaxy has a redshift of z=5.043, placing it shortly after the end of the Epoch of Reionization, and an AB magnitude z_AB=20.47 mag (Khullar et al. 2021). As such, it serves as a touchstone for future…
▽ More
We present a strong lensing analysis of COOL J1241+2219, the brightest known gravitationally lensed galaxy at $z \geq 5$, based on new multi-band Hubble Space Telescope (HST) imaging data. The lensed galaxy has a redshift of z=5.043, placing it shortly after the end of the Epoch of Reionization, and an AB magnitude z_AB=20.47 mag (Khullar et al. 2021). As such, it serves as a touchstone for future research of that epoch. The high spatial resolution of HST reveals internal structure in the giant arc, from which we identify 15 constraints and construct a robust lens model. We use the lens model to extract cluster mass and lensing magnification. We find that the mass enclosed within the Einstein radius of the z=1.001 cluster lens is M(<5.77'')=$1.079^{+0.023}_{-0.007}$, significantly lower than other known strong lensing clusters at its redshift. The average magnification of the giant arc is $<μ_{arc}>=76^{+40}_{-20}$, a factor of $2.4^{+1.4}_{-0.7}$ greater than previously estimated from ground-based data; the flux-weighted average magnification is $<μ_{arc}>=92^{+37}_{-31}$ We update the current measurements of the stellar mass and star formation rate (SFR) of the source for the revised magnification, $\log(M_\star/M_{\odot})=9.7\pm0.3$ and ${\rm SFR} = 10.3^{+7.0}_{-4.4}$ $ M_{\odot} $yr$^{-1}$. The powerful lensing magnification acting upon COOL J1241+2219 resolves the source and enables future studies of the properties of its star formation on a clump-by-clump basis. The lensing analysis presented here will support upcoming multiwavelength characterization with HST and JWST data of the stellar mass assembly and physical properties of this high-redshift lensed galaxy.
△ Less
Submitted 18 January, 2024;
originally announced January 2024.
-
Adaptive Self-training Framework for Fine-grained Scene Graph Generation
Authors:
Kibum Kim,
Kanghoon Yoon,
Yeonjun In,
Jinyoung Moon,
Donghyun Kim,
Chanyoung Park
Abstract:
Scene graph generation (SGG) models have suffered from inherent problems regarding the benchmark datasets such as the long-tailed predicate distribution and missing annotation problems. In this work, we aim to alleviate the long-tailed problem of SGG by utilizing unannotated triplets. To this end, we introduce a Self-Training framework for SGG (ST-SGG) that assigns pseudo-labels for unannotated tr…
▽ More
Scene graph generation (SGG) models have suffered from inherent problems regarding the benchmark datasets such as the long-tailed predicate distribution and missing annotation problems. In this work, we aim to alleviate the long-tailed problem of SGG by utilizing unannotated triplets. To this end, we introduce a Self-Training framework for SGG (ST-SGG) that assigns pseudo-labels for unannotated triplets based on which the SGG models are trained. While there has been significant progress in self-training for image recognition, designing a self-training framework for the SGG task is more challenging due to its inherent nature such as the semantic ambiguity and the long-tailed distribution of predicate classes. Hence, we propose a novel pseudo-labeling technique for SGG, called Class-specific Adaptive Thresholding with Momentum (CATM), which is a model-agnostic framework that can be applied to any existing SGG models. Furthermore, we devise a graph structure learner (GSL) that is beneficial when adopting our proposed self-training framework to the state-of-the-art message-passing neural network (MPNN)-based SGG models. Our extensive experiments verify the effectiveness of ST-SGG on various SGG models, particularly in enhancing the performance on fine-grained predicate classes.
△ Less
Submitted 18 January, 2024;
originally announced January 2024.
-
DOO-RE: A dataset of ambient sensors in a meeting room for activity recognition
Authors:
Hyunju Kim,
Geon Kim,
Taehoon Lee,
Kisoo Kim,
Dongman Lee
Abstract:
With the advancement of IoT technology, recognizing user activities with machine learning methods is a promising way to provide various smart services to users. High-quality data with privacy protection is essential for deploying such services in the real world. Data streams from surrounding ambient sensors are well suited to the requirement. Existing ambient sensor datasets only support constrain…
▽ More
With the advancement of IoT technology, recognizing user activities with machine learning methods is a promising way to provide various smart services to users. High-quality data with privacy protection is essential for deploying such services in the real world. Data streams from surrounding ambient sensors are well suited to the requirement. Existing ambient sensor datasets only support constrained private spaces and those for public spaces have yet to be explored despite growing interest in research on them. To meet this need, we build a dataset collected from a meeting room equipped with ambient sensors. The dataset, DOO-RE, includes data streams from various ambient sensor types such as Sound and Projector. Each sensor data stream is segmented into activity units and multiple annotators provide activity labels through a cross-validation annotation process to improve annotation quality. We finally obtain 9 types of activities. To our best knowledge, DOO-RE is the first dataset to support the recognition of both single and group activities in a real meeting room with reliable annotations.
△ Less
Submitted 16 January, 2024;
originally announced January 2024.
-
CFASL: Composite Factor-Aligned Symmetry Learning for Disentanglement in Variational AutoEncoder
Authors:
Hee-Jun Jung,
Jaehyoung Jeong,
Kangil Kim
Abstract:
Symmetries of input and latent vectors have provided valuable insights for disentanglement learning in VAEs.However, only a few works were proposed as an unsupervised method, and even these works require known factor information in training data. We propose a novel method, Composite Factor-Aligned Symmetry Learning (CFASL), which is integrated into VAEs for learning symmetry-based disentanglement…
▽ More
Symmetries of input and latent vectors have provided valuable insights for disentanglement learning in VAEs.However, only a few works were proposed as an unsupervised method, and even these works require known factor information in training data. We propose a novel method, Composite Factor-Aligned Symmetry Learning (CFASL), which is integrated into VAEs for learning symmetry-based disentanglement in unsupervised learning without any knowledge of the dataset factor information.CFASL incorporates three novel features for learning symmetry-based disentanglement: 1) Injecting inductive bias to align latent vector dimensions to factor-aligned symmetries within an explicit learnable symmetry codebook 2) Learning a composite symmetry to express unknown factors change between two random samples by learning factor-aligned symmetries within the codebook 3) Inducing group equivariant encoder and decoder in training VAEs with the two conditions. In addition, we propose an extended evaluation metric for multi-factor changes in comparison to disentanglement evaluation in VAEs. In quantitative and in-depth qualitative analysis, CFASL demonstrates a significant improvement of disentanglement in single-factor change, and multi-factor change conditions compared to state-of-the-art methods.
△ Less
Submitted 18 January, 2024; v1 submitted 16 January, 2024;
originally announced January 2024.
-
Improving ASR Contextual Biasing with Guided Attention
Authors:
Jiyang Tang,
Kwangyoun Kim,
Suwon Shon,
Felix Wu,
Prashant Sridhar,
Shinji Watanabe
Abstract:
In this paper, we propose a Guided Attention (GA) auxiliary training loss, which improves the effectiveness and robustness of automatic speech recognition (ASR) contextual biasing without introducing additional parameters. A common challenge in previous literature is that the word error rate (WER) reduction brought by contextual biasing diminishes as the number of bias phrases increases. To addres…
▽ More
In this paper, we propose a Guided Attention (GA) auxiliary training loss, which improves the effectiveness and robustness of automatic speech recognition (ASR) contextual biasing without introducing additional parameters. A common challenge in previous literature is that the word error rate (WER) reduction brought by contextual biasing diminishes as the number of bias phrases increases. To address this challenge, we employ a GA loss as an additional training objective besides the Transducer loss. The proposed GA loss aims to teach the cross attention how to align bias phrases with text tokens or audio frames. Compared to studies with similar motivations, the proposed loss operates directly on the cross attention weights and is easier to implement. Through extensive experiments based on Conformer Transducer with Contextual Adapter, we demonstrate that the proposed method not only leads to a lower WER but also retains its effectiveness as the number of bias phrases increases. Specifically, the GA loss decreases the WER of rare vocabularies by up to 19.2% on LibriSpeech compared to the contextual biasing baseline, and up to 49.3% compared to a vanilla Transducer.
△ Less
Submitted 16 January, 2024;
originally announced January 2024.
-
Bounded weak solutions for Keller-Segel equations with generalized diffusion and logistic source via an unbalanced Optimal Transport splitting scheme
Authors:
Kyungkeun Kang,
Hwa Kil Kim,
Geuntaek Seo
Abstract:
We consider a parabolic-elliptic type of Keller-Segel equations with generalized diffusion and logistic source under homogeneous Neumann-Neumann boundary conditions. We construct bounded weak solutions globally in time in an unbalanced optimal transport framework, provided that the magnitude of the chemotactic sensitivity can be restricted depending on parameters. In the case of subquadratic degra…
▽ More
We consider a parabolic-elliptic type of Keller-Segel equations with generalized diffusion and logistic source under homogeneous Neumann-Neumann boundary conditions. We construct bounded weak solutions globally in time in an unbalanced optimal transport framework, provided that the magnitude of the chemotactic sensitivity can be restricted depending on parameters. In the case of subquadratic degradation of the logistic source, we quantify the chemotactic sensitivity, in particular, in terms of the power of degradation and the pointwise bound of the initial density.
△ Less
Submitted 2 March, 2024; v1 submitted 16 January, 2024;
originally announced January 2024.
-
Statistics in Survey Sampling
Authors:
Jae Kwang Kim
Abstract:
Survey sampling theory and methods are introduced. Sampling designs and estimation methods are carefully discussed as a textbook for survey sampling. Topics includes Horvitz-Thompson estimation, simple random sampling, stratified sampling, cluster sampling, ratio estimation, regression estimation, variance estimation, two-phase sampling, and nonresponse adjustment methods.
Survey sampling theory and methods are introduced. Sampling designs and estimation methods are carefully discussed as a textbook for survey sampling. Topics includes Horvitz-Thompson estimation, simple random sampling, stratified sampling, cluster sampling, ratio estimation, regression estimation, variance estimation, two-phase sampling, and nonresponse adjustment methods.
△ Less
Submitted 11 June, 2024; v1 submitted 15 January, 2024;
originally announced January 2024.
-
Background study of the AMoRE-pilot experiment
Authors:
A. Agrawal,
V. V. Alenkov,
P. Aryal,
J. Beyer,
B. Bhandari,
R. S. Boiko,
K. Boonin,
O. Buzanov,
C. R. Byeon,
N. Chanthima,
M. K. Cheoun,
J. S. Choe,
Seonho Choi,
S. Choudhury,
J. S. Chung,
F. A. Danevich,
M. Djamal,
D. Drung,
C. Enss,
A. Fleischmann,
A. M. Gangapshev,
L. Gastaldo,
Yu. M. Gavrilyuk,
A. M. Gezhaev,
O. Gileva
, et al. (83 additional authors not shown)
Abstract:
We report a study on the background of the Advanced Molybdenum-Based Rare process Experiment (AMoRE), a search for neutrinoless double beta decay (\znbb) of $^{100}$Mo. The pilot stage of the experiment was conducted using $\sim$1.9 kg of \CAMOO~ crystals at the Yangyang Underground Laboratory, South Korea, from 2015 to 2018. We compared the measured $β/γ$ energy spectra in three experimental conf…
▽ More
We report a study on the background of the Advanced Molybdenum-Based Rare process Experiment (AMoRE), a search for neutrinoless double beta decay (\znbb) of $^{100}$Mo. The pilot stage of the experiment was conducted using $\sim$1.9 kg of \CAMOO~ crystals at the Yangyang Underground Laboratory, South Korea, from 2015 to 2018. We compared the measured $β/γ$ energy spectra in three experimental configurations with the results of Monte Carlo simulations and identified the background sources in each configuration. We replaced several detector components and enhanced the neutron shielding to lower the background level between configurations. A limit on the half-life of $0νββ$ decay of $^{100}$Mo was found at $T_{1/2}^{0ν} \ge 3.0\times 10^{23}$ years at 90\% confidence level, based on the measured background and its modeling. Further reduction of the background rate in the AMoRE-I and AMoRE-II are discussed.
△ Less
Submitted 7 April, 2024; v1 submitted 15 January, 2024;
originally announced January 2024.
-
Nonproportionality of NaI(Tl) Scintillation Detector for Dark Matter Search Experiments
Authors:
S. M. Lee,
G. Adhikari,
N. Carlin,
J. Y. Cho,
J. J. Choi,
S. Choi,
A. C. Ezeribe,
L. E. Fran. a,
C. Ha,
I. S. Hahn,
S. J. Hollick,
E. J. Jeon,
H. W. Joo,
W. G. Kang,
M. Kauer,
B. H. Kim,
H. J. Kim,
J. Kim,
K. W. Kim,
S. H. Kim,
S. K. Kim,
S. W. Kim,
W. K. Kim,
Y. D. Kim,
Y. H. Kim
, et al. (37 additional authors not shown)
Abstract:
We present a comprehensive study of the nonproportionality of NaI(Tl) scintillation detectors within the context of dark matter search experiments. Our investigation, which integrates COSINE-100 data with supplementary $γ$ spectroscopy, measures light yields across diverse energy levels from full-energy $γ$ peaks produced by the decays of various isotopes. These $γ$ peaks of interest were produced…
▽ More
We present a comprehensive study of the nonproportionality of NaI(Tl) scintillation detectors within the context of dark matter search experiments. Our investigation, which integrates COSINE-100 data with supplementary $γ$ spectroscopy, measures light yields across diverse energy levels from full-energy $γ$ peaks produced by the decays of various isotopes. These $γ$ peaks of interest were produced by decays supported by both long and short-lived isotopes. Analyzing peaks from decays supported only by short-lived isotopes presented a unique challenge due to their limited statistics and overlapping energies, which was overcome by long-term data collection and a time-dependent analysis. A key achievement is the direct measurement of the 0.87 keV light yield, resulting from the cascade following electron capture decay of $^{22}$Na from internal contamination. This measurement, previously accessible only indirectly, deepens our understanding of NaI(Tl) scintillator behavior in the region of interest for dark matter searches. This study holds substantial implications for background modeling and the interpretation of dark matter signals in NaI(Tl) experiments.
△ Less
Submitted 10 May, 2024; v1 submitted 14 January, 2024;
originally announced January 2024.
-
Folding potential with modern nuclear density functionals and application to 16O+208Pb reaction
Authors:
Kyoungsu Heo,
Hana Gil,
Ki-Seok Choi,
K. S. Kim,
Chang Ho Hyun,
W. Y. So
Abstract:
Double folding potential is constructed using the M3Y interaction and the matter densities of the projectile and target nuclei obtained from four microscopic energy density functional (EDF) models. The elastic scattering cross sections for the 16O+208Pb system are calculated using the optical model with the double folding potentials of the four EDF models. We focus on the correlation between the m…
▽ More
Double folding potential is constructed using the M3Y interaction and the matter densities of the projectile and target nuclei obtained from four microscopic energy density functional (EDF) models. The elastic scattering cross sections for the 16O+208Pb system are calculated using the optical model with the double folding potentials of the four EDF models. We focus on the correlation between the matter densities and the behavior the double folding potential and the elastic scattering cross sections. First, the matter and charge densities are examined by comparing the results of the four EDF models. There is a slight difference in the density in the internal region, but it is negligible in the outer region. Next, we calculate the double folding potential with the matter densities obtained from the four EDF models. Differences between the models are negligible in the outer region, but the potential depth in the internal region shows model dependence, which can be understood from the behavior of matter densities in the internal region. Another point is that the double folding potential is shown to be weakly dependent on the incident energy. Finally, the elastic scattering cross sections have no significant model dependence except for the slight difference in the backward angle.
△ Less
Submitted 14 January, 2024;
originally announced January 2024.
-
QCQP-Net: Reliably Learning Feasible Alternating Current Optimal Power Flow Solutions Under Constraints
Authors:
Sihan Zeng,
Youngdae Kim,
Yuxuan Ren,
Kibaek Kim
Abstract:
At the heart of power system operations, alternating current optimal power flow (ACOPF) studies the generation of electric power in the most economical way under network-wide load requirement, and can be formulated as a highly structured non-convex quadratically constrained quadratic program (QCQP). Optimization-based solutions to ACOPF (such as ADMM or interior-point method), as the classic appro…
▽ More
At the heart of power system operations, alternating current optimal power flow (ACOPF) studies the generation of electric power in the most economical way under network-wide load requirement, and can be formulated as a highly structured non-convex quadratically constrained quadratic program (QCQP). Optimization-based solutions to ACOPF (such as ADMM or interior-point method), as the classic approach, require large amount of computation and cannot meet the need to repeatedly solve the problem as load requirement frequently changes. On the other hand, learning-based methods that directly predict the ACOPF solution given the load input incur little computational cost but often generates infeasible solutions (i.e. violate the constraints of ACOPF). In this work, we combine the best of both worlds -- we propose an innovated framework for learning ACOPF, where the input load is mapped to the ACOPF solution through a neural network in a computationally efficient and reliable manner. Key to our innovation is a specific-purpose "activation function" defined implicitly by a QCQP and a novel loss, which enforce constraint satisfaction. We show through numerical simulations that our proposed method achieves superior feasibility rate and generation cost in situations where the existing learning-based approaches fail.
△ Less
Submitted 11 January, 2024;
originally announced January 2024.
-
Reinforcement Learning for Optimizing RAG for Domain Chatbots
Authors:
Mandar Kulkarni,
Praveen Tangarajan,
Kyung Kim,
Anusua Trivedi
Abstract:
With the advent of Large Language Models (LLM), conversational assistants have become prevalent for domain use cases. LLMs acquire the ability to contextual question answering through training, and Retrieval Augmented Generation (RAG) further enables the bot to answer domain-specific questions. This paper describes a RAG-based approach for building a chatbot that answers user's queries using Frequ…
▽ More
With the advent of Large Language Models (LLM), conversational assistants have become prevalent for domain use cases. LLMs acquire the ability to contextual question answering through training, and Retrieval Augmented Generation (RAG) further enables the bot to answer domain-specific questions. This paper describes a RAG-based approach for building a chatbot that answers user's queries using Frequently Asked Questions (FAQ) data. We train an in-house retrieval embedding model using infoNCE loss, and experimental results demonstrate that the in-house model works significantly better than the well-known general-purpose public embedding model, both in terms of retrieval accuracy and Out-of-Domain (OOD) query detection. As an LLM, we use an open API-based paid ChatGPT model. We noticed that a previously retrieved-context could be used to generate an answer for specific patterns/sequences of queries (e.g., follow-up queries). Hence, there is a scope to optimize the number of LLM tokens and cost. Assuming a fixed retrieval model and an LLM, we optimize the number of LLM tokens using Reinforcement Learning (RL). Specifically, we propose a policy-based model external to the RAG, which interacts with the RAG pipeline through policy actions and updates the policy to optimize the cost. The policy model can perform two actions: to fetch FAQ context or skip retrieval. We use the open API-based GPT-4 as the reward model. We then train a policy model using policy gradient on multiple training chat sessions. As a policy model, we experimented with a public gpt-2 model and an in-house BERT model. With the proposed RL-based optimization combined with similarity threshold, we are able to achieve significant cost savings while getting a slightly improved accuracy. Though we demonstrate results for the FAQ chatbot, the proposed RL approach is generic and can be experimented with any existing RAG pipeline.
△ Less
Submitted 9 January, 2024;
originally announced January 2024.
-
Search for Baryon-Number-Violating Processes in $B^-$ Decays to the $\barΞ_{c}^{0} \barΛ_{c}^{-}$ Final State
Authors:
Belle Collaboration,
T. Gu,
V. Savinov,
I. Adachi,
H. Aihara,
D. M. Asner,
H. Atmacan,
T. Aushev,
R. Ayad,
Sw. Banerjee,
K. Belous,
J. Bennett,
M. Bessner,
V. Bhardwaj,
B. Bhuyan,
D. Biswas,
A. Bobrov,
D. Bodrov,
J. Borah,
A. Bozek,
M. Bračko,
P. Branchini,
T. E. Browder,
A. Budano,
M. Campajola
, et al. (139 additional authors not shown)
Abstract:
We report the results of the first search for $B^-$ decays to the $\barΞ_{c}^{0} \barΛ_{c}^{-}$ final state using 711~${\rm fb^{-1}}$ of data collected at the $Υ(4S)$ resonance with the Belle detector at the KEKB asymmetric-energy $e^+ e^-$ collider. The results are interpreted in terms of both direct baryon-number-violating $B^-$ decay and $Ξ_{c}^{0}-\barΞ_{c}^{0}$ oscillations which follow the S…
▽ More
We report the results of the first search for $B^-$ decays to the $\barΞ_{c}^{0} \barΛ_{c}^{-}$ final state using 711~${\rm fb^{-1}}$ of data collected at the $Υ(4S)$ resonance with the Belle detector at the KEKB asymmetric-energy $e^+ e^-$ collider. The results are interpreted in terms of both direct baryon-number-violating $B^-$ decay and $Ξ_{c}^{0}-\barΞ_{c}^{0}$ oscillations which follow the Standard Model decay $B^- \to Ξ_{c}^{0} \barΛ_{c}^{-}$. We observe no evidence for baryon number violation and set the 95\% confidence-level upper limits on the ratio of baryon-number-violating and Standard Model branching fractions ${\mathcal{B}(B^- \rightarrow \barΞ_{c}^{0} \barΛ_{c}^{-})}/{\mathcal{B}(B^- \rightarrow Ξ_{c}^{0} \barΛ_{c}^{-})}$ to be $< 2.7\%$ and on the $Ξ_{c}^{0} - \barΞ_{c}^{0}$ oscillation angular frequency $ω$ to be $< 0.76\ \mathrm{ps}^{-1}$ (equivalent to $τ_{\rm mix} > 1.3$~ps).
△ Less
Submitted 11 January, 2024; v1 submitted 9 January, 2024;
originally announced January 2024.
-
Measurements of the branching fraction, polarization, and $CP$ asymmetry for the decay $B^0\rightarrow ωω$
Authors:
Belle Collaboration,
Y. Guan,
A. J. Schwartz,
K. Kinoshita,
I. Adachi,
H. Aihara,
S. Al Said,
D. M. Asner,
H. Atmacan,
R. Ayad,
S. Bahinipati,
Sw. Banerjee,
K. Belous,
J. Bennett,
M. Bessner,
V. Bhardwaj,
B. Bhuyan,
D. Biswas,
A. Bobrov,
D. Bodrov,
J. Borah,
A. Bozek,
M. Bračko,
P. Branchini,
A. Budano
, et al. (145 additional authors not shown)
Abstract:
We present a measurement of $B^{0} \rightarrow ωω$, a charmless decay into two vector mesons, using 772 $\times 10^6$ $B\overline{B}$ pairs collected with the Belle detector at the KEKB $e^+e^-$ collider. The decay is observed with a significance of 7.9 standard deviations. We measure a branching fraction $\mathcal{B} = (1.53 \pm 0.29 \pm 0.17) \times 10^{-6}$, a fraction of longitudinal polarizat…
▽ More
We present a measurement of $B^{0} \rightarrow ωω$, a charmless decay into two vector mesons, using 772 $\times 10^6$ $B\overline{B}$ pairs collected with the Belle detector at the KEKB $e^+e^-$ collider. The decay is observed with a significance of 7.9 standard deviations. We measure a branching fraction $\mathcal{B} = (1.53 \pm 0.29 \pm 0.17) \times 10^{-6}$, a fraction of longitudinal polarization $f_L = 0.87 \pm 0.13 \pm 0.13$, and a time-integrated $CP$ asymmetry $A_{CP}$ = $-0.44 \pm 0.43 \pm 0.11$, where the first uncertainties listed are statistical and the second are systematic. This is the first observation of $B^{0} \rightarrow ωω$, and the first measurements of $f_L$ and $A_{CP}$ for this decay.
△ Less
Submitted 9 January, 2024;
originally announced January 2024.
-
RHOBIN Challenge: Reconstruction of Human Object Interaction
Authors:
Xianghui Xie,
Xi Wang,
Nikos Athanasiou,
Bharat Lal Bhatnagar,
Chun-Hao P. Huang,
Kaichun Mo,
Hao Chen,
Xia Jia,
Zerui Zhang,
Liangxian Cui,
Xiao Lin,
Bingqiao Qian,
Jie Xiao,
Wenfei Yang,
Hyeongjin Nam,
Daniel Sungho Jung,
Kihoon Kim,
Kyoung Mu Lee,
Otmar Hilliges,
Gerard Pons-Moll
Abstract:
Modeling the interaction between humans and objects has been an emerging research direction in recent years. Capturing human-object interaction is however a very challenging task due to heavy occlusion and complex dynamics, which requires understanding not only 3D human pose, and object pose but also the interaction between them. Reconstruction of 3D humans and objects has been two separate resear…
▽ More
Modeling the interaction between humans and objects has been an emerging research direction in recent years. Capturing human-object interaction is however a very challenging task due to heavy occlusion and complex dynamics, which requires understanding not only 3D human pose, and object pose but also the interaction between them. Reconstruction of 3D humans and objects has been two separate research fields in computer vision for a long time. We hence proposed the first RHOBIN challenge: reconstruction of human-object interactions in conjunction with the RHOBIN workshop. It was aimed at bringing the research communities of human and object reconstruction as well as interaction modeling together to discuss techniques and exchange ideas. Our challenge consists of three tracks of 3D reconstruction from monocular RGB images with a focus on dealing with challenging interaction scenarios. Our challenge attracted more than 100 participants with more than 300 submissions, indicating the broad interest in the research communities. This paper describes the settings of our challenge and discusses the winning methods of each track in more detail. We observe that the human reconstruction task is becoming mature even under heavy occlusion settings while object pose estimation and joint reconstruction remain challenging tasks. With the growing interest in interaction modeling, we hope this report can provide useful insights and foster future research in this direction. Our workshop website can be found at \href{https://rhobin-challenge.github.io/}{https://rhobin-challenge.github.io/}.
△ Less
Submitted 7 January, 2024;
originally announced January 2024.
-
Active Diffusion of Self-Propelled Particles in Semi-Flexible Polymer Networks
Authors:
Yeongjin Kim,
Won Kyu Kim,
Jae-Hyung Jeon
Abstract:
Mesh-like structures, such as mucus gel or cytoskeleton networks, are ubiquitous in biological systems. These intricate structures are composed of cross-linked, semi-flexible bio-filaments, crucial to numerous biological processes. In many biological systems, active self-propelled particles like motor proteins or bacteria navigate these intricate polymer networks. In this study, we develop a compu…
▽ More
Mesh-like structures, such as mucus gel or cytoskeleton networks, are ubiquitous in biological systems. These intricate structures are composed of cross-linked, semi-flexible bio-filaments, crucial to numerous biological processes. In many biological systems, active self-propelled particles like motor proteins or bacteria navigate these intricate polymer networks. In this study, we develop a computational model of three-dimensional cubic-topological, swollen polymer networks of semi-flexible filaments. We perform Langevin dynamics simulations to investigate the diffusion of active tracer particles navigating through these networks. By analyzing various physical observables, we investigate the effects of mesh-to-particle size ratio, Péclet number of active particles, and bending stiffness of the polymer networks upon active trapped-and-hopping diffusion of the tracer. When the tracer size is equal to or larger than the mesh size, the polymer stiffness substantially enhances trapping while suppressing the hopping process. Notably, the mean trapped time exhibits an exponential growth law to the bending stiffness with an activity-dependent slope. An analytic theory based on the mean first-passage time of active particles in a harmonic potential is developed. Our findings deepen the comprehension of the intricate interplay between the polymer's bending stiffness, tracer size, and the activity of tracer particles. This knowledge can shed light on important biological processes, such as motor-driven cargo transport or drug delivery, which hinge on the behavior of active particles within biological gels.
△ Less
Submitted 9 January, 2024; v1 submitted 8 January, 2024;
originally announced January 2024.
-
Lyman Continuum Emission from AGN at 2.3$\lesssim$z$\lesssim$3.7 in the UVCANDELS Fields
Authors:
Brent M. Smith,
Rogier A. Windhorst,
Harry Teplitz,
Matthew Hayes,
Marc Rafelski,
Mark Dickinson,
Vihang Mehta,
Nimish P. Hathi,
John MacKenty,
L. Y. Aaron Yung,
Anton M. Koekemoer,
Emmaris Soto,
Christopher J. Conselice,
Ray A. Lucas,
Xin Wang,
Keunho J. Kim,
Anahita Alavi,
Norman A. Grogin,
Ben Sunnquist,
Laura Prichard,
Rolf A. Jansen,
the UVCANDELS team
Abstract:
We present the results of our search for Lyman continuum (LyC) emitting AGN at redshifts 2.3$\lesssim$z$\lesssim$4.9 from HST WFC3 F275W observations in the UVCANDELS fields. We also include LyC emission from AGN using HST WFC3 F225W, F275W, and F336W found in the ERS and HDUV data. We performed exhaustive queries of the Vizier database to locate AGN with high quality spectroscopic redshifts. In t…
▽ More
We present the results of our search for Lyman continuum (LyC) emitting AGN at redshifts 2.3$\lesssim$z$\lesssim$4.9 from HST WFC3 F275W observations in the UVCANDELS fields. We also include LyC emission from AGN using HST WFC3 F225W, F275W, and F336W found in the ERS and HDUV data. We performed exhaustive queries of the Vizier database to locate AGN with high quality spectroscopic redshifts. In total, we found 51 AGN that met our criteria within the UVCANDELS and ERS footprints. Of these 51, we find 12 AGN had $\geq$4$σ$ detected LyC flux in the WFC3/UVIS images. Using space- and ground-based data from X-ray to radio, we fit the multi-wavelength photometric data of each AGN to a CIGALE SED and correlate various SED parameters to the LyC flux. KS-tests of the SED parameter distributions for the LyC-detected and non-detected AGN showed they are likely not distinct samples. However, we find that X-ray luminosity, star-formation onset age, and disk luminosity show strong correlations relative to their emitted LyC flux. We also find strong correlation of the LyC flux to several dust parameters, i.e., polar and toroidal dust emission, 6 $μm$ luminosity, and anti-correlation with metallicity and $A_{FUV}$. We simulate the LyC escape fraction ($f_{esc}$) using the CIGALE and IGM transmission models for the LyC-detected AGN and find an average $f_{esc}$$\simeq$18%, weighted by uncertainties. We stack the LyC flux of subsamples of AGN according to the wavelength continuum region in which they are detected and find no significant distinctions in their LyC emission, although our $sub-mm\ detected$ F336W sample shows the brightest stacked LyC flux. These findings indicate that LyC-production and -escape in AGN is more complicated than the simple assumption of thermal emission and a 100% escape fraction. Further testing of AGN models with larger samples than presented here is needed.
△ Less
Submitted 5 January, 2024;
originally announced January 2024.
-
A test of lepton flavor universality with a measurement of $R(D^{*})$ using hadronic $B$ tagging at the Belle II experiment
Authors:
Belle II Collaboration,
I. Adachi,
K. Adamczyk,
L. Aggarwal,
H. Ahmed,
H. Aihara,
N. Akopov,
A. Aloisio,
N. Anh Ky,
D. M. Asner,
H. Atmacan,
T. Aushev,
V. Aushev,
M. Aversano,
R. Ayad,
V. Babu,
H. Bae,
S. Bahinipati,
P. Bambade,
Sw. Banerjee,
S. Bansal,
M. Barrett,
J. Baudot,
M. Bauer,
A. Baur
, et al. (412 additional authors not shown)
Abstract:
The ratio of branching fractions $R(D^{*}) = \mathcal{B}(\overline{B} \rightarrow D^{*} τ^{-} \overlineν_τ)$/$\mathcal{B} (\overline{B} \rightarrow D^{*} \ell^{-} \overlineν_{\ell})$, where $\ell$ is an electron or muon, is measured using a Belle~II data sample with an integrated luminosity of $189~\mathrm{fb}^{-1}$ at the SuperKEKB asymmetric-energy $e^{+} e^{-}$ collider. Data is collected at th…
▽ More
The ratio of branching fractions $R(D^{*}) = \mathcal{B}(\overline{B} \rightarrow D^{*} τ^{-} \overlineν_τ)$/$\mathcal{B} (\overline{B} \rightarrow D^{*} \ell^{-} \overlineν_{\ell})$, where $\ell$ is an electron or muon, is measured using a Belle~II data sample with an integrated luminosity of $189~\mathrm{fb}^{-1}$ at the SuperKEKB asymmetric-energy $e^{+} e^{-}$ collider. Data is collected at the $Υ(\mathrm{4S})$ resonance, and one $B$ meson in the $Υ(\mathrm{4S})\rightarrow B\overline{B}$ decay is fully reconstructed in hadronic decay modes. The accompanying signal $B$ meson is reconstructed as $\overline{B}\rightarrow D^{*} τ^{-}\overlineν_τ$ using leptonic $τ$ decays. The normalization decay, $\overline{B}\rightarrow D^{*} \ell^{-} \overlineν_{\ell}$, where $\ell$ is an electron or muon, produces the same observable final state particles. The ratio of branching fractions is extracted in a simultaneous fit to two signal-discriminating variables in both channels and yields $R(D^{*}) = 0.262~_{-0.039}^{+0.041}(\mathrm{stat})~_{-0.032}^{+0.035}(\mathrm{syst})$. This result is consistent with the current world average and with standard model predictions.
△ Less
Submitted 5 January, 2024;
originally announced January 2024.
-
GTA: Guided Transfer of Spatial Attention from Object-Centric Representations
Authors:
SeokHyun Seo,
Jinwoo Hong,
JungWoo Chae,
Kyungyul Kim,
Sangheum Hwang
Abstract:
Utilizing well-trained representations in transfer learning often results in superior performance and faster convergence compared to training from scratch. However, even if such good representations are transferred, a model can easily overfit the limited training dataset and lose the valuable properties of the transferred representations. This phenomenon is more severe in ViT due to its low induct…
▽ More
Utilizing well-trained representations in transfer learning often results in superior performance and faster convergence compared to training from scratch. However, even if such good representations are transferred, a model can easily overfit the limited training dataset and lose the valuable properties of the transferred representations. This phenomenon is more severe in ViT due to its low inductive bias. Through experimental analysis using attention maps in ViT, we observe that the rich representations deteriorate when trained on a small dataset. Motivated by this finding, we propose a novel and simple regularization method for ViT called Guided Transfer of spatial Attention (GTA). Our proposed method regularizes the self-attention maps between the source and target models. A target model can fully exploit the knowledge related to object localization properties through this explicit regularization. Our experimental results show that the proposed GTA consistently improves the accuracy across five benchmark datasets especially when the number of training data is small.
△ Less
Submitted 5 January, 2024;
originally announced January 2024.
-
SPT Clusters with DES and HST Weak Lensing. II. Cosmological Constraints from the Abundance of Massive Halos
Authors:
S. Bocquet,
S. Grandis,
L. E. Bleem,
M. Klein,
J. J. Mohr,
T. Schrabback,
T. M. C. Abbott,
P. A. R. Ade,
M. Aguena,
A. Alarcon,
S. Allam,
S. W. Allen,
O. Alves,
A. Amon,
A. J. Anderson,
J. Annis,
B. Ansarinejad,
J. E. Austermann,
S. Avila,
D. Bacon,
M. Bayliss,
J. A. Beall,
K. Bechtol,
M. R. Becker,
A. N. Bender
, et al. (171 additional authors not shown)
Abstract:
We present cosmological constraints from the abundance of galaxy clusters selected via the thermal Sunyaev-Zel'dovich (SZ) effect in South Pole Telescope (SPT) data with a simultaneous mass calibration using weak gravitational lensing data from the Dark Energy Survey (DES) and the Hubble Space Telescope (HST). The cluster sample is constructed from the combined SPT-SZ, SPTpol ECS, and SPTpol 500d…
▽ More
We present cosmological constraints from the abundance of galaxy clusters selected via the thermal Sunyaev-Zel'dovich (SZ) effect in South Pole Telescope (SPT) data with a simultaneous mass calibration using weak gravitational lensing data from the Dark Energy Survey (DES) and the Hubble Space Telescope (HST). The cluster sample is constructed from the combined SPT-SZ, SPTpol ECS, and SPTpol 500d surveys, and comprises 1,005 confirmed clusters in the redshift range $0.25-1.78$ over a total sky area of 5,200 deg$^2$. We use DES Year 3 weak-lensing data for 688 clusters with redshifts $z<0.95$ and HST weak-lensing data for 39 clusters with $0.6<z<1.7$. The weak-lensing measurements enable robust mass measurements of sample clusters and allow us to empirically constrain the SZ observable--mass relation. For a flat $Λ$CDM cosmology, and marginalizing over the sum of massive neutrinos, we measure $Ω_\mathrm{m}=0.286\pm0.032$, $σ_8=0.817\pm0.026$, and the parameter combination $σ_8\,(Ω_\mathrm{m}/0.3)^{0.25}=0.805\pm0.016$. Our measurement of $S_8\equivσ_8\,\sqrt{Ω_\mathrm{m}/0.3}=0.795\pm0.029$ and the constraint from Planck CMB anisotropies (2018 TT,TE,EE+lowE) differ by $1.1σ$. In combination with that Planck dataset, we place a 95% upper limit on the sum of neutrino masses $\sum m_ν<0.18$ eV. When additionally allowing the dark energy equation of state parameter $w$ to vary, we obtain $w=-1.45\pm0.31$ from our cluster-based analysis. In combination with Planck data, we measure $w=-1.34^{+0.22}_{-0.15}$, or a $2.2σ$ difference with a cosmological constant. We use the cluster abundance to measure $σ_8$ in five redshift bins between 0.25 and 1.8, and we find the results to be consistent with structure growth as predicted by the $Λ$CDM model fit to Planck primary CMB data.
△ Less
Submitted 21 June, 2024; v1 submitted 4 January, 2024;
originally announced January 2024.
-
Deep learning bulk spacetime from boundary optical conductivity
Authors:
Byoungjoon Ahn,
Hyun-Sik Jeong,
Keun-Young Kim,
Kwan Yun
Abstract:
We employ a deep learning method to deduce the \textit{bulk} spacetime from \textit{boundary} optical conductivity. We apply the neural ordinary differential equation technique, tailored for continuous functions such as the metric, to the typical class of holographic condensed matter models featuring broken translations: linear-axion models. We successfully extract the bulk metric from the boundar…
▽ More
We employ a deep learning method to deduce the \textit{bulk} spacetime from \textit{boundary} optical conductivity. We apply the neural ordinary differential equation technique, tailored for continuous functions such as the metric, to the typical class of holographic condensed matter models featuring broken translations: linear-axion models. We successfully extract the bulk metric from the boundary holographic optical conductivity. Furthermore, as an example for real material, we use experimental optical conductivity of $\text{UPd}_2\text{Al}_3$, a representative of heavy fermion metals in strongly correlated electron systems, and construct the corresponding bulk metric. To our knowledge, our work is the first illustration of deep learning bulk spacetime from \textit{boundary} holographic or experimental conductivity data.
△ Less
Submitted 1 January, 2024;
originally announced January 2024.
-
Salem/Pisot Numbers in the Weyl Spectrum
Authors:
Kyounghee Kim
Abstract:
In this article, we define the orbit data of birational maps of $\mathbf{P}^2(\mathbb{C})$ and show that the orbit data determine the dynamical degree by providing the minimal polynomials of the dynamical degree in terms of orbit data. Using this, we determine all Salem numbers and Pisot numbers that appear in the Weyl Spectrum of Coxeter group $W_n$ associated to $E_n$ and the Dynamical Spectrum…
▽ More
In this article, we define the orbit data of birational maps of $\mathbf{P}^2(\mathbb{C})$ and show that the orbit data determine the dynamical degree by providing the minimal polynomials of the dynamical degree in terms of orbit data. Using this, we determine all Salem numbers and Pisot numbers that appear in the Weyl Spectrum of Coxeter group $W_n$ associated to $E_n$ and the Dynamical Spectrum of $\mathbf{P}^2(\mathbb{C})$.
△ Less
Submitted 29 December, 2023;
originally announced December 2023.
-
Quantum Scaling Dimension from the Equivalence principle
Authors:
Yoon-Seok Choun,
Ki-Seok Kim,
Sang-Jin Sin
Abstract:
We propose a method to constrain the scaling dimension of the operators of the strongly interacting systems (SIS) using the holographic setup where the (d+1)-dimensional black hole is used to describe the d-dimensional SIS. We demonstrate the method in the holographic superconductor theory where the operator is a scalar. The idea is to consider the inside as well as the outside of the AdS black ho…
▽ More
We propose a method to constrain the scaling dimension of the operators of the strongly interacting systems (SIS) using the holographic setup where the (d+1)-dimensional black hole is used to describe the d-dimensional SIS. We demonstrate the method in the holographic superconductor theory where the operator is a scalar. The idea is to consider the inside as well as the outside of the AdS black hole in which the gap equations has higher order singularities. Then the equivalence principle requests the solution be smoothly connected at the horizon, which turns out to give a quantized values of the scaling dimension of the condensed operator. This is a pleasant surprise because so far one gets the constraints on the scaling dimension only by a hard analysis with bootstrap ansatz.
△ Less
Submitted 1 February, 2024; v1 submitted 27 December, 2023;
originally announced December 2023.
-
iDet3D: Towards Efficient Interactive Object Detection for LiDAR Point Clouds
Authors:
Dongmin Choi,
Wonwoo Cho,
Kangyeol Kim,
Jaegul Choo
Abstract:
Accurately annotating multiple 3D objects in LiDAR scenes is laborious and challenging. While a few previous studies have attempted to leverage semi-automatic methods for cost-effective bounding box annotation, such methods have limitations in efficiently handling numerous multi-class objects. To effectively accelerate 3D annotation pipelines, we propose iDet3D, an efficient interactive 3D object…
▽ More
Accurately annotating multiple 3D objects in LiDAR scenes is laborious and challenging. While a few previous studies have attempted to leverage semi-automatic methods for cost-effective bounding box annotation, such methods have limitations in efficiently handling numerous multi-class objects. To effectively accelerate 3D annotation pipelines, we propose iDet3D, an efficient interactive 3D object detector. Supporting a user-friendly 2D interface, which can ease the cognitive burden of exploring 3D space to provide click interactions, iDet3D enables users to annotate the entire objects in each scene with minimal interactions. Taking the sparse nature of 3D point clouds into account, we design a negative click simulation (NCS) to improve accuracy by reducing false-positive predictions. In addition, iDet3D incorporates two click propagation techniques to take full advantage of user interactions: (1) dense click guidance (DCG) for keeping user-provided information throughout the network and (2) spatial click propagation (SCP) for detecting other instances of the same class based on the user-specified objects. Through our extensive experiments, we present that our method can construct precise annotations in a few clicks, which shows the practicality as an efficient annotation tool for 3D object detection.
△ Less
Submitted 24 December, 2023;
originally announced December 2023.
-
Large-scale Graph Representation Learning of Dynamic Brain Connectome with Transformers
Authors:
Byung-Hoon Kim,
Jungwon Choi,
EungGu Yun,
Kyungsang Kim,
Xiang Li,
Juho Lee
Abstract:
Graph Transformers have recently been successful in various graph representation learning tasks, providing a number of advantages over message-passing Graph Neural Networks. Utilizing Graph Transformers for learning the representation of the brain functional connectivity network is also gaining interest. However, studies to date have underlooked the temporal dynamics of functional connectivity, wh…
▽ More
Graph Transformers have recently been successful in various graph representation learning tasks, providing a number of advantages over message-passing Graph Neural Networks. Utilizing Graph Transformers for learning the representation of the brain functional connectivity network is also gaining interest. However, studies to date have underlooked the temporal dynamics of functional connectivity, which fluctuates over time. Here, we propose a method for learning the representation of dynamic functional connectivity with Graph Transformers. Specifically, we define the connectome embedding, which holds the position, structure, and time information of the functional connectivity graph, and use Transformers to learn its representation across time. We perform experiments with over 50,000 resting-state fMRI samples obtained from three datasets, which is the largest number of fMRI data used in studies by far. The experimental results show that our proposed method outperforms other competitive baselines in gender classification and age regression tasks based on the functional connectivity extracted from the fMRI data.
△ Less
Submitted 4 December, 2023;
originally announced December 2023.
-
Partitioned neural network approximation for partial differential equations enhanced with Lagrange multipliers and localized loss functions
Authors:
Deok-Kyu Jang,
Kyungsoo Kim,
Hyea Hyun Kim
Abstract:
Partitioned neural network functions are used to approximate the solution of partial differential equations. The problem domain is partitioned into non-overlapping subdomains and the partitioned neural network functions are defined on the given non-overlapping subdomains. Each neural network function then approximates the solution in each subdomain. To obtain the convergent neural network solution…
▽ More
Partitioned neural network functions are used to approximate the solution of partial differential equations. The problem domain is partitioned into non-overlapping subdomains and the partitioned neural network functions are defined on the given non-overlapping subdomains. Each neural network function then approximates the solution in each subdomain. To obtain the convergent neural network solution, certain continuity conditions on the partitioned neural network functions across the subdomain interface need to be included in the loss function, that is used to train the parameters in the neural network functions. In our work, by introducing suitable interface values, the loss function is reformulated into a sum of localized loss functions and each localized loss function is used to train the corresponding local neural network parameters. In addition, to accelerate the neural network solution convergence, the localized loss function is enriched with an augmented Lagrangian term, where the interface condition and the boundary condition are enforced as constraints on the local solutions by using Lagrange multipliers. The local neural network parameters and Lagrange multipliers are then found by optimizing the localized loss function. To take the advantage of the localized loss function for the parallel computation, an iterative algorithm is also proposed. For the proposed algorithms, their training performance and convergence are numerically studied for various test examples.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
FedSZ: Leveraging Error-Bounded Lossy Compression for Federated Learning Communications
Authors:
Grant Wilkins,
Sheng Di,
Jon C. Calhoun,
Zilinghan Li,
Kibaek Kim,
Robert Underwood,
Richard Mortier,
Franck Cappello
Abstract:
With the promise of federated learning (FL) to allow for geographically-distributed and highly personalized services, the efficient exchange of model updates between clients and servers becomes crucial. FL, though decentralized, often faces communication bottlenecks, especially in resource-constrained scenarios. Existing data compression techniques like gradient sparsification, quantization, and p…
▽ More
With the promise of federated learning (FL) to allow for geographically-distributed and highly personalized services, the efficient exchange of model updates between clients and servers becomes crucial. FL, though decentralized, often faces communication bottlenecks, especially in resource-constrained scenarios. Existing data compression techniques like gradient sparsification, quantization, and pruning offer some solutions, but may compromise model performance or necessitate expensive retraining. In this paper, we introduce FedSZ, a specialized lossy-compression algorithm designed to minimize the size of client model updates in FL. FedSZ incorporates a comprehensive compression pipeline featuring data partitioning, lossy and lossless compression of model parameters and metadata, and serialization. We evaluate FedSZ using a suite of error-bounded lossy compressors, ultimately finding SZ2 to be the most effective across various model architectures and datasets including AlexNet, MobileNetV2, ResNet50, CIFAR-10, Caltech101, and Fashion-MNIST. Our study reveals that a relative error bound 1E-2 achieves an optimal tradeoff, compressing model states between 5.55-12.61x while maintaining inference accuracy within <0.5% of uncompressed results. Additionally, the runtime overhead of FedSZ is <4.7% or between of the wall-clock communication-round time, a worthwhile trade-off for reducing network transfer times by an order of magnitude for networks bandwidths <500Mbps. Intriguingly, we also find that the error introduced by FedSZ could potentially serve as a source of differentially private noise, opening up new avenues for privacy-preserving FL.
△ Less
Submitted 24 April, 2024; v1 submitted 20 December, 2023;
originally announced December 2023.
-
Search for the $e^+e^-\toη_{b}(1S)ω$ and $e^+e^-\toχ_{b0}(1P)ω$ processes at $\sqrt{s}=10.745\,\mathrm{GeV}$
Authors:
Belle II Collaboration,
I. Adachi,
L. Aggarwal,
H. Ahmed,
H. Aihara,
N. Akopov,
A. Aloisio,
N. Anh Ky,
D. M. Asner,
H. Atmacan,
T. Aushev,
V. Aushev,
M. Aversano,
V. Babu,
H. Bae,
S. Bahinipati,
P. Bambade,
Sw. Banerjee,
M. Barrett,
J. Baudot,
M. Bauer,
A. Baur,
A. Beaubien,
F. Becherer,
J. Becker
, et al. (397 additional authors not shown)
Abstract:
We search for the $e^+e^-\toη_b(1S)ω$ and $e^+e^-\toχ_{b0}(1P)ω$ processes at a center-of-mass energy of 10.745 GeV, which is close to the peak of the $Υ(10753)$ state. We use data collected by the Belle II experiment during a special run, corresponding to an integrated luminosity of $9.8\,\mathrm{fb}^{-1}$. We reconstruct $ω\toπ^+π^-π^0$ decays and use the $ω$ meson's recoil mass to search for th…
▽ More
We search for the $e^+e^-\toη_b(1S)ω$ and $e^+e^-\toχ_{b0}(1P)ω$ processes at a center-of-mass energy of 10.745 GeV, which is close to the peak of the $Υ(10753)$ state. We use data collected by the Belle II experiment during a special run, corresponding to an integrated luminosity of $9.8\,\mathrm{fb}^{-1}$. We reconstruct $ω\toπ^+π^-π^0$ decays and use the $ω$ meson's recoil mass to search for the signals. We do not find evidence for either process, and set upper limits on the corresponding Born-level cross sections of 2.5 pb and 7.8 pb, respectively, at the 90% confidence level. The $χ_{b0}(1P)ω$ limit is the result of a combination of this analysis and a previous search using full reconstruction.
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
Singular Hall response from a correlated ferromagnetic flat nodal-line semimetal
Authors:
Woohyun Cho,
Yoon-Gu Kang,
Jaehun Cha,
Dong Hyun David Lee,
Do Hoon Kiem,
Jaewhan Oh,
Jongho Park,
Changyoung Kim,
Yongsoo Yang,
Yeong Kwan Kim,
Myung Joon Han,
Heejun Yang
Abstract:
Topological quantum phases have been largely understood in weakly correlated systems, which have identified various quantum phenomena such as spin Hall effect, protected transport of helical fermions, and topological superconductivity. Robust ferromagnetic order in correlated topological materials particularly attracts attention, as it can provide a versatile platform for novel quantum devices. He…
▽ More
Topological quantum phases have been largely understood in weakly correlated systems, which have identified various quantum phenomena such as spin Hall effect, protected transport of helical fermions, and topological superconductivity. Robust ferromagnetic order in correlated topological materials particularly attracts attention, as it can provide a versatile platform for novel quantum devices. Here, we report singular Hall response arising from a unique band structure of flat topological nodal lines in combination with electron correlation in an itinerant, van der Waals ferromagnetic semimetal, Fe3GaTe2, with a high Curie temperature of Tc=360 K. High anomalous Hall conductivity violating the conventional scaling, resistivity upturn at low temperature, and a large Sommerfeld coefficient are observed in Fe3GaTe2, which implies heavy fermion features in this ferromagnetic topological material. Our circular dichroism in angle-resolved photoemission spectroscopy and theoretical calculations support the original electronic features in the material. Thus, low-dimensional Fe3GaTe2 with electronic correlation, topology, and room-temperature ferromagnetic order appears to be a promising candidate for robust quantum devices.
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
The pseudochiral Fermi surface of $α$-RuI$_3$
Authors:
Alex Louat,
Matthew D. Watson,
Timur K. Kim,
Danrui Ni,
Robert J. Cava,
Cephise Cacho
Abstract:
In continuation of research into RuCl$_3$ and RuBr$_3$ as potential quantum spin liquids, a phase with unique magnetic order characterised by long-range quantum entanglement and fractionalised excitations, the compound RuI$_3$ has been recently synthesised. Here, we show RuI$_3$ is a moderately correlated metal with two bands crossing the Fermi level, implying the absence of any quantum spin liqui…
▽ More
In continuation of research into RuCl$_3$ and RuBr$_3$ as potential quantum spin liquids, a phase with unique magnetic order characterised by long-range quantum entanglement and fractionalised excitations, the compound RuI$_3$ has been recently synthesised. Here, we show RuI$_3$ is a moderately correlated metal with two bands crossing the Fermi level, implying the absence of any quantum spin liquids phase. We find that the Fermi surface as measured or calculated for a 2D ($k_\text{x},k_\text{y}$) slice at any $k_\text{z}$ lacks mirror symmetry, i.e. is pseudochiral. We link this phenomenon to the ABC stacking in the R$\bar{3}$ space group of $α$-RuI$_3$, which is achiral but lacks any mirror or glide symmetries. We further provide a formal framework for understanding when such a pseudochiral electronic structure may be observed.
△ Less
Submitted 19 December, 2023;
originally announced December 2023.
-
Origin of chirality in transition-metal dichalcogenides
Authors:
Kwangrae Kim,
Hyun-Woo J. Kim,
Seunghyeok Ha,
Hoon Kim,
Jin-Kwang Kim,
Jaehwon Kim,
Hyunsung Kim,
Junyoung Kwon,
Jihoon Seol,
Saegyeol Jung,
Changyoung Kim,
Ahmet Alatas,
Ayman Said,
Michael Merz,
Matthieu Le Tacon,
Jin Mo Bok,
Ki-Seok Kim,
B. J. Kim
Abstract:
Chirality is a ubiquitous phenomenon in which a symmetry between left- and right-handed objects is broken, examples in nature ranging from subatomic particles and molecules to living organisms. In particle physics, the weak force is responsible for the symmetry breaking and parity violation in beta decay, but in condensed matter systems interactions that lead to chirality remain poorly understood.…
▽ More
Chirality is a ubiquitous phenomenon in which a symmetry between left- and right-handed objects is broken, examples in nature ranging from subatomic particles and molecules to living organisms. In particle physics, the weak force is responsible for the symmetry breaking and parity violation in beta decay, but in condensed matter systems interactions that lead to chirality remain poorly understood. Here, we unravel the mechanism of chiral charge density wave formation in the transition-metal dichalcogenide 1T-TiSe2. Using representation analysis, we show that charge density modulations and ionic displacements, which transform as a continuous scalar field and a vector field on a discrete lattice, respectively, follow different irreducible representations of the space group, despite the fact that they propagate with the same wave-vectors and are strongly coupled to each other. This charge-lattice symmetry frustration is resolved by further breaking of all symmetries not common to both sectors through induced lattice distortions, thus leading to chirality. Our theory is verified using Raman spectroscopy and inelastic x-ray scattering, which reveal that all but translation symmetries are broken at a level not resolved by state-of-the-art diffraction techniques.
△ Less
Submitted 19 December, 2023;
originally announced December 2023.
-
Conceptual Design of a Low-Energy Ion Beam Storage Ring and a Recoil Separator to Study Radiative Neutron Capture by Radioactive Ions
Authors:
Kihong Pak,
Barry Davids,
Yong Kyun Kim
Abstract:
Recently, the TRIUMF Storage Ring (TRISR), a storage ring for the existing Isotope Separator and Accelerator-I (ISAC-I) radioactive ion beam facility at TRIUMF, was proposed. It may be possible to directly measure neutron-induced radiative capture reactions in inverse kinematics by combining the ring with a high-flux neutron generator as the neutron target. Herein, we present the conceptual design…
▽ More
Recently, the TRIUMF Storage Ring (TRISR), a storage ring for the existing Isotope Separator and Accelerator-I (ISAC-I) radioactive ion beam facility at TRIUMF, was proposed. It may be possible to directly measure neutron-induced radiative capture reactions in inverse kinematics by combining the ring with a high-flux neutron generator as the neutron target. Herein, we present the conceptual design of a low-energy ion storage ring as well as a fusion product extraction system with a Wien filter and recoil separator for detecting neutron capture products based on ion optical calculations and particle-tracking simulations.
△ Less
Submitted 19 December, 2023;
originally announced December 2023.
-
Implementing biosensing based user preference visualisation in architectural spaces
Authors:
Mi Kyoung Kim
Abstract:
This study delves into the interplay between architectural spaces and human emotions, leveraging the emergent field of neuroarchitecture. It examines the functional and aesthetic influence of architectural design on individual users, with a focus on biosensing data such as brainwave and eye tracking information to understand user preferences.
This study delves into the interplay between architectural spaces and human emotions, leveraging the emergent field of neuroarchitecture. It examines the functional and aesthetic influence of architectural design on individual users, with a focus on biosensing data such as brainwave and eye tracking information to understand user preferences.
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
Analysing user sentiment data for architectural interior spaces
Authors:
Mi Kyoung Kim
Abstract:
This study aims to develop a data driven system to enhance the analysis and improvement of user experiences in interior spaces, acknowledging the significant impact of design on individuals health, productivity, and quality of life.
This study aims to develop a data driven system to enhance the analysis and improvement of user experiences in interior spaces, acknowledging the significant impact of design on individuals health, productivity, and quality of life.
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
Generative Context-aware Fine-tuning of Self-supervised Speech Models
Authors:
Suwon Shon,
Kwangyoun Kim,
Prashant Sridhar,
Yi-Te Hsu,
Shinji Watanabe,
Karen Livescu
Abstract:
When performing tasks like automatic speech recognition or spoken language understanding for a given utterance, access to preceding text or audio provides contextual information can improve performance. Considering the recent advances in generative large language models (LLM), we hypothesize that an LLM could generate useful context information using the preceding text. With appropriate prompts, L…
▽ More
When performing tasks like automatic speech recognition or spoken language understanding for a given utterance, access to preceding text or audio provides contextual information can improve performance. Considering the recent advances in generative large language models (LLM), we hypothesize that an LLM could generate useful context information using the preceding text. With appropriate prompts, LLM could generate a prediction of the next sentence or abstractive text like titles or topics. In this paper, we study the use of LLM-generated context information and propose an approach to distill the generated information during fine-tuning of self-supervised speech models, which we refer to as generative context-aware fine-tuning. This approach allows the fine-tuned model to make improved predictions without access to the true surrounding segments or to the LLM at inference time, while requiring only a very small additional context module. We evaluate the proposed approach using the SLUE and Libri-light benchmarks for several downstream tasks: automatic speech recognition, named entity recognition, and sentiment analysis. The results show that generative context-aware fine-tuning outperforms a context injection fine-tuning approach that accesses the ground-truth previous text, and is competitive with a generative context injection fine-tuning approach that requires the LLM at inference time.
△ Less
Submitted 15 December, 2023;
originally announced December 2023.
-
Identified charged-hadron production in $p$$+$Al, $^3$He$+$Au, and Cu$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV and in U$+$U collisions at $\sqrt{s_{_{NN}}}=193$ GeV
Authors:
PHENIX Collaboration,
N. J. Abdulameer,
U. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
R. Akimoto,
J. Alexander,
M. Alfred,
V. Andrieux,
K. Aoki,
N. Apadula,
H. Asano,
E. T. Atomssa,
T. C. Awes,
B. Azmoun,
V. Babintsev,
M. Bai,
X. Bai,
N. S. Bandara,
B. Bannier,
K. N. Barish,
S. Bathe,
V. Baublis
, et al. (456 additional authors not shown)
Abstract:
The PHENIX experiment has performed a systematic study of identified charged-hadron ($π^\pm$, $K^\pm$, $p$, $\bar{p}$) production at midrapidity in $p$$+$Al, $^3$He$+$Au, Cu$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV and U$+$U collisions at $\sqrt{s_{_{NN}}}=193$ GeV. Identified charged-hadron invariant transverse-momentum ($p_T$) and transverse-mass ($m_T$) spectra are presented and interprete…
▽ More
The PHENIX experiment has performed a systematic study of identified charged-hadron ($π^\pm$, $K^\pm$, $p$, $\bar{p}$) production at midrapidity in $p$$+$Al, $^3$He$+$Au, Cu$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV and U$+$U collisions at $\sqrt{s_{_{NN}}}=193$ GeV. Identified charged-hadron invariant transverse-momentum ($p_T$) and transverse-mass ($m_T$) spectra are presented and interpreted in terms of radially expanding thermalized systems. The particle ratios of $K/π$ and $p/π$ have been measured in different centrality ranges of large (Cu$+$Au, U$+$U) and small ($p$$+$Al, $^3$He$+$Au) collision systems. The values of $K/π$ ratios measured in all considered collision systems were found to be consistent with those measured in $p$$+$$p$ collisions. However the values of $p/π$ ratios measured in large collision systems reach the values of $\approx0.6$, which is $\approx2$ times larger than in $p$$+$$p$ collisions. These results can be qualitatively understood in terms of the baryon enhancement expected from hadronization by recombination. Identified charged-hadron nuclear-modification factors ($R_{AB}$) are also presented. Enhancement of proton $R_{AB}$ values over meson $R_{AB}$ values was observed in central $^3$He$+$Au, Cu$+$Au, and U$+$U collisions. The proton $R_{AB}$ values measured in $p$$+$Al collision system were found to be consistent with $R_{AB}$ values of $φ$, $π^\pm$, $K^\pm$, and $π^0$ mesons, which may indicate that the size of the system produced in $p$$+$Al collisions is too small for recombination to cause a noticeable increase in proton production.
△ Less
Submitted 22 May, 2024; v1 submitted 14 December, 2023;
originally announced December 2023.
-
A Rydberg-atom approach to the integer factorization problem
Authors:
Juyoung Park,
Seokho Jeong,
Minhyuk Kim,
Kangheun Kim,
Andrew Byun,
Louis Vignoli,
Louis-Paul Henry,
Loïc Henriet,
Jaewook Ahn
Abstract:
The task of factoring integers poses a significant challenge in modern cryptography, and quantum computing holds the potential to efficiently address this problem compared to classical algorithms. Thus, it is crucial to develop quantum computing algorithms to address this problem. This study introduces a quantum approach that utilizes Rydberg atoms to tackle the factorization problem. Experimental…
▽ More
The task of factoring integers poses a significant challenge in modern cryptography, and quantum computing holds the potential to efficiently address this problem compared to classical algorithms. Thus, it is crucial to develop quantum computing algorithms to address this problem. This study introduces a quantum approach that utilizes Rydberg atoms to tackle the factorization problem. Experimental demonstrations are conducted for the factorization of small composite numbers such as $6 = 2 \times 3$, $15 = 3 \times 5$, and $35 = 5 \times 7$. This approach involves employing Rydberg-atom graphs to algorithmically program binary multiplication tables, yielding many-body ground states that represent superpositions of factoring solutions. Subsequently, these states are probed using quantum adiabatic computing. Limitations of this method are discussed, specifically addressing the scalability of current Rydberg quantum computing for the intricate computational problem.
△ Less
Submitted 31 January, 2024; v1 submitted 14 December, 2023;
originally announced December 2023.
-
Enabling End-to-End Secure Federated Learning in Biomedical Research on Heterogeneous Computing Environments with APPFLx
Authors:
Trung-Hieu Hoang,
Jordan Fuhrman,
Ravi Madduri,
Miao Li,
Pranshu Chaturvedi,
Zilinghan Li,
Kibaek Kim,
Minseok Ryu,
Ryan Chard,
E. A. Huerta,
Maryellen Giger
Abstract:
Facilitating large-scale, cross-institutional collaboration in biomedical machine learning projects requires a trustworthy and resilient federated learning (FL) environment to ensure that sensitive information such as protected health information is kept confidential. In this work, we introduce APPFLx, a low-code FL framework that enables the easy setup, configuration, and running of FL experiment…
▽ More
Facilitating large-scale, cross-institutional collaboration in biomedical machine learning projects requires a trustworthy and resilient federated learning (FL) environment to ensure that sensitive information such as protected health information is kept confidential. In this work, we introduce APPFLx, a low-code FL framework that enables the easy setup, configuration, and running of FL experiments across organizational and administrative boundaries while providing secure end-to-end communication, privacy-preserving functionality, and identity management. APPFLx is completely agnostic to the underlying computational infrastructure of participating clients. We demonstrate the capability of APPFLx as an easy-to-use framework for accelerating biomedical studies across institutions and healthcare systems while maintaining the protection of private medical data in two case studies: (1) predicting participant age from electrocardiogram (ECG) waveforms, and (2) detecting COVID-19 disease from chest radiographs. These experiments were performed securely across heterogeneous compute resources, including a mixture of on-premise high-performance computing and cloud computing, and highlight the role of federated learning in improving model generalizability and performance when aggregating data from multiple healthcare systems. Finally, we demonstrate that APPFLx serves as a convenient and easy-to-use framework for accelerating biomedical studies across institutions and healthcare system while maintaining the protection of private medical data.
△ Less
Submitted 14 December, 2023;
originally announced December 2023.
-
Scintillation characteristics of an undoped CsI crystal at low-temperature for dark matter search
Authors:
W. K. Kim,
H. Y. Lee,
K. W. Kim,
Y. J. Ko,
J. A. Jeon,
H. J. Kim,
H. S. Lee
Abstract:
The scintillation characteristics of an undoped CsI crystal with dimensions of 5.8 mm $\times$ 5.9 mm $\times$ 7.0 mm, corresponding to a weight of 1.0 g, were studied by directly coupling two silicon photomultipliers (SiPMs) over a temperature range from room temperature (300 K) to a low temperature of 86 K. The scintillation decay time and light output were measured using x-ray (23 keV) and gamm…
▽ More
The scintillation characteristics of an undoped CsI crystal with dimensions of 5.8 mm $\times$ 5.9 mm $\times$ 7.0 mm, corresponding to a weight of 1.0 g, were studied by directly coupling two silicon photomultipliers (SiPMs) over a temperature range from room temperature (300 K) to a low temperature of 86 K. The scintillation decay time and light output were measured using x-ray (23 keV) and gamma-ray (88 keV) peaks from a $^{109}$Cd radioactive source. An increase in decay time was observed as the temperature decreased from room temperature to 86 K, ranging from 76 ns to 605 ns. Correspondingly, the light output increased as well, reaching 37.9 $\pm$ 1.5 photoelectrons per keV electron-equivalent at 86 K, which is approximately 18 times higher than the light yield at room temperature. Leveraging the significantly enhanced scintillation light output of the undpoed CsI crystal at the low temperature, coupling it with SiPMs makes it a promising candidate for the future dark matter search detector, benefiting from the low threshold owing to the high light output. The odd proton numbers from both cesium and iodine provide an advantage for the WIMP-proton spin-dependent interaction. We evaluated the sensitivity of low-mass dark matter on WIMP-proton spin-dependent interaction with the Migdal process, assuming 200 kg of undoped CsI crystals for the dark matter search. We conclude that undoped CsI crystal detectors exhibit world-competitive sensitivities for low-mass dark matter detection, particularly for the WIMP-proton spin-dependent interaction.
△ Less
Submitted 13 December, 2023;
originally announced December 2023.
-
Construction of Superconducting Dome and Emergence of Quantum Critical Region in Holography
Authors:
Yunseok Seo,
Sejin Kim,
Kyung Kiu Kim
Abstract:
In this work, we investigate an extended model of holographic superconductor by a non-linear electrodynamic interaction coupled to a complex scalar field. This non-linear interaction term can make a quantum phase transition at zero temperature with finite charge carrier density. By solving full equations of motion, we can construct various shapes of the superconducting phase in the phase diagram.…
▽ More
In this work, we investigate an extended model of holographic superconductor by a non-linear electrodynamic interaction coupled to a complex scalar field. This non-linear interaction term can make a quantum phase transition at zero temperature with finite charge carrier density. By solving full equations of motion, we can construct various shapes of the superconducting phase in the phase diagram. With a specific choice of interaction coefficients, we can construct a phase diagram with a superconducting dome. Also, we find a new geometric solution inside the superconducting dome, which turns out to be a Lifshitz-type geometry. This geometry is characterized by a dynamical critical exponent, which plays a crucial role near the quantum critical point. We refer to this region in the phase diagram as a `quantum critical region.'
△ Less
Submitted 20 December, 2023; v1 submitted 11 December, 2023;
originally announced December 2023.
-
Small-ball constants, and exceptional flat points of SPDEs
Authors:
Davar Khoshnevisan,
Kunwoo Kim,
Carl Mueller
Abstract:
We study small-ball probabilities for the stochastic heat equation with multiplicative noise in the moderate-deviations regime. We prove the existence of a small-ball constant and related it to other known quantities in the literature. These small-ball estimates are known to imply Chung-type laws of the iterated logarithm (LIL) at typical spatial points; these points can be thought of as "points o…
▽ More
We study small-ball probabilities for the stochastic heat equation with multiplicative noise in the moderate-deviations regime. We prove the existence of a small-ball constant and related it to other known quantities in the literature. These small-ball estimates are known to imply Chung-type laws of the iterated logarithm (LIL) at typical spatial points; these points can be thought of as "points of flat growth". For this result in a similar context in SPDEs see, for example, the recent work of Chen \cite{Ch2023}. We establish the existence of a new family of exceptional spatial points where the Chung-type LIL fails.
△ Less
Submitted 10 December, 2023;
originally announced December 2023.
-
Enhancing the Electron Pair Approximation with Measurements on Trapped Ion Quantum Computers
Authors:
Luning Zhao,
Joshua Goings,
Qingfeng Wang,
Kyujin Shin,
Woomin Kyoung,
Seunghyo Noh,
Young Min Rhee,
Kyungmin Kim
Abstract:
The electron pair approximation offers a resource efficient variational quantum eigensolver (VQE) approach for quantum chemistry simulations on quantum computers. With the number of entangling gates scaling quadratically with system size and a constant energy measurement overhead, the orbital optimized unitary pair coupled cluster double (oo-upCCD) ansatz strikes a balance between accuracy and eff…
▽ More
The electron pair approximation offers a resource efficient variational quantum eigensolver (VQE) approach for quantum chemistry simulations on quantum computers. With the number of entangling gates scaling quadratically with system size and a constant energy measurement overhead, the orbital optimized unitary pair coupled cluster double (oo-upCCD) ansatz strikes a balance between accuracy and efficiency on today's quantum computers. However, the electron pair approximation makes the method incapable of producing quantitatively accurate energy predictions. In order to improve the accuracy without increasing the circuit depth, we explore the idea of reduced density matrix (RDM) based second order perturbation theory (PT2) as an energetic correction to electron pair approximation. The new approach takes into account of the broken-pair energy contribution that is missing in pair-correlated electron simulations, while maintaining the computational advantages of oo-upCCD ansatz. In dissociations of N$_2$, Li$_2$O, and chemical reactions such as the unimolecular decomposition of CH$_2$OH$^+$ and the \snTwo reaction of CH$_3$I $+$ Br$^-$, the method significantly improves the accuracy of energy prediction. On two generations of the IonQ's trapped ion quantum computers, Aria and Forte, we find that unlike the VQE energy, the PT2 energy correction is highly noise-resilient. By applying a simple error mitigation approach based on post-selection solely on the VQE energies, the predicted VQE-PT2 energy differences between reactants, transition state, and products are in excellent agreement with noise-free simulators.
△ Less
Submitted 8 December, 2023;
originally announced December 2023.
-
Transferable Candidate Proposal with Bounded Uncertainty
Authors:
Kyeongryeol Go,
Kye-Hyeon Kim
Abstract:
From an empirical perspective, the subset chosen through active learning cannot guarantee an advantage over random sampling when transferred to another model. While it underscores the significance of verifying transferability, experimental design from previous works often neglected that the informativeness of a data subset can change over model configurations. To tackle this issue, we introduce a…
▽ More
From an empirical perspective, the subset chosen through active learning cannot guarantee an advantage over random sampling when transferred to another model. While it underscores the significance of verifying transferability, experimental design from previous works often neglected that the informativeness of a data subset can change over model configurations. To tackle this issue, we introduce a new experimental design, coined as Candidate Proposal, to find transferable data candidates from which active learning algorithms choose the informative subset. Correspondingly, a data selection algorithm is proposed, namely Transferable candidate proposal with Bounded Uncertainty (TBU), which constrains the pool of transferable data candidates by filtering out the presumably redundant data points based on uncertainty estimation. We verified the validity of TBU in image classification benchmarks, including CIFAR-10/100 and SVHN. When transferred to different model configurations, TBU consistency improves performance in existing active learning algorithms. Our code is available at https://github.com/gokyeongryeol/TBU.
△ Less
Submitted 7 December, 2023;
originally announced December 2023.
-
Visual Hindsight Self-Imitation Learning for Interactive Navigation
Authors:
Kibeom Kim,
Kisung Shin,
Min Whoo Lee,
Moonhoen Lee,
Minsu Lee,
Byoung-Tak Zhang
Abstract:
Interactive visual navigation tasks, which involve following instructions to reach and interact with specific targets, are challenging not only because successful experiences are very rare but also because the complex visual inputs require a substantial number of samples. Previous methods for these tasks often rely on intricately designed dense rewards or the use of expensive expert data for imita…
▽ More
Interactive visual navigation tasks, which involve following instructions to reach and interact with specific targets, are challenging not only because successful experiences are very rare but also because the complex visual inputs require a substantial number of samples. Previous methods for these tasks often rely on intricately designed dense rewards or the use of expensive expert data for imitation learning. To tackle these challenges, we propose a novel approach, Visual Hindsight Self-Imitation Learning (VHS) for enhancing sample efficiency through hindsight goal re-labeling and self-imitation. We also introduce a prototypical goal embedding method derived from experienced goal observations, that is particularly effective in vision-based and partially observable environments. This embedding technique allows the agent to visually reinterpret its unsuccessful attempts, enabling vision-based goal re-labeling and self-imitation from enhanced successful experiences. Experimental results show that VHS outperforms existing techniques in interactive visual navigation tasks, confirming its superior performance and sample efficiency.
△ Less
Submitted 5 December, 2023;
originally announced December 2023.
-
Seamless monolithic three-dimensional integration of single-crystalline films by growth
Authors:
Ki Seok Kim,
Seunghwan Seo,
Junyoung Kwon,
Doyoon Lee,
Changhyun Kim,
Jung-El Ryu,
Jekyung Kim,
Min-Kyu Song,
Jun Min Suh,
Hang-Gyo Jung,
Youhwan Jo,
Hogeun Ahn,
Sangho Lee,
Kyeongjae Cho,
Jongwook Jeon,
Minsu Seol,
Jin-Hong Park,
Sang Won Kim,
Jeehwan Kim
Abstract:
The demand for the three-dimensional (3D) integration of electronic components is on a steady rise. The through-silicon-via (TSV) technique emerges as the only viable method for integrating single-crystalline device components in a 3D format, despite encountering significant processing challenges. While monolithic 3D (M3D) integration schemes show promise, the seamless connection of single-crystal…
▽ More
The demand for the three-dimensional (3D) integration of electronic components is on a steady rise. The through-silicon-via (TSV) technique emerges as the only viable method for integrating single-crystalline device components in a 3D format, despite encountering significant processing challenges. While monolithic 3D (M3D) integration schemes show promise, the seamless connection of single-crystalline semiconductors without intervening wafers has yet to be demonstrated. This challenge arises from the inherent difficulty of growing single crystals on amorphous or polycrystalline surfaces post the back-end-of-the-line process at low temperatures to preserve the underlying circuitry. Consequently, a practical growth-based solution for M3D of single crystals remains elusive. Here, we present a method for growing single-crystalline channel materials, specifically composed of transition metal dichalcogenides, on amorphous and polycrystalline surfaces at temperatures lower than 400 °C. Building on this developed technique, we demonstrate the seamless monolithic integration of vertical single-crystalline logic transistor arrays. This accomplishment leads to the development of unprecedented vertical CMOS arrays, thereby constructing vertical inverters. Ultimately, this achievement sets the stage to pave the way for M3D integration of various electronic and optoelectronic hardware in the form of single crystals.
△ Less
Submitted 6 December, 2023; v1 submitted 5 December, 2023;
originally announced December 2023.
-
Search for the semileptonic decays $Ξ_c^0 \to Ξ^0\ell^+\ell^-$ at Belle
Authors:
Belle Collaboration,
J. X. Cui,
Y. B. Li,
C. P. Shen,
I. Adachi,
H. Aihara,
S. Al Said,
D. M. Asner,
T. Aushev,
R. Ayad,
V. Babu,
S. Bahinipati,
Sw. Banerjee,
M. Bauer,
P. Behera,
K. Belous,
J. Bennett,
M. Bessner,
B. Bhuyan,
T. Bilka,
D. Biswas,
A. Bobrov,
D. Bodrov,
J. Borah,
M. Bračko
, et al. (141 additional authors not shown)
Abstract:
Using the full data sample of 980 $\mathrm{fb}^{-1}$ collected with the Belle detector at the KEKB asymmetric energy electron-positron collider, we report the results of the first search for the rare semileptonic decays $Ξ_c^0 \to Ξ^0\ell^+\ell^-$ ($\ell=e$ or $μ)$. No significant signals are observed in the $Ξ^0\ell^+\ell^-$ invariant-mass distributions. Taking the decay $Ξ_c^0 \to Ξ^- π^+$ as th…
▽ More
Using the full data sample of 980 $\mathrm{fb}^{-1}$ collected with the Belle detector at the KEKB asymmetric energy electron-positron collider, we report the results of the first search for the rare semileptonic decays $Ξ_c^0 \to Ξ^0\ell^+\ell^-$ ($\ell=e$ or $μ)$. No significant signals are observed in the $Ξ^0\ell^+\ell^-$ invariant-mass distributions. Taking the decay $Ξ_c^0 \to Ξ^- π^+$ as the normalization mode, we report 90\% credibility upper limits on the branching fraction ratios ${\cal{B}} (Ξ_c^0 \to Ξ^0 e^+ e^-) / {\cal{B}}(Ξ_c^0\to Ξ^-π^+) < 6.7 \times 10^{-3}$ and ${\cal{B}} (Ξ_c^0 \to Ξ^0 μ^+ μ^-) / {\cal{B}}(Ξ_c^0\to Ξ^-π^+) < 4.3 \times 10^{-3}$ based on the phase-space assumption for signal decays. The 90\% credibility upper limits on the absolute branching fractions of ${\cal{B}} (Ξ_c^0 \to Ξ^0 e^+ e^-)$ and ${\cal{B}} (Ξ_c^0 \to Ξ^0 μ^+ μ^-)$ are found to be $9.9 \times 10^{-5}$ and $6.5 \times 10^{-5}$, respectively.
△ Less
Submitted 5 December, 2023; v1 submitted 5 December, 2023;
originally announced December 2023.
-
DRAFT: Dense Retrieval Augmented Few-shot Topic classifier Framework
Authors:
Keonwoo Kim,
Younggun Lee
Abstract:
With the growing volume of diverse information, the demand for classifying arbitrary topics has become increasingly critical. To address this challenge, we introduce DRAFT, a simple framework designed to train a classifier for few-shot topic classification. DRAFT uses a few examples of a specific topic as queries to construct Customized dataset with a dense retriever model. Multi-query retrieval (…
▽ More
With the growing volume of diverse information, the demand for classifying arbitrary topics has become increasingly critical. To address this challenge, we introduce DRAFT, a simple framework designed to train a classifier for few-shot topic classification. DRAFT uses a few examples of a specific topic as queries to construct Customized dataset with a dense retriever model. Multi-query retrieval (MQR) algorithm, which effectively handles multiple queries related to a specific topic, is applied to construct the Customized dataset. Subsequently, we fine-tune a classifier using the Customized dataset to identify the topic. To demonstrate the efficacy of our proposed approach, we conduct evaluations on both widely used classification benchmark datasets and manually constructed datasets with 291 diverse topics, which simulate diverse contents encountered in real-world applications. DRAFT shows competitive or superior performance compared to baselines that use in-context learning, such as GPT-3 175B and InstructGPT 175B, on few-shot topic classification tasks despite having 177 times fewer parameters, demonstrating its effectiveness.
△ Less
Submitted 5 December, 2023;
originally announced December 2023.
-
PolyFit: A Peg-in-hole Assembly Framework for Unseen Polygon Shapes via Sim-to-real Adaptation
Authors:
Geonhyup Lee,
Joosoon Lee,
Sangjun Noh,
Minhwan Ko,
Kangmin Kim,
Kyoobin Lee
Abstract:
The study addresses the foundational and challenging task of peg-in-hole assembly in robotics, where misalignments caused by sensor inaccuracies and mechanical errors often result in insertion failures or jamming. This research introduces PolyFit, representing a paradigm shift by transitioning from a reinforcement learning approach to a supervised learning methodology. PolyFit is a Force/Torque (F…
▽ More
The study addresses the foundational and challenging task of peg-in-hole assembly in robotics, where misalignments caused by sensor inaccuracies and mechanical errors often result in insertion failures or jamming. This research introduces PolyFit, representing a paradigm shift by transitioning from a reinforcement learning approach to a supervised learning methodology. PolyFit is a Force/Torque (F/T)-based supervised learning framework designed for 5-DoF peg-in-hole assembly. It utilizes F/T data for accurate extrinsic pose estimation and adjusts the peg pose to rectify misalignments. Extensive training in a simulated environment involves a dataset encompassing a diverse range of peg-hole shapes, extrinsic poses, and their corresponding contact F/T readings. To enhance extrinsic pose estimation, a multi-point contact strategy is integrated into the model input, recognizing that identical F/T readings can indicate different poses. The study proposes a sim-to-real adaptation method for real-world application, using a sim-real paired dataset to enable effective generalization to complex and unseen polygon shapes. PolyFit achieves impressive peg-in-hole success rates of 97.3% and 96.3% for seen and unseen shapes in simulations, respectively. Real-world evaluations further demonstrate substantial success rates of 86.7% and 85.0%, highlighting the robustness and adaptability of the proposed method.
△ Less
Submitted 5 December, 2023;
originally announced December 2023.
-
MEMTO: Memory-guided Transformer for Multivariate Time Series Anomaly Detection
Authors:
Junho Song,
Keonwoo Kim,
Jeonglyul Oh,
Sungzoon Cho
Abstract:
Detecting anomalies in real-world multivariate time series data is challenging due to complex temporal dependencies and inter-variable correlations. Recently, reconstruction-based deep models have been widely used to solve the problem. However, these methods still suffer from an over-generalization issue and fail to deliver consistently high performance. To address this issue, we propose the MEMTO…
▽ More
Detecting anomalies in real-world multivariate time series data is challenging due to complex temporal dependencies and inter-variable correlations. Recently, reconstruction-based deep models have been widely used to solve the problem. However, these methods still suffer from an over-generalization issue and fail to deliver consistently high performance. To address this issue, we propose the MEMTO, a memory-guided Transformer using a reconstruction-based approach. It is designed to incorporate a novel memory module that can learn the degree to which each memory item should be updated in response to the input data. To stabilize the training procedure, we use a two-phase training paradigm which involves using K-means clustering for initializing memory items. Additionally, we introduce a bi-dimensional deviation-based detection criterion that calculates anomaly scores considering both input space and latent space. We evaluate our proposed method on five real-world datasets from diverse domains, and it achieves an average anomaly detection F1-score of 95.74%, significantly outperforming the previous state-of-the-art methods. We also conduct extensive experiments to empirically validate the effectiveness of our proposed model's key components.
△ Less
Submitted 5 December, 2023;
originally announced December 2023.