subscribe to arXiv mailings

Modeling the refractive index profile n(z) of polar ice for ultra-high energy neutrino experiments

Authors: S. Ali, P. Allison, S. Archambault, J. J. Beatty, D. Z. Besson, A. Bishop, P. Chen, Y. C. Chen, B. A. Clark, W. Clay, A. Connolly, K. Couberly, L. Cremonesi, A. Cummings, P. Dasgupta, R. Debolt, S. de Kockere, K. D. de Vries, C. Deaconu, M. A. DuVernois, J. Flaherty, E. Friedman, R. Gaior, P. Giri, J. Hanson , et al. (45 additional authors not shown)

Abstract: We develop an in-situ index of refraction profile using the transit time of radio signals broadcast from an englacial transmitter to 2-5 km distant radio-frequency receivers, deployed at depths up to 200 m. Maxwell's equations generally admit two ray propagation solutions from a given transmitter, corresponding to a direct path (D) and a refracted path (R); the measured D vs. R (dt(D,R)) timing di… ▽ More We develop an in-situ index of refraction profile using the transit time of radio signals broadcast from an englacial transmitter to 2-5 km distant radio-frequency receivers, deployed at depths up to 200 m. Maxwell's equations generally admit two ray propagation solutions from a given transmitter, corresponding to a direct path (D) and a refracted path (R); the measured D vs. R (dt(D,R)) timing differences provide constraints on the index of refraction profile near South Pole, where the Askaryan Radio Array (ARA) neutrino observatory is located. We constrain the refractive index profile by simulating D and R ray paths via ray tracing and comparing those to measured dt(D,R) signals. Using previous ice density data as a proxy for n(z), we demonstrate that our data strongly favors a glaciologically-motivated three-phase densification model rather than a single exponential scale height model. Simulations show that the single exponential model overestimates ARA neutrino sensitivity compared to the three-phase model. △ Less

Submitted 11 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

arXiv:2406.00810 [pdf, other]

Expanding the Attack Scenarios of SAE J1939: A Comprehensive Analysis of Established and Novel Vulnerabilities in Transport Protocol

Authors: Hwejae Lee, Hyosun Lee, Saehee Jun, Huy Kang Kim

Abstract: Following the enactment of the UN Regulation, substantial efforts have been directed toward implementing intrusion detection and prevention systems (IDPSs) and vulnerability analysis in Controller Area Network (CAN). However, Society of Automotive Engineers (SAE) J1939 protocol, despite its extensive application in camping cars and commercial vehicles, has seen limited vulnerability identification… ▽ More Following the enactment of the UN Regulation, substantial efforts have been directed toward implementing intrusion detection and prevention systems (IDPSs) and vulnerability analysis in Controller Area Network (CAN). However, Society of Automotive Engineers (SAE) J1939 protocol, despite its extensive application in camping cars and commercial vehicles, has seen limited vulnerability identification, which raises significant safety concerns in the event of security breaches. In this research, we explore and demonstrate attack techniques specific to SAE J1939 communication protocol. We introduce 14 attack scenarios, enhancing the discourse with seven scenarios recognized in the previous research and unveiling seven novel scenarios through our elaborate study. To verify the feasibility of these scenarios, we leverage a sophisticated testbed that facilitates real-time communication and the simulation of attacks. Our testing confirms the successful execution of 11 scenarios, underscoring their imminent threat to commercial vehicle operations. Some attacks will be difficult to detect because they only inject a single message. These results highlight unique vulnerabilities within SAE J1939 protocol, indicating the automotive cybersecurity community needs to address the identified risks. △ Less

Submitted 2 June, 2024; originally announced June 2024.

Comments: 18 pages, 7 figures, 5 tables; This is the accepted version of ESCAR USA 2024

MSC Class: 68M25 ACM Class: K.6.5

arXiv:2405.20233 [pdf, other]

Grokfast: Accelerated Grokking by Amplifying Slow Gradients

Authors: Jaerin Lee, Bong Gyun Kang, Kihoon Kim, Kyoung Mu Lee

Abstract: One puzzling artifact in machine learning dubbed grokking is where delayed generalization is achieved tenfolds of iterations after near perfect overfitting to the training data. Focusing on the long delay itself on behalf of machine learning practitioners, our goal is to accelerate generalization of a model under grokking phenomenon. By regarding a series of gradients of a parameter over training… ▽ More One puzzling artifact in machine learning dubbed grokking is where delayed generalization is achieved tenfolds of iterations after near perfect overfitting to the training data. Focusing on the long delay itself on behalf of machine learning practitioners, our goal is to accelerate generalization of a model under grokking phenomenon. By regarding a series of gradients of a parameter over training iterations as a random signal over time, we can spectrally decompose the parameter trajectories under gradient descent into two components: the fast-varying, overfitting-yielding component and the slow-varying, generalization-inducing component. This analysis allows us to accelerate the grokking phenomenon more than $\times 50$ with only a few lines of code that amplifies the slow-varying components of gradients. The experiments show that our algorithm applies to diverse tasks involving images, languages, and graphs, enabling practical availability of this peculiar artifact of sudden generalization. Our code is available at https://github.com/ironjr/grokfast. △ Less

Submitted 5 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

Comments: 17 pages, 13 figures. Typo fixed. Project page: https://jaerinlee.com/research/grokfast

arXiv:2405.19771 [pdf, other]

Data Service Maximization in Integrated Terrestrial-Non-Terrestrial 6G Networks: A Deep Reinforcement Learning Approach

Authors: Nway Nway Ei, Kitae Kim, Yan Kyaw Tun, Choong Seon Hong

Abstract: Integrating terrestrial and non-terrestrial networks has emerged as a promising paradigm to fulfill the constantly growing demand for connectivity, low transmission delay, and quality of services (QoS). This integration brings together the strengths of terrestrial and non-terrestrial networks, such as the reliability of terrestrial networks, broad coverage, and service continuity of non-terrestria… ▽ More Integrating terrestrial and non-terrestrial networks has emerged as a promising paradigm to fulfill the constantly growing demand for connectivity, low transmission delay, and quality of services (QoS). This integration brings together the strengths of terrestrial and non-terrestrial networks, such as the reliability of terrestrial networks, broad coverage, and service continuity of non-terrestrial networks like low earth orbit (LEO) satellites. In this work, we study a data service maximization problem in an integrated terrestrial-non-terrestrial network (I-TNT) where the ground base stations (GBSs) and LEO satellites cooperatively serve the coexisting aerial users (AUs) and ground users (GUs). Then, by considering the spectrum scarcity, interference, and QoS requirements of the users, we jointly optimize the user association, AUE's trajectory, and power allocation. To tackle the formulated mixed-integer non-convex problem, we disintegrate it into two subproblems: 1) user association problem and 2) trajectory and power allocation problem. Since the user association problem is a binary integer programming problem, we use the standard convex optimization method to solve it. Meanwhile, the trajectory and power allocation problem is solved by the deep deterministic policy gradient (DDPG) method to cope with the problem's non-convexity and dynamic network environments. Then, the two subproblems are alternately solved by the proposed iterative algorithm. By comparing with the baselines in the existing literature, extensive simulations are conducted to evaluate the performance of the proposed framework. △ Less

Submitted 30 May, 2024; originally announced May 2024.

Comments: 5 pages, 4 figures

arXiv:2405.19734 [pdf, other]

Search for the decay $B^{0}\toγγ$ using Belle and Belle II data

Authors: Belle, Belle II Collaborations, :, I. Adachi, L. Aggarwal, H. Aihara, N. Akopov, A. Aloisio, S. Al Said, N. Althubiti, N. Anh Ky, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot , et al. (385 additional authors not shown)

Abstract: We report the result of a search for the rare decay $B^{0} \to γγ$ using a combined dataset of $753\times10^{6}$ $B\bar{B}$ pairs collected by the Belle experiment and $387\times10^{6}$ $B\bar{B}$ pairs collected by the Belle II experiment from decays of the $\rm Υ(4S)$ resonance produced in $e^{+}e^{-}$ collisions. A simultaneous fit to the Belle and Belle II data sets yields… ▽ More We report the result of a search for the rare decay $B^{0} \to γγ$ using a combined dataset of $753\times10^{6}$ $B\bar{B}$ pairs collected by the Belle experiment and $387\times10^{6}$ $B\bar{B}$ pairs collected by the Belle II experiment from decays of the $\rm Υ(4S)$ resonance produced in $e^{+}e^{-}$ collisions. A simultaneous fit to the Belle and Belle II data sets yields $11.0^{+6.5}_{-5.5}$ signal events, corresponding to a 2.5$σ$ significance. We determine the branching fraction $\mathcal{B}(B^{0} \to γγ) = (3.7^{+2.2}_{-1.8}(\rm stat)\pm0.5(\rm syst))\times10^{-8}$ and set a 90% credibility level upper limit of $\mathcal{B}(B^{0} \to γγ) < 6.4\times10^{-8}$. △ Less

Submitted 30 May, 2024; originally announced May 2024.

Report number: Belle II Preprint: 2024-017, KEK Preprint: 2024-13

arXiv:2405.18928 [pdf, other]

Measurement of the energy dependence of the $e^+e^- \to B\bar{B}$, $B\bar{B}{}^*$, and $B^*\bar{B}{}^*$ cross sections at Belle~II

Authors: Belle II Collaboration, I. Adachi, L. Aggarwal, H. Ahmed, H. Aihara, N. Akopov, A. Aloisio, N. Althubiti, N. Anh Ky, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, M. Bauer, A. Baur , et al. (444 additional authors not shown)

Abstract: We report measurements of the $e^+e^- \to B\bar{B}$, $B\bar{B}{}^*$, and $B^*\bar{B}{}^*$ cross sections at four energies, 10653, 10701, 10746 and 10805 MeV, using data collected by the Belle~II experiment. We reconstruct one $B$ meson in a large number of hadronic final states and use its momentum to identify the production process. In the first $2-5$ MeV above $B^*\bar{B}{}^*$ threshold, the… ▽ More We report measurements of the $e^+e^- \to B\bar{B}$, $B\bar{B}{}^*$, and $B^*\bar{B}{}^*$ cross sections at four energies, 10653, 10701, 10746 and 10805 MeV, using data collected by the Belle~II experiment. We reconstruct one $B$ meson in a large number of hadronic final states and use its momentum to identify the production process. In the first $2-5$ MeV above $B^*\bar{B}{}^*$ threshold, the $e^+e^- \to B^*\bar{B}{}^*$ cross section increases rapidly. This may indicate the presence of a pole close to the threshold. △ Less

Submitted 29 May, 2024; originally announced May 2024.

Comments: 30 pages, 15 figures, submitted to JHEP

Report number: Belle II Preprint 2024-016, KEK Preprint 2024-12

arXiv:2405.18792 [pdf, other]

Kernel Metric Learning for In-Sample Off-Policy Evaluation of Deterministic RL Policies

Authors: Haanvid Lee, Tri Wahyu Guntara, Jongmin Lee, Yung-Kyun Noh, Kee-Eung Kim

Abstract: We consider off-policy evaluation (OPE) of deterministic target policies for reinforcement learning (RL) in environments with continuous action spaces. While it is common to use importance sampling for OPE, it suffers from high variance when the behavior policy deviates significantly from the target policy. In order to address this issue, some recent works on OPE proposed in-sample learning with i… ▽ More We consider off-policy evaluation (OPE) of deterministic target policies for reinforcement learning (RL) in environments with continuous action spaces. While it is common to use importance sampling for OPE, it suffers from high variance when the behavior policy deviates significantly from the target policy. In order to address this issue, some recent works on OPE proposed in-sample learning with importance resampling. Yet, these approaches are not applicable to deterministic target policies for continuous action spaces. To address this limitation, we propose to relax the deterministic target policy using a kernel and learn the kernel metrics that minimize the overall mean squared error of the estimated temporal difference update vector of an action value function, where the action value function is used for policy evaluation. We derive the bias and variance of the estimation error due to this relaxation and provide analytic solutions for the optimal kernel metric. In empirical studies using various test domains, we show that the OPE with in-sample learning using the kernel with optimized metric achieves significantly improved accuracy than other baselines. △ Less

Submitted 29 May, 2024; originally announced May 2024.

Comments: 23 pages, 2 figures, Accepted at ICLR 2024 (spotlight)

arXiv:2405.15987 [pdf, other]

Modes of Analyzing Disinformation Narratives With AI/ML/Text Mining to Assist in Mitigating the Weaponization of Social Media

Authors: Andy Skumanich, Han Kyul Kim

Abstract: This paper highlights the developing need for quantitative modes for capturing and monitoring malicious communication in social media. There has been a deliberate "weaponization" of messaging through the use of social networks including by politically oriented entities both state sponsored and privately run. The article identifies a use of AI/ML characterization of generalized "mal-info," a broad… ▽ More This paper highlights the developing need for quantitative modes for capturing and monitoring malicious communication in social media. There has been a deliberate "weaponization" of messaging through the use of social networks including by politically oriented entities both state sponsored and privately run. The article identifies a use of AI/ML characterization of generalized "mal-info," a broad term which includes deliberate malicious narratives similar with hate speech, which adversely impact society. A key point of the discussion is that this mal-info will dramatically increase in volume, and it will become essential for sharable quantifying tools to provide support for human expert intervention. Despite attempts to introduce moderation on major platforms like Facebook and X/Twitter, there are now established alternative social networks that offer completely unmoderated spaces. The paper presents an introduction to these platforms and the initial results of a qualitative and semi-quantitative analysis of characteristic mal-info posts. The authors perform a rudimentary text mining function for a preliminary characterization in order to evaluate the modes for better-automated monitoring. The action examines several inflammatory terms using text analysis and, importantly, discusses the use of generative algorithms by one political agent in particular, providing some examples of the potential risks to society. This latter is of grave concern, and monitoring tools must be established. This paper presents a preliminary step to selecting relevant sources and to setting a foundation for characterizing the mal-info, which must be monitored. The AI/ML methods provide a means for semi-quantitative signature capture. The impending use of "mal-GenAI" is presented. △ Less

Submitted 24 May, 2024; originally announced May 2024.

Comments: Accepted at ICWSM-2024 Workshop on Digital State Sponsored Disinformation and Propaganda: Challenges and Opportunities (DSSDP24)

arXiv:2405.14727 [pdf, other]

Quantized geodesic lengths for Teichmüller spaces: algebraic aspects

Authors: Hyun Kyu Kim

Abstract: In 1980's H Verlinde suggested to construct and use a quantization of Teichmüller spaces to construct spaces of conformal blocks for the Liouville conformal field theory. This suggestion led to a mathematical formulation by Fock in 1990's, called the modular functor conjecture, based on the Chekhov-Fock quantum Teichmüller theory. In 2000's Teschner combined the Chekhov-Fock version and the Kashae… ▽ More In 1980's H Verlinde suggested to construct and use a quantization of Teichmüller spaces to construct spaces of conformal blocks for the Liouville conformal field theory. This suggestion led to a mathematical formulation by Fock in 1990's, called the modular functor conjecture, based on the Chekhov-Fock quantum Teichmüller theory. In 2000's Teschner combined the Chekhov-Fock version and the Kashaev version of quantum Teichmüller theory to construct a solution to a modified form of the conjecture. We embark on a direct approach to the conjecture based on the Chekhov-Fock(-Goncharov) theory. We construct quantized trace-of-monodromy along simple loops via Bonahon and Wong's quantum trace maps developed in 2010's, and investigate algebraic structures of them, which will eventually lead to construction and properties of quantized geodesic length operators. We show that a special recursion relation used by Teschner is satisfied by the quantized trace-of-monodromy, and that the quantized trace-of-monodromy for disjoint loops commute in a certain strong sense. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: 74 pages

MSC Class: 18M20; 57K31; 57K20; 13F60; 81R60; 46L65

arXiv:2405.14625 [pdf, other]

Test of light-lepton universality in $τ$ decays with the Belle II experiment

Authors: Belle II Collaboration, I. Adachi, K. Adamczyk, L. Aggarwal, H. Aihara, N. Akopov, A. Aloisio, N. Anh Ky, D. M. Asner, H. Atmacan, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, A. Baur, A. Beaubien, F. Becherer, J. Becker , et al. (406 additional authors not shown)

Abstract: We present a measurement of the ratio $R_μ= \mathcal{B}(τ^-\to μ^-\barν_μν_τ) / \mathcal{B}(τ^-\to e^-\barν_eν_τ)$ of branching fractions $\mathcal{B}$ of the $τ$ lepton decaying to muons or electrons using data collected with the Belle II detector at the SuperKEKB $e^+e^-$ collider. The sample has an integrated luminosity of 362 fb$^{-1}$ at a centre-of-mass energy of 10.58 GeV. Using an optimise… ▽ More We present a measurement of the ratio $R_μ= \mathcal{B}(τ^-\to μ^-\barν_μν_τ) / \mathcal{B}(τ^-\to e^-\barν_eν_τ)$ of branching fractions $\mathcal{B}$ of the $τ$ lepton decaying to muons or electrons using data collected with the Belle II detector at the SuperKEKB $e^+e^-$ collider. The sample has an integrated luminosity of 362 fb$^{-1}$ at a centre-of-mass energy of 10.58 GeV. Using an optimised event selection, a binned maximum likelihood fit is performed using the momentum spectra of the electron and muon candidates. The result, $R_μ= 0.9675 \pm 0.0007 \pm 0.0036$, where the first uncertainty is statistical and the second is systematic, is the most precise to date. It provides a stringent test of the light-lepton universality, translating to a ratio of the couplings of the muon and electron to the $W$ boson in $τ$ decays of $0.9974 \pm 0.0019$, in agreement with the standard model expectation of unity. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Report number: Belle II Preprint 2024-002, KEK Preprint 2023-49

arXiv:2405.14155 [pdf]

Room-temperature waveguide-integrated photodetector using bolometric effect for mid-infrared spectroscopy applications

Authors: Joonsup Shim, Jinha Lim, Inki Kim, Jaeyong Jeong, Bong Ho Kim, Seong Kwang Kim, Dae-Myeong Geum, SangHyeon Kim

Abstract: Waveguide-integrated mid-infrared (MIR) photodetectors are pivotal components for developing molecular spectroscopy applications, leveraging mature photonic integrated circuit (PIC) technologies. Despite various strategies, critical challenges still remain in achieving broadband photoresponse, cooling-free operation, and large-scale complementary-metal-oxide-semiconductor (CMOS)-compatible manufac… ▽ More Waveguide-integrated mid-infrared (MIR) photodetectors are pivotal components for developing molecular spectroscopy applications, leveraging mature photonic integrated circuit (PIC) technologies. Despite various strategies, critical challenges still remain in achieving broadband photoresponse, cooling-free operation, and large-scale complementary-metal-oxide-semiconductor (CMOS)-compatible manufacturability. To leap beyond these limitations, the bolometric effect - a thermal detection mechanism - is introduced into the waveguide platform. More importantly, we pursue a free-carrier absorption (FCA) process in germanium (Ge) to create an efficient light-absorbing medium, providing a pragmatic solution for full coverage of the MIR spectrum without incorporating exotic materials into CMOS. Here, we present an uncooled waveguide-integrated photodetector based on a Ge-on-insulator (Ge-OI) PIC architecture, exploiting the bolometric effect combined with FCA. Notably, our device exhibits a broadband responsivity of ~12 mA/W across 4030-4360 nm (and potentially beyond), challenging the state of the art, while achieving a noise-equivalent power of 3.4x10^-9 W/Hz^0.5 at 4180 nm. We further demonstrate label-free sensing of carbon dioxide using our integrated photodetector and sensing waveguide on a single chip. This approach to room-temperature waveguide-integrated MIR photodetection, harnessing bolometry with FCA in Ge, not only facilitates the realization of fully integrated lab-on-a-chip systems with wavelength flexibility but also provides a blueprint for MIR PICs with CMOS-foundry-compatibility. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: 6 figures for the main manuscript and 14 figures for the supplementary information

arXiv:2405.12421 [pdf, other]

A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback

Authors: Kihyun Kim, Jiawei Zhang, Asuman Ozdaglar, Pablo A. Parrilo

Abstract: Inverse Reinforcement Learning (IRL) and Reinforcement Learning from Human Feedback (RLHF) are pivotal methodologies in reward learning, which involve inferring and shaping the underlying reward function of sequential decision-making problems based on observed human demonstrations and feedback. Most prior work in reward learning has relied on prior knowledge or assumptions about decision or prefer… ▽ More Inverse Reinforcement Learning (IRL) and Reinforcement Learning from Human Feedback (RLHF) are pivotal methodologies in reward learning, which involve inferring and shaping the underlying reward function of sequential decision-making problems based on observed human demonstrations and feedback. Most prior work in reward learning has relied on prior knowledge or assumptions about decision or preference models, potentially leading to robustness issues. In response, this paper introduces a novel linear programming (LP) framework tailored for offline reward learning. Utilizing pre-collected trajectories without online exploration, this framework estimates a feasible reward set from the primal-dual optimality conditions of a suitably designed LP, and offers an optimality guarantee with provable sample efficiency. Our LP framework also enables aligning the reward functions with human feedback, such as pairwise trajectory comparison data, while maintaining computational tractability and sample efficiency. We demonstrate that our framework potentially achieves better performance compared to the conventional maximum likelihood estimation (MLE) approach through analytical examples and numerical experiments. △ Less

Submitted 3 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

Comments: ICML 2024

arXiv:2405.11905 [pdf, other]

CSTA: CNN-based Spatiotemporal Attention for Video Summarization

Authors: Jaewon Son, Jaehun Park, Kwangsu Kim

Abstract: Video summarization aims to generate a concise representation of a video, capturing its essential content and key moments while reducing its overall length. Although several methods employ attention mechanisms to handle long-term dependencies, they often fail to capture the visual significance inherent in frames. To address this limitation, we propose a CNN-based SpatioTemporal Attention (CSTA) me… ▽ More Video summarization aims to generate a concise representation of a video, capturing its essential content and key moments while reducing its overall length. Although several methods employ attention mechanisms to handle long-term dependencies, they often fail to capture the visual significance inherent in frames. To address this limitation, we propose a CNN-based SpatioTemporal Attention (CSTA) method that stacks each feature of frames from a single video to form image-like frame representations and applies 2D CNN to these frame features. Our methodology relies on CNN to comprehend the inter and intra-frame relations and to find crucial attributes in videos by exploiting its ability to learn absolute positions within images. In contrast to previous work compromising efficiency by designing additional modules to focus on spatial importance, CSTA requires minimal computational overhead as it uses CNN as a sliding window. Extensive experiments on two benchmark datasets (SumMe and TVSum) demonstrate that our proposed approach achieves state-of-the-art performance with fewer MACs compared to previous methods. Codes are available at https://github.com/thswodnjs3/CSTA. △ Less

Submitted 21 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

Comments: Accepted at CVPR 2024

arXiv:2405.11390 [pdf, other]

Search for Two-Body $B$ Meson Decays to $Λ^{0}$ and $Ω^{(*)0}_{c}$

Authors: Belle Collaboration, V. Savinov, I. Adachi, J. K. Ahn, H. Aihara, D. M. Asner, H. Atmacan, R. Ayad, Sw. Banerjee, J. Bennett, M. Bessner, V. Bhardwaj, D. Biswas, A. Bobrov, D. Bodrov, J. Borah, M. Bračko, P. Branchini, T. E. Browder, A. Budano, D. Červenkov, M. -C. Chang, P. Chang, B. G. Cheon, K. Cho , et al. (124 additional authors not shown)

Abstract: We report the results of the first search for Standard Model and baryon-number-violating two-body decays of the neutral $B$ mesons to $Λ^{0}$ and $Ω^{(*)0}_c$ using 711~${\rm fb^{-1}}$ of data collected at the $Υ(4S)$ resonance with the Belle detector at the KEKB asymmetric-energy $e^+ e^-$ collider. We observe no evidence of signal from any such decays and set 95\% confidence-level upper limits o… ▽ More We report the results of the first search for Standard Model and baryon-number-violating two-body decays of the neutral $B$ mesons to $Λ^{0}$ and $Ω^{(*)0}_c$ using 711~${\rm fb^{-1}}$ of data collected at the $Υ(4S)$ resonance with the Belle detector at the KEKB asymmetric-energy $e^+ e^-$ collider. We observe no evidence of signal from any such decays and set 95\% confidence-level upper limits on the products of $B^0$ and $\bar{B}^0$ branching fractions for these two-body decays with $\mathcal{B}(Ω_{c}^{0} \to π^+ Ω^-)$ in the range between 9.5~$\times 10^{-8}$ and 31.2~$\times 10^{-8}$. △ Less

Submitted 18 May, 2024; originally announced May 2024.

Comments: 6 pages, 2 figures, submitted to PRD(L)

Report number: Belle Preprint 2024-04, KEK Preprint 2024-5

arXiv:2405.11254 [pdf, other]

Spread and Spectral Complexity in Quantum Spin Chains: from Integrability to Chaos

Authors: Hugo A. Camargo, Kyoung-Bum Huh, Viktor Jahnke, Hyun-Sik Jeong, Keun-Young Kim, Mitsuhiro Nishida

Abstract: We explore spread and spectral complexity in quantum systems that exhibit a transition from integrability to chaos, namely the mixed-field Ising model and the next-to-nearest-neighbor deformation of the Heisenberg XXZ spin chain. We corroborate the observation that the presence of a peak in spread complexity before its saturation, is a characteristic feature in chaotic systems. We find that, in ge… ▽ More We explore spread and spectral complexity in quantum systems that exhibit a transition from integrability to chaos, namely the mixed-field Ising model and the next-to-nearest-neighbor deformation of the Heisenberg XXZ spin chain. We corroborate the observation that the presence of a peak in spread complexity before its saturation, is a characteristic feature in chaotic systems. We find that, in general, the saturation value of spread complexity post-peak depends not only on the spectral statistics of the Hamiltonian, but also on the specific state. However, there appears to be a maximal universal bound determined by the symmetries and dimension of the Hamiltonian, which is realized by the thermofield double state (TFD) at infinite temperature. We also find that the time scales at which the spread complexity and spectral form factor change their behaviour agree with each other and are independent of the chaotic properties of the systems. In the case of spectral complexity, we identify that the key factor determining its saturation value and timescale in chaotic systems is given by minimum energy difference in the theory's spectrum. This explains observations made in the literature regarding its earlier saturation in chaotic systems compared to their integrable counterparts. We conclude by discussing the properties of the TFD which, we conjecture, make it suitable for probing signatures of chaos in quantum many-body systems. △ Less

Submitted 3 June, 2024; v1 submitted 18 May, 2024; originally announced May 2024.

Comments: v1: 35 pages, 18 figures, v2: references added, minor changes

Report number: IFT-UAM/CSIC-24-65

arXiv:2405.10908 [pdf, other]

UVCANDELS: The role of dust on the stellar mass-size relation of disk galaxies at 0.5 $\leq z \leq$ 3.0

Authors: Kalina V. Nedkova, Marc Rafelski, Harry I. Teplitz, Vihang Mehta, Laura DeGroot, Swara Ravindranath, Anahita Alavi, Alexander Beckett, Norman A. Grogin, Boris Häußler, Anton M. Koekemoer, Grecco A. Oyarzún, Laura Prichard, Mitchell Revalski, Gregory F. Snyder, Ben Sunnquist, Xin Wang, Rogier A. Windhorst, Nima Chartab, Christopher J. Conselice, Yicheng Guo, Nimish Hathi, Matthew J. Hayes, Zhiyuan Ji, Keunho J. Kim , et al. (8 additional authors not shown)

Abstract: We use the Ultraviolet Imaging of the Cosmic Assembly Near-infrared Deep Extragalactic Legacy Survey fields (UVCANDELS) to measure half-light radii in the rest-frame far-UV for $\sim$16,000 disk-like galaxies over $0.5\leq z \leq 3$. We compare these results to rest-frame optical sizes that we measure in a self-consistent way and find that the stellar mass-size relation of disk galaxies is steeper… ▽ More We use the Ultraviolet Imaging of the Cosmic Assembly Near-infrared Deep Extragalactic Legacy Survey fields (UVCANDELS) to measure half-light radii in the rest-frame far-UV for $\sim$16,000 disk-like galaxies over $0.5\leq z \leq 3$. We compare these results to rest-frame optical sizes that we measure in a self-consistent way and find that the stellar mass-size relation of disk galaxies is steeper in the rest-frame UV than in the optical across our entire redshift range. We show that this is mainly driven by massive galaxies ($\gtrsim10^{10}$M$_\odot$), which we find to also be among the most dusty. Our results are consistent with the literature and have commonly been interpreted as evidence of inside-out growth wherein galaxies form their central structures first. However, they could also suggest that the centers of massive galaxies are more heavily attenuated than their outskirts. We distinguish between these scenarios by modeling and selecting galaxies at $z=2$ from the VELA simulation suite in a way that is consistent with UVCANDELS. We show that the effects of dust alone can account for the size differences we measure at $z=2$. This indicates that, at different wavelengths, size differences and the different slopes of the stellar mass-size relation do not constitute evidence for inside-out growth. △ Less

Submitted 28 June, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

Comments: Accepted for publication in ApJ. 22 pages, 12 figures, and 4 tables

arXiv:2405.10123 [pdf, other]

Asynchronous Federated Stochastic Optimization for Heterogeneous Objectives Under Arbitrary Delays

Authors: Charikleia Iakovidou, Kibaek Kim

Abstract: Federated learning (FL) was recently proposed to securely train models with data held over multiple locations ("clients") under the coordination of a central server. Two major challenges hindering the performance of FL algorithms are long training times caused by straggling clients, and a decline in model accuracy under non-iid local data distributions ("client drift"). In this work, we propose an… ▽ More Federated learning (FL) was recently proposed to securely train models with data held over multiple locations ("clients") under the coordination of a central server. Two major challenges hindering the performance of FL algorithms are long training times caused by straggling clients, and a decline in model accuracy under non-iid local data distributions ("client drift"). In this work, we propose and analyze Asynchronous Exact Averaging (AREA), a new stochastic (sub)gradient algorithm that utilizes asynchronous communication to speed up convergence and enhance scalability, and employs client memory to correct the client drift caused by variations in client update frequencies. Moreover, AREA is, to the best of our knowledge, the first method that is guaranteed to converge under arbitrarily long delays, without the use of delay-adaptive stepsizes, and (i) for strongly convex, smooth functions, asymptotically converges to an error neighborhood whose size depends only on the variance of the stochastic gradients used with respect to the number of iterations, and (ii) for convex, non-smooth functions, matches the convergence rate of the centralized stochastic subgradient method up to a constant factor, which depends on the average of the individual client update frequencies instead of their minimum (or maximum). Our numerical results validate our theoretical analysis and indicate AREA outperforms state-of-the-art methods when local data are highly non-iid, especially as the number of clients grows. △ Less

Submitted 28 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

arXiv:2405.09935 [pdf, other]

DEBATE: Devil's Advocate-Based Assessment and Text Evaluation

Authors: Alex Kim, Keonwoo Kim, Sangwon Yoon

Abstract: As natural language generation (NLG) models have become prevalent, systematically assessing the quality of machine-generated texts has become increasingly important. Recent studies introduce LLM-based evaluators that operate as reference-free metrics, demonstrating their capability to adeptly handle novel tasks. However, these models generally rely on a single-agent approach, which, we argue, intr… ▽ More As natural language generation (NLG) models have become prevalent, systematically assessing the quality of machine-generated texts has become increasingly important. Recent studies introduce LLM-based evaluators that operate as reference-free metrics, demonstrating their capability to adeptly handle novel tasks. However, these models generally rely on a single-agent approach, which, we argue, introduces an inherent limit to their performance. This is because there exist biases in LLM agent's responses, including preferences for certain text structure or content. In this work, we propose DEBATE, an NLG evaluation framework based on multi-agent scoring system augmented with a concept of Devil's Advocate. Within the framework, one agent is instructed to criticize other agents' arguments, potentially resolving the bias in LLM agent's answers. DEBATE substantially outperforms the previous state-of-the-art methods in two meta-evaluation benchmarks in NLG evaluation, SummEval and TopicalChat. We also show that the extensiveness of debates among agents and the persona of an agent can influence the performance of evaluators. △ Less

Submitted 23 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

arXiv:2405.09765 [pdf, other]

doi 10.1109/ICASSP48485.2024.10446698

Unsupervised Extractive Dialogue Summarization in Hyperdimensional Space

Authors: Seongmin Park, Kyungho Kim, Jaejin Seo, Jihwa Lee

Abstract: We present HyperSum, an extractive summarization framework that captures both the efficiency of traditional lexical summarization and the accuracy of contemporary neural approaches. HyperSum exploits the pseudo-orthogonality that emerges when randomly initializing vectors at extremely high dimensions ("blessing of dimensionality") to construct representative and efficient sentence embeddings. Simp… ▽ More We present HyperSum, an extractive summarization framework that captures both the efficiency of traditional lexical summarization and the accuracy of contemporary neural approaches. HyperSum exploits the pseudo-orthogonality that emerges when randomly initializing vectors at extremely high dimensions ("blessing of dimensionality") to construct representative and efficient sentence embeddings. Simply clustering the obtained embeddings and extracting their medoids yields competitive summaries. HyperSum often outperforms state-of-the-art summarizers -- in terms of both summary accuracy and faithfulness -- while being 10 to 100 times faster. We open-source HyperSum as a strong baseline for unsupervised extractive summarization. △ Less

Submitted 15 May, 2024; originally announced May 2024.

Comments: ICASSP 2024

arXiv:2405.08311 [pdf, ps, other]

A Decoupling and Aggregating Framework for Joint Extraction of Entities and Relations

Authors: Yao Wang, Xin Liu, Weikun Kong, Hai-Tao Yu, Teeradaj Racharak, Kyoung-Sook Kim, Minh Le Nguyen

Abstract: Named Entity Recognition and Relation Extraction are two crucial and challenging subtasks in the field of Information Extraction. Despite the successes achieved by the traditional approaches, fundamental research questions remain open. First, most recent studies use parameter sharing for a single subtask or shared features for both two subtasks, ignoring their semantic differences. Second, informa… ▽ More Named Entity Recognition and Relation Extraction are two crucial and challenging subtasks in the field of Information Extraction. Despite the successes achieved by the traditional approaches, fundamental research questions remain open. First, most recent studies use parameter sharing for a single subtask or shared features for both two subtasks, ignoring their semantic differences. Second, information interaction mainly focuses on the two subtasks, leaving the fine-grained informtion interaction among the subtask-specific features of encoding subjects, relations, and objects unexplored. Motivated by the aforementioned limitations, we propose a novel model to jointly extract entities and relations. The main novelties are as follows: (1) We propose to decouple the feature encoding process into three parts, namely encoding subjects, encoding objects, and encoding relations. Thanks to this, we are able to use fine-grained subtask-specific features. (2) We propose novel inter-aggregation and intra-aggregation strategies to enhance the information interaction and construct individual fine-grained subtask-specific features, respectively. The experimental results demonstrate that our model outperforms several previous state-of-the-art models. Extensive additional experiments further confirm the effectiveness of our model. △ Less

Submitted 14 May, 2024; originally announced May 2024.

arXiv:2405.07386 [pdf, other]

Search for lepton-flavor-violating $τ^- \to μ^-μ^+μ^-$ decays at Belle II

Authors: Belle II Collaboration, I. Adachi, L. Aggarwal, H. Aihara, N. Akopov, A. Aloisio, N. Althubiti, N. Anh Ky, D. M. Asner, H. Atmacan, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, A. Baur, A. Beaubien, F. Becherer, J. Becker , et al. (407 additional authors not shown)

Abstract: We present the result of a search for the charged-lepton-flavor violating decay $τ^- \to μ^-μ^+μ^-$ using a $424fb^{-1}$ sample of data recorded by the Belle II experiment at the SuperKEKB $e^{-}e^{+}$ collider. The selection of $e^{-}e^{+}\toτ^+τ^-$ events is based on an inclusive reconstruction of the non-signal tau decay, and on a boosted decision tree to suppress background. We observe one sig… ▽ More We present the result of a search for the charged-lepton-flavor violating decay $τ^- \to μ^-μ^+μ^-$ using a $424fb^{-1}$ sample of data recorded by the Belle II experiment at the SuperKEKB $e^{-}e^{+}$ collider. The selection of $e^{-}e^{+}\toτ^+τ^-$ events is based on an inclusive reconstruction of the non-signal tau decay, and on a boosted decision tree to suppress background. We observe one signal candidate, which is compatible with the expectation from background processes. We set a $90\%$ confidence level upper limit of $1.9 \times 10^{-8}$ on the branching fraction of the \taumu decay, which is the most stringent bound to date. △ Less

Submitted 12 May, 2024; originally announced May 2024.

Report number: Belle II Preprint 2024-012 KEK Preprint 2024-6

arXiv:2405.06953 [pdf, other]

The Sunburst Arc with JWST: II. Observations of an Eta Carinae Analog at $z=2.37$

Authors: S. Choe, T. Emil Rivera-Thorsen, H. Dahle, K. Sharon, M. Riley Owens, J. R. Rigby, M. B. Bayliss, M. J. Hayes, T. Hutchison, B. Welch, J. Chisholm, M. D. Gladders, G. Khullar, K. Kim

Abstract: "Godzilla" is a peculiar object within the gravitationally lensed Sunburst Arc at $z=2.37$. Despite being very bright, it appears in only one of the twelve lensed images of the source galaxy, and shows exotic spectroscopic properties not found elsewhere in the galaxy. We use JWST's unique combination of spatial resolution and spectroscopic sensitivity to provide a unified, coherent explanation of… ▽ More "Godzilla" is a peculiar object within the gravitationally lensed Sunburst Arc at $z=2.37$. Despite being very bright, it appears in only one of the twelve lensed images of the source galaxy, and shows exotic spectroscopic properties not found elsewhere in the galaxy. We use JWST's unique combination of spatial resolution and spectroscopic sensitivity to provide a unified, coherent explanation of the physical nature of Godzilla. We measure fluxes and kinematic properties of rest-optical emission lines in Godzilla and surrounding regions. Using standard line ratio-based diagnostic methods in combination with NIRCam imaging and ground based rest-UV spectra, we characterize Godzilla and its surroundings. We find that Godzilla is most likely an extremely magnified, non-erupting LBV star with dense gas condensations in close proximity. Among around 60 detected lines, we find a cascade of strong O I lines pumped by intense Ly$β$ emission, as well as Ly$α$-pumped rest-optical Fe II lines, reminiscent of the Weigelt blobs in the local LBV star Eta Carinae. Godzilla is surrounded by dusty, inhomogeneous gas common to massive, evolved stars. Spectra and images of Godzilla and adjacent objects and the detection of a low-surface brightness foreground galaxy in the NIRCam data support the interpretation that Godzilla is a stellar-scale object extremely magnified by alignment with lensing caustics. To explain the dusty surroundings, strong [Ne III] and line kinematics simultaneously, we argue that Godzilla is a post-eruption LBV accompanied by a hotter companion and/or gas condensations exposed to more intense radiation compared to the Weigelt blobs. We expect periodic spectroscopic variations if Godzilla is a binary system. If Godzilla is confirmed to be an LBV star, it expands the distance to the furthest known LBV from a dozen Mpc to several Gpc. △ Less

Submitted 11 May, 2024; originally announced May 2024.

Comments: 18 pages, 16 figures. Submitted to A&A

arXiv:2405.06631 [pdf, other]

The Sunburst Arc with JWST: III. An Abundance of Direct Chemical Abundances

Authors: Brian Welch, T. Emil Rivera-Thorsen, Jane Rigby, Taylor Hutchison, Grace M. Olivier, Danielle A. Berg, Keren Sharon, Hakon Dahle, M. Riley Owens, Matthew B. Bayliss, Gourav Khullar, John Chisholm, Matthew Hayes, Keunho J. Kim

Abstract: We measure the gas-phase abundances of the elements He, N, O, Ne, S, Ar, and Fe in the Lyman-continuum emitting region of the Sunburst Arc, a highly magnified galaxy at redshift $z=2.37$. We detect the temperature-sensitive auroral lines [SII]$λ\lambda4069,4076$, [OII]$λ\lambda7320,7330$, [SIII]$\lambda6312$, [OIII]$\lambda4363$, and [NeIII]$\lambda3343$ in a stacked spectrum of 5 multiple images… ▽ More We measure the gas-phase abundances of the elements He, N, O, Ne, S, Ar, and Fe in the Lyman-continuum emitting region of the Sunburst Arc, a highly magnified galaxy at redshift $z=2.37$. We detect the temperature-sensitive auroral lines [SII]$λ\lambda4069,4076$, [OII]$λ\lambda7320,7330$, [SIII]$\lambda6312$, [OIII]$\lambda4363$, and [NeIII]$\lambda3343$ in a stacked spectrum of 5 multiple images of the Lyman-continuum emitter (LCE), from which we directly measure the electron temperature in the low, intermediate, and high ionization zones. We also detect the density-sensitive doublets of [OII]$λ\lambda3727,3729$, [SII]$λ\lambda6717,6731$, and [ArIV]$λ\lambda4713,4741$, which constrain the density in both the low- and high-ionization gas. With these temperature and density measurements, we measure gas-phase abundances with similar rigor as studies of local galaxies. We measure a gas-phase metallicity for the LCE of $12+\log(\textrm{O}/\textrm{H}) = 7.97 \pm 0.05$, and find an enhanced nitrogen abundance $\log(\textrm{N}/\textrm{O}) = -0.65^{+0.16}_{-0.25}$. This nitrogen abundance is consistent with enrichment from a population of Wolf-Rayet stars, additional signatures of which are reported in a companion paper. Abundances of sulfur, argon, neon, and iron are consistent with local low-metallicity HII regions and low-redshift galaxies. This study represents the most complete chemical abundance analysis of a galaxy at Cosmic Noon to date, which enables direct comparisons between local HII regions and those in the distant universe. △ Less

Submitted 14 May, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

Comments: 15 pages, 4 figures, 3 tables. Submitted to ApJ

arXiv:2405.05787 [pdf, other]

Autonomous Robotic Ultrasound System for Liver Follow-up Diagnosis: Pilot Phantom Study

Authors: Tianpeng Zhang, Sekeun Kim, Jerome Charton, Haitong Ma, Kyungsang Kim, Na Li, Quanzheng Li

Abstract: The paper introduces a novel autonomous robot ultrasound (US) system targeting liver follow-up scans for outpatients in local communities. Given a computed tomography (CT) image with specific target regions of interest, the proposed system carries out the autonomous follow-up scan in three steps: (i) initial robot contact to surface, (ii) coordinate mapping between CT image and robot, and (iii) ta… ▽ More The paper introduces a novel autonomous robot ultrasound (US) system targeting liver follow-up scans for outpatients in local communities. Given a computed tomography (CT) image with specific target regions of interest, the proposed system carries out the autonomous follow-up scan in three steps: (i) initial robot contact to surface, (ii) coordinate mapping between CT image and robot, and (iii) target US scan. Utilizing 3D US-CT registration and deep learning-based segmentation networks, we can achieve precise imaging of 3D hepatic veins, facilitating accurate coordinate mapping between CT and the robot. This enables the automatic localization of follow-up targets within the CT image, allowing the robot to navigate precisely to the target's surface. Evaluation of the ultrasound phantom confirms the quality of the US-CT registration and shows the robot reliably locates the targets in repeated trials. The proposed framework holds the potential to significantly reduce time and costs for healthcare providers, clinicians, and follow-up patients, thereby addressing the increasing healthcare burden associated with chronic disease in local communities. △ Less

Submitted 9 May, 2024; originally announced May 2024.

arXiv:2405.03905 [pdf, other]

A 65nm 36nJ/Decision Bio-inspired Temporal-Sparsity-Aware Digital Keyword Spotting IC with 0.6V Near-Threshold SRAM

Authors: Qinyu Chen, Kwantae Kim, Chang Gao, Sheng Zhou, Taekwang Jang, Tobi Delbruck, Shih-Chii Liu

Abstract: This paper introduces, to the best of the authors' knowledge, the first fine-grained temporal sparsity-aware keyword spotting (KWS) IC leveraging temporal similarities between neighboring feature vectors extracted from input frames and network hidden states, eliminating unnecessary operations and memory accesses. This KWS IC, featuring a bio-inspired delta-gated recurrent neural network (ΔRNN) cla… ▽ More This paper introduces, to the best of the authors' knowledge, the first fine-grained temporal sparsity-aware keyword spotting (KWS) IC leveraging temporal similarities between neighboring feature vectors extracted from input frames and network hidden states, eliminating unnecessary operations and memory accesses. This KWS IC, featuring a bio-inspired delta-gated recurrent neural network (ΔRNN) classifier, achieves an 11-class Google Speech Command Dataset (GSCD) KWS accuracy of 90.5% and energy consumption of 36nJ/decision. At 87% temporal sparsity, computing latency and energy per inference are reduced by 2.4$\times$/3.4$\times$, respectively. The 65nm design occupies 0.78mm$^2$ and features two additional blocks, a compact 0.084mm$^2$ digital infinite-impulse-response (IIR)-based band-pass filter (BPF) audio feature extractor (FEx) and a 24kB 0.6V near-Vth weight SRAM with 6.6$\times$ lower read power compared to the standard SRAM. △ Less

Submitted 6 May, 2024; originally announced May 2024.

arXiv:2405.03083 [pdf, other]

Causal K-Means Clustering

Authors: Kwangho Kim, Jisu Kim, Edward H. Kennedy

Abstract: Causal effects are often characterized with population summaries. These might provide an incomplete picture when there are heterogeneous treatment effects across subgroups. Since the subgroup structure is typically unknown, it is more challenging to identify and evaluate subgroup effects than population effects. We propose a new solution to this problem: Causal k-Means Clustering, which harnesses… ▽ More Causal effects are often characterized with population summaries. These might provide an incomplete picture when there are heterogeneous treatment effects across subgroups. Since the subgroup structure is typically unknown, it is more challenging to identify and evaluate subgroup effects than population effects. We propose a new solution to this problem: Causal k-Means Clustering, which harnesses the widely-used k-means clustering algorithm to uncover the unknown subgroup structure. Our problem differs significantly from the conventional clustering setup since the variables to be clustered are unknown counterfactual functions. We present a plug-in estimator which is simple and readily implementable using off-the-shelf algorithms, and study its rate of convergence. We also develop a new bias-corrected estimator based on nonparametric efficiency theory and double machine learning, and show that this estimator achieves fast root-n rates and asymptotic normality in large nonparametric models. Our proposed methods are especially useful for modern outcome-wide studies with multiple treatment levels. Further, our framework is extensible to clustering with generic pseudo-outcomes, such as partially observed outcomes or otherwise unknown functions. Finally, we explore finite sample properties via simulation, and illustrate the proposed methods in a study of treatment programs for adolescent substance abuse. △ Less

Submitted 29 June, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

arXiv:2405.02367 [pdf, other]

Enhancing Social Media Post Popularity Prediction with Visual Content

Authors: Dahyun Jeong, Hyelim Son, Yunjin Choi, Keunwoo Kim

Abstract: Our study presents a framework for predicting image-based social media content popularity that focuses on addressing complex image information and a hierarchical data structure. We utilize the Google Cloud Vision API to effectively extract key image and color information from users' postings, achieving 6.8% higher accuracy compared to using non-image covariates alone. For prediction, we explore a… ▽ More Our study presents a framework for predicting image-based social media content popularity that focuses on addressing complex image information and a hierarchical data structure. We utilize the Google Cloud Vision API to effectively extract key image and color information from users' postings, achieving 6.8% higher accuracy compared to using non-image covariates alone. For prediction, we explore a wide range of prediction models, including Linear Mixed Model, Support Vector Regression, Multi-layer Perceptron, Random Forest, and XGBoost, with linear regression as the benchmark. Our comparative study demonstrates that models that are capable of capturing the underlying nonlinear interactions between covariates outperform other methods. △ Less

Submitted 8 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

Report number: Report-no: JKSS-D-23-00299R1

arXiv:2405.00873 [pdf, other]

Implementing a synthetic magnetic vector potential in a 2D superconducting qubit array

Authors: Ilan T. Rosen, Sarah Muschinske, Cora N. Barrett, Arkya Chatterjee, Max Hays, Michael DeMarco, Amir Karamlou, David Rower, Rabindra Das, David K. Kim, Bethany M. Niedzielski, Meghan Schuldt, Kyle Serniak, Mollie E. Schwartz, Jonilyn L. Yoder, Jeffrey A. Grover, William D. Oliver

Abstract: Superconducting quantum processors are a compelling platform for analog quantum simulation due to the precision control, fast operation, and site-resolved readout inherent to the hardware. Arrays of coupled superconducting qubits natively emulate the dynamics of interacting particles according to the Bose-Hubbard model. However, many interesting condensed-matter phenomena emerge only in the presen… ▽ More Superconducting quantum processors are a compelling platform for analog quantum simulation due to the precision control, fast operation, and site-resolved readout inherent to the hardware. Arrays of coupled superconducting qubits natively emulate the dynamics of interacting particles according to the Bose-Hubbard model. However, many interesting condensed-matter phenomena emerge only in the presence of electromagnetic fields. Here, we emulate the dynamics of charged particles in an electromagnetic field using a superconducting quantum simulator. We realize a broadly adjustable synthetic magnetic vector potential by applying continuous modulation tones to all qubits. We verify that the synthetic vector potential obeys requisite properties of electromagnetism: a spatially-varying vector potential breaks time-reversal symmetry and generates a gauge-invariant synthetic magnetic field, and a temporally-varying vector potential produces a synthetic electric field. We demonstrate that the Hall effect--the transverse deflection of a charged particle propagating in an electromagnetic field--exists in the presence of the synthetic electromagnetic field. △ Less

Submitted 1 May, 2024; originally announced May 2024.

Comments: 9 pages, 5 figures, and Supplementary Information

arXiv:2405.00493 [pdf, other]

A study of Galactic Plane Planck Galactic Cold Clumps observed by SCOPE and the JCMT Plane Survey

Authors: D. J. Eden, Tie Liu, T. J. T. Moore, J. Di Francesco, G. Fuller, Kee-Tae Kim, Di Li, S. -Y. Liu, R. Plume, Ken'ichi Tatematsu, M. A. Thompson, Y. Wu, L. Bronfman, H. M. Butner, M. J. Currie, G. Garay, P. F. Goldsmith, N. Hirano, D. Johnstone, M. Juvela, S. -P. Lai, C. W. Lee, E. E. Mannfors, F. Olguin, K. Pattle , et al. (10 additional authors not shown)

Abstract: We have investigated the physical properties of Planck Galactic Cold Clumps (PGCCs) located in the Galactic Plane, using the JCMT Plane Survey (JPS) and the SCUBA-2 Continuum Observations of Pre-protostellar Evolution (SCOPE) survey. By utilising a suite of molecular-line surveys, velocities and distances were assigned to the compact sources within the PGCCs, placing them in a Galactic context. Th… ▽ More We have investigated the physical properties of Planck Galactic Cold Clumps (PGCCs) located in the Galactic Plane, using the JCMT Plane Survey (JPS) and the SCUBA-2 Continuum Observations of Pre-protostellar Evolution (SCOPE) survey. By utilising a suite of molecular-line surveys, velocities and distances were assigned to the compact sources within the PGCCs, placing them in a Galactic context. The properties of these compact sources show no large-scale variations with Galactic environment. Investigating the star-forming content of the sample, we find that the luminosity-to-mass ratio (L/M) is an order of magnitude lower than in other Galactic studies, indicating that these objects are hosting lower levels of star formation. Finally, by comparing ATLASGAL sources that are associated or are not associated with PGCCs, we find that those associated with PGCCs are typically colder, denser, and have a lower L/M ratio, hinting that PGCCs are a distinct population of Galactic Plane sources. △ Less

Submitted 1 May, 2024; originally announced May 2024.

Comments: 18 pages, 14 figures, 7 tables. Accepted for publication in MNRAS

arXiv:2404.19535 [pdf, other]

Ferroelectrically-enhanced Schottky barrier transistors for Logic-in-Memory applications

Authors: Daniele Nazzari, Lukas Wind, Masiar Sistani, Dominik Mayr, Kihye Kim, Walter M. Weber

Abstract: Artificial neural networks (ANNs) have had an enormous impact on a multitude of sectors, from research to industry, generating an unprecedented demand for tailor-suited hardware platforms. Their training and execution is highly memory-intensive, clearly evidencing the limitations affecting the currently available hardware based on the von Neumann architecture, which requires frequent data shuttlin… ▽ More Artificial neural networks (ANNs) have had an enormous impact on a multitude of sectors, from research to industry, generating an unprecedented demand for tailor-suited hardware platforms. Their training and execution is highly memory-intensive, clearly evidencing the limitations affecting the currently available hardware based on the von Neumann architecture, which requires frequent data shuttling due to the physical separation of logic and memory units. This does not only limit the achievable performances but also greatly increases the energy consumption, hindering the integration of ANNs into low-power platforms. New Logic in Memory (LiM) architectures, able to unify memory and logic functionalities into a single component, are highly promising for overcoming these limitations, by drastically reducing the need of data transfers. Recently, it has been shown that a very flexible platform for logic applications can be realized recurring to a multi-gated Schottky-Barrier Field Effect Transistor (SBFET). If equipped with memory capabilities, this architecture could represent an ideal building block for versatile LiM hardware. To reach this goal, here we investigate the integration of a ferroelectric Hf$_{0.5}$Zr$_{0.5}$O$_2$ (HZO) layer onto Dual Top Gated SBFETs. We demonstrate that HZO polarization charges can be successfully employed to tune the height of the two Schottky barriers, influencing the injection behavior, thus defining the transistor mode, switching it between n and p-type transport. The modulation strength is strongly dependent on the polarization pulse height, allowing for the selection of multiple current levels. All these achievable states can be well retained over time, thanks to the HZO stability. The presented result show how ferroelectric-enhanced SBFETs are promising for the realization of novel LiM hardware, enabling low-power circuits for ANNs execution. △ Less

Submitted 30 April, 2024; originally announced April 2024.

arXiv:2404.18306 [pdf, other]

Tunable Ultrafast Dynamics of Antiferromagnetic Vortices in Nanoscale Dots

Authors: Ji Zou, Even Thingstad, Se Kwon Kim, Jelena Klinovaja, Daniel Loss

Abstract: Topological vortex textures in magnetic disks have garnered great attention due to their interesting physics and diverse applications. However, up to now, the vortex state has mainly been studied in microsize ferromagnetic disks, which have oscillation frequencies confined to the GHz range. Here, we propose an experimentally feasible ultrasmall and ultrafast vortex state in an antiferromagnetic na… ▽ More Topological vortex textures in magnetic disks have garnered great attention due to their interesting physics and diverse applications. However, up to now, the vortex state has mainly been studied in microsize ferromagnetic disks, which have oscillation frequencies confined to the GHz range. Here, we propose an experimentally feasible ultrasmall and ultrafast vortex state in an antiferromagnetic nanodot surrounded by a heavy metal, which is further harnessed to construct a highly tunable vortex network. We theoretically demonstrate that, interestingly, the interfacial Dzyaloshinskii-Moriya interaction (iDMI) induced by the heavy metal at the boundary of the dot acts as an effective chemical potential for the vortices in the interior. Mimicking the creation of a superfluid vortex by rotation, we show that a magnetic vortex state can be stabilized by this iDMI. Subjecting the system to an electric current can trigger vortex oscillations via spin-transfer torque, which reside in the THz regime and can be further modulated by external magnetic fields. Furthermore, we show that coherent coupling between vortices in different nanodisks can be achieved via an antiferromagnetic link. Remarkably, this interaction depends on the vortex polarity and topological charge and is also exceptionally tunable through the vortex resonance frequency. This opens up the possibility for controllable interconnected networks of antiferromagnetic vortices. Our proposal therefore introduces a new avenue for developing high-density memory, ultrafast logic devices, and THz signal generators, which are ideal for compact integration into microchips. △ Less

Submitted 28 April, 2024; originally announced April 2024.

Comments: 11 pages including supplemental material; 4 figures

arXiv:2404.12817 [pdf, other]

Determination of the CKM angle $φ_{3}$ from a combination of Belle and Belle II results

Authors: Belle, Belle II Collaborations, :, I. Adachi, L. Aggarwal, H. Aihara, N. Akopov, A. Aloisio, S. Al Said, N. Anh Ky, D. M. Asner, H. Atmacan, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, A. Baur, A. Beaubien , et al. (377 additional authors not shown)

Abstract: We report a determination of the CKM angle $φ_{3}$, also known as $γ$, from a combination of measurements using samples of up to 711~fb$^{-1}$ from the Belle experiment and up to 362~fb$^{-1}$ from the Belle II experiment. We combine results from analyses of $B^+\to DK^+, B^+\to Dπ^+$, and $B^+ \to D^{*}K^+$ decays, where $D$ is an admixture of $D^0$ and $\overline{D}{}^{0}$ mesons, in a likelihoo… ▽ More We report a determination of the CKM angle $φ_{3}$, also known as $γ$, from a combination of measurements using samples of up to 711~fb$^{-1}$ from the Belle experiment and up to 362~fb$^{-1}$ from the Belle II experiment. We combine results from analyses of $B^+\to DK^+, B^+\to Dπ^+$, and $B^+ \to D^{*}K^+$ decays, where $D$ is an admixture of $D^0$ and $\overline{D}{}^{0}$ mesons, in a likelihood fit to obtain $φ_{3} = (78.6^{+7.2}_{-7.3})^{\circ}$. We also briefly discuss the interpretation of this result. △ Less

Submitted 19 April, 2024; originally announced April 2024.

Comments: 31 pages, 4 figures

Report number: Belle II Preprint 2023-015, KEK Preprint 2023-31

arXiv:2404.10874 [pdf, other]

doi 10.1103/PhysRevD.109.L111103

Measurement of the branching fraction of the decay $B^- \to D^0 ρ(770)^-$ at Belle II

Authors: Belle II Collaboration, I. Adachi, L. Aggarwal, H. Aihara, N. Akopov, A. Aloisio, N. Anh Ky, D. M. Asner, H. Atmacan, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, A. Baur, A. Beaubien, F. Becherer, J. Becker, J. V. Bennett , et al. (367 additional authors not shown)

Abstract: We measure the branching fraction of the decay $B^- \to D^0 ρ(770)^-$ using data collected with the Belle II detector. The data contain 387 million $B\overline{B}$ pairs produced in $e^+e^-$ collisions at the $Υ(4S)$ resonance. We reconstruct $8360\pm 180$ decays from an analysis of the distributions of the $B^-$ energy and the $ρ(770)^-$ helicity angle. We determine the branching fraction to be… ▽ More We measure the branching fraction of the decay $B^- \to D^0 ρ(770)^-$ using data collected with the Belle II detector. The data contain 387 million $B\overline{B}$ pairs produced in $e^+e^-$ collisions at the $Υ(4S)$ resonance. We reconstruct $8360\pm 180$ decays from an analysis of the distributions of the $B^-$ energy and the $ρ(770)^-$ helicity angle. We determine the branching fraction to be $(0.939 \pm 0.021\mathrm{(stat)} \pm 0.050\mathrm{(syst)})\%$, in agreement with previous results. Our measurement improves the relative precision of the world average by more than a factor of two. △ Less

Submitted 27 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

Report number: Belle II Preprint 2024-011, KEK Preprint 2024-4

Journal ref: PRD 109, 111103 (2024)

arXiv:2404.10294 [pdf, other]

Topological Fukaya category of tagged arcs

Authors: Cheol-Hyun Cho, Kyoungmo Kim

Abstract: A tagged arc on a surface is introduced by Fomin, Shapiro, and Thurston to study cluster theory on marked surfaces. Given a tagged arc system on a graded marked surface, we define its $\mathbb{Z}$-graded $\mathcal{A}_\infty$-category, generalizing the construction of Haiden, Katzarkov, and Kontsevich for arc systems. When a tagged arc system arises from a non-trivial involution on a marked surface… ▽ More A tagged arc on a surface is introduced by Fomin, Shapiro, and Thurston to study cluster theory on marked surfaces. Given a tagged arc system on a graded marked surface, we define its $\mathbb{Z}$-graded $\mathcal{A}_\infty$-category, generalizing the construction of Haiden, Katzarkov, and Kontsevich for arc systems. When a tagged arc system arises from a non-trivial involution on a marked surface, we show that this $\mathcal{A}_\infty$-category is quasi-isomorphic to the invariant part of the topological Fukaya category under the involution. In particular, this identifies tagged arcs with non-geometric idempotents of Fukaya category. △ Less

Submitted 16 April, 2024; originally announced April 2024.

Comments: 57 pages, 18 figures

MSC Class: 53D37; 16E35

arXiv:2404.09603 [pdf, ps, other]

Construction of smooth chiral finite-time blow-up solutions to Calogero--Moser derivative nonlinear Schrödinger equation

Authors: Kihyun Kim, Taegyu Kim, Soonsik Kwon

Abstract: We consider the Calogero--Moser derivative nonlinear Schrödinger equation (CM-DNLS), which is an $L^{2}$-critical nonlinear Schrödinger equation with explicit solitons, self-duality, and pseudo-conformal symmetry. More importantly, this equation is known to be completely integrable in the Hardy space $L_{+}^{2}$ and the solutions in this class are referred to as \emph{chiral} solutions. A rigorous… ▽ More We consider the Calogero--Moser derivative nonlinear Schrödinger equation (CM-DNLS), which is an $L^{2}$-critical nonlinear Schrödinger equation with explicit solitons, self-duality, and pseudo-conformal symmetry. More importantly, this equation is known to be completely integrable in the Hardy space $L_{+}^{2}$ and the solutions in this class are referred to as \emph{chiral} solutions. A rigorous PDE analysis of this equation with complete integrability was recently initiated by Gérard and Lenzmann. Our main result constructs smooth, chiral, and finite energy finite-time blow-up solutions with mass arbitrarily close to that of soliton, answering the global regularity question for chiral solutions raised by Gérard and Lenzmann. The blow-up rate obtained for these solutions is different from the pseudo-conformal rate. Our proof also gives a construction of a codimension one set of smooth finite energy initial data (but without addressing chirality) leading to the same blow-up dynamics. Our blow-up construction in the Hardy space might also be contrasted with the global well-posedness of the derivative nonlinear Schrödinger equation (DNLS), which is another integrable $L^{2}$-critical Schrödinger equation. The overall scheme of our proof is the forward construction of blow-up dynamics with modulation analysis and is not reliant on complete integrability. We begin with developing a linear theory for the near soliton dynamics. We discover a nontrivial conjugation identity, which unveils a surprising connection from the linearized (CM-DNLS) to the 1D free Schrödinger equation, which is a crucial ingredient for overcoming the difficulties from the non-local nonlinearity. Another principal challenge in this work, the slow decay of soliton, is overcome by introducing a trick of decomposing solutions depending on topologies, which we believe is of independent interest. △ Less

Submitted 15 April, 2024; originally announced April 2024.

Comments: 99 pages

MSC Class: 35B44 (primary); 35Q55; 37K10; 37K40

arXiv:2404.09482 [pdf, other]

Binary microlensing by high eccentric stellar-mass black hole binaries

Authors: Kyungmin Kim, Yeong-Bok Bae, Yoon-Hyun Ryu

Abstract: Microlensing is one of the most promising tools for discovering stellar-mass black holes (BHs) in the Milky Way because it allows us to probe dark or faint celestial compact objects. While the existence of stellar-mass BHs has been confirmed through observation of X-ray binaries within our galaxy and gravitational waves from extragalactic BH binaries, a conclusive observation of microlensing event… ▽ More Microlensing is one of the most promising tools for discovering stellar-mass black holes (BHs) in the Milky Way because it allows us to probe dark or faint celestial compact objects. While the existence of stellar-mass BHs has been confirmed through observation of X-ray binaries within our galaxy and gravitational waves from extragalactic BH binaries, a conclusive observation of microlensing events caused by Galactic BH binaries has yet to be achieved. In this study, we focus on those with high eccentricity, including unbound orbits, which can dynamically form in star clusters and could potentially increase the observation rate. We demonstrate parameter estimation for simulated light curves supposing various orbital configurations of BH binary lenses. We employ a model-based fitting using the Nelder-Mead method and Bayesian inference based on the Markov chain Monte Carlo method for the demonstration. The results show that we can retrieve true values of the parameters of high eccentric BH binary lenses within the 1$σ$ uncertainty of inferred values. We conclude it is feasible to find high eccentric Galactic BH binaries from the observation of binary microlensing events. △ Less

Submitted 15 April, 2024; originally announced April 2024.

Comments: 12 pages, 9 figures, 4 tables

arXiv:2404.09122 [pdf, other]

Monotonicity of RG flow in emergent dual holography of worldsheet nonlinear $σ$ model

Authors: Ki-Seok Kim, Arpita Mitra, Debangshu Mukherjee, Shinsei Ryu

Abstract: Based on the renormalization group (RG) flow of worldsheet bosonic string theory, we construct an effective holographic dual description of the target space theory identifying the RG scale with the emergent extra dimension. This results in an effective dilaton-gravity theory, analogous to the low-energy description of bosonic M-theory. We argue that this holographic dual effective field theory is… ▽ More Based on the renormalization group (RG) flow of worldsheet bosonic string theory, we construct an effective holographic dual description of the target space theory identifying the RG scale with the emergent extra dimension. This results in an effective dilaton-gravity theory, analogous to the low-energy description of bosonic M-theory. We argue that this holographic dual effective field theory is non-perturbative in the $α'$ expansion. To investigate the monotonicity of RG flow of the metric in the emergent spacetime, we consider entropy production along the RG flow. We construct a microscopic entropy functional based on the probability distribution function of the holographic dual effective field theory, regarded as Gibbs or Shannon entropy. Given that the Ricci flow represents the 1-loop RG flow equation of the target space metric for the 2D non-linear sigma model, and motivated by Perelman's proof of the monotonicity of Ricci flow, we propose a Perelman's entropy functional for the holographic dual effective field theory. Furthermore, utilizing the equivalence between the Hamilton-Jacobi equation and the local RG equation, we show that the RG flow of holographic Perelman's entropy functional is the Weyl anomaly. This eventually verify the monotonicity of RG flow for the emergent target spacetime. Interestingly, we find that the microscopic entropy production rate can be determined by integrating the rate of change of the holographic Perelman's entropy functional over all possible metric configurations along the flow. △ Less

Submitted 5 July, 2024; v1 submitted 13 April, 2024; originally announced April 2024.

Comments: The manuscript has been rewritten

arXiv:2404.08884 [pdf, other]

The Sunburst Arc with JWST: Detection of Wolf-Rayet stars injecting nitrogen into a low-metallicity, $z=2.37$ proto-globular cluster leaking ionizing photons

Authors: T. Emil Rivera-Thorsen, J. Chisholm, B. Welch, J. R. Rigby, T. Hutchison, M. Florian, K. Sharon, S. Choe, H. Dahle, M. B. Bayliss, G. Khullar, M. Gladders, M. Hayes, A. Adamo, M. R. Owens, K. Kim

Abstract: We report the detection of a population of Wolf-Rayet (WR) stars in the Sunburst Arc, a strongly gravitationally lensed galaxy at redshift $z=2.37$. As the brightest known lensed galaxy, the Sunburst Arc has become an important cosmic laboratory for studying star and cluster formation, Lyman $α$ radiative transfer, and Lyman Continuum (LyC) escape. Here, we present the first results of JWST/NIRCam… ▽ More We report the detection of a population of Wolf-Rayet (WR) stars in the Sunburst Arc, a strongly gravitationally lensed galaxy at redshift $z=2.37$. As the brightest known lensed galaxy, the Sunburst Arc has become an important cosmic laboratory for studying star and cluster formation, Lyman $α$ radiative transfer, and Lyman Continuum (LyC) escape. Here, we present the first results of JWST/NIRCam imaging and NIRSpec IFU observations of the Sunburst Arc, focusing on a stacked spectrum of the 12-fold imaged LyC-emitting (Sunburst LCE) cluster. In agreement with previous studies, we find that the cluster is massive and compact, with $M_{\text{dyn}} = (9\pm1) \times 10^{6} M_{\odot}$, Our age estimate of 4.2--4.5 Myr is much larger than the crossing time of $t_{\text{cross}} = 183 \pm 9 $ kyr, indicating that the cluster is dynamically evolved and consistent with being gravitationally bound. We find a significant nitrogen enhancement of the low ionization state ISM, with $\log(N/O) = -0.74 \pm 0.09$, which is $\approx 0.8$ dex above typical values for H II regions of similar metallicity in the local Universe. We find broad stellar emission complexes around He II$λ4686$ and C IV$λ5808$ with associated nitrogen emission -- this is the first time WR signatures have been directly observed at redshifts above $\sim 0.5$. The strength of the WR signatures cannot be reproduced by stellar population models that only include single-star evolution. While models with binary evolution better match the WR features, they still struggle to reproduce the nitrogen-enhanced WR features. JWST reveals the Sunburst LCE to be a highly ionized, proto-globular cluster with low oxygen abundance and extreme nitrogen enhancement that hosts a population of Wolf-Rayet stars, and possibly Very Massive stars (VMSs), which are rapidly enriching the surrounding medium. △ Less

Submitted 12 April, 2024; originally announced April 2024.

Comments: 8 pages, 4 figures, 3 tables. Submitted to A&A

arXiv:2404.08672 [pdf, other]

Taxonomy and Analysis of Sensitive User Queries in Generative AI Search

Authors: Hwiyeol Jo, Taiwoo Park, Nayoung Choi, Changbong Kim, Ohjoon Kwon, Donghyeon Jeon, Hyunwoo Lee, Eui-Hyeon Lee, Kyoungho Shin, Sun Suk Lim, Kyungmi Kim, Jihye Lee, Sun Kim

Abstract: Although there has been a growing interest among industries to integrate generative LLMs into their services, limited experiences and scarcity of resources acts as a barrier in launching and servicing large-scale LLM-based conversational services. In this paper, we share our experiences in developing and operating generative AI models within a national-scale search engine, with a specific focus on… ▽ More Although there has been a growing interest among industries to integrate generative LLMs into their services, limited experiences and scarcity of resources acts as a barrier in launching and servicing large-scale LLM-based conversational services. In this paper, we share our experiences in developing and operating generative AI models within a national-scale search engine, with a specific focus on the sensitiveness of user queries. We propose a taxonomy for sensitive search queries, outline our approaches, and present a comprehensive analysis report on sensitive queries from actual users. △ Less

Submitted 5 April, 2024; originally announced April 2024.

arXiv:2404.08133 [pdf, other]

Search for rare $b \to d\ell^+\ell^-$ transitions at Belle

Authors: Belle, Belle II Collaborations, :, I. Adachi, L. Aggarwal, H. Aihara, N. Akopov, A. Aloisio, S. Al Said, D. M. Asner, H. Atmacan, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, A. Beaubien, F. Becherer, J. Becker , et al. (371 additional authors not shown)

Abstract: We present the results of a search for the $b \to d\ell^+\ell^-$ flavor-changing neutral-current rare decays $B^{+, 0} \to (η, ω, π^{+,0}, ρ^{+, 0}) e^+e^-$ and $B^{+, 0} \to (η, ω, π^{0}, ρ^{+}) μ^+μ^-$ using a $711$ fb$^{-1}$ data sample that contains $772 \times 10^{6}$ $B\overline{B}$ events. The data were collected at the $Υ(4S)$ resonance with the Belle detector at the KEKB asymmetric-energy… ▽ More We present the results of a search for the $b \to d\ell^+\ell^-$ flavor-changing neutral-current rare decays $B^{+, 0} \to (η, ω, π^{+,0}, ρ^{+, 0}) e^+e^-$ and $B^{+, 0} \to (η, ω, π^{0}, ρ^{+}) μ^+μ^-$ using a $711$ fb$^{-1}$ data sample that contains $772 \times 10^{6}$ $B\overline{B}$ events. The data were collected at the $Υ(4S)$ resonance with the Belle detector at the KEKB asymmetric-energy $e^+e^-$ collider. We find no evidence for signal and set upper limits on branching fractions at the $90\%$ confidence level in the range $(3.8 - 47) \times 10^{-8}$ depending on the decay channel. The obtained limits are the world's best results. This is the first search for the channels $B^{+, 0} \to (ω, ρ^{+,0}) e^+e^-$ and $B^{+, 0} \to (ω, ρ^{+})μ^+μ^-$. △ Less

Submitted 11 April, 2024; originally announced April 2024.

Comments: 7 pages, 12 figures

Report number: Belle II Preprint 2024-005, KEK Preprint 2023-52

arXiv:2404.07947 [pdf, other]

ExeGPT: Constraint-Aware Resource Scheduling for LLM Inference

Authors: Hyungjun Oh, Kihong Kim, Jaemin Kim, Sungkyun Kim, Junyeol Lee, Du-seong Chang, Jiwon Seo

Abstract: This paper presents ExeGPT, a distributed system designed for constraint-aware LLM inference. ExeGPT finds and runs with an optimal execution schedule to maximize inference throughput while satisfying a given latency constraint. By leveraging the distribution of input and output sequences, it effectively allocates resources and determines optimal execution configurations, including batch sizes and… ▽ More This paper presents ExeGPT, a distributed system designed for constraint-aware LLM inference. ExeGPT finds and runs with an optimal execution schedule to maximize inference throughput while satisfying a given latency constraint. By leveraging the distribution of input and output sequences, it effectively allocates resources and determines optimal execution configurations, including batch sizes and partial tensor parallelism. We also introduce two scheduling strategies based on Round-Robin Allocation and Workload-Aware Allocation policies, suitable for different NLP workloads. We evaluate ExeGPT on six LLM instances of T5, OPT, and GPT-3 and five NLP tasks, each with four distinct latency constraints. Compared to FasterTransformer, ExeGPT achieves up to 15.2x improvements in throughput and 6x improvements in latency. Overall, ExeGPT achieves an average throughput gain of 2.9x across twenty evaluation scenarios. Moreover, when adapting to changing sequence distributions, the cost of adjusting the schedule in ExeGPT is reasonably modest. ExeGPT proves to be an effective solution for optimizing and executing LLM inference for diverse NLP workload and serving conditions. △ Less

Submitted 15 March, 2024; originally announced April 2024.

Comments: Accepted to ASPLOS 2024 (summer cycle)

Journal ref: 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems(ASPLOS 24 summer cycle), Volume 2, Nov 15, 2023 (Notification Date)

arXiv:2404.07021 [pdf, other]

A 4x32Gb/s 1.8pJ/bit Collaborative Baud-Rate CDR with Background Eye-Climbing Algorithm and Low-Power Global Clock Distribution

Authors: Jihee Kim, Jia Park, Jiwon Shin, Hanseok Kim, Kahyun Kim, Haengbeom Shin, Ha-Jung Park, Woo-Seok Choi

Abstract: This paper presents design techniques for an energy-efficient multi-lane receiver (RX) with baud-rate clock and data recovery (CDR), which is essential for high-throughput low-latency communication in high-performance computing systems. The proposed low-power global clock distribution not only significantly reduces power consumption across multi-lane RXs but is capable of compensating for the freq… ▽ More This paper presents design techniques for an energy-efficient multi-lane receiver (RX) with baud-rate clock and data recovery (CDR), which is essential for high-throughput low-latency communication in high-performance computing systems. The proposed low-power global clock distribution not only significantly reduces power consumption across multi-lane RXs but is capable of compensating for the frequency offset without any phase interpolators. To this end, a fractional divider controlled by CDR is placed close to the global phase locked loop. Moreover, in order to address the sub-optimal lock point of conventional baud-rate phase detectors, the proposed CDR employs a background eye-climbing algorithm, which optimizes the sampling phase and maximizes the vertical eye margin (VEM). Fabricated in a 28nm CMOS process, the proposed 4x32Gb/s RX shows a low integrated fractional spur of -40.4dBc at a 2500ppm frequency offset. Furthermore, it improves bit-error-rate performance by increasing the VEM by 17%. The entire RX achieves the energy efficiency of 1.8pJ/bit with the aggregate data rate of 128Gb/s. △ Less

Submitted 22 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

arXiv:2404.06731 [pdf]

Accuracy of a Large Language Model in Distinguishing Anti- And Pro-vaccination Messages on Social Media: The Case of Human Papillomavirus Vaccination

Authors: Soojong Kim, Kwanho Kim, Claire Wonjeong Jo

Abstract: Objective. Vaccination has engendered a spectrum of public opinions, with social media acting as a crucial platform for health-related discussions. The emergence of artificial intelligence technologies, such as large language models (LLMs), offers a novel opportunity to efficiently investigate public discourses. This research assesses the accuracy of ChatGPT, a widely used and freely available ser… ▽ More Objective. Vaccination has engendered a spectrum of public opinions, with social media acting as a crucial platform for health-related discussions. The emergence of artificial intelligence technologies, such as large language models (LLMs), offers a novel opportunity to efficiently investigate public discourses. This research assesses the accuracy of ChatGPT, a widely used and freely available service built upon an LLM, for sentiment analysis to discern different stances toward Human Papillomavirus (HPV) vaccination. Methods. Messages related to HPV vaccination were collected from social media supporting different message formats: Facebook (long format) and Twitter (short format). A selection of 1,000 human-evaluated messages was input into the LLM, which generated multiple response instances containing its classification results. Accuracy was measured for each message as the level of concurrence between human and machine decisions, ranging between 0 and 1. Results. Average accuracy was notably high when 20 response instances were used to determine the machine decision of each message: .882 (SE = .021) and .750 (SE = .029) for anti- and pro-vaccination long-form; .773 (SE = .027) and .723 (SE = .029) for anti- and pro-vaccination short-form, respectively. Using only three or even one instance did not lead to a severe decrease in accuracy. However, for long-form messages, the language model exhibited significantly lower accuracy in categorizing pro-vaccination messages than anti-vaccination ones. Conclusions. ChatGPT shows potential in analyzing public opinions on HPV vaccination using social media content. However, understanding the characteristics and limitations of a language model within specific public health contexts remains imperative. △ Less

Submitted 10 April, 2024; originally announced April 2024.

Comments: Forthcoming in Preventive Medicine Reports

arXiv:2404.06324 [pdf, other]

Dynamic D2D-Assisted Federated Learning over O-RAN: Performance Analysis, MAC Scheduler, and Asymmetric User Selection

Authors: Payam Abdisarabshali, Kwang Taik Kim, Michael Langberg, Weifeng Su, Seyyedali Hosseinalipour

Abstract: Existing studies on federated learning (FL) are mostly focused on system orchestration for static snapshots of the network and making static control decisions (e.g., spectrum allocation). However, real-world wireless networks are susceptible to temporal variations of wireless channel capacity and users' datasets. In this paper, we incorporate multi-granular system dynamics (MSDs) into FL, includin… ▽ More Existing studies on federated learning (FL) are mostly focused on system orchestration for static snapshots of the network and making static control decisions (e.g., spectrum allocation). However, real-world wireless networks are susceptible to temporal variations of wireless channel capacity and users' datasets. In this paper, we incorporate multi-granular system dynamics (MSDs) into FL, including (M1) dynamic wireless channel capacity, captured by a set of discrete-time events, called $\mathscr{D}$-Events, and (M2) dynamic datasets of users. The latter is characterized by (M2-a) modeling the dynamics of user's dataset size via an ordinary differential equation and (M2-b) introducing dynamic model drift}, formulated via a partial differential inequality} drawing concrete analytical connections between the dynamics of users' datasets and FL accuracy. We then conduct FL orchestration under MSDs by introducing dynamic cooperative FL with dedicated MAC schedulers (DCLM), exploiting the unique features of open radio access network (O-RAN). DCLM proposes (i) a hierarchical device-to-device (D2D)-assisted model training, (ii) dynamic control decisions through dedicated O-RAN MAC schedulers, and (iii) asymmetric user selection. We provide extensive theoretical analysis to study the convergence of DCLM. We then optimize the degrees of freedom (e.g., user selection and spectrum allocation) in DCLM through a highly non-convex optimization problem. We develop a systematic approach to obtain the solution for this problem, opening the door to solving a broad variety of network-aware FL optimization problems. We show the efficiency of DCLM via numerical simulations and provide a series of future directions. △ Less

Submitted 9 April, 2024; originally announced April 2024.

Comments: 120 pages, 13 figures

arXiv:2404.05916 [pdf, other]

Prompt-driven Universal Model for View-Agnostic Echocardiography Analysis

Authors: Sekeun Kim, Hui Ren, Peng Guo, Abder-Rahman Ali, Patrick Zhang, Kyungsang Kim, Xiang Li, Quanzheng Li

Abstract: Echocardiography segmentation for cardiac analysis is time-consuming and resource-intensive due to the variability in image quality and the necessity to process scans from various standard views. While current automated segmentation methods in echocardiography show promising performance, they are trained on specific scan views to analyze corresponding data. However, this solution has a limitation… ▽ More Echocardiography segmentation for cardiac analysis is time-consuming and resource-intensive due to the variability in image quality and the necessity to process scans from various standard views. While current automated segmentation methods in echocardiography show promising performance, they are trained on specific scan views to analyze corresponding data. However, this solution has a limitation as the number of required models increases with the number of standard views. To address this, in this paper, we present a prompt-driven universal method for view-agnostic echocardiography analysis. Considering the domain shift between standard views, we first introduce a method called prompt matching, aimed at learning prompts specific to different views by matching prompts and querying input embeddings using a pre-trained vision model. Then, we utilized a pre-trained medical language model to align textual information with pixel data for accurate segmentation. Extensive experiments on three standard views showed that our approach significantly outperforms the state-of-the-art universal methods and achieves comparable or even better performances over the segmentation model trained and tested on same views. △ Less

Submitted 8 April, 2024; originally announced April 2024.

arXiv:2404.04915 [pdf, other]

Measurement of the $e^+e^- \to π^+π^-π^0$ cross section in the energy range 0.62-3.50 GeV at Belle II

Authors: Belle II Collaboration, I. Adachi, L. Aggarwal, H. Aihara, N. Akopov, A. Aloisio, N. Anh Ky, D. M. Asner, H. Atmacan, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, A. Baur, A. Beaubien, F. Becherer, J. Becker, J. V. Bennett , et al. (338 additional authors not shown)

Abstract: We report a measurement of the $e^+e^- \to π^+π^-π^0$ cross section in the energy range from 0.62 to 3.50 GeV using an initial-state radiation technique. We use an $e^+e^-$ data sample corresponding to 191 $\text{fb}^{-1}$ of integrated luminosity, collected at a center-of-mass energy at or near the $Υ{(4S)}$ resonance with the Belle II detector at the SuperKEKB collider. Signal yields are extract… ▽ More We report a measurement of the $e^+e^- \to π^+π^-π^0$ cross section in the energy range from 0.62 to 3.50 GeV using an initial-state radiation technique. We use an $e^+e^-$ data sample corresponding to 191 $\text{fb}^{-1}$ of integrated luminosity, collected at a center-of-mass energy at or near the $Υ{(4S)}$ resonance with the Belle II detector at the SuperKEKB collider. Signal yields are extracted by fitting the two-photon mass distribution in $e^+e^- \to π^+π^-π^0γ$ events, which involve a $π^0 \to γγ$ decay and an energetic photon radiated from the initial state. Signal efficiency corrections with an accuracy of 1.6% are obtained from several control data samples. The uncertainty on the cross section at the $ω$ and $φ$ resonances is dominated by the systematic uncertainty of 2.2%. The resulting cross sections in the 0.62-1.80 GeV energy range yield $ a_μ^{3π} = [48.91 \pm 0.23~(\mathrm{stat}) \pm 1.07~(\mathrm{syst})] \times 10^{-10} $ for the leading-order hadronic vacuum polarization contribution to the muon anomalous magnetic moment. This result differs by $2.5$ standard deviations from the most precise current determination. △ Less

Submitted 7 April, 2024; originally announced April 2024.

Comments: 23 pages, 24 figures, submitted to PRD

Report number: KEK Preprint 2023-51, Belle II Preprint 2024-004

arXiv:2404.04247 [pdf, ps, other]

On classification of global dynamics for energy-critical equivariant harmonic map heat flows and radial nonlinear heat equation

Authors: Kihyun Kim, Frank Merle

Abstract: We consider the global dynamics of finite energy solutions to energy-critical equivariant harmonic map heat flow (HMHF) and radial nonlinear heat equation (NLH). It is known that any finite energy equivariant solutions to (HMHF) decompose into finitely many harmonic maps (bubbles) separated by scales and a body map, as approaching to the maximal time of existence. Our main result for (HMHF) gives… ▽ More We consider the global dynamics of finite energy solutions to energy-critical equivariant harmonic map heat flow (HMHF) and radial nonlinear heat equation (NLH). It is known that any finite energy equivariant solutions to (HMHF) decompose into finitely many harmonic maps (bubbles) separated by scales and a body map, as approaching to the maximal time of existence. Our main result for (HMHF) gives a complete classification of their dynamics for equivariance indices $D\geq3$; (i) they exist globally in time, (ii) the number of bubbles and signs are determined by the energy class of the initial data, and (iii) the scales of bubbles are asymptotically given by a universal sequence of rates up to scaling symmetry. In parallel, we also obtain a complete classification of $\dot{H}^{1}$-bounded radial solutions to (NLH) in dimensions $N\geq7$, building upon soliton resolution for such solutions. To our knowledge, this provides the first rigorous classification of bubble tree dynamics within symmetry. We introduce a new approach based on the energy method that does not rely on maximum principle. The key ingredient of the proof is a monotonicity estimate near any bubble tree configurations, which in turn requires a delicate construction of modified multi-bubble profiles also. △ Less

Submitted 5 April, 2024; originally announced April 2024.

Comments: 44 pages

MSC Class: 35K58 (primary); 35B40; 37K40; 58E20

arXiv:2404.04096 [pdf, other]

Machine Learning-Aided Cooperative Localization under Dense Urban Environment

Authors: Hoon Lee, Hong Ki Kim, Seung Hyun Oh, Sang Hyun Lee

Abstract: Future wireless network technology provides automobiles with the connectivity feature to consolidate the concept of vehicular networks that collaborate on conducting cooperative driving tasks. The full potential of connected vehicles, which promises road safety and quality driving experience, can be leveraged if machine learning models guarantee the robustness in performing core functions includin… ▽ More Future wireless network technology provides automobiles with the connectivity feature to consolidate the concept of vehicular networks that collaborate on conducting cooperative driving tasks. The full potential of connected vehicles, which promises road safety and quality driving experience, can be leveraged if machine learning models guarantee the robustness in performing core functions including localization and controls. Location awareness, in particular, lends itself to the deployment of location-specific services and the improvement of the operation performance. The localization entails direct communication to the network infrastructure, and the resulting centralized positioning solutions readily become intractable as the network scales up. As an alternative to the centralized solutions, this article addresses decentralized principle of vehicular localization reinforced by machine learning techniques in dense urban environments with frequent inaccessibility to reliable measurement. As such, the collaboration of multiple vehicles enhances the positioning performance of machine learning approaches. A virtual testbed is developed to validate this machine learning model for real-map vehicular networks. Numerical results demonstrate universal feasibility of cooperative localization, in particular, for dense urban area configurations. △ Less

Submitted 5 April, 2024; originally announced April 2024.

arXiv:2404.03691 [pdf, other]

Upgrade of NaI(Tl) crystal encapsulation for the NEON experiment

Authors: J. J. Choi, E. J. Jeon, J. Y. Kim, K. W. Kim, S. H. Kim, S. K. Kim, Y. D. Kim, Y. J. Ko, B. C. Koh, C. Ha, B. J. Park, S. H. Lee, I. S. Lee, H. Lee, H. S. Lee, J. Lee, Y. M. Oh

Abstract: The Neutrino Elastic-scattering Observation with NaI(Tl) experiment (NEON) aims to detect coherent elastic neutrino-nucleus scattering~(\cenns) in a NaI(Tl) crystal using reactor anti-electron neutrinos at the Hanbit nuclear power plant complex. A total of 13.3 kg of NaI(Tl) crystals were initially installed in December 2020 at the tendon gallery, 23.7$\pm$0.3\,m away from the reactor core, which… ▽ More The Neutrino Elastic-scattering Observation with NaI(Tl) experiment (NEON) aims to detect coherent elastic neutrino-nucleus scattering~(\cenns) in a NaI(Tl) crystal using reactor anti-electron neutrinos at the Hanbit nuclear power plant complex. A total of 13.3 kg of NaI(Tl) crystals were initially installed in December 2020 at the tendon gallery, 23.7$\pm$0.3\,m away from the reactor core, which operates at a thermal power of 2.8\,GW. Initial engineering operation was performed from May 2021 to March 2022 and observed unexpected photomultiplier-induced noise and a decreased light yield that were caused by leakage of liquid scintillator into the detector due to weakness of detector encapsulation. We upgraded the detector encapsulation design to prevent the leakage of the liquid scintillator. Meanwhile two small-sized detectors were replaced with larger ones resulting in a total mass of 16.7\,kg. With this new design implementation, the detector system has been operating stably since April 2022 for over a year without detector gain drop. In this paper, we present an improved crystal encapsulation design and stability of the NEON experiment. △ Less

Submitted 28 June, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

arXiv:2404.01954 [pdf, other]

HyperCLOVA X Technical Report

Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment to responsible AI. The model is evaluated across various benchmarks, including comprehensive reasoning, knowledge, commonsense, factuality, coding, math, chatting, instruction-following, and harmlessness, in both Korean and English. HyperCLOVA X exhibits strong reasoning capabilities in Korean backed by a deep understanding of the language and cultural nuances. Further analysis of the inherent bilingual nature and its extension to multilingualism highlights the model's cross-lingual proficiency and strong generalization ability to untargeted languages, including machine translation between several language pairs and cross-lingual inference tasks. We believe that HyperCLOVA X can provide helpful guidance for regions or countries in developing their sovereign LLMs. △ Less

Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

Comments: 44 pages; updated authors list and fixed author names

Showing 51–100 of 4,231 results for author: Kim, K