-
Spin-polarized DFT calculations for physical properties of novel KVSb half-Heusler compound for spintronic and thermodynamic applicability
Authors:
Ashwani Kumar,
Anupam,
Shyam L. Gupta,
Sumit Kumar,
Vipan Kumar,
Diwaker
Abstract:
In the reported study we have investigated the robust phase stability, elasto-mechanical, thermophysical and magnetic properties of KVSb half Heusler compound by implementing density functional theory models in Wien2k simulation package. The dynamic phase stability is computed in phase type I, II & III phase configurations by optimising their energy. It is observed that given compound is more stab…
▽ More
In the reported study we have investigated the robust phase stability, elasto-mechanical, thermophysical and magnetic properties of KVSb half Heusler compound by implementing density functional theory models in Wien2k simulation package. The dynamic phase stability is computed in phase type I, II & III phase configurations by optimising their energy. It is observed that given compound is more stable in spin-polarised state of phase type I. To explore the electronic band structure, we apply the generalised gradient approximation. The electronic band profile of the Heusler alloy display a half-metallic nature. Moreover, the calculated second-order elastic parameters divulge the ductile nature. To understand the thermodynamical and thermoelectric stability of the alloy at various temperature and pressures ranges we have utilised the Quasi-Harmonic Debye model. The computed value of magnetic moment found in good agreement with Slater-Pauling rule. Our findings confirms that the predicted half Heusler alloy can be used in various spintronics and thermoelectric applications.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Pragmatic Formal Verification Methodology for Clock Domain Crossing (CDC)
Authors:
Aman Kumar,
Muhammad Ul Haque Khan,
Bijitendra Mittra
Abstract:
Modern System-on-Chip (SoC) designs are becoming more and more complex due to the technology upscaling. SoC designs often operate on multiple asynchronous clock domains, further adding to the complexity of the overall design. To make the devices power efficient, designers take a Globally-Asynchronous Locally-Synchronous (GALS) approach that creates multiple asynchronous domains. These Clock Domain…
▽ More
Modern System-on-Chip (SoC) designs are becoming more and more complex due to the technology upscaling. SoC designs often operate on multiple asynchronous clock domains, further adding to the complexity of the overall design. To make the devices power efficient, designers take a Globally-Asynchronous Locally-Synchronous (GALS) approach that creates multiple asynchronous domains. These Clock Domain Crossings (CDC) are prone to metastability effects, and functional verification of such CDC is very important to ensure that no bug escapes. Conventional verification methods, such as register transfer level (RTL) simulations and static timing analysis, are not enough to address these CDC issues, which may lead to verification gaps. Additionally, identifying these CDC-related bugs is very time-consuming and is one of the most common reasons for costly silicon re-spins. This paper is focused on the development of a pragmatic formal verification methodology to minimize the CDC issues by exercising Metastability Injection (MSI) in different CDC paths.
△ Less
Submitted 20 April, 2024;
originally announced June 2024.
-
Merlin: A Vision Language Foundation Model for 3D Computed Tomography
Authors:
Louis Blankemeier,
Joseph Paul Cohen,
Ashwin Kumar,
Dave Van Veen,
Syed Jamal Safdar Gardezi,
Magdalini Paschali,
Zhihong Chen,
Jean-Benoit Delbrouck,
Eduardo Reis,
Cesar Truyts,
Christian Bluethgen,
Malte Engmann Kjeldskov Jensen,
Sophie Ostmeier,
Maya Varma,
Jeya Maria Jose Valanarasu,
Zhongnan Fang,
Zepeng Huo,
Zaid Nabulsi,
Diego Ardila,
Wei-Hung Weng,
Edson Amaro Junior,
Neera Ahuja,
Jason Fries,
Nigam H. Shah,
Andrew Johnston
, et al. (6 additional authors not shown)
Abstract:
Over 85 million computed tomography (CT) scans are performed annually in the US, of which approximately one quarter focus on the abdomen. Given the current radiologist shortage, there is a large impetus to use artificial intelligence to alleviate the burden of interpreting these complex imaging studies. Prior state-of-the-art approaches for automated medical image interpretation leverage vision la…
▽ More
Over 85 million computed tomography (CT) scans are performed annually in the US, of which approximately one quarter focus on the abdomen. Given the current radiologist shortage, there is a large impetus to use artificial intelligence to alleviate the burden of interpreting these complex imaging studies. Prior state-of-the-art approaches for automated medical image interpretation leverage vision language models (VLMs). However, current medical VLMs are generally limited to 2D images and short reports, and do not leverage electronic health record (EHR) data for supervision. We introduce Merlin - a 3D VLM that we train using paired CT scans (6+ million images from 15,331 CTs), EHR diagnosis codes (1.8+ million codes), and radiology reports (6+ million tokens). We evaluate Merlin on 6 task types and 752 individual tasks. The non-adapted (off-the-shelf) tasks include zero-shot findings classification (31 findings), phenotype classification (692 phenotypes), and zero-shot cross-modal retrieval (image to findings and image to impressions), while model adapted tasks include 5-year disease prediction (6 diseases), radiology report generation, and 3D semantic segmentation (20 organs). We perform internal validation on a test set of 5,137 CTs, and external validation on 7,000 clinical CTs and on two public CT datasets (VerSe, TotalSegmentator). Beyond these clinically-relevant evaluations, we assess the efficacy of various network architectures and training strategies to depict that Merlin has favorable performance to existing task-specific baselines. We derive data scaling laws to empirically assess training data needs for requisite downstream task performance. Furthermore, unlike conventional VLMs that require hundreds of GPUs for training, we perform all training on a single GPU.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Predicting edge-localized monovacancy defects in zigzag graphene nanoribbons from Floquet quasienergy spectrum
Authors:
Gulshan Kumar,
Shashikant Kumar,
Ajay Kumar,
Prakash Parida
Abstract:
In this work, we prescribe a theoretical framework aiming at predicting the position of monovacancy defects at the edges of zigzag graphene nanoribbons (ZGNRs) using Floquet-Bloch formalism, which can be experimentally observed through time- and angle-resolved photoemission spectroscopy (tr-ARPES). Our methodology involves an in-depth investigation of the Floquet quasienergy band spectrum influenc…
▽ More
In this work, we prescribe a theoretical framework aiming at predicting the position of monovacancy defects at the edges of zigzag graphene nanoribbons (ZGNRs) using Floquet-Bloch formalism, which can be experimentally observed through time- and angle-resolved photoemission spectroscopy (tr-ARPES). Our methodology involves an in-depth investigation of the Floquet quasienergy band spectrum influenced by light with varying polarization across a range of frequencies. Particularly under the influence of circularly polarized light with a frequency comparable to the bandwidth of the system, our findings suggest a promising approach for locating monovacancy defects at either edge, a challenge that proves intricate to predict from the ARPES spectrum of ZGNRs with monovacancy defects. This has been achieved by analyzing the orientation of the Floquet edge state and the appearance of new Dirac points in the vicinity of the Fermi level. The real-world applications of these captivating characteristics underscore the importance and pertinence of our theoretical framework, paving the way for additional exploration and practical use. Our approach, employing the Floquet formalism, is not limited to monovacancy-type defects; rather, it can be expanded to encompass various types of vacancy defects.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
Focusing of concentric free-surface waves
Authors:
Lohit Kayal,
Vatsal Sanjay,
Nikhil Yewale,
Anil Kumar,
Ratul Dasgupta
Abstract:
Gravito-capillary waves at free-surfaces are ubiquitous in several natural and industrial processes involving quiescent liquid pools bounded by cylindrical walls. These waves emanate from the relaxation of initial interface distortions, which often take the form of a cavity (depression) centred on the symmetry axis of the container. These surface waves reflect from the container walls leading to a…
▽ More
Gravito-capillary waves at free-surfaces are ubiquitous in several natural and industrial processes involving quiescent liquid pools bounded by cylindrical walls. These waves emanate from the relaxation of initial interface distortions, which often take the form of a cavity (depression) centred on the symmetry axis of the container. These surface waves reflect from the container walls leading to a radially inward propagating wave-train converging (focussing) onto the symmetry axis. Under the inviscid approximation and for sufficiently shallow cavities, the relaxation is well-described by the linearised potential-flow equations. Naturally, adding viscosity to such a system introduces viscous dissipation that enervates energy and dampens the oscillations at the symmetry axis.
However, for viscous liquids and deeper cavities, these equations are qualitatively inaccurate. In this study, we elucidate a modal approach to study the initial-value problem for concentric gravito-capillary waves generated on a free-surface for inviscid as well as viscous liquids. For a sufficiently deep cavity, the inward focusing of waves results in large interfacial oscillations at the axis, necessitating a second-order nonlinear theory. We demonstrate that this theory effectively models the interfacial behavior and highlights the crucial role of nonlinearity near the symmetry axis. Contrary to expectations, the addition of slight viscosity further intensifies the oscillations at the symmetry axis. This finding underscores the limitations of the potential flow model and suggests avenues for more accurate modelling of such complex free-surface flows.
△ Less
Submitted 8 June, 2024;
originally announced June 2024.
-
CRAG -- Comprehensive RAG Benchmark
Authors:
Xiao Yang,
Kai Sun,
Hao Xin,
Yushi Sun,
Nikita Bhalla,
Xiangsen Chen,
Sajal Choudhary,
Rongze Daniel Gui,
Ziran Will Jiang,
Ziyu Jiang,
Lingkun Kong,
Brian Moran,
Jiaqi Wang,
Yifan Ethan Xu,
An Yan,
Chenyu Yang,
Eting Yuan,
Hanwen Zha,
Nan Tang,
Lei Chen,
Nicolas Scheffer,
Yue Liu,
Nirav Shah,
Rakesh Wanga,
Anuj Kumar
, et al. (2 additional authors not shown)
Abstract:
Retrieval-Augmented Generation (RAG) has recently emerged as a promising solution to alleviate Large Language Model (LLM)'s deficiency in lack of knowledge. Existing RAG datasets, however, do not adequately represent the diverse and dynamic nature of real-world Question Answering (QA) tasks. To bridge this gap, we introduce the Comprehensive RAG Benchmark (CRAG), a factual question answering bench…
▽ More
Retrieval-Augmented Generation (RAG) has recently emerged as a promising solution to alleviate Large Language Model (LLM)'s deficiency in lack of knowledge. Existing RAG datasets, however, do not adequately represent the diverse and dynamic nature of real-world Question Answering (QA) tasks. To bridge this gap, we introduce the Comprehensive RAG Benchmark (CRAG), a factual question answering benchmark of 4,409 question-answer pairs and mock APIs to simulate web and Knowledge Graph (KG) search. CRAG is designed to encapsulate a diverse array of questions across five domains and eight question categories, reflecting varied entity popularity from popular to long-tail, and temporal dynamisms ranging from years to seconds. Our evaluation on this benchmark highlights the gap to fully trustworthy QA. Whereas most advanced LLMs achieve <=34% accuracy on CRAG, adding RAG in a straightforward manner improves the accuracy only to 44%. State-of-the-art industry RAG solutions only answer 63% questions without any hallucination. CRAG also reveals much lower accuracy in answering questions regarding facts with higher dynamism, lower popularity, or higher complexity, suggesting future research directions. The CRAG benchmark laid the groundwork for a KDD Cup 2024 challenge, attracting thousands of participants and submissions within the first 50 days of the competition. We commit to maintaining CRAG to serve research communities in advancing RAG solutions and general QA solutions.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
Mechanism of instability in non-uniform dusty channel flow
Authors:
Anup Kumar,
Rama Govindarajan
Abstract:
Particles in pressure-driven channel flow are often inhomogeneously distributed. Two modes of low-Reynolds number instability, absent in Poiseuille flow of clean fluid, are created by inhomogeneous particle loading, and their mechanism is worked out here. Two distinct classes of behaviour are seen: when the critical layer of the dominant perturbation overlaps with variations in particle concentrat…
▽ More
Particles in pressure-driven channel flow are often inhomogeneously distributed. Two modes of low-Reynolds number instability, absent in Poiseuille flow of clean fluid, are created by inhomogeneous particle loading, and their mechanism is worked out here. Two distinct classes of behaviour are seen: when the critical layer of the dominant perturbation overlaps with variations in particle concentration, the new instabilities arise, which we term overlap modes. But when the layers are distinct, only the traditional Tollmien-Schlichting mode of instability occurs. We derive the dominant critical layer balance equations in this flow along the lines done classically for clean fluid. These reveal how concentration variations within the critical layer cause two the particle-driven instabilities. As a result of these variations, disturbance kinetic energy production is qualitatively and majorly altered. Surprisingly the two overlap modes, though completely different in the symmetry of the eigenstructure and regime of exponential growth, show practically identical energy budgets, highlighting the relevance of variations within the critical layer. The wall layer is shown to be unimportant. We derive a minimal composite theory comprising all terms in the complete equation which are dominant somewhere in the flow, and show that it contains the essential physics.
When particles are infinitely dense relative to the fluid, the volume fraction is negligible. But for finite density ratios, the volume fraction of particles causes a profile of effective viscosity. This is shown to be uniformly stabilizing in the present flow. Gravity is neglected here, and will be important to study in future. So will transient growth of perturbations due to non-normality of the stability operator, in a quest for the mechanism of transition to turbulence.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement
Authors:
Wangyou Zhang,
Robin Scheibler,
Kohei Saijo,
Samuele Cornell,
Chenda Li,
Zhaoheng Ni,
Anurag Kumar,
Jan Pirklbauer,
Marvin Sach,
Shinji Watanabe,
Tim Fingscheidt,
Yanmin Qian
Abstract:
The last decade has witnessed significant advancements in deep learning-based speech enhancement (SE). However, most existing SE research has limitations on the coverage of SE sub-tasks, data diversity and amount, and evaluation metrics. To fill this gap and promote research toward universal SE, we establish a new SE challenge, named URGENT, to focus on the universality, robustness, and generaliza…
▽ More
The last decade has witnessed significant advancements in deep learning-based speech enhancement (SE). However, most existing SE research has limitations on the coverage of SE sub-tasks, data diversity and amount, and evaluation metrics. To fill this gap and promote research toward universal SE, we establish a new SE challenge, named URGENT, to focus on the universality, robustness, and generalizability of SE. We aim to extend the SE definition to cover different sub-tasks to explore the limits of SE models, starting from denoising, dereverberation, bandwidth extension, and declipping. A novel framework is proposed to unify all these sub-tasks in a single model, allowing the use of all existing SE approaches. We collected public speech and noise data from different domains to construct diverse evaluation data. Finally, we discuss the insights gained from our preliminary baseline experiments based on both generative and discriminative SE methods with 12 curated metrics.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
Ultrafast Optical Control of Rashba Interactions in a TMDC Heterostructure
Authors:
Henry Mittenzwey,
Abhijeet Kumar,
Raghav Dhingra,
Kenji Watanabe,
Takashi Taniguchi,
Cornelius Gahl,
Kirill I. Bolotin,
Malte Selig,
Andreas Knorr
Abstract:
We investigate spin relaxation dynamics of interlayer excitons in a MoSe2/MoS2 heterostructure induced by the Rashba effect. In such a system, Rashba interactions arise from an out-of-plane electric field due to photo-generated interlayer excitons inducing a phonon-assisted intravalley spin relaxation. We develop a theoretical description based on a microscopic approach to quantify the magnitude o…
▽ More
We investigate spin relaxation dynamics of interlayer excitons in a MoSe2/MoS2 heterostructure induced by the Rashba effect. In such a system, Rashba interactions arise from an out-of-plane electric field due to photo-generated interlayer excitons inducing a phonon-assisted intravalley spin relaxation. We develop a theoretical description based on a microscopic approach to quantify the magnitude of Rashba interactions and test these predictions via time-resolved Kerr rotation measurements. In agreement with the calculations, we find that the Rashba-induced intravalley spin mixing becomes the dominating spin relaxation channel above T = 50 K. Our work identifies a previously unexplored spin-depolarization channel in heterostructures which can be used for ultrafast spin manipulation.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning
Authors:
Amandeep Kumar,
Muhammad Awais,
Sanath Narayan,
Hisham Cholakkal,
Salman Khan,
Rao Muhammad Anwer
Abstract:
Drawing upon StyleGAN's expressivity and disentangled latent space, existing 2D approaches employ textual prompting to edit facial images with different attributes. In contrast, 3D-aware approaches that generate faces at different target poses require attribute-specific classifiers, learning separate model weights for each attribute, and are not scalable for novel attributes. In this work, we prop…
▽ More
Drawing upon StyleGAN's expressivity and disentangled latent space, existing 2D approaches employ textual prompting to edit facial images with different attributes. In contrast, 3D-aware approaches that generate faces at different target poses require attribute-specific classifiers, learning separate model weights for each attribute, and are not scalable for novel attributes. In this work, we propose an efficient, plug-and-play, 3D-aware face editing framework based on attribute-specific prompt learning, enabling the generation of facial images with controllable attributes across various target poses. To this end, we introduce a text-driven learnable style token-based latent attribute editor (LAE). The LAE harnesses a pre-trained vision-language model to find text-guided attribute-specific editing direction in the latent space of any pre-trained 3D-aware GAN. It utilizes learnable style tokens and style mappers to learn and transform this editing direction to 3D latent space. To train LAE with multiple attributes, we use directional contrastive loss and style token loss. Furthermore, to ensure view consistency and identity preservation across different poses and attributes, we employ several 3D-aware identity and pose preservation losses. Our experiments show that our proposed framework generates high-quality images with 3D awareness and view consistency while maintaining attribute-specific features. We demonstrate the effectiveness of our method on different facial attributes, including hair color and style, expression, and others. Code: https://github.com/VIROBO-15/Efficient-3D-Aware-Facial-Image-Editing.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Instance Segmentation and Teeth Classification in Panoramic X-rays
Authors:
Devichand Budagam,
Ayush Kumar,
Sayan Ghosh,
Anuj Shrivastav,
Azamat Zhanatuly Imanbayev,
Iskander Rafailovich Akhmetov,
Dmitrii Kaplun,
Sergey Antonov,
Artem Rychenkov,
Gleb Cyganov,
Aleksandr Sinitca
Abstract:
Teeth segmentation and recognition are critical in various dental applications and dental diagnosis. Automatic and accurate segmentation approaches have been made possible by integrating deep learning models. Although teeth segmentation has been studied in the past, only some techniques were able to effectively classify and segment teeth simultaneously. This article offers a pipeline of two deep l…
▽ More
Teeth segmentation and recognition are critical in various dental applications and dental diagnosis. Automatic and accurate segmentation approaches have been made possible by integrating deep learning models. Although teeth segmentation has been studied in the past, only some techniques were able to effectively classify and segment teeth simultaneously. This article offers a pipeline of two deep learning models, U-Net and YOLOv8, which results in BB-UNet, a new architecture for the classification and segmentation of teeth on panoramic X-rays that is efficient and reliable. We have improved the quality and reliability of teeth segmentation by utilising the YOLOv8 and U-Net capabilities. The proposed networks have been evaluated using the mean average precision (mAP) and dice coefficient for YOLOv8 and BB-UNet, respectively. We have achieved a 3\% increase in mAP score for teeth classification compared to existing methods, and a 10-15\% increase in dice coefficient for teeth segmentation compared to U-Net across different categories of teeth. A new Dental dataset was created based on UFBA-UESC dataset with Bounding-Box and Polygon annotations of 425 dental panoramic X-rays. The findings of this research pave the way for a wider adoption of object detection models in the field of dental diagnosis.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
$\textit{Kilonova Seekers}$: the GOTO project for real-time citizen science in time-domain astrophysics
Authors:
T. L. Killestein,
L. Kelsey,
E. Wickens,
L. Nuttall,
J. Lyman,
C. Krawczyk,
K. Ackley,
M. J. Dyer,
F. Jiménez-Ibarra,
K. Ulaczyk,
D. O'Neill,
A. Kumar,
D. Steeghs,
D. K. Galloway,
V. S. Dhillon,
P. O'Brien,
G. Ramsay,
K. Noysena,
R. Kotak,
R. P. Breton,
E. Pallé,
D. Pollacco,
S. Awiphan,
S. Belkin,
P. Chote
, et al. (29 additional authors not shown)
Abstract:
Time-domain astrophysics continues to grow rapidly, with the inception of new surveys drastically increasing data volumes. Democratised, distributed approaches to training sets for machine learning classifiers are crucial to make the most of this torrent of discovery -- with citizen science approaches proving effective at meeting these requirements. In this paper, we describe the creation of and t…
▽ More
Time-domain astrophysics continues to grow rapidly, with the inception of new surveys drastically increasing data volumes. Democratised, distributed approaches to training sets for machine learning classifiers are crucial to make the most of this torrent of discovery -- with citizen science approaches proving effective at meeting these requirements. In this paper, we describe the creation of and the initial results from the $\textit{Kilonova Seekers}$ citizen science project, built to find transient phenomena from the GOTO telescopes in near real-time. $\textit{Kilonova Seekers}$ launched in July 2023 and received over 600,000 classifications from approximately 2,000 volunteers over the course of the LIGO-Virgo-KAGRA O4a observing run. During this time, the project has yielded 20 discoveries, generated a `gold-standard' training set of 17,682 detections for augmenting deep-learned classifiers, and measured the performance and biases of Zooniverse volunteers on real-bogus classification. This project will continue throughout the lifetime of GOTO, pushing candidates at ever-greater cadence, and directly facilitate the next-generation classification algorithms currently in development.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Tensor square and isoclinic extensions of multiplicative Lie algebras
Authors:
Dev Karan Singh,
Amit Kumar,
Sumit Kumar Upadhyay,
Shiv Datt Kumar
Abstract:
In this paper, we discuss the capable and isoclinic properties of the tensor square in the context of multiplicative Lie algebras. We also developed the concept of isoclinic extensions and proved several results for multiplicative Lie algebras. Consequently, we demonstrate that covers of a multiplicative Lie algebra are mutually isoclinic.
In this paper, we discuss the capable and isoclinic properties of the tensor square in the context of multiplicative Lie algebras. We also developed the concept of isoclinic extensions and proved several results for multiplicative Lie algebras. Consequently, we demonstrate that covers of a multiplicative Lie algebra are mutually isoclinic.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Optical heterodyne microscopy of operating spin Hall nano-oscillator arrays
Authors:
A. Alemán,
A. A. Awad,
S. Muralidhar,
R. Khymyn,
A. Kumar,
A. Houshang,
D. Hanstorp,
J. Åkerman
Abstract:
Optical heterodyne detection is a powerful technique for characterizing a wide range of physical excitations. Here, we use two types of optical heterodyne detection techniques (fundamental and parametric pumping) to microscopically characterize the high-frequency auto-oscillations of single and multiple nano-constriction spin Hall nano-oscillators (SHNOs). To validate the technique and demonstrate…
▽ More
Optical heterodyne detection is a powerful technique for characterizing a wide range of physical excitations. Here, we use two types of optical heterodyne detection techniques (fundamental and parametric pumping) to microscopically characterize the high-frequency auto-oscillations of single and multiple nano-constriction spin Hall nano-oscillators (SHNOs). To validate the technique and demonstrate its robustness, we study SHNOs made from two different material stacks, NiFe/Pt and W/CoFeB/MgO, and investigate the influence of both the RF injection power and the laser power on the measurements, comparing the optical results to conventional electrical measurements. To demonstrate the key features of direct, non-invasive, submicron, spatial, and phase-resolved characterization of the SHNO magnetodynamics, we map out the auto-oscillation magnitude and phase of two phase-binarized SHNOs used in Ising Machines. This proof-of-concept platform establishes a strong foundation for further extensions, contributing to the ongoing development of crucial characterization techniques for emerging computing technologies based on spintronics devices
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Investigating a Device Independence Quantum Random Number Generation
Authors:
Vardaan Mongia,
Abhishek Kumar,
Shashi Prabhakar,
Anindya Banerji,
R. P. Singh
Abstract:
Quantum random number generation (QRNG) is a resource that is a necessity in the field of cryptography. However, its certification has been challenging. In this article, we certify randomness with the aid of quantum entanglement in a device independent setting, where we choose two-photon interference for source characterisation. The CHSH inequality violation and quantum state tomography are used a…
▽ More
Quantum random number generation (QRNG) is a resource that is a necessity in the field of cryptography. However, its certification has been challenging. In this article, we certify randomness with the aid of quantum entanglement in a device independent setting, where we choose two-photon interference for source characterisation. The CHSH inequality violation and quantum state tomography are used as independent checks on the measurement devices. These measures ensure the unpredictability of quantum random number generation. This work can be easily extended to faster randomness expansion protocols.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Seasonal variation in nighttime NO radiative cooling as observed by TIMED/SABER in lower thermosphere during solar maximum and solar minimum
Authors:
Alok Kumar Ranjan,
MV Sunil Krishna,
Akash Kumar,
Dayakrishna Nailwal,
Sumanta Sarkhel
Abstract:
Both composition and temperature play a crucial role in determining the NO radiative cooling in lower thermosphere as observed by TIMED/SABER. In this work, we present a detailed investigation of seasonal variation in thermospheric NO radiative cooling. We have carried forward the investigation of \cite{li2018} regarding the variations in local nighttime peak NO radiative cooling and its altitude…
▽ More
Both composition and temperature play a crucial role in determining the NO radiative cooling in lower thermosphere as observed by TIMED/SABER. In this work, we present a detailed investigation of seasonal variation in thermospheric NO radiative cooling. We have carried forward the investigation of \cite{li2018} regarding the variations in local nighttime peak NO radiative cooling and its altitude during solar maximum and solar minimum conditions. By analyzing latitudinal changes over quiet times for each month in year 2018, it is evident that both the investigative parameters exhibit summer-winter variability. The qualitative contribution of different species (i.e., NO, and O), and temperatures in determining the vertical profile of NO radiative cooling for different latitudes is investigated by utilizing the NRLMSISE-00 estimated parameters, and SNOE observed NO density. The temperature, NO density, meridional wind, and associated compositional variations due to asymmetrical solar heating in both the hemispheres during solar minimum conditions seem to be the dominating factor in controlling the NO radiative cooling during different seasons. The altitudes at which maximum cooling by NO occurs exhibits an inverse correlation with the amount of radiative cooling. The region of enhanced NO densities (polar and summer hemispheric low-mid latitude regions) have larger NO radiative cooling with lower peak altitudes in comparison to other regions (equatorial to winter hemispheric low-mid latitude regions), where NO radiative cooling is low with higher peak altitude values.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Exploration of mass splitting and muon/tau mixing parameters for an eV-scale sterile neutrino with IceCube
Authors:
R. Abbasi,
M. Ackermann,
J. Adams,
S. K. Agarwalla,
J. A. Aguilar,
M. Ahlers,
J. M. Alameddine,
N. M. Amin,
K. Andeen,
C. Argüelles,
Y. Ashida,
S. Athanasiadou,
L. Ausborm,
S. N. Axani,
X. Bai,
A. Balagopal V.,
M. Baricevic,
S. W. Barwick,
S. Bash,
V. Basu,
R. Bay,
J. J. Beatty,
J. Becker Tjus,
J. Beise,
C. Bellenghi
, et al. (400 additional authors not shown)
Abstract:
We present the first three-parameter fit to a 3+1 sterile neutrino model using 7.634 years of data from the IceCube Neutrino Observatory on $ν_μ+\overlineν_μ$ charged-current interactions in the energy range 500-9976 GeV. Our analysis is sensitive to the mass-squared splitting between the heaviest and lightest mass state ($Δm_{41}^2$), the mixing matrix element connecting muon flavor to the fourth…
▽ More
We present the first three-parameter fit to a 3+1 sterile neutrino model using 7.634 years of data from the IceCube Neutrino Observatory on $ν_μ+\overlineν_μ$ charged-current interactions in the energy range 500-9976 GeV. Our analysis is sensitive to the mass-squared splitting between the heaviest and lightest mass state ($Δm_{41}^2$), the mixing matrix element connecting muon flavor to the fourth mass state ($|U_{\mu4}|^2$), and the element connecting tau flavor to the fourth mass state ($|U_{\tau4}|^2$). Predicted propagation effects in matter enhance the signature through a resonance as atmospheric neutrinos from the Northern Hemisphere traverse the Earth to the IceCube detector at the South Pole. The result is consistent with the no-sterile neutrino hypothesis with a probability of 4.3 %. Profiling the likelihood of each parameter yields the 90 % confidence levels: $ 2.4\,\mathrm{eV}^{2} < Δm_{41}^2 <9.6\,\mathrm{eV}^{2} $ , $0.0081 < |U_{\mu4}|^2 < 0.10$ , and $|U_{\tau4}|^2< 0.035$, which narrows the allowed parameter-space for $|U_{\tau4}|^2$. However, the primary result of this analysis is the first map of the 3+1 parameter space exploring the interdependence of $Δm_{41}^2$, $|U_{\mu4}|^2$, and $|U_{\tau4}|^2$.
△ Less
Submitted 2 June, 2024;
originally announced June 2024.
-
Exploring Child-Robot Interaction in Individual and Group settings in India
Authors:
Gayathri Manikutty,
Sai Ankith Potapragada,
Devasena Pasupuleti,
Mahesh S. Unnithan,
Arjun Venugopal,
Pranav Prabha,
Arunav H.,
Vyshnavi Anil Kumar,
Rthuraj P. R.,
Rao R Bhavani
Abstract:
This study evaluates the effectiveness of child-robot interactions with the HaKsh-E social robot in India, examining both individual and group interaction settings. The research centers on game-based interactions designed to teach hand hygiene to children aged 7-11. Utilizing video analysis, rubric assessments, and post-study questionnaires, the study gathered data from 36 participants. Findings i…
▽ More
This study evaluates the effectiveness of child-robot interactions with the HaKsh-E social robot in India, examining both individual and group interaction settings. The research centers on game-based interactions designed to teach hand hygiene to children aged 7-11. Utilizing video analysis, rubric assessments, and post-study questionnaires, the study gathered data from 36 participants. Findings indicate that children in both settings developed positive perceptions of the robot in terms of the robot's trustworthiness, closeness, and social support. The significant difference in the interaction level scores presented in the study suggests that group settings foster higher levels of interaction, potentially due to peer influence and collaborative dynamics. While both settings showed significant improvements in learning outcomes, the individual setting had more pronounced learning gains. This suggests that personal interactions with the robot might lead to deeper or more effective learning experiences. Consequently, this study concludes that individual interaction settings are more conducive for focused learning gains, while group settings enhance interaction and engagement.
△ Less
Submitted 4 June, 2024; v1 submitted 2 June, 2024;
originally announced June 2024.
-
Optimizing Photometric Light Curve Analysis: Evaluating Scipy's Minimize Function for Eclipse Mapping of Cataclysmic Variables
Authors:
Anoop Kumar,
Madan Mohan Tito Ayyalasomayajula,
Dheerendra Panwar,
Yeshwanth Vasa
Abstract:
With a particular focus on Scipy's minimize function the eclipse mapping method is thoroughly researched and implemented utilizing Python and essential libraries. Many optimization techniques are used, including Sequential Least Squares Programming (SLSQP), Nelder-Mead, and Conjugate Gradient (CG). However, for the purpose of examining photometric light curves these methods seek to solve the maxim…
▽ More
With a particular focus on Scipy's minimize function the eclipse mapping method is thoroughly researched and implemented utilizing Python and essential libraries. Many optimization techniques are used, including Sequential Least Squares Programming (SLSQP), Nelder-Mead, and Conjugate Gradient (CG). However, for the purpose of examining photometric light curves these methods seek to solve the maximum entropy equation under a chi-squared constraint. Therefore, these techniques are first evaluated on two-dimensional Gaussian data without a chi-squared restriction, and then they are used to map the accretion disc and uncover the Gaussian structure of the Cataclysmic Variable KIC 201325107. Critical analysis is performed on the code structure to find possible faults and design problems. Additionally, the analysis shows how several factors impacting computing time and image quality are included including the variance in Gaussian weighting, disc image resolution, number of data points in the light curve, and degree of constraint.
△ Less
Submitted 30 May, 2024;
originally announced June 2024.
-
EnterpriseEM: Fine-tuned Embeddings for Enterprise Semantic Search
Authors:
Kamalkumar Rathinasamy,
Jayarama Nettar,
Amit Kumar,
Vishal Manchanda,
Arun Vijayakumar,
Ayush Kataria,
Venkateshprasanna Manjunath,
Chidambaram GS,
Jaskirat Singh Sodhi,
Shoeb Shaikh,
Wasim Akhtar Khan,
Prashant Singh,
Tanishq Dattatray Ige,
Vipin Tiwari,
Rajab Ali Mondal,
Harshini K,
S Reka,
Chetana Amancharla,
Faiz ur Rahman,
Harikrishnan P A,
Indraneel Saha,
Bhavya Tiwary,
Navin Shankar Patel,
Pradeep T S,
Balaji A J
, et al. (2 additional authors not shown)
Abstract:
Enterprises grapple with the significant challenge of managing proprietary unstructured data, hindering efficient information retrieval. This has led to the emergence of AI-driven information retrieval solutions, designed to adeptly extract relevant insights to address employee inquiries. These solutions often leverage pre-trained embedding models and generative models as foundational components.…
▽ More
Enterprises grapple with the significant challenge of managing proprietary unstructured data, hindering efficient information retrieval. This has led to the emergence of AI-driven information retrieval solutions, designed to adeptly extract relevant insights to address employee inquiries. These solutions often leverage pre-trained embedding models and generative models as foundational components. While pre-trained embeddings may exhibit proximity or disparity based on their original training objectives, they might not fully align with the unique characteristics of enterprise-specific data, leading to suboptimal alignment with the retrieval goals of enterprise environments. In this paper, we propose a methodology to fine-tune pre-trained embedding models specifically for enterprise environments. By adapting the embeddings to better suit the retrieval tasks prevalent in enterprises, we aim to enhance the performance of information retrieval solutions. We discuss the process of fine-tuning, its effect on retrieval accuracy, and the potential benefits for enterprise information management. Our findings demonstrate the efficacy of fine-tuned embedding models in improving the precision and relevance of search results in enterprise settings.
△ Less
Submitted 18 May, 2024;
originally announced June 2024.
-
Unravelling the asphericities in the explosion and multi-faceted circumstellar matter of SN 2023ixf
Authors:
Avinash Singh,
R. S. Teja,
T. J. Moriya,
K. Maeda,
K. S. Kawabata,
M. Tanaka,
R. Imazawa,
T. Nakaoka,
A. Gangopadhyay,
M. Yamanaka,
V. Swain,
D. K. Sahu,
G. C. Anupama,
B. Kumar,
R. M. Anche,
Y. Sano,
A. Raj,
V. K. Agnihotri,
V. Bhalerao,
D. Bisht,
M. S. Bisht,
K. Belwal,
S. K. Chakrabarti,
M. Fujii,
T. Nagayama
, et al. (11 additional authors not shown)
Abstract:
We present a detailed investigation of photometric, spectroscopic, and polarimetric observations of the Type II SN 2023ixf. The early detection of highly-ionized flash features, rapid ascent in ultraviolet flux coupled with the blueward shift in near-ultraviolet colors and temperature provides compelling evidence for a delayed shock breakout from a confined dense circumstellar matter (CSM) envelop…
▽ More
We present a detailed investigation of photometric, spectroscopic, and polarimetric observations of the Type II SN 2023ixf. The early detection of highly-ionized flash features, rapid ascent in ultraviolet flux coupled with the blueward shift in near-ultraviolet colors and temperature provides compelling evidence for a delayed shock breakout from a confined dense circumstellar matter (CSM) enveloping the progenitor star. The temporal evolution of polarization in the SN 2023ixf phase revealed three distinct peaks in polarization evolution at 1.4 d, 6.4 d, and 79.2 d, indicating an asymmetric dense CSM, an aspherical shock front and clumpiness in the low-density extended CSM, and an aspherical inner ejecta/He-core. SN 2023ixf displayed two dominant axes, one along the CSM-outer ejecta and the other along the inner ejecta/He-core, showcasing the independent origin of asymmetry in the early and late evolution. The argument for an aspherical shock front is further strengthened by the presence of a high-velocity broad absorption feature in the blue wing of the Balmer features in addition to the P-Cygni absorption post 16 d. Hydrodynamical light curve modeling indicated a progenitor mass of 10 solar mass with a radius of 470 solar radius, explosion energy of 2e51 erg, and 0.06 solar mass of 56Ni. The modeling also indicated a two-zone CSM: a confined dense CSM extending up to 5e14 cm, with a mass-loss rate of 1e-2 solar mass per year, and an extended CSM spanning from 5e14 cm to 1e16 cm with a mass-loss rate of 1e-4 solar mass per year. The early nebular phase observations display an axisymmetric line profile of [OI] and red-ward attenuation of the emission of Halpha post 125 days, marking the onset of dust formation.
△ Less
Submitted 31 May, 2024;
originally announced May 2024.
-
Improving code-mixed hate detection by native sample mixing: A case study for Hindi-English code-mixed scenario
Authors:
Debajyoti Mazumder,
Aakash Kumar,
Jasabanta Patro
Abstract:
Hate detection has long been a challenging task for the NLP community. The task becomes complex in a code-mixed environment because the models must understand the context and the hate expressed through language alteration. Compared to the monolingual setup, we see very less work on code-mixed hate as large-scale annotated hate corpora are unavailable to make the study. To overcome this bottleneck,…
▽ More
Hate detection has long been a challenging task for the NLP community. The task becomes complex in a code-mixed environment because the models must understand the context and the hate expressed through language alteration. Compared to the monolingual setup, we see very less work on code-mixed hate as large-scale annotated hate corpora are unavailable to make the study. To overcome this bottleneck, we propose using native language hate samples. We hypothesise that in the era of multilingual language models (MLMs), hate in code-mixed settings can be detected by majorly relying on the native language samples. Even though the NLP literature reports the effectiveness of MLMs on hate detection in many cross-lingual settings, their extensive evaluation in a code-mixed scenario is yet to be done. This paper attempts to fill this gap through rigorous empirical experiments. We considered the Hindi-English code-mixed setup as a case study as we have the linguistic expertise for the same. Some of the interesting observations we got are: (i) adding native hate samples in the code-mixed training set, even in small quantity, improved the performance of MLMs for code-mixed hate detection, (ii) MLMs trained with native samples alone observed to be detecting code-mixed hate to a large extent, (iii) The visualisation of attention scores revealed that, when native samples were included in training, MLMs could better focus on the hate emitting words in the code-mixed context, and (iv) finally, when hate is subjective or sarcastic, naively mixing native samples doesn't help much to detect code-mixed hate. We will release the data and code repository to reproduce the reported results.
△ Less
Submitted 31 May, 2024;
originally announced May 2024.
-
Cross-Talk Reduction
Authors:
Zhong-Qiu Wang,
Anurag Kumar,
Shinji Watanabe
Abstract:
While far-field multi-talker mixtures are recorded, each speaker can wear a close-talk microphone so that close-talk mixtures can be recorded at the same time. Although each close-talk mixture has a high signal-to-noise ratio (SNR) of the wearer, it has a very limited range of applications, as it also contains significant cross-talk speech by other speakers and is not clean enough. In this context…
▽ More
While far-field multi-talker mixtures are recorded, each speaker can wear a close-talk microphone so that close-talk mixtures can be recorded at the same time. Although each close-talk mixture has a high signal-to-noise ratio (SNR) of the wearer, it has a very limited range of applications, as it also contains significant cross-talk speech by other speakers and is not clean enough. In this context, we propose a novel task named cross-talk reduction (CTR) which aims at reducing cross-talk speech, and a novel solution named CTRnet which is based on unsupervised or weakly-supervised neural speech separation. In unsupervised CTRnet, close-talk and far-field mixtures are stacked as input for a DNN to estimate the close-talk speech of each speaker. It is trained in an unsupervised, discriminative way such that the DNN estimate for each speaker can be linearly filtered to cancel out the speaker's cross-talk speech captured at other microphones. In weakly-supervised CTRnet, we assume the availability of each speaker's activity timestamps during training, and leverage them to improve the training of unsupervised CTRnet. Evaluation results on a simulated two-speaker CTR task and on a real-recorded conversational speech separation and recognition task show the effectiveness and potential of CTRnet.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Adsorption of Mo and O at S-vacancy on ReS2 surface of ReS2/MoTe2 vdW heterointerface
Authors:
Puneet Kumar Shaw,
Jehan Taraporewalla,
Sohaib Raza,
Akash Kumar,
Rimisha Duttagupta,
Hafizur Rahaman,
Dipankar Saha
Abstract:
Applications like high density information storage, neuromorphic computing, nanophotonics, etc. require ultra-thin electronic devices which can be controlled with applied electric field. Of late, atomically thin two-dimensional (2D) materials and van der Waals (vdW) heterointerface of those have emerged as suitable candidates for such ultra-low power nanoelectric devices. In this work, employing d…
▽ More
Applications like high density information storage, neuromorphic computing, nanophotonics, etc. require ultra-thin electronic devices which can be controlled with applied electric field. Of late, atomically thin two-dimensional (2D) materials and van der Waals (vdW) heterointerface of those have emerged as suitable candidates for such ultra-low power nanoelectric devices. In this work, employing density functional theory (DFT), the monolayer ReS2 / monolayer MoTe2 vdW heterostructure with Sulphur vacancy is studied to examine various ground state electronic properties. Changes in effective band gap owing to defect-induced states and modulation of the energy gap value with Molybdenum (Mo) and Oxygen (O) adsorption at the defect site are examined. Since two-dimensional (2D) material based nanoscaled devices exhibit promising switching between non-conducting and conducting states, determining the role of defect-induced states and the adsorption of atoms/molecules on surfaces is crucial. Here, a detailed theoretical study to determine surface properties and relative energetic stability of the vdW heterostructures is carried out. The charge re-distribution between the constituent layers is also analyzed by obtaining Electron Difference Density (EDD) for different heterointerfaces. Nonetheless, the efficacy of switching between non-conducting and conducting states is assessed based on adsorption energy of adatoms binding at the defect site.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Efficient Stimuli Generation using Reinforcement Learning in Design Verification
Authors:
Deepak Narayan Gadde,
Thomas Nalapat,
Aman Kumar,
Djones Lettnin,
Wolfgang Kunz,
Sebastian Simon
Abstract:
The increasing design complexity of System-on-Chips (SoCs) has led to significant verification challenges, particularly in meeting coverage targets within a timely manner. At present, coverage closure is heavily dependent on constrained random and coverage driven verification methodologies where the randomized stimuli are bounded to verify certain scenarios and to reach coverage goals. This proces…
▽ More
The increasing design complexity of System-on-Chips (SoCs) has led to significant verification challenges, particularly in meeting coverage targets within a timely manner. At present, coverage closure is heavily dependent on constrained random and coverage driven verification methodologies where the randomized stimuli are bounded to verify certain scenarios and to reach coverage goals. This process is said to be exhaustive and to consume a lot of project time. In this paper, a novel methodology is proposed to generate efficient stimuli with the help of Reinforcement Learning (RL) to reach the maximum code coverage of the Design Under Verification (DUV). Additionally, an automated framework is created using metamodeling to generate a SystemVerilog testbench and an RL environment for any given design. The proposed approach is applied to various designs and the produced results proves that the RL agent provides effective stimuli to achieve code coverage faster in comparison with baseline random simulations. Furthermore, various RL agents and reward schemes are analyzed in our work.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Classification analysis of transition-metal chalcogenides and oxides using quantum machine learning
Authors:
Kurudi V Vedavyasa,
Ashok Kumar
Abstract:
Quantum machine learning (QML) leverages the potential from machine learning to explore the subtle patterns in huge datasets of complex nature with quantum advantages. This exponentially reduces the time and resources necessary for computations. QML accelerates materials research with active screening of chemical space, identifying novel materials for practical applications and classifying structu…
▽ More
Quantum machine learning (QML) leverages the potential from machine learning to explore the subtle patterns in huge datasets of complex nature with quantum advantages. This exponentially reduces the time and resources necessary for computations. QML accelerates materials research with active screening of chemical space, identifying novel materials for practical applications and classifying structurally diverse materials given their measured properties. This study analyzes the performance of three efficient quantum machine learning algorithms viz., variational quantum eigen solver (VQE), quantum support vector machine (QSVM) and quantum neural networks (QNN) for the classification of transition metal chalcogenides and oxides (TMCs &TMOs). The analysis is performed on three datasets of different sizes containing 102, 192 and 350 materials with TMCs and TMOs labelled as +1 and -1 respectively. By employing feature selection, classical machine learning achieves 100% accuracy whereas QML achieves the highest performance of 99% and 98% for test and train data respectively on QSVC. This study establishes the competence of QML models in materials classification and explores the quantum circuits in terms of over-fitting using the circuit descriptors expressibility and entangling capability. In addition, the perspectives on QML in materials research with noisy intermediate scale quantum (NISQ) devices is given.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Multi-modal Generation via Cross-Modal In-Context Learning
Authors:
Amandeep Kumar,
Muzammal Naseer,
Sanath Narayan,
Rao Muhammad Anwer,
Salman Khan,
Hisham Cholakkal
Abstract:
In this work, we study the problem of generating novel images from complex multimodal prompt sequences. While existing methods achieve promising results for text-to-image generation, they often struggle to capture fine-grained details from lengthy prompts and maintain contextual coherence within prompt sequences. Moreover, they often result in misaligned image generation for prompt sequences featu…
▽ More
In this work, we study the problem of generating novel images from complex multimodal prompt sequences. While existing methods achieve promising results for text-to-image generation, they often struggle to capture fine-grained details from lengthy prompts and maintain contextual coherence within prompt sequences. Moreover, they often result in misaligned image generation for prompt sequences featuring multiple objects. To address this, we propose a Multi-modal Generation via Cross-Modal In-Context Learning (MGCC) method that generates novel images from complex multimodal prompt sequences by leveraging the combined capabilities of large language models (LLMs) and diffusion models. Our MGCC comprises a novel Cross-Modal Refinement module to explicitly learn cross-modal dependencies between the text and image in the LLM embedding space, and a contextual object grounding module to generate object bounding boxes specifically targeting scenes with multiple objects. Our MGCC demonstrates a diverse range of multimodal capabilities, like novel image generation, the facilitation of multimodal dialogue, and generation of texts. Experimental evaluations on two benchmark datasets, demonstrate the effectiveness of our method. On Visual Story Generation (VIST) dataset with multimodal inputs, our MGCC achieves a CLIP Similarity score of $0.652$ compared to SOTA GILL $0.641$. Similarly, on Visual Dialogue Context (VisDial) having lengthy dialogue sequences, our MGCC achieves an impressive CLIP score of $0.660$, largely outperforming existing SOTA method scoring $0.645$. Code: https://github.com/VIROBO-15/MGCC
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control
Authors:
Litu Rout,
Yujia Chen,
Nataniel Ruiz,
Abhishek Kumar,
Constantine Caramanis,
Sanjay Shakkottai,
Wen-Sheng Chu
Abstract:
We propose Reference-Based Modulation (RB-Modulation), a new plug-and-play solution for training-free personalization of diffusion models. Existing training-free approaches exhibit difficulties in (a) style extraction from reference images in the absence of additional style or content text descriptions, (b) unwanted content leakage from reference style images, and (c) effective composition of styl…
▽ More
We propose Reference-Based Modulation (RB-Modulation), a new plug-and-play solution for training-free personalization of diffusion models. Existing training-free approaches exhibit difficulties in (a) style extraction from reference images in the absence of additional style or content text descriptions, (b) unwanted content leakage from reference style images, and (c) effective composition of style and content. RB-Modulation is built on a novel stochastic optimal controller where a style descriptor encodes the desired attributes through a terminal cost. The resulting drift not only overcomes the difficulties above, but also ensures high fidelity to the reference style and adheres to the given text prompt. We also introduce a cross-attention-based feature aggregation scheme that allows RB-Modulation to decouple content and style from the reference image. With theoretical justification and empirical evidence, our framework demonstrates precise extraction and control of content and style in a training-free manner. Further, our method allows a seamless composition of content and style, which marks a departure from the dependency on external adapters or ControlNets.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Confidence Under the Hood: An Investigation into the Confidence-Probability Alignment in Large Language Models
Authors:
Abhishek Kumar,
Robert Morabito,
Sanzhar Umbet,
Jad Kabbara,
Ali Emami
Abstract:
As the use of Large Language Models (LLMs) becomes more widespread, understanding their self-evaluation of confidence in generated responses becomes increasingly important as it is integral to the reliability of the output of these models. We introduce the concept of Confidence-Probability Alignment, that connects an LLM's internal confidence, quantified by token probabilities, to the confidence c…
▽ More
As the use of Large Language Models (LLMs) becomes more widespread, understanding their self-evaluation of confidence in generated responses becomes increasingly important as it is integral to the reliability of the output of these models. We introduce the concept of Confidence-Probability Alignment, that connects an LLM's internal confidence, quantified by token probabilities, to the confidence conveyed in the model's response when explicitly asked about its certainty. Using various datasets and prompting techniques that encourage model introspection, we probe the alignment between models' internal and expressed confidence. These techniques encompass using structured evaluation scales to rate confidence, including answer options when prompting, and eliciting the model's confidence level for outputs it does not recognize as its own. Notably, among the models analyzed, OpenAI's GPT-4 showed the strongest confidence-probability alignment, with an average Spearman's $\hatρ$ of 0.42, across a wide range of tasks. Our work contributes to the ongoing efforts to facilitate risk assessment in the application of LLMs and to further our understanding of model trustworthiness.
△ Less
Submitted 15 June, 2024; v1 submitted 25 May, 2024;
originally announced May 2024.
-
Subtle Biases Need Subtler Measures: Dual Metrics for Evaluating Representative and Affinity Bias in Large Language Models
Authors:
Abhishek Kumar,
Sarfaroz Yunusov,
Ali Emami
Abstract:
Research on Large Language Models (LLMs) has often neglected subtle biases that, although less apparent, can significantly influence the models' outputs toward particular social narratives. This study addresses two such biases within LLMs: representative bias, which denotes a tendency of LLMs to generate outputs that mirror the experiences of certain identity groups, and affinity bias, reflecting…
▽ More
Research on Large Language Models (LLMs) has often neglected subtle biases that, although less apparent, can significantly influence the models' outputs toward particular social narratives. This study addresses two such biases within LLMs: representative bias, which denotes a tendency of LLMs to generate outputs that mirror the experiences of certain identity groups, and affinity bias, reflecting the models' evaluative preferences for specific narratives or viewpoints. We introduce two novel metrics to measure these biases: the Representative Bias Score (RBS) and the Affinity Bias Score (ABS), and present the Creativity-Oriented Generation Suite (CoGS), a collection of open-ended tasks such as short story writing and poetry composition, designed with customized rubrics to detect these subtle biases. Our analysis uncovers marked representative biases in prominent LLMs, with a preference for identities associated with being white, straight, and men. Furthermore, our investigation of affinity bias reveals distinctive evaluative patterns within each model, akin to `bias fingerprints'. This trend is also seen in human evaluators, highlighting a complex interplay between human and machine bias perceptions.
△ Less
Submitted 3 June, 2024; v1 submitted 23 May, 2024;
originally announced May 2024.
-
In-beam $γ-$spectroscopy of the transitional nucleus $^{217}$Ac
Authors:
Dhananjaya Sahoo,
A. Y. Deo,
Madhu,
Khamosh Yadav,
S. S. Tiwary,
P. C. Srivastava,
R. Palit,
S. K. Tandel,
Anil Kumar,
P. Dey,
Biswajit Das,
Vishal Malik,
A. Kundu,
A. Sindhu,
S. V. Jadhav,
B. S. Naidu,
A. V. Thomas
Abstract:
High-spin states in the transitional $^{217}$Ac nucleus are established up to 3.8 MeV excitation energy and $I^π =$ 41/2$^+$ with the addition of around 20 new transitions. The structure of the yrast and near-yrast states below the 29/2$^+$ isomer is revisited. The inconsistencies in the level schemes reported earlier are resolved. The level structure above the 29/2$^+$ isomer is established for t…
▽ More
High-spin states in the transitional $^{217}$Ac nucleus are established up to 3.8 MeV excitation energy and $I^π =$ 41/2$^+$ with the addition of around 20 new transitions. The structure of the yrast and near-yrast states below the 29/2$^+$ isomer is revisited. The inconsistencies in the level schemes reported earlier are resolved. The level structure above the 29/2$^+$ isomer is established for the first time. Large-basis shell-model calculations with the KHPE interaction are performed to compare the experimentally observed level energies with the theoretical predictions. A comparison with the systematics of the N = 128 isotones suggests that the yrast structures result from a weak coupling of the odd proton to the even-even 216Ra core, which is consistent with the shell-model configurations. Furthermore, alpha decay of the 29/2$^+$ isomer is revisited and the decay scheme established from this work is discussed in the framework of the shell model.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Prediction of Cryptocurrency Prices through a Path Dependent Monte Carlo Simulation
Authors:
Ayush Singh,
Anshu K. Jha,
Amit N. Kumar
Abstract:
In this paper, our focus lies on the Merton's jump diffusion model, employing jump processes characterized by the compound Poisson process. Our primary objective is to forecast the drift and volatility of the model using a variety of methodologies. We adopt an approach that involves implementing different drift, volatility, and jump terms within the model through various machine learning technique…
▽ More
In this paper, our focus lies on the Merton's jump diffusion model, employing jump processes characterized by the compound Poisson process. Our primary objective is to forecast the drift and volatility of the model using a variety of methodologies. We adopt an approach that involves implementing different drift, volatility, and jump terms within the model through various machine learning techniques, traditional methods, and statistical methods on price-volume data. Additionally, we introduce a path-dependent Monte Carlo simulation to model cryptocurrency prices, taking into account the volatility and unexpected jumps in prices.
△ Less
Submitted 10 April, 2024;
originally announced May 2024.
-
Elucidating the role of electron transfer in the photoluminescence of $\mathrm{MoS_{2}}$ quantum dots synthesized by fs-pulse ablation
Authors:
Anubhab Sahoo,
Tejendra Dixit,
K. V. Anil Kumar,
K. Lakshmi Ganapathi,
Pramoda K. Nayak,
M. S. Ramachandra Rao,
Sivarama Krishnan
Abstract:
Herein, $\mathrm{MoS_{2}}$ quantum dot (QDs) with controlled optical, structural, and electronic properties are synthesized using the femtosecond pulsed laser ablation in liquid (fs-PLAL) technique by varying pulse-width, ablation power, and ablation time to harness the potential for next-generation optoelectronics and quantum technology. Furthermore, this work elucidates key aspects of the mechan…
▽ More
Herein, $\mathrm{MoS_{2}}$ quantum dot (QDs) with controlled optical, structural, and electronic properties are synthesized using the femtosecond pulsed laser ablation in liquid (fs-PLAL) technique by varying pulse-width, ablation power, and ablation time to harness the potential for next-generation optoelectronics and quantum technology. Furthermore, this work elucidates key aspects of the mechanisms underlying the near-UV and blue emission, the accompanying large Stokes-shift, and the consequent change in sample color with laser exposure parameters pertaining to $\mathrm{MoS_{2}}$ QDs. Through spectroscopic analysis, including UV-visible absorption, photoluminescence, and Raman spectroscopy, we successfully unravelled the mechanisms for the change in optoelectronic properties of $\mathrm{MoS_{2}}$ QDs with laser parameters. We realize that the occurrence of a secondary phase, specifically $\mathrm{MoO_{3-x}}$, is responsible for the significant Stokes-shift and blue emission observed in this QDs system. The primary factor influencing these activities is the electron transfer observed between these two phases, as validated by excitation dependent photoluminescence, XPS and Raman spectroscopies.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
A Graph-Theoretical Framework to Analyse Zero Discord Quantum States
Authors:
Anoopa Joshi,
Parvinder Singh,
Atul Kumar
Abstract:
This article comprehensively explores matrices and their prerequisites for achieving positive semidefiniteness. The study delves into a series of theorems concerning pure quantum states in the context of weighted graphs. The main objective of this study is to establish a graph-theoretic framework for the study of quantum discord and to identify the necessary and sufficient conditions for zero quan…
▽ More
This article comprehensively explores matrices and their prerequisites for achieving positive semidefiniteness. The study delves into a series of theorems concerning pure quantum states in the context of weighted graphs. The main objective of this study is to establish a graph-theoretic framework for the study of quantum discord and to identify the necessary and sufficient conditions for zero quantum discord states using unitary operators. This research aims to advance the understanding of quantum discord and its implications for quantum information theory with a graph-theoretic framework.
△ Less
Submitted 17 May, 2024; v1 submitted 16 May, 2024;
originally announced May 2024.
-
NH3 gas sensing over 2D Phosphorene sheet: A First-Principles Study
Authors:
Naresh Kumar,
Yogendra K. Gautam,
Soni Mishra,
Anuj Kumar,
Abhishek Kumar Mishra
Abstract:
First-principles based calculations were executed to investigate the sensing properties of ammonia gas molecules on two-dimensional pristine black phosphorene towards its application as a gas sensor and related applications. We discuss in detail, the interaction of ammonia gas molecules on the phosphorene single sheet through the structural change analysis, electronic band gap, Bader charge transf…
▽ More
First-principles based calculations were executed to investigate the sensing properties of ammonia gas molecules on two-dimensional pristine black phosphorene towards its application as a gas sensor and related applications. We discuss in detail, the interaction of ammonia gas molecules on the phosphorene single sheet through the structural change analysis, electronic band gap, Bader charge transfer, and density-of-states calculations. Our calculations indicate that the phosphorene could be used as a detector of ammonia, where good sensitivity and very short recovery time at room temperature have confirmed the potential use of phosphorene in the detection of ammonia.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
DeCoDEx: Confounder Detector Guidance for Improved Diffusion-based Counterfactual Explanations
Authors:
Nima Fathi,
Amar Kumar,
Brennan Nichyporuk,
Mohammad Havaei,
Tal Arbel
Abstract:
Deep learning classifiers are prone to latching onto dominant confounders present in a dataset rather than on the causal markers associated with the target class, leading to poor generalization and biased predictions. Although explainability via counterfactual image generation has been successful at exposing the problem, bias mitigation strategies that permit accurate explainability in the presenc…
▽ More
Deep learning classifiers are prone to latching onto dominant confounders present in a dataset rather than on the causal markers associated with the target class, leading to poor generalization and biased predictions. Although explainability via counterfactual image generation has been successful at exposing the problem, bias mitigation strategies that permit accurate explainability in the presence of dominant and diverse artifacts remain unsolved. In this work, we propose the DeCoDEx framework and show how an external, pre-trained binary artifact detector can be leveraged during inference to guide a diffusion-based counterfactual image generator towards accurate explainability. Experiments on the CheXpert dataset, using both synthetic artifacts and real visual artifacts (support devices), show that the proposed method successfully synthesizes the counterfactual images that change the causal pathology markers associated with Pleural Effusion while preserving or ignoring the visual artifacts. Augmentation of ERM and Group-DRO classifiers with the DeCoDEx generated images substantially improves the results across underrepresented groups that are out of distribution for each class. The code is made publicly available at https://github.com/NimaFathi/DeCoDEx.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
Methods and stability tests associated with the sterile neutrino search using improved high-energy $ν_μ$ event reconstruction in IceCube
Authors:
IceCube Collaboration,
R. Abbasi,
M. Ackermann,
J. Adams,
S. K. Agarwalla,
J. A. Aguilar,
M. Ahlers,
J. M. Alameddine,
N. M. Amin,
K. Andeen,
C. Argüelles,
Y. Ashida,
S. Athanasiadou,
L. Ausborm,
S. N. Axani,
X. Bai,
A. Balagopal V.,
M. Baricevic,
S. W. Barwick,
S. Bash,
V. Basu,
R. Bay,
J. J. Beatty,
J. Becker Tjus,
J. Beise
, et al. (398 additional authors not shown)
Abstract:
We provide supporting details for the search for a 3+1 sterile neutrino using data collected over eleven years at the IceCube Neutrino Observatory. The analysis uses atmospheric muon-flavored neutrinos from 0.5 to 100\, TeV that traverse the Earth to reach the IceCube detector, and finds a best-fit point at $\sin^2(2θ_{24}) = 0.16$ and $Δm^{2}_{41} = 3.5$ eV$^2$ with a goodness-of-fit p-value of 1…
▽ More
We provide supporting details for the search for a 3+1 sterile neutrino using data collected over eleven years at the IceCube Neutrino Observatory. The analysis uses atmospheric muon-flavored neutrinos from 0.5 to 100\, TeV that traverse the Earth to reach the IceCube detector, and finds a best-fit point at $\sin^2(2θ_{24}) = 0.16$ and $Δm^{2}_{41} = 3.5$ eV$^2$ with a goodness-of-fit p-value of 12\% and consistency with the null hypothesis of no oscillations to sterile neutrinos with a p-value of 3.1\%. Several improvements were made over past analyses, which are reviewed in this article, including upgrades to the reconstruction and the study of sources of systematic uncertainty. We provide details of the fit quality and discuss stability tests that split the data for separate samples, comparing results. We find that the fits are consistent between split data sets.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
A search for an eV-scale sterile neutrino using improved high-energy $ν_μ$ event reconstruction in IceCube
Authors:
IceCube Collaboration,
R. Abbasi,
M. Ackermann,
J. Adams,
S. K. Agarwalla,
J. A. Aguilar,
M. Ahlers,
J. M. Alameddine,
N. M. Amin,
K. Andeen,
C. Argüelles,
Y. Ashida,
S. Athanasiadou,
L. Ausborm,
S. N. Axani,
X. Bai,
A. Balagopal V.,
M. Baricevic,
S. W. Barwick,
S. Bash,
V. Basu,
R. Bay,
J. J. Beatty,
J. Becker Tjus,
J. Beise
, et al. (398 additional authors not shown)
Abstract:
This Letter presents the result of a 3+1 sterile neutrino search using 10.7 years of IceCube data. We analyze atmospheric muon neutrinos that traverse the Earth with energies ranging from 0.5 to 100 TeV, incorporating significant improvements in modeling neutrino flux and detector response compared to earlier studies. Notably, for the first time, we categorize data into starting and through-going…
▽ More
This Letter presents the result of a 3+1 sterile neutrino search using 10.7 years of IceCube data. We analyze atmospheric muon neutrinos that traverse the Earth with energies ranging from 0.5 to 100 TeV, incorporating significant improvements in modeling neutrino flux and detector response compared to earlier studies. Notably, for the first time, we categorize data into starting and through-going events, distinguishing neutrino interactions with vertices inside or outside the instrumented volume, to improve energy resolution. The best-fit point for a 3+1 model is found to be at $\sin^2(2θ_{24}) = 0.16$ and $Δm^{2}_{41} = 3.5$ eV$^2$, which agrees with previous iterations of this study. The result is consistent with the null hypothesis of no sterile neutrinos with a p-value of 3.1\%.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
A Methodology-Oriented Study of Catastrophic Forgetting in Incremental Deep Neural Networks
Authors:
Ashutosh Kumar,
Sonali Agarwal,
D Jude Hemanth
Abstract:
Human being and different species of animals having the skills to gather, transferring knowledge, processing, fine-tune and generating information throughout their lifetime. The ability of learning throughout their lifespan is referred as continuous learning which is using neurocognition mechanism. Consequently, in real world computational system of incremental learning autonomous agents also need…
▽ More
Human being and different species of animals having the skills to gather, transferring knowledge, processing, fine-tune and generating information throughout their lifetime. The ability of learning throughout their lifespan is referred as continuous learning which is using neurocognition mechanism. Consequently, in real world computational system of incremental learning autonomous agents also needs such continuous learning mechanism which provide retrieval of information and long-term memory consolidation. However, the main challenge in artificial intelligence is that the incremental learning of the autonomous agent when new data confronted. In such scenarios, the main concern is catastrophic forgetting(CF), i.e., while learning the sequentially, neural network underfits the old data when it confronted with new data. To tackle this CF problem many numerous studied have been proposed, however it is very difficult to compare their performance due to dissimilarity in their evaluation mechanism. Here we focus on the comparison of all algorithms which are having similar type of evaluation mechanism. Here we are comparing three types of incremental learning methods: (1) Exemplar based methods, (2) Memory based methods, and (3) Network based method. In this survey paper, methodology oriented study for catastrophic forgetting in incremental deep neural network is addressed. Furthermore, it contains the mathematical overview of impact-full methods which can be help researchers to deal with CF.
△ Less
Submitted 11 May, 2024;
originally announced May 2024.
-
Phase separation in a binary mixture of sticky spheres
Authors:
D. C. Thakur,
Jalim Singh,
A. V. Anil Kumar
Abstract:
We numerically investigate the dependence of range of attractive potential on the phase separation of 2-D binary systems. Through extensive simulations and analysis, we show that when the range of attractive interactions approaches the sticky sphere limit, the system undergoes a phase separation at lower temperature. Further reduction in temperature causes the system to mix again. These mixing-dem…
▽ More
We numerically investigate the dependence of range of attractive potential on the phase separation of 2-D binary systems. Through extensive simulations and analysis, we show that when the range of attractive interactions approaches the sticky sphere limit, the system undergoes a phase separation at lower temperature. Further reduction in temperature causes the system to mix again. These mixing-demixing-mixing transitions are of first order. Such phase separation is not observed for systems with larger interaction range. In the phase separated region of the phase diagram, one of the components of the mixture chooses to be in crystalline configuration, while other being in disordered state
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Entanglement Dynamics in Quantum Continuous-Variable States
Authors:
Ankit Kumar
Abstract:
Due to the weakness of gravitational coupling, all quantum experiments up to date in which gravity plays a role utilized the field of the Earth. Since this field undergoes practically undetectable back-action from quantum particles, it effectively admits a classical description as a fixed background Newtonian field or spacetime. This argument strongly motivates theoretical and experimental researc…
▽ More
Due to the weakness of gravitational coupling, all quantum experiments up to date in which gravity plays a role utilized the field of the Earth. Since this field undergoes practically undetectable back-action from quantum particles, it effectively admits a classical description as a fixed background Newtonian field or spacetime. This argument strongly motivates theoretical and experimental research towards a demonstration of gravitation between two quantum masses, as this is one of the most straightforward scenarios where quantum features of gravity could be observed. Several proposals studied the possibility of generating entanglement between two massive objects. Along the same lines, with a particular focus on gravity, this thesis introduces general tools to tackle interaction-mediated entanglement and applies them to two particles prepared in continuous-variable states.
△ Less
Submitted 15 May, 2024; v1 submitted 12 May, 2024;
originally announced May 2024.
-
Host-Based Allocators for Device Memory
Authors:
Oren Bell,
Ashwin Kumar,
Chris Gill
Abstract:
Memory allocation is a fairly mature field of computer science. However, we challenge a prevailing assumption in the literature over the last 50 years which, if reconsidered, necessitates a fundamental reevaluation of many classical memory management algorithms. We pose a model where the allocation algorithm runs on host memory but allocates device memory and so incur the following constraint: the…
▽ More
Memory allocation is a fairly mature field of computer science. However, we challenge a prevailing assumption in the literature over the last 50 years which, if reconsidered, necessitates a fundamental reevaluation of many classical memory management algorithms. We pose a model where the allocation algorithm runs on host memory but allocates device memory and so incur the following constraint: the allocator can't read the memory it is allocating.
This means we are unable to use boundary tags, which is a concept that has been ubiquitous in nearly every allocation algorithm. In this paper, we propose alternate algorithms to work around this constraint, and discuss in general the implications of this system model.
△ Less
Submitted 11 May, 2024;
originally announced May 2024.
-
Multiple magnetic interactions and large inverse magnetocaloric effect in TbSi and TbSi$_{0.6}$Ge$_{0.4}$
Authors:
Ajay Kumar,
Prashant Singh,
Andrew Doyle,
Deborah L. Schlagel,
Yaroslav Mudryk
Abstract:
We present a comprehensive investigation of the electronic structure, magnetization, specific heat, and crystallography of TbSi (FeB structure type) and TbSi$_{0.6}$Ge$_{0.4}$ (CrB structure type) compounds. Both TbSi and TbSi$_{0.6}$Ge$_{0.4}$ exhibit two antiferromagnetic (AFM) transitions at T$_{\rm N1}\approx$ 58~K and 57~K, and T$_{\rm N2}\approx$ 36~K and 44~K, respectively, along with an on…
▽ More
We present a comprehensive investigation of the electronic structure, magnetization, specific heat, and crystallography of TbSi (FeB structure type) and TbSi$_{0.6}$Ge$_{0.4}$ (CrB structure type) compounds. Both TbSi and TbSi$_{0.6}$Ge$_{0.4}$ exhibit two antiferromagnetic (AFM) transitions at T$_{\rm N1}\approx$ 58~K and 57~K, and T$_{\rm N2}\approx$ 36~K and 44~K, respectively, along with an onset of weak metamagnetic-like transition around 6~T between T$_{\rm N1}$ and T$_{\rm N2}$. High-resolution specific heat (C$_{\rm P}$) measurements show the second- and first-order nature of the magnetic transition at T$_{\rm N1}$ and T$_{\rm N2}$, respectively, for both samples. However, in the case of TbSi, the low-temperature (LT) AFM to high-temperature (HT) AFM transition takes place via an additional AFM phase at the intermediate temperature (IT), where both LT to IT AFM and IT to HT AFM phase transitions exhibit a first-order nature. Both TbSi and TbSi$_{0.6}$Ge$_{0.4}$ manifest significant magnetic entropy changes ($ΔS_{\rm M}$) of 9.6 and 11.6~J/kg-K, respectively, for $Δμ_0H$=7~T, at T$_{\rm N2}$. The HT AFM phase of TbSi$_{0.6}$Ge$_{0.4}$ is found to be more susceptible to the external magnetic field, causing a significant broadening in the peaks of $ΔS_{\rm M}$ curves at higher magnetic fields. Temperature and field-dependent specific heat data have been utilized to construct the complex H-T phase diagram of these compounds. Furthermore, temperature-dependent x-ray diffraction measurements demonstrate substantial magnetostriction and anisotropic thermal expansion of the unit cell in both samples.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
Deep learning-based variational autoencoder for classification of quantum and classical states of light
Authors:
Mahesh Bhupati,
Abhishek Mall,
Anshuman Kumar,
Pankaj K. Jha
Abstract:
Advancements in optical quantum technologies have been enabled by the generation, manipulation, and characterization of light, with identification based on its photon statistics. However, characterizing light and its sources through single photon measurements often requires efficient detectors and longer measurement times to obtain high-quality photon statistics. Here we introduce a deep learning-…
▽ More
Advancements in optical quantum technologies have been enabled by the generation, manipulation, and characterization of light, with identification based on its photon statistics. However, characterizing light and its sources through single photon measurements often requires efficient detectors and longer measurement times to obtain high-quality photon statistics. Here we introduce a deep learning-based variational autoencoder (VAE) method for classifying single photon added coherent state (SPACS), single photon added thermal state (SPACS), mixed states between coherent/SPACS and thermal/SPATS of light. Our semisupervised learning-based VAE efficiently maps the photon statistics features of light to a lower dimension, enabling quasi-instantaneous classification with low average photon counts. The proposed VAE method is robust and maintains classification accuracy in the presence of losses inherent in an experiment, such as finite collection efficiency, non-unity quantum efficiency, finite number of detectors, etc. Additionally, leveraging the transfer learning capabilities of VAE enables successful classification of data of any quality using a single trained model. We envision that such a deep learning methodology will enable better classification of quantum light and light sources even in the presence of poor detection quality.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
Constraining the core radius and density jumps inside Earth using atmospheric neutrino oscillations
Authors:
Anuj Kumar Upadhyay,
Anil Kumar,
Sanjib Kumar Agarwalla,
Amol Dighe
Abstract:
Atmospheric neutrinos can act as a tool to probe the interior of Earth using weak interactions, and can provide information complementary to that obtained from gravitational and seismic measurements. While passing through Earth, multi-GeV neutrinos encounter Earth matter effects due to the coherent forward scattering with the ambient electrons, which alter the neutrino oscillation probabilities. T…
▽ More
Atmospheric neutrinos can act as a tool to probe the interior of Earth using weak interactions, and can provide information complementary to that obtained from gravitational and seismic measurements. While passing through Earth, multi-GeV neutrinos encounter Earth matter effects due to the coherent forward scattering with the ambient electrons, which alter the neutrino oscillation probabilities. These matter effects depend upon the density distribution of electrons inside Earth, and hence, can be used to determine the internal structure of Earth. In this work, we employ a five-layered model of Earth where the layer densities and radii are modified, keeping the mass and moment of inertia of Earth unchanged and respecting the hydrostatic equilibrium condition. We use the proposed INO-ICAL detector as an example of an atmospheric neutrino experiment that can distinguish between neutrinos and antineutrinos efficiently in the multi-GeV energy range. Our analysis demonstrates the role such an experiment can play in simultaneously constraining the density jumps inside Earth and the location of the core-mantle boundary.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
Legendrian knots and multi-crossings
Authors:
Amit Kumar,
Jake Murphy,
Brian Naff
Abstract:
It was shown in arXiv:1208.5742 that any smooth knot can be represented by an übercrossing projection, i.e. a knot projection with no crossings aside from a single multi-crossing. We extend this idea to Legendrian knots and investigate übercrossing and petal projections in the front and Lagrangian projections. We show that any Legendrian knot with an übercrossing projection in the front projection…
▽ More
It was shown in arXiv:1208.5742 that any smooth knot can be represented by an übercrossing projection, i.e. a knot projection with no crossings aside from a single multi-crossing. We extend this idea to Legendrian knots and investigate übercrossing and petal projections in the front and Lagrangian projections. We show that any Legendrian knot with an übercrossing projection in the front projection is smoothly isotopic to the unknot and we demonstrate how to compute the $tb$ and rotation numbers for petal projections in the Lagrangian projection.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
The Fault in Our Recommendations: On the Perils of Optimizing the Measurable
Authors:
Omar Besbes,
Yash Kanoria,
Akshit Kumar
Abstract:
Recommendation systems are widespread, and through customized recommendations, promise to match users with options they will like. To that end, data on engagement is collected and used. Most recommendation systems are ranking-based, where they rank and recommend items based on their predicted engagement. However, the engagement signals are often only a crude proxy for utility, as data on the latte…
▽ More
Recommendation systems are widespread, and through customized recommendations, promise to match users with options they will like. To that end, data on engagement is collected and used. Most recommendation systems are ranking-based, where they rank and recommend items based on their predicted engagement. However, the engagement signals are often only a crude proxy for utility, as data on the latter is rarely collected or available. This paper explores the following question: By optimizing for measurable proxies, are recommendation systems at risk of significantly under-delivering on utility? If so, how can one improve utility which is seldom measured? To study these questions, we introduce a model of repeated user consumption in which, at each interaction, users select between an outside option and the best option from a recommendation set. Our model accounts for user heterogeneity, with the majority preferring ``popular'' content, and a minority favoring ``niche'' content. The system initially lacks knowledge of individual user preferences but can learn them through observations of users' choices over time. Our theoretical and numerical analysis demonstrate that optimizing for engagement can lead to significant utility losses. Instead, we propose a utility-aware policy that initially recommends a mix of popular and niche content. As the platform becomes more forward-looking, our utility-aware policy achieves the best of both worlds: near-optimal utility and near-optimal engagement simultaneously. Our study elucidates an important feature of recommendation systems; given the ability to suggest multiple items, one can perform significant exploration without incurring significant reductions in engagement. By recommending high-risk, high-reward items alongside popular items, systems can enhance discovery of high utility items without significantly affecting engagement.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Search for joint multimessenger signals from potential Galactic PeVatrons with HAWC and IceCube
Authors:
R. Alfaro,
C. Alvarez,
J. C. Arteaga-Velázquez,
D. Avila Rojas,
H. A. Ayala Solares,
R. Babu,
E. Belmont-Moreno,
K. S. Caballero-Mora,
T. Capistrán,
A. Carramiñana,
S. Casanova,
U. Cotti,
J. Cotzomi,
S. Coutiño de León,
E. De la Fuente,
D. Depaoli,
N. Di Lalla,
R. Diaz Hernandez,
J. C. Díaz-Vélez,
K. Engel,
T. Ergin,
K. L. Fan,
K. Fang,
N. Fraija,
S. Fraija
, et al. (469 additional authors not shown)
Abstract:
Galactic PeVatrons are sources that can accelerate cosmic rays to PeV energies. The high-energy cosmic rays are expected to interact with the surrounding ambient material or radiation, resulting in the production of gamma rays and neutrinos. To optimize for the detection of such associated production of gamma rays and neutrinos for a given source morphology and spectrum, a multi-messenger analysis…
▽ More
Galactic PeVatrons are sources that can accelerate cosmic rays to PeV energies. The high-energy cosmic rays are expected to interact with the surrounding ambient material or radiation, resulting in the production of gamma rays and neutrinos. To optimize for the detection of such associated production of gamma rays and neutrinos for a given source morphology and spectrum, a multi-messenger analysis that combines gamma rays and neutrinos is required. In this study, we use the Multi-Mission Maximum Likelihood framework (3ML) with IceCube Maximum Likelihood Analysis software (i3mla) and HAWC Accelerated Likelihood (HAL) to search for a correlation between 22 known gamma-ray sources from the third HAWC gamma-ray catalog and 14 years of IceCube track-like data. No significant neutrino emission from the direction of the HAWC sources was found. We report the best-fit gamma-ray model and 90% CL neutrino flux limit from the 22 sources. From the neutrino flux limit, we conclude that the gamma-ray emission from five of the sources can not be produced purely from hadronic interactions. We report the limit for the fraction of gamma rays produced by hadronic interactions for these five sources.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Safe Reinforcement Learning with Learned Non-Markovian Safety Constraints
Authors:
Siow Meng Low,
Akshat Kumar
Abstract:
In safe Reinforcement Learning (RL), safety cost is typically defined as a function dependent on the immediate state and actions. In practice, safety constraints can often be non-Markovian due to the insufficient fidelity of state representation, and safety cost may not be known. We therefore address a general setting where safety labels (e.g., safe or unsafe) are associated with state-action traj…
▽ More
In safe Reinforcement Learning (RL), safety cost is typically defined as a function dependent on the immediate state and actions. In practice, safety constraints can often be non-Markovian due to the insufficient fidelity of state representation, and safety cost may not be known. We therefore address a general setting where safety labels (e.g., safe or unsafe) are associated with state-action trajectories. Our key contributions are: first, we design a safety model that specifically performs credit assignment to assess contributions of partial state-action trajectories on safety. This safety model is trained using a labeled safety dataset. Second, using RL-as-inference strategy we derive an effective algorithm for optimizing a safe policy using the learned safety model. Finally, we devise a method to dynamically adapt the tradeoff coefficient between reward maximization and safety compliance. We rewrite the constrained optimization problem into its dual problem and derive a gradient-based method to dynamically adjust the tradeoff coefficient during training. Our empirical results demonstrate that this approach is highly scalable and able to satisfy sophisticated non-Markovian safety constraints.
△ Less
Submitted 5 May, 2024;
originally announced May 2024.
-
Deep Learning of ab initio Hessians for Transition State Optimization
Authors:
Eric C. -Y. Yuan,
Anup Kumar,
Xingyi Guan,
Eric D. Hermes,
Andrew S. Rosen,
Judit Zádor,
Teresa Head-Gordon,
Samuel M. Blau
Abstract:
Identifying transition states -- saddle points on the potential energy surface connecting reactant and product minima -- is central to predicting kinetic barriers and understanding chemical reaction mechanisms. In this work, we train an equivariant neural network potential, NewtonNet, on an ab initio dataset of thousands of organic reactions from which we derive the analytical Hessians from the fu…
▽ More
Identifying transition states -- saddle points on the potential energy surface connecting reactant and product minima -- is central to predicting kinetic barriers and understanding chemical reaction mechanisms. In this work, we train an equivariant neural network potential, NewtonNet, on an ab initio dataset of thousands of organic reactions from which we derive the analytical Hessians from the fully differentiable machine learning (ML) model. By reducing the computational cost by several orders of magnitude relative to the Density Functional Theory (DFT) ab initio source, we can afford to use the learned Hessians at every step for the saddle point optimizations. We have implemented our ML Hessian algorithm in Sella, an open source software package designed to optimize atomic systems to find saddle point structures, in order to compare transition state optimization against quasi-Newton Hessian updates using DFT or the ML model. We show that the full ML Hessian robustly finds the transition states of 240 unseen organic reactions, even when the quality of the initial guess structures are degraded, while reducing the number of optimization steps to convergence by 2--3$\times$ compared to the quasi-Newton DFT and ML methods. All data generation, NewtonNet model, and ML transition state finding methods are available in an automated workflow.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.