subscribe to arXiv mailings

arXiv:2406.06656 [pdf]

Spin-polarized DFT calculations for physical properties of novel KVSb half-Heusler compound for spintronic and thermodynamic applicability

Authors: Ashwani Kumar, Anupam, Shyam L. Gupta, Sumit Kumar, Vipan Kumar, Diwaker

Abstract: In the reported study we have investigated the robust phase stability, elasto-mechanical, thermophysical and magnetic properties of KVSb half Heusler compound by implementing density functional theory models in Wien2k simulation package. The dynamic phase stability is computed in phase type I, II & III phase configurations by optimising their energy. It is observed that given compound is more stab… ▽ More In the reported study we have investigated the robust phase stability, elasto-mechanical, thermophysical and magnetic properties of KVSb half Heusler compound by implementing density functional theory models in Wien2k simulation package. The dynamic phase stability is computed in phase type I, II & III phase configurations by optimising their energy. It is observed that given compound is more stable in spin-polarised state of phase type I. To explore the electronic band structure, we apply the generalised gradient approximation. The electronic band profile of the Heusler alloy display a half-metallic nature. Moreover, the calculated second-order elastic parameters divulge the ductile nature. To understand the thermodynamical and thermoelectric stability of the alloy at various temperature and pressures ranges we have utilised the Quasi-Harmonic Debye model. The computed value of magnetic moment found in good agreement with Slater-Pauling rule. Our findings confirms that the predicted half Heusler alloy can be used in various spintronics and thermoelectric applications. △ Less

Submitted 10 June, 2024; originally announced June 2024.

arXiv:2406.06533 [pdf, other]

Pragmatic Formal Verification Methodology for Clock Domain Crossing (CDC)

Authors: Aman Kumar, Muhammad Ul Haque Khan, Bijitendra Mittra

Abstract: Modern System-on-Chip (SoC) designs are becoming more and more complex due to the technology upscaling. SoC designs often operate on multiple asynchronous clock domains, further adding to the complexity of the overall design. To make the devices power efficient, designers take a Globally-Asynchronous Locally-Synchronous (GALS) approach that creates multiple asynchronous domains. These Clock Domain… ▽ More Modern System-on-Chip (SoC) designs are becoming more and more complex due to the technology upscaling. SoC designs often operate on multiple asynchronous clock domains, further adding to the complexity of the overall design. To make the devices power efficient, designers take a Globally-Asynchronous Locally-Synchronous (GALS) approach that creates multiple asynchronous domains. These Clock Domain Crossings (CDC) are prone to metastability effects, and functional verification of such CDC is very important to ensure that no bug escapes. Conventional verification methods, such as register transfer level (RTL) simulations and static timing analysis, are not enough to address these CDC issues, which may lead to verification gaps. Additionally, identifying these CDC-related bugs is very time-consuming and is one of the most common reasons for costly silicon re-spins. This paper is focused on the development of a pragmatic formal verification methodology to minimize the CDC issues by exercising Metastability Injection (MSI) in different CDC paths. △ Less

Submitted 20 April, 2024; originally announced June 2024.

Comments: Published in DVCon Europe 2023

arXiv:2406.06512 [pdf, other]

Merlin: A Vision Language Foundation Model for 3D Computed Tomography

Authors: Louis Blankemeier, Joseph Paul Cohen, Ashwin Kumar, Dave Van Veen, Syed Jamal Safdar Gardezi, Magdalini Paschali, Zhihong Chen, Jean-Benoit Delbrouck, Eduardo Reis, Cesar Truyts, Christian Bluethgen, Malte Engmann Kjeldskov Jensen, Sophie Ostmeier, Maya Varma, Jeya Maria Jose Valanarasu, Zhongnan Fang, Zepeng Huo, Zaid Nabulsi, Diego Ardila, Wei-Hung Weng, Edson Amaro Junior, Neera Ahuja, Jason Fries, Nigam H. Shah, Andrew Johnston , et al. (6 additional authors not shown)

Abstract: Over 85 million computed tomography (CT) scans are performed annually in the US, of which approximately one quarter focus on the abdomen. Given the current radiologist shortage, there is a large impetus to use artificial intelligence to alleviate the burden of interpreting these complex imaging studies. Prior state-of-the-art approaches for automated medical image interpretation leverage vision la… ▽ More Over 85 million computed tomography (CT) scans are performed annually in the US, of which approximately one quarter focus on the abdomen. Given the current radiologist shortage, there is a large impetus to use artificial intelligence to alleviate the burden of interpreting these complex imaging studies. Prior state-of-the-art approaches for automated medical image interpretation leverage vision language models (VLMs). However, current medical VLMs are generally limited to 2D images and short reports, and do not leverage electronic health record (EHR) data for supervision. We introduce Merlin - a 3D VLM that we train using paired CT scans (6+ million images from 15,331 CTs), EHR diagnosis codes (1.8+ million codes), and radiology reports (6+ million tokens). We evaluate Merlin on 6 task types and 752 individual tasks. The non-adapted (off-the-shelf) tasks include zero-shot findings classification (31 findings), phenotype classification (692 phenotypes), and zero-shot cross-modal retrieval (image to findings and image to impressions), while model adapted tasks include 5-year disease prediction (6 diseases), radiology report generation, and 3D semantic segmentation (20 organs). We perform internal validation on a test set of 5,137 CTs, and external validation on 7,000 clinical CTs and on two public CT datasets (VerSe, TotalSegmentator). Beyond these clinically-relevant evaluations, we assess the efficacy of various network architectures and training strategies to depict that Merlin has favorable performance to existing task-specific baselines. We derive data scaling laws to empirically assess training data needs for requisite downstream task performance. Furthermore, unlike conventional VLMs that require hundreds of GPUs for training, we perform all training on a single GPU. △ Less

Submitted 10 June, 2024; originally announced June 2024.

Comments: 18 pages, 7 figures

arXiv:2406.05643 [pdf, other]

doi 10.1103/PhysRevB.109.235401

Predicting edge-localized monovacancy defects in zigzag graphene nanoribbons from Floquet quasienergy spectrum

Authors: Gulshan Kumar, Shashikant Kumar, Ajay Kumar, Prakash Parida

Abstract: In this work, we prescribe a theoretical framework aiming at predicting the position of monovacancy defects at the edges of zigzag graphene nanoribbons (ZGNRs) using Floquet-Bloch formalism, which can be experimentally observed through time- and angle-resolved photoemission spectroscopy (tr-ARPES). Our methodology involves an in-depth investigation of the Floquet quasienergy band spectrum influenc… ▽ More In this work, we prescribe a theoretical framework aiming at predicting the position of monovacancy defects at the edges of zigzag graphene nanoribbons (ZGNRs) using Floquet-Bloch formalism, which can be experimentally observed through time- and angle-resolved photoemission spectroscopy (tr-ARPES). Our methodology involves an in-depth investigation of the Floquet quasienergy band spectrum influenced by light with varying polarization across a range of frequencies. Particularly under the influence of circularly polarized light with a frequency comparable to the bandwidth of the system, our findings suggest a promising approach for locating monovacancy defects at either edge, a challenge that proves intricate to predict from the ARPES spectrum of ZGNRs with monovacancy defects. This has been achieved by analyzing the orientation of the Floquet edge state and the appearance of new Dirac points in the vicinity of the Fermi level. The real-world applications of these captivating characteristics underscore the importance and pertinence of our theoretical framework, paving the way for additional exploration and practical use. Our approach, employing the Floquet formalism, is not limited to monovacancy-type defects; rather, it can be expanded to encompass various types of vacancy defects. △ Less

Submitted 9 June, 2024; originally announced June 2024.

Comments: Total number of 10 pages and 12 figures

Journal ref: Physical Review B 109, 235401 (2024)

arXiv:2406.05416 [pdf, other]

Focusing of concentric free-surface waves

Authors: Lohit Kayal, Vatsal Sanjay, Nikhil Yewale, Anil Kumar, Ratul Dasgupta

Abstract: Gravito-capillary waves at free-surfaces are ubiquitous in several natural and industrial processes involving quiescent liquid pools bounded by cylindrical walls. These waves emanate from the relaxation of initial interface distortions, which often take the form of a cavity (depression) centred on the symmetry axis of the container. These surface waves reflect from the container walls leading to a… ▽ More Gravito-capillary waves at free-surfaces are ubiquitous in several natural and industrial processes involving quiescent liquid pools bounded by cylindrical walls. These waves emanate from the relaxation of initial interface distortions, which often take the form of a cavity (depression) centred on the symmetry axis of the container. These surface waves reflect from the container walls leading to a radially inward propagating wave-train converging (focussing) onto the symmetry axis. Under the inviscid approximation and for sufficiently shallow cavities, the relaxation is well-described by the linearised potential-flow equations. Naturally, adding viscosity to such a system introduces viscous dissipation that enervates energy and dampens the oscillations at the symmetry axis. However, for viscous liquids and deeper cavities, these equations are qualitatively inaccurate. In this study, we elucidate a modal approach to study the initial-value problem for concentric gravito-capillary waves generated on a free-surface for inviscid as well as viscous liquids. For a sufficiently deep cavity, the inward focusing of waves results in large interfacial oscillations at the axis, necessitating a second-order nonlinear theory. We demonstrate that this theory effectively models the interfacial behavior and highlights the crucial role of nonlinearity near the symmetry axis. Contrary to expectations, the addition of slight viscosity further intensifies the oscillations at the symmetry axis. This finding underscores the limitations of the potential flow model and suggests avenues for more accurate modelling of such complex free-surface flows. △ Less

Submitted 8 June, 2024; originally announced June 2024.

arXiv:2406.04744 [pdf, other]

CRAG -- Comprehensive RAG Benchmark

Authors: Xiao Yang, Kai Sun, Hao Xin, Yushi Sun, Nikita Bhalla, Xiangsen Chen, Sajal Choudhary, Rongze Daniel Gui, Ziran Will Jiang, Ziyu Jiang, Lingkun Kong, Brian Moran, Jiaqi Wang, Yifan Ethan Xu, An Yan, Chenyu Yang, Eting Yuan, Hanwen Zha, Nan Tang, Lei Chen, Nicolas Scheffer, Yue Liu, Nirav Shah, Rakesh Wanga, Anuj Kumar , et al. (2 additional authors not shown)

Abstract: Retrieval-Augmented Generation (RAG) has recently emerged as a promising solution to alleviate Large Language Model (LLM)'s deficiency in lack of knowledge. Existing RAG datasets, however, do not adequately represent the diverse and dynamic nature of real-world Question Answering (QA) tasks. To bridge this gap, we introduce the Comprehensive RAG Benchmark (CRAG), a factual question answering bench… ▽ More Retrieval-Augmented Generation (RAG) has recently emerged as a promising solution to alleviate Large Language Model (LLM)'s deficiency in lack of knowledge. Existing RAG datasets, however, do not adequately represent the diverse and dynamic nature of real-world Question Answering (QA) tasks. To bridge this gap, we introduce the Comprehensive RAG Benchmark (CRAG), a factual question answering benchmark of 4,409 question-answer pairs and mock APIs to simulate web and Knowledge Graph (KG) search. CRAG is designed to encapsulate a diverse array of questions across five domains and eight question categories, reflecting varied entity popularity from popular to long-tail, and temporal dynamisms ranging from years to seconds. Our evaluation on this benchmark highlights the gap to fully trustworthy QA. Whereas most advanced LLMs achieve <=34% accuracy on CRAG, adding RAG in a straightforward manner improves the accuracy only to 44%. State-of-the-art industry RAG solutions only answer 63% questions without any hallucination. CRAG also reveals much lower accuracy in answering questions regarding facts with higher dynamism, lower popularity, or higher complexity, suggesting future research directions. The CRAG benchmark laid the groundwork for a KDD Cup 2024 challenge, attracting thousands of participants and submissions within the first 50 days of the competition. We commit to maintaining CRAG to serve research communities in advancing RAG solutions and general QA solutions. △ Less

Submitted 7 June, 2024; originally announced June 2024.

arXiv:2406.04697 [pdf, other]

Mechanism of instability in non-uniform dusty channel flow

Authors: Anup Kumar, Rama Govindarajan

Abstract: Particles in pressure-driven channel flow are often inhomogeneously distributed. Two modes of low-Reynolds number instability, absent in Poiseuille flow of clean fluid, are created by inhomogeneous particle loading, and their mechanism is worked out here. Two distinct classes of behaviour are seen: when the critical layer of the dominant perturbation overlaps with variations in particle concentrat… ▽ More Particles in pressure-driven channel flow are often inhomogeneously distributed. Two modes of low-Reynolds number instability, absent in Poiseuille flow of clean fluid, are created by inhomogeneous particle loading, and their mechanism is worked out here. Two distinct classes of behaviour are seen: when the critical layer of the dominant perturbation overlaps with variations in particle concentration, the new instabilities arise, which we term overlap modes. But when the layers are distinct, only the traditional Tollmien-Schlichting mode of instability occurs. We derive the dominant critical layer balance equations in this flow along the lines done classically for clean fluid. These reveal how concentration variations within the critical layer cause two the particle-driven instabilities. As a result of these variations, disturbance kinetic energy production is qualitatively and majorly altered. Surprisingly the two overlap modes, though completely different in the symmetry of the eigenstructure and regime of exponential growth, show practically identical energy budgets, highlighting the relevance of variations within the critical layer. The wall layer is shown to be unimportant. We derive a minimal composite theory comprising all terms in the complete equation which are dominant somewhere in the flow, and show that it contains the essential physics. When particles are infinitely dense relative to the fluid, the volume fraction is negligible. But for finite density ratios, the volume fraction of particles causes a profile of effective viscosity. This is shown to be uniformly stabilizing in the present flow. Gravity is neglected here, and will be important to study in future. So will transient growth of perturbations due to non-normality of the stability operator, in a quest for the mechanism of transition to turbulence. △ Less

Submitted 7 June, 2024; originally announced June 2024.

arXiv:2406.04660 [pdf, other]

URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement

Authors: Wangyou Zhang, Robin Scheibler, Kohei Saijo, Samuele Cornell, Chenda Li, Zhaoheng Ni, Anurag Kumar, Jan Pirklbauer, Marvin Sach, Shinji Watanabe, Tim Fingscheidt, Yanmin Qian

Abstract: The last decade has witnessed significant advancements in deep learning-based speech enhancement (SE). However, most existing SE research has limitations on the coverage of SE sub-tasks, data diversity and amount, and evaluation metrics. To fill this gap and promote research toward universal SE, we establish a new SE challenge, named URGENT, to focus on the universality, robustness, and generaliza… ▽ More The last decade has witnessed significant advancements in deep learning-based speech enhancement (SE). However, most existing SE research has limitations on the coverage of SE sub-tasks, data diversity and amount, and evaluation metrics. To fill this gap and promote research toward universal SE, we establish a new SE challenge, named URGENT, to focus on the universality, robustness, and generalizability of SE. We aim to extend the SE definition to cover different sub-tasks to explore the limits of SE models, starting from denoising, dereverberation, bandwidth extension, and declipping. A novel framework is proposed to unify all these sub-tasks in a single model, allowing the use of all existing SE approaches. We collected public speech and noise data from different domains to construct diverse evaluation data. Finally, we discuss the insights gained from our preliminary baseline experiments based on both generative and discriminative SE methods with 12 curated metrics. △ Less

Submitted 7 June, 2024; originally announced June 2024.

Comments: 6 pages, 3 figures, 3 tables. Accepted by Interspeech 2024. An extended version of the accepted manuscript with appendix

arXiv:2406.04454 [pdf, other]

Ultrafast Optical Control of Rashba Interactions in a TMDC Heterostructure

Authors: Henry Mittenzwey, Abhijeet Kumar, Raghav Dhingra, Kenji Watanabe, Takashi Taniguchi, Cornelius Gahl, Kirill I. Bolotin, Malte Selig, Andreas Knorr

Abstract: We investigate spin relaxation dynamics of interlayer excitons in a MoSe2/MoS2 heterostructure induced by the Rashba effect. In such a system, Rashba interactions arise from an out-of-plane electric field due to photo-generated interlayer excitons inducing a phonon-assisted intravalley spin relaxation. We develop a theoretical description based on a microscopic approach to quantify the magnitude o… ▽ More We investigate spin relaxation dynamics of interlayer excitons in a MoSe2/MoS2 heterostructure induced by the Rashba effect. In such a system, Rashba interactions arise from an out-of-plane electric field due to photo-generated interlayer excitons inducing a phonon-assisted intravalley spin relaxation. We develop a theoretical description based on a microscopic approach to quantify the magnitude of Rashba interactions and test these predictions via time-resolved Kerr rotation measurements. In agreement with the calculations, we find that the Rashba-induced intravalley spin mixing becomes the dominating spin relaxation channel above T = 50 K. Our work identifies a previously unexplored spin-depolarization channel in heterostructures which can be used for ultrafast spin manipulation. △ Less

Submitted 6 June, 2024; originally announced June 2024.

arXiv:2406.04413 [pdf, other]

Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning

Authors: Amandeep Kumar, Muhammad Awais, Sanath Narayan, Hisham Cholakkal, Salman Khan, Rao Muhammad Anwer

Abstract: Drawing upon StyleGAN's expressivity and disentangled latent space, existing 2D approaches employ textual prompting to edit facial images with different attributes. In contrast, 3D-aware approaches that generate faces at different target poses require attribute-specific classifiers, learning separate model weights for each attribute, and are not scalable for novel attributes. In this work, we prop… ▽ More Drawing upon StyleGAN's expressivity and disentangled latent space, existing 2D approaches employ textual prompting to edit facial images with different attributes. In contrast, 3D-aware approaches that generate faces at different target poses require attribute-specific classifiers, learning separate model weights for each attribute, and are not scalable for novel attributes. In this work, we propose an efficient, plug-and-play, 3D-aware face editing framework based on attribute-specific prompt learning, enabling the generation of facial images with controllable attributes across various target poses. To this end, we introduce a text-driven learnable style token-based latent attribute editor (LAE). The LAE harnesses a pre-trained vision-language model to find text-guided attribute-specific editing direction in the latent space of any pre-trained 3D-aware GAN. It utilizes learnable style tokens and style mappers to learn and transform this editing direction to 3D latent space. To train LAE with multiple attributes, we use directional contrastive loss and style token loss. Furthermore, to ensure view consistency and identity preservation across different poses and attributes, we employ several 3D-aware identity and pose preservation losses. Our experiments show that our proposed framework generates high-quality images with 3D awareness and view consistency while maintaining attribute-specific features. We demonstrate the effectiveness of our method on different facial attributes, including hair color and style, expression, and others. Code: https://github.com/VIROBO-15/Efficient-3D-Aware-Facial-Image-Editing. △ Less

Submitted 6 June, 2024; originally announced June 2024.

arXiv:2406.03747 [pdf, other]

Instance Segmentation and Teeth Classification in Panoramic X-rays

Authors: Devichand Budagam, Ayush Kumar, Sayan Ghosh, Anuj Shrivastav, Azamat Zhanatuly Imanbayev, Iskander Rafailovich Akhmetov, Dmitrii Kaplun, Sergey Antonov, Artem Rychenkov, Gleb Cyganov, Aleksandr Sinitca

Abstract: Teeth segmentation and recognition are critical in various dental applications and dental diagnosis. Automatic and accurate segmentation approaches have been made possible by integrating deep learning models. Although teeth segmentation has been studied in the past, only some techniques were able to effectively classify and segment teeth simultaneously. This article offers a pipeline of two deep l… ▽ More Teeth segmentation and recognition are critical in various dental applications and dental diagnosis. Automatic and accurate segmentation approaches have been made possible by integrating deep learning models. Although teeth segmentation has been studied in the past, only some techniques were able to effectively classify and segment teeth simultaneously. This article offers a pipeline of two deep learning models, U-Net and YOLOv8, which results in BB-UNet, a new architecture for the classification and segmentation of teeth on panoramic X-rays that is efficient and reliable. We have improved the quality and reliability of teeth segmentation by utilising the YOLOv8 and U-Net capabilities. The proposed networks have been evaluated using the mean average precision (mAP) and dice coefficient for YOLOv8 and BB-UNet, respectively. We have achieved a 3\% increase in mAP score for teeth classification compared to existing methods, and a 10-15\% increase in dice coefficient for teeth segmentation compared to U-Net across different categories of teeth. A new Dental dataset was created based on UFBA-UESC dataset with Bounding-Box and Polygon annotations of 425 dental panoramic X-rays. The findings of this research pave the way for a wider adoption of object detection models in the field of dental diagnosis. △ Less

Submitted 6 June, 2024; originally announced June 2024.

Comments: submtted to Expert Systems with Applications Journal

arXiv:2406.02334 [pdf, other]

$\textit{Kilonova Seekers}$: the GOTO project for real-time citizen science in time-domain astrophysics

Authors: T. L. Killestein, L. Kelsey, E. Wickens, L. Nuttall, J. Lyman, C. Krawczyk, K. Ackley, M. J. Dyer, F. Jiménez-Ibarra, K. Ulaczyk, D. O'Neill, A. Kumar, D. Steeghs, D. K. Galloway, V. S. Dhillon, P. O'Brien, G. Ramsay, K. Noysena, R. Kotak, R. P. Breton, E. Pallé, D. Pollacco, S. Awiphan, S. Belkin, P. Chote , et al. (29 additional authors not shown)

Abstract: Time-domain astrophysics continues to grow rapidly, with the inception of new surveys drastically increasing data volumes. Democratised, distributed approaches to training sets for machine learning classifiers are crucial to make the most of this torrent of discovery -- with citizen science approaches proving effective at meeting these requirements. In this paper, we describe the creation of and t… ▽ More Time-domain astrophysics continues to grow rapidly, with the inception of new surveys drastically increasing data volumes. Democratised, distributed approaches to training sets for machine learning classifiers are crucial to make the most of this torrent of discovery -- with citizen science approaches proving effective at meeting these requirements. In this paper, we describe the creation of and the initial results from the $\textit{Kilonova Seekers}$ citizen science project, built to find transient phenomena from the GOTO telescopes in near real-time. $\textit{Kilonova Seekers}$ launched in July 2023 and received over 600,000 classifications from approximately 2,000 volunteers over the course of the LIGO-Virgo-KAGRA O4a observing run. During this time, the project has yielded 20 discoveries, generated a `gold-standard' training set of 17,682 detections for augmenting deep-learned classifiers, and measured the performance and biases of Zooniverse volunteers on real-bogus classification. This project will continue throughout the lifetime of GOTO, pushing candidates at ever-greater cadence, and directly facilitate the next-generation classification algorithms currently in development. △ Less

Submitted 4 June, 2024; originally announced June 2024.

Comments: 20 pages, 15 figures. Submitted to MNRAS

arXiv:2406.02246 [pdf, ps, other]

Tensor square and isoclinic extensions of multiplicative Lie algebras

Authors: Dev Karan Singh, Amit Kumar, Sumit Kumar Upadhyay, Shiv Datt Kumar

Abstract: In this paper, we discuss the capable and isoclinic properties of the tensor square in the context of multiplicative Lie algebras. We also developed the concept of isoclinic extensions and proved several results for multiplicative Lie algebras. Consequently, we demonstrate that covers of a multiplicative Lie algebra are mutually isoclinic. In this paper, we discuss the capable and isoclinic properties of the tensor square in the context of multiplicative Lie algebras. We also developed the concept of isoclinic extensions and proved several results for multiplicative Lie algebras. Consequently, we demonstrate that covers of a multiplicative Lie algebra are mutually isoclinic. △ Less

Submitted 4 June, 2024; originally announced June 2024.

arXiv:2406.01155 [pdf, other]

Optical heterodyne microscopy of operating spin Hall nano-oscillator arrays

Authors: A. Alemán, A. A. Awad, S. Muralidhar, R. Khymyn, A. Kumar, A. Houshang, D. Hanstorp, J. Åkerman

Abstract: Optical heterodyne detection is a powerful technique for characterizing a wide range of physical excitations. Here, we use two types of optical heterodyne detection techniques (fundamental and parametric pumping) to microscopically characterize the high-frequency auto-oscillations of single and multiple nano-constriction spin Hall nano-oscillators (SHNOs). To validate the technique and demonstrate… ▽ More Optical heterodyne detection is a powerful technique for characterizing a wide range of physical excitations. Here, we use two types of optical heterodyne detection techniques (fundamental and parametric pumping) to microscopically characterize the high-frequency auto-oscillations of single and multiple nano-constriction spin Hall nano-oscillators (SHNOs). To validate the technique and demonstrate its robustness, we study SHNOs made from two different material stacks, NiFe/Pt and W/CoFeB/MgO, and investigate the influence of both the RF injection power and the laser power on the measurements, comparing the optical results to conventional electrical measurements. To demonstrate the key features of direct, non-invasive, submicron, spatial, and phase-resolved characterization of the SHNO magnetodynamics, we map out the auto-oscillation magnitude and phase of two phase-binarized SHNOs used in Ising Machines. This proof-of-concept platform establishes a strong foundation for further extensions, contributing to the ongoing development of crucial characterization techniques for emerging computing technologies based on spintronics devices △ Less

Submitted 3 June, 2024; originally announced June 2024.

Comments: 7 pages

arXiv:2406.01132 [pdf, other]

Investigating a Device Independence Quantum Random Number Generation

Authors: Vardaan Mongia, Abhishek Kumar, Shashi Prabhakar, Anindya Banerji, R. P. Singh

Abstract: Quantum random number generation (QRNG) is a resource that is a necessity in the field of cryptography. However, its certification has been challenging. In this article, we certify randomness with the aid of quantum entanglement in a device independent setting, where we choose two-photon interference for source characterisation. The CHSH inequality violation and quantum state tomography are used a… ▽ More Quantum random number generation (QRNG) is a resource that is a necessity in the field of cryptography. However, its certification has been challenging. In this article, we certify randomness with the aid of quantum entanglement in a device independent setting, where we choose two-photon interference for source characterisation. The CHSH inequality violation and quantum state tomography are used as independent checks on the measurement devices. These measures ensure the unpredictability of quantum random number generation. This work can be easily extended to faster randomness expansion protocols. △ Less

Submitted 3 June, 2024; originally announced June 2024.

Comments: Comments and suggestions are welcomed

arXiv:2406.01000 [pdf, other]

doi 10.1016/j.asr.2024.06.040

Seasonal variation in nighttime NO radiative cooling as observed by TIMED/SABER in lower thermosphere during solar maximum and solar minimum

Authors: Alok Kumar Ranjan, MV Sunil Krishna, Akash Kumar, Dayakrishna Nailwal, Sumanta Sarkhel

Abstract: Both composition and temperature play a crucial role in determining the NO radiative cooling in lower thermosphere as observed by TIMED/SABER. In this work, we present a detailed investigation of seasonal variation in thermospheric NO radiative cooling. We have carried forward the investigation of \cite{li2018} regarding the variations in local nighttime peak NO radiative cooling and its altitude… ▽ More Both composition and temperature play a crucial role in determining the NO radiative cooling in lower thermosphere as observed by TIMED/SABER. In this work, we present a detailed investigation of seasonal variation in thermospheric NO radiative cooling. We have carried forward the investigation of \cite{li2018} regarding the variations in local nighttime peak NO radiative cooling and its altitude during solar maximum and solar minimum conditions. By analyzing latitudinal changes over quiet times for each month in year 2018, it is evident that both the investigative parameters exhibit summer-winter variability. The qualitative contribution of different species (i.e., NO, and O), and temperatures in determining the vertical profile of NO radiative cooling for different latitudes is investigated by utilizing the NRLMSISE-00 estimated parameters, and SNOE observed NO density. The temperature, NO density, meridional wind, and associated compositional variations due to asymmetrical solar heating in both the hemispheres during solar minimum conditions seem to be the dominating factor in controlling the NO radiative cooling during different seasons. The altitudes at which maximum cooling by NO occurs exhibits an inverse correlation with the amount of radiative cooling. The region of enhanced NO densities (polar and summer hemispheric low-mid latitude regions) have larger NO radiative cooling with lower peak altitudes in comparison to other regions (equatorial to winter hemispheric low-mid latitude regions), where NO radiative cooling is low with higher peak altitude values. △ Less

Submitted 3 June, 2024; originally announced June 2024.

Comments: 19 pages, 10 figures

arXiv:2406.00905 [pdf, other]

Exploration of mass splitting and muon/tau mixing parameters for an eV-scale sterile neutrino with IceCube

Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (400 additional authors not shown)

Abstract: We present the first three-parameter fit to a 3+1 sterile neutrino model using 7.634 years of data from the IceCube Neutrino Observatory on $ν_μ+\overlineν_μ$ charged-current interactions in the energy range 500-9976 GeV. Our analysis is sensitive to the mass-squared splitting between the heaviest and lightest mass state ($Δm_{41}^2$), the mixing matrix element connecting muon flavor to the fourth… ▽ More We present the first three-parameter fit to a 3+1 sterile neutrino model using 7.634 years of data from the IceCube Neutrino Observatory on $ν_μ+\overlineν_μ$ charged-current interactions in the energy range 500-9976 GeV. Our analysis is sensitive to the mass-squared splitting between the heaviest and lightest mass state ($Δm_{41}^2$), the mixing matrix element connecting muon flavor to the fourth mass state ($|U_{\mu4}|^2$), and the element connecting tau flavor to the fourth mass state ($|U_{\tau4}|^2$). Predicted propagation effects in matter enhance the signature through a resonance as atmospheric neutrinos from the Northern Hemisphere traverse the Earth to the IceCube detector at the South Pole. The result is consistent with the no-sterile neutrino hypothesis with a probability of 4.3 %. Profiling the likelihood of each parameter yields the 90 % confidence levels: $ 2.4\,\mathrm{eV}^{2} < Δm_{41}^2 <9.6\,\mathrm{eV}^{2} $ , $0.0081 < |U_{\mu4}|^2 < 0.10$ , and $|U_{\tau4}|^2< 0.035$, which narrows the allowed parameter-space for $|U_{\tau4}|^2$. However, the primary result of this analysis is the first map of the 3+1 parameter space exploring the interdependence of $Δm_{41}^2$, $|U_{\mu4}|^2$, and $|U_{\tau4}|^2$. △ Less

Submitted 2 June, 2024; originally announced June 2024.

arXiv:2406.00724 [pdf, other]

Exploring Child-Robot Interaction in Individual and Group settings in India

Authors: Gayathri Manikutty, Sai Ankith Potapragada, Devasena Pasupuleti, Mahesh S. Unnithan, Arjun Venugopal, Pranav Prabha, Arunav H., Vyshnavi Anil Kumar, Rthuraj P. R., Rao R Bhavani

Abstract: This study evaluates the effectiveness of child-robot interactions with the HaKsh-E social robot in India, examining both individual and group interaction settings. The research centers on game-based interactions designed to teach hand hygiene to children aged 7-11. Utilizing video analysis, rubric assessments, and post-study questionnaires, the study gathered data from 36 participants. Findings i… ▽ More This study evaluates the effectiveness of child-robot interactions with the HaKsh-E social robot in India, examining both individual and group interaction settings. The research centers on game-based interactions designed to teach hand hygiene to children aged 7-11. Utilizing video analysis, rubric assessments, and post-study questionnaires, the study gathered data from 36 participants. Findings indicate that children in both settings developed positive perceptions of the robot in terms of the robot's trustworthiness, closeness, and social support. The significant difference in the interaction level scores presented in the study suggests that group settings foster higher levels of interaction, potentially due to peer influence and collaborative dynamics. While both settings showed significant improvements in learning outcomes, the individual setting had more pronounced learning gains. This suggests that personal interactions with the robot might lead to deeper or more effective learning experiences. Consequently, this study concludes that individual interaction settings are more conducive for focused learning gains, while group settings enhance interaction and engagement. △ Less

Submitted 4 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

Comments: 6 pages, 6 figures, Accepted for presentation at ICRAS 2024 (https://www.icras.org/)

arXiv:2406.00071 [pdf]

doi 10.52783/jes.4079

Optimizing Photometric Light Curve Analysis: Evaluating Scipy's Minimize Function for Eclipse Mapping of Cataclysmic Variables

Authors: Anoop Kumar, Madan Mohan Tito Ayyalasomayajula, Dheerendra Panwar, Yeshwanth Vasa

Abstract: With a particular focus on Scipy's minimize function the eclipse mapping method is thoroughly researched and implemented utilizing Python and essential libraries. Many optimization techniques are used, including Sequential Least Squares Programming (SLSQP), Nelder-Mead, and Conjugate Gradient (CG). However, for the purpose of examining photometric light curves these methods seek to solve the maxim… ▽ More With a particular focus on Scipy's minimize function the eclipse mapping method is thoroughly researched and implemented utilizing Python and essential libraries. Many optimization techniques are used, including Sequential Least Squares Programming (SLSQP), Nelder-Mead, and Conjugate Gradient (CG). However, for the purpose of examining photometric light curves these methods seek to solve the maximum entropy equation under a chi-squared constraint. Therefore, these techniques are first evaluated on two-dimensional Gaussian data without a chi-squared restriction, and then they are used to map the accretion disc and uncover the Gaussian structure of the Cataclysmic Variable KIC 201325107. Critical analysis is performed on the code structure to find possible faults and design problems. Additionally, the analysis shows how several factors impacting computing time and image quality are included including the variance in Gaussian weighting, disc image resolution, number of data points in the light curve, and degree of constraint. △ Less

Submitted 30 May, 2024; originally announced June 2024.

arXiv:2406.00010 [pdf, other]

EnterpriseEM: Fine-tuned Embeddings for Enterprise Semantic Search

Authors: Kamalkumar Rathinasamy, Jayarama Nettar, Amit Kumar, Vishal Manchanda, Arun Vijayakumar, Ayush Kataria, Venkateshprasanna Manjunath, Chidambaram GS, Jaskirat Singh Sodhi, Shoeb Shaikh, Wasim Akhtar Khan, Prashant Singh, Tanishq Dattatray Ige, Vipin Tiwari, Rajab Ali Mondal, Harshini K, S Reka, Chetana Amancharla, Faiz ur Rahman, Harikrishnan P A, Indraneel Saha, Bhavya Tiwary, Navin Shankar Patel, Pradeep T S, Balaji A J , et al. (2 additional authors not shown)

Abstract: Enterprises grapple with the significant challenge of managing proprietary unstructured data, hindering efficient information retrieval. This has led to the emergence of AI-driven information retrieval solutions, designed to adeptly extract relevant insights to address employee inquiries. These solutions often leverage pre-trained embedding models and generative models as foundational components.… ▽ More Enterprises grapple with the significant challenge of managing proprietary unstructured data, hindering efficient information retrieval. This has led to the emergence of AI-driven information retrieval solutions, designed to adeptly extract relevant insights to address employee inquiries. These solutions often leverage pre-trained embedding models and generative models as foundational components. While pre-trained embeddings may exhibit proximity or disparity based on their original training objectives, they might not fully align with the unique characteristics of enterprise-specific data, leading to suboptimal alignment with the retrieval goals of enterprise environments. In this paper, we propose a methodology to fine-tune pre-trained embedding models specifically for enterprise environments. By adapting the embeddings to better suit the retrieval tasks prevalent in enterprises, we aim to enhance the performance of information retrieval solutions. We discuss the process of fine-tuning, its effect on retrieval accuracy, and the potential benefits for enterprise information management. Our findings demonstrate the efficacy of fine-tuned embedding models in improving the precision and relevance of search results in enterprise settings. △ Less

Submitted 18 May, 2024; originally announced June 2024.

ACM Class: I.2.7

arXiv:2405.20989 [pdf, other]

Unravelling the asphericities in the explosion and multi-faceted circumstellar matter of SN 2023ixf

Authors: Avinash Singh, R. S. Teja, T. J. Moriya, K. Maeda, K. S. Kawabata, M. Tanaka, R. Imazawa, T. Nakaoka, A. Gangopadhyay, M. Yamanaka, V. Swain, D. K. Sahu, G. C. Anupama, B. Kumar, R. M. Anche, Y. Sano, A. Raj, V. K. Agnihotri, V. Bhalerao, D. Bisht, M. S. Bisht, K. Belwal, S. K. Chakrabarti, M. Fujii, T. Nagayama , et al. (11 additional authors not shown)

Abstract: We present a detailed investigation of photometric, spectroscopic, and polarimetric observations of the Type II SN 2023ixf. The early detection of highly-ionized flash features, rapid ascent in ultraviolet flux coupled with the blueward shift in near-ultraviolet colors and temperature provides compelling evidence for a delayed shock breakout from a confined dense circumstellar matter (CSM) envelop… ▽ More We present a detailed investigation of photometric, spectroscopic, and polarimetric observations of the Type II SN 2023ixf. The early detection of highly-ionized flash features, rapid ascent in ultraviolet flux coupled with the blueward shift in near-ultraviolet colors and temperature provides compelling evidence for a delayed shock breakout from a confined dense circumstellar matter (CSM) enveloping the progenitor star. The temporal evolution of polarization in the SN 2023ixf phase revealed three distinct peaks in polarization evolution at 1.4 d, 6.4 d, and 79.2 d, indicating an asymmetric dense CSM, an aspherical shock front and clumpiness in the low-density extended CSM, and an aspherical inner ejecta/He-core. SN 2023ixf displayed two dominant axes, one along the CSM-outer ejecta and the other along the inner ejecta/He-core, showcasing the independent origin of asymmetry in the early and late evolution. The argument for an aspherical shock front is further strengthened by the presence of a high-velocity broad absorption feature in the blue wing of the Balmer features in addition to the P-Cygni absorption post 16 d. Hydrodynamical light curve modeling indicated a progenitor mass of 10 solar mass with a radius of 470 solar radius, explosion energy of 2e51 erg, and 0.06 solar mass of 56Ni. The modeling also indicated a two-zone CSM: a confined dense CSM extending up to 5e14 cm, with a mass-loss rate of 1e-2 solar mass per year, and an extended CSM spanning from 5e14 cm to 1e16 cm with a mass-loss rate of 1e-4 solar mass per year. The early nebular phase observations display an axisymmetric line profile of [OI] and red-ward attenuation of the emission of Halpha post 125 days, marking the onset of dust formation. △ Less

Submitted 31 May, 2024; originally announced May 2024.

Comments: 30 pages, 14 figures, 1 Table, Submitted to AAS Journals

arXiv:2405.20755 [pdf]

Improving code-mixed hate detection by native sample mixing: A case study for Hindi-English code-mixed scenario

Authors: Debajyoti Mazumder, Aakash Kumar, Jasabanta Patro

Abstract: Hate detection has long been a challenging task for the NLP community. The task becomes complex in a code-mixed environment because the models must understand the context and the hate expressed through language alteration. Compared to the monolingual setup, we see very less work on code-mixed hate as large-scale annotated hate corpora are unavailable to make the study. To overcome this bottleneck,… ▽ More Hate detection has long been a challenging task for the NLP community. The task becomes complex in a code-mixed environment because the models must understand the context and the hate expressed through language alteration. Compared to the monolingual setup, we see very less work on code-mixed hate as large-scale annotated hate corpora are unavailable to make the study. To overcome this bottleneck, we propose using native language hate samples. We hypothesise that in the era of multilingual language models (MLMs), hate in code-mixed settings can be detected by majorly relying on the native language samples. Even though the NLP literature reports the effectiveness of MLMs on hate detection in many cross-lingual settings, their extensive evaluation in a code-mixed scenario is yet to be done. This paper attempts to fill this gap through rigorous empirical experiments. We considered the Hindi-English code-mixed setup as a case study as we have the linguistic expertise for the same. Some of the interesting observations we got are: (i) adding native hate samples in the code-mixed training set, even in small quantity, improved the performance of MLMs for code-mixed hate detection, (ii) MLMs trained with native samples alone observed to be detecting code-mixed hate to a large extent, (iii) The visualisation of attention scores revealed that, when native samples were included in training, MLMs could better focus on the hate emitting words in the code-mixed context, and (iv) finally, when hate is subjective or sarcastic, naively mixing native samples doesn't help much to detect code-mixed hate. We will release the data and code repository to reproduce the reported results. △ Less

Submitted 31 May, 2024; originally announced May 2024.

Comments: Generated from XeLaTeX

arXiv:2405.20402 [pdf, other]

Cross-Talk Reduction

Authors: Zhong-Qiu Wang, Anurag Kumar, Shinji Watanabe

Abstract: While far-field multi-talker mixtures are recorded, each speaker can wear a close-talk microphone so that close-talk mixtures can be recorded at the same time. Although each close-talk mixture has a high signal-to-noise ratio (SNR) of the wearer, it has a very limited range of applications, as it also contains significant cross-talk speech by other speakers and is not clean enough. In this context… ▽ More While far-field multi-talker mixtures are recorded, each speaker can wear a close-talk microphone so that close-talk mixtures can be recorded at the same time. Although each close-talk mixture has a high signal-to-noise ratio (SNR) of the wearer, it has a very limited range of applications, as it also contains significant cross-talk speech by other speakers and is not clean enough. In this context, we propose a novel task named cross-talk reduction (CTR) which aims at reducing cross-talk speech, and a novel solution named CTRnet which is based on unsupervised or weakly-supervised neural speech separation. In unsupervised CTRnet, close-talk and far-field mixtures are stacked as input for a DNN to estimate the close-talk speech of each speaker. It is trained in an unsupervised, discriminative way such that the DNN estimate for each speaker can be linearly filtered to cancel out the speaker's cross-talk speech captured at other microphones. In weakly-supervised CTRnet, we assume the availability of each speaker's activity timestamps during training, and leverage them to improve the training of unsupervised CTRnet. Evaluation results on a simulated two-speaker CTR task and on a real-recorded conversational speech separation and recognition task show the effectiveness and potential of CTRnet. △ Less

Submitted 30 May, 2024; originally announced May 2024.

Comments: in International Joint Conference on Artificial Intelligence (IJCAI), 2024

arXiv:2405.19927 [pdf, ps, other]

Adsorption of Mo and O at S-vacancy on ReS2 surface of ReS2/MoTe2 vdW heterointerface

Authors: Puneet Kumar Shaw, Jehan Taraporewalla, Sohaib Raza, Akash Kumar, Rimisha Duttagupta, Hafizur Rahaman, Dipankar Saha

Abstract: Applications like high density information storage, neuromorphic computing, nanophotonics, etc. require ultra-thin electronic devices which can be controlled with applied electric field. Of late, atomically thin two-dimensional (2D) materials and van der Waals (vdW) heterointerface of those have emerged as suitable candidates for such ultra-low power nanoelectric devices. In this work, employing d… ▽ More Applications like high density information storage, neuromorphic computing, nanophotonics, etc. require ultra-thin electronic devices which can be controlled with applied electric field. Of late, atomically thin two-dimensional (2D) materials and van der Waals (vdW) heterointerface of those have emerged as suitable candidates for such ultra-low power nanoelectric devices. In this work, employing density functional theory (DFT), the monolayer ReS2 / monolayer MoTe2 vdW heterostructure with Sulphur vacancy is studied to examine various ground state electronic properties. Changes in effective band gap owing to defect-induced states and modulation of the energy gap value with Molybdenum (Mo) and Oxygen (O) adsorption at the defect site are examined. Since two-dimensional (2D) material based nanoscaled devices exhibit promising switching between non-conducting and conducting states, determining the role of defect-induced states and the adsorption of atoms/molecules on surfaces is crucial. Here, a detailed theoretical study to determine surface properties and relative energetic stability of the vdW heterostructures is carried out. The charge re-distribution between the constituent layers is also analyzed by obtaining Electron Difference Density (EDD) for different heterointerfaces. Nonetheless, the efficacy of switching between non-conducting and conducting states is assessed based on adsorption energy of adatoms binding at the defect site. △ Less

Submitted 30 May, 2024; originally announced May 2024.

Comments: 21 pages | 10 figures

arXiv:2405.19815 [pdf, other]

Efficient Stimuli Generation using Reinforcement Learning in Design Verification

Authors: Deepak Narayan Gadde, Thomas Nalapat, Aman Kumar, Djones Lettnin, Wolfgang Kunz, Sebastian Simon

Abstract: The increasing design complexity of System-on-Chips (SoCs) has led to significant verification challenges, particularly in meeting coverage targets within a timely manner. At present, coverage closure is heavily dependent on constrained random and coverage driven verification methodologies where the randomized stimuli are bounded to verify certain scenarios and to reach coverage goals. This proces… ▽ More The increasing design complexity of System-on-Chips (SoCs) has led to significant verification challenges, particularly in meeting coverage targets within a timely manner. At present, coverage closure is heavily dependent on constrained random and coverage driven verification methodologies where the randomized stimuli are bounded to verify certain scenarios and to reach coverage goals. This process is said to be exhaustive and to consume a lot of project time. In this paper, a novel methodology is proposed to generate efficient stimuli with the help of Reinforcement Learning (RL) to reach the maximum code coverage of the Design Under Verification (DUV). Additionally, an automated framework is created using metamodeling to generate a SystemVerilog testbench and an RL environment for any given design. The proposed approach is applied to various designs and the produced results proves that the RL agent provides effective stimuli to achieve code coverage faster in comparison with baseline random simulations. Furthermore, various RL agents and reward schemes are analyzed in our work. △ Less

Submitted 30 May, 2024; originally announced May 2024.

Comments: Accepted for publication at the 20th International Conference on Synthesis, Modeling, Analysis and Simulation Methods, and Applications to Circuit Design (SMACD'24), Jul 2-5 2024, Volos, Greece

arXiv:2405.18989 [pdf]

Classification analysis of transition-metal chalcogenides and oxides using quantum machine learning

Authors: Kurudi V Vedavyasa, Ashok Kumar

Abstract: Quantum machine learning (QML) leverages the potential from machine learning to explore the subtle patterns in huge datasets of complex nature with quantum advantages. This exponentially reduces the time and resources necessary for computations. QML accelerates materials research with active screening of chemical space, identifying novel materials for practical applications and classifying structu… ▽ More Quantum machine learning (QML) leverages the potential from machine learning to explore the subtle patterns in huge datasets of complex nature with quantum advantages. This exponentially reduces the time and resources necessary for computations. QML accelerates materials research with active screening of chemical space, identifying novel materials for practical applications and classifying structurally diverse materials given their measured properties. This study analyzes the performance of three efficient quantum machine learning algorithms viz., variational quantum eigen solver (VQE), quantum support vector machine (QSVM) and quantum neural networks (QNN) for the classification of transition metal chalcogenides and oxides (TMCs &TMOs). The analysis is performed on three datasets of different sizes containing 102, 192 and 350 materials with TMCs and TMOs labelled as +1 and -1 respectively. By employing feature selection, classical machine learning achieves 100% accuracy whereas QML achieves the highest performance of 99% and 98% for test and train data respectively on QSVC. This study establishes the competence of QML models in materials classification and explores the quantum circuits in terms of over-fitting using the circuit descriptors expressibility and entangling capability. In addition, the perspectives on QML in materials research with noisy intermediate scale quantum (NISQ) devices is given. △ Less

Submitted 29 May, 2024; originally announced May 2024.

Comments: 29 pages, 5 figures, 1 table

arXiv:2405.18304 [pdf, other]

Multi-modal Generation via Cross-Modal In-Context Learning

Authors: Amandeep Kumar, Muzammal Naseer, Sanath Narayan, Rao Muhammad Anwer, Salman Khan, Hisham Cholakkal

Abstract: In this work, we study the problem of generating novel images from complex multimodal prompt sequences. While existing methods achieve promising results for text-to-image generation, they often struggle to capture fine-grained details from lengthy prompts and maintain contextual coherence within prompt sequences. Moreover, they often result in misaligned image generation for prompt sequences featu… ▽ More In this work, we study the problem of generating novel images from complex multimodal prompt sequences. While existing methods achieve promising results for text-to-image generation, they often struggle to capture fine-grained details from lengthy prompts and maintain contextual coherence within prompt sequences. Moreover, they often result in misaligned image generation for prompt sequences featuring multiple objects. To address this, we propose a Multi-modal Generation via Cross-Modal In-Context Learning (MGCC) method that generates novel images from complex multimodal prompt sequences by leveraging the combined capabilities of large language models (LLMs) and diffusion models. Our MGCC comprises a novel Cross-Modal Refinement module to explicitly learn cross-modal dependencies between the text and image in the LLM embedding space, and a contextual object grounding module to generate object bounding boxes specifically targeting scenes with multiple objects. Our MGCC demonstrates a diverse range of multimodal capabilities, like novel image generation, the facilitation of multimodal dialogue, and generation of texts. Experimental evaluations on two benchmark datasets, demonstrate the effectiveness of our method. On Visual Story Generation (VIST) dataset with multimodal inputs, our MGCC achieves a CLIP Similarity score of $0.652$ compared to SOTA GILL $0.641$. Similarly, on Visual Dialogue Context (VisDial) having lengthy dialogue sequences, our MGCC achieves an impressive CLIP score of $0.660$, largely outperforming existing SOTA method scoring $0.645$. Code: https://github.com/VIROBO-15/MGCC △ Less

Submitted 28 May, 2024; originally announced May 2024.

Comments: Technical Report

arXiv:2405.17401 [pdf, other]

RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control

Authors: Litu Rout, Yujia Chen, Nataniel Ruiz, Abhishek Kumar, Constantine Caramanis, Sanjay Shakkottai, Wen-Sheng Chu

Abstract: We propose Reference-Based Modulation (RB-Modulation), a new plug-and-play solution for training-free personalization of diffusion models. Existing training-free approaches exhibit difficulties in (a) style extraction from reference images in the absence of additional style or content text descriptions, (b) unwanted content leakage from reference style images, and (c) effective composition of styl… ▽ More We propose Reference-Based Modulation (RB-Modulation), a new plug-and-play solution for training-free personalization of diffusion models. Existing training-free approaches exhibit difficulties in (a) style extraction from reference images in the absence of additional style or content text descriptions, (b) unwanted content leakage from reference style images, and (c) effective composition of style and content. RB-Modulation is built on a novel stochastic optimal controller where a style descriptor encodes the desired attributes through a terminal cost. The resulting drift not only overcomes the difficulties above, but also ensures high fidelity to the reference style and adheres to the given text prompt. We also introduce a cross-attention-based feature aggregation scheme that allows RB-Modulation to decouple content and style from the reference image. With theoretical justification and empirical evidence, our framework demonstrates precise extraction and control of content and style in a training-free manner. Further, our method allows a seamless composition of content and style, which marks a departure from the dependency on external adapters or ControlNets. △ Less

Submitted 27 May, 2024; originally announced May 2024.

Comments: Preprint. Under review

arXiv:2405.16282 [pdf, other]

Confidence Under the Hood: An Investigation into the Confidence-Probability Alignment in Large Language Models

Authors: Abhishek Kumar, Robert Morabito, Sanzhar Umbet, Jad Kabbara, Ali Emami

Abstract: As the use of Large Language Models (LLMs) becomes more widespread, understanding their self-evaluation of confidence in generated responses becomes increasingly important as it is integral to the reliability of the output of these models. We introduce the concept of Confidence-Probability Alignment, that connects an LLM's internal confidence, quantified by token probabilities, to the confidence c… ▽ More As the use of Large Language Models (LLMs) becomes more widespread, understanding their self-evaluation of confidence in generated responses becomes increasingly important as it is integral to the reliability of the output of these models. We introduce the concept of Confidence-Probability Alignment, that connects an LLM's internal confidence, quantified by token probabilities, to the confidence conveyed in the model's response when explicitly asked about its certainty. Using various datasets and prompting techniques that encourage model introspection, we probe the alignment between models' internal and expressed confidence. These techniques encompass using structured evaluation scales to rate confidence, including answer options when prompting, and eliciting the model's confidence level for outputs it does not recognize as its own. Notably, among the models analyzed, OpenAI's GPT-4 showed the strongest confidence-probability alignment, with an average Spearman's $\hatρ$ of 0.42, across a wide range of tasks. Our work contributes to the ongoing efforts to facilitate risk assessment in the application of LLMs and to further our understanding of model trustworthiness. △ Less

Submitted 15 June, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

Comments: 9 pages (excluding references), accepted to ACL 2024 Main Conference

arXiv:2405.14555 [pdf, other]

Subtle Biases Need Subtler Measures: Dual Metrics for Evaluating Representative and Affinity Bias in Large Language Models

Authors: Abhishek Kumar, Sarfaroz Yunusov, Ali Emami

Abstract: Research on Large Language Models (LLMs) has often neglected subtle biases that, although less apparent, can significantly influence the models' outputs toward particular social narratives. This study addresses two such biases within LLMs: representative bias, which denotes a tendency of LLMs to generate outputs that mirror the experiences of certain identity groups, and affinity bias, reflecting… ▽ More Research on Large Language Models (LLMs) has often neglected subtle biases that, although less apparent, can significantly influence the models' outputs toward particular social narratives. This study addresses two such biases within LLMs: representative bias, which denotes a tendency of LLMs to generate outputs that mirror the experiences of certain identity groups, and affinity bias, reflecting the models' evaluative preferences for specific narratives or viewpoints. We introduce two novel metrics to measure these biases: the Representative Bias Score (RBS) and the Affinity Bias Score (ABS), and present the Creativity-Oriented Generation Suite (CoGS), a collection of open-ended tasks such as short story writing and poetry composition, designed with customized rubrics to detect these subtle biases. Our analysis uncovers marked representative biases in prominent LLMs, with a preference for identities associated with being white, straight, and men. Furthermore, our investigation of affinity bias reveals distinctive evaluative patterns within each model, akin to `bias fingerprints'. This trend is also seen in human evaluators, highlighting a complex interplay between human and machine bias perceptions. △ Less

Submitted 3 June, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

Comments: 9 pages (excluding references), accepted to ACL 2024 Main Conference

arXiv:2405.13856 [pdf, ps, other]

In-beam $γ-$spectroscopy of the transitional nucleus $^{217}$Ac

Authors: Dhananjaya Sahoo, A. Y. Deo, Madhu, Khamosh Yadav, S. S. Tiwary, P. C. Srivastava, R. Palit, S. K. Tandel, Anil Kumar, P. Dey, Biswajit Das, Vishal Malik, A. Kundu, A. Sindhu, S. V. Jadhav, B. S. Naidu, A. V. Thomas

Abstract: High-spin states in the transitional $^{217}$Ac nucleus are established up to 3.8 MeV excitation energy and $I^π =$ 41/2$^+$ with the addition of around 20 new transitions. The structure of the yrast and near-yrast states below the 29/2$^+$ isomer is revisited. The inconsistencies in the level schemes reported earlier are resolved. The level structure above the 29/2$^+$ isomer is established for t… ▽ More High-spin states in the transitional $^{217}$Ac nucleus are established up to 3.8 MeV excitation energy and $I^π =$ 41/2$^+$ with the addition of around 20 new transitions. The structure of the yrast and near-yrast states below the 29/2$^+$ isomer is revisited. The inconsistencies in the level schemes reported earlier are resolved. The level structure above the 29/2$^+$ isomer is established for the first time. Large-basis shell-model calculations with the KHPE interaction are performed to compare the experimentally observed level energies with the theoretical predictions. A comparison with the systematics of the N = 128 isotones suggests that the yrast structures result from a weak coupling of the odd proton to the even-even 216Ra core, which is consistent with the shell-model configurations. Furthermore, alpha decay of the 29/2$^+$ isomer is revisited and the decay scheme established from this work is discussed in the framework of the shell model. △ Less

Submitted 22 May, 2024; originally announced May 2024.

Comments: 11 pages, 9 figures

arXiv:2405.12988 [pdf, other]

Prediction of Cryptocurrency Prices through a Path Dependent Monte Carlo Simulation

Authors: Ayush Singh, Anshu K. Jha, Amit N. Kumar

Abstract: In this paper, our focus lies on the Merton's jump diffusion model, employing jump processes characterized by the compound Poisson process. Our primary objective is to forecast the drift and volatility of the model using a variety of methodologies. We adopt an approach that involves implementing different drift, volatility, and jump terms within the model through various machine learning technique… ▽ More In this paper, our focus lies on the Merton's jump diffusion model, employing jump processes characterized by the compound Poisson process. Our primary objective is to forecast the drift and volatility of the model using a variety of methodologies. We adopt an approach that involves implementing different drift, volatility, and jump terms within the model through various machine learning techniques, traditional methods, and statistical methods on price-volume data. Additionally, we introduce a path-dependent Monte Carlo simulation to model cryptocurrency prices, taking into account the volatility and unexpected jumps in prices. △ Less

Submitted 10 April, 2024; originally announced May 2024.

Comments: 21 pages

arXiv:2405.11934 [pdf, other]

doi 10.1021/acs.jpclett.4c00215

Elucidating the role of electron transfer in the photoluminescence of $\mathrm{MoS_{2}}$ quantum dots synthesized by fs-pulse ablation

Authors: Anubhab Sahoo, Tejendra Dixit, K. V. Anil Kumar, K. Lakshmi Ganapathi, Pramoda K. Nayak, M. S. Ramachandra Rao, Sivarama Krishnan

Abstract: Herein, $\mathrm{MoS_{2}}$ quantum dot (QDs) with controlled optical, structural, and electronic properties are synthesized using the femtosecond pulsed laser ablation in liquid (fs-PLAL) technique by varying pulse-width, ablation power, and ablation time to harness the potential for next-generation optoelectronics and quantum technology. Furthermore, this work elucidates key aspects of the mechan… ▽ More Herein, $\mathrm{MoS_{2}}$ quantum dot (QDs) with controlled optical, structural, and electronic properties are synthesized using the femtosecond pulsed laser ablation in liquid (fs-PLAL) technique by varying pulse-width, ablation power, and ablation time to harness the potential for next-generation optoelectronics and quantum technology. Furthermore, this work elucidates key aspects of the mechanisms underlying the near-UV and blue emission, the accompanying large Stokes-shift, and the consequent change in sample color with laser exposure parameters pertaining to $\mathrm{MoS_{2}}$ QDs. Through spectroscopic analysis, including UV-visible absorption, photoluminescence, and Raman spectroscopy, we successfully unravelled the mechanisms for the change in optoelectronic properties of $\mathrm{MoS_{2}}$ QDs with laser parameters. We realize that the occurrence of a secondary phase, specifically $\mathrm{MoO_{3-x}}$, is responsible for the significant Stokes-shift and blue emission observed in this QDs system. The primary factor influencing these activities is the electron transfer observed between these two phases, as validated by excitation dependent photoluminescence, XPS and Raman spectroscopies. △ Less

Submitted 20 May, 2024; originally announced May 2024.

arXiv:2405.10245 [pdf, ps, other]

A Graph-Theoretical Framework to Analyse Zero Discord Quantum States

Authors: Anoopa Joshi, Parvinder Singh, Atul Kumar

Abstract: This article comprehensively explores matrices and their prerequisites for achieving positive semidefiniteness. The study delves into a series of theorems concerning pure quantum states in the context of weighted graphs. The main objective of this study is to establish a graph-theoretic framework for the study of quantum discord and to identify the necessary and sufficient conditions for zero quan… ▽ More This article comprehensively explores matrices and their prerequisites for achieving positive semidefiniteness. The study delves into a series of theorems concerning pure quantum states in the context of weighted graphs. The main objective of this study is to establish a graph-theoretic framework for the study of quantum discord and to identify the necessary and sufficient conditions for zero quantum discord states using unitary operators. This research aims to advance the understanding of quantum discord and its implications for quantum information theory with a graph-theoretic framework. △ Less

Submitted 17 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

arXiv:2405.09898 [pdf]

NH3 gas sensing over 2D Phosphorene sheet: A First-Principles Study

Authors: Naresh Kumar, Yogendra K. Gautam, Soni Mishra, Anuj Kumar, Abhishek Kumar Mishra

Abstract: First-principles based calculations were executed to investigate the sensing properties of ammonia gas molecules on two-dimensional pristine black phosphorene towards its application as a gas sensor and related applications. We discuss in detail, the interaction of ammonia gas molecules on the phosphorene single sheet through the structural change analysis, electronic band gap, Bader charge transf… ▽ More First-principles based calculations were executed to investigate the sensing properties of ammonia gas molecules on two-dimensional pristine black phosphorene towards its application as a gas sensor and related applications. We discuss in detail, the interaction of ammonia gas molecules on the phosphorene single sheet through the structural change analysis, electronic band gap, Bader charge transfer, and density-of-states calculations. Our calculations indicate that the phosphorene could be used as a detector of ammonia, where good sensitivity and very short recovery time at room temperature have confirmed the potential use of phosphorene in the detection of ammonia. △ Less

Submitted 16 May, 2024; originally announced May 2024.

Comments: 21 pages, Figures 8

arXiv:2405.09288 [pdf, other]

DeCoDEx: Confounder Detector Guidance for Improved Diffusion-based Counterfactual Explanations

Authors: Nima Fathi, Amar Kumar, Brennan Nichyporuk, Mohammad Havaei, Tal Arbel

Abstract: Deep learning classifiers are prone to latching onto dominant confounders present in a dataset rather than on the causal markers associated with the target class, leading to poor generalization and biased predictions. Although explainability via counterfactual image generation has been successful at exposing the problem, bias mitigation strategies that permit accurate explainability in the presenc… ▽ More Deep learning classifiers are prone to latching onto dominant confounders present in a dataset rather than on the causal markers associated with the target class, leading to poor generalization and biased predictions. Although explainability via counterfactual image generation has been successful at exposing the problem, bias mitigation strategies that permit accurate explainability in the presence of dominant and diverse artifacts remain unsolved. In this work, we propose the DeCoDEx framework and show how an external, pre-trained binary artifact detector can be leveraged during inference to guide a diffusion-based counterfactual image generator towards accurate explainability. Experiments on the CheXpert dataset, using both synthetic artifacts and real visual artifacts (support devices), show that the proposed method successfully synthesizes the counterfactual images that change the causal pathology markers associated with Pleural Effusion while preserving or ignoring the visual artifacts. Augmentation of ERM and Group-DRO classifiers with the DeCoDEx generated images substantially improves the results across underrepresented groups that are out of distribution for each class. The code is made publicly available at https://github.com/NimaFathi/DeCoDEx. △ Less

Submitted 15 May, 2024; originally announced May 2024.

Comments: Accepted to Medical Imaging with Deep Learning (MIDL) 2024

arXiv:2405.08077 [pdf, other]

Methods and stability tests associated with the sterile neutrino search using improved high-energy $ν_μ$ event reconstruction in IceCube

Authors: IceCube Collaboration, R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise , et al. (398 additional authors not shown)

Abstract: We provide supporting details for the search for a 3+1 sterile neutrino using data collected over eleven years at the IceCube Neutrino Observatory. The analysis uses atmospheric muon-flavored neutrinos from 0.5 to 100\, TeV that traverse the Earth to reach the IceCube detector, and finds a best-fit point at $\sin^2(2θ_{24}) = 0.16$ and $Δm^{2}_{41} = 3.5$ eV$^2$ with a goodness-of-fit p-value of 1… ▽ More We provide supporting details for the search for a 3+1 sterile neutrino using data collected over eleven years at the IceCube Neutrino Observatory. The analysis uses atmospheric muon-flavored neutrinos from 0.5 to 100\, TeV that traverse the Earth to reach the IceCube detector, and finds a best-fit point at $\sin^2(2θ_{24}) = 0.16$ and $Δm^{2}_{41} = 3.5$ eV$^2$ with a goodness-of-fit p-value of 12\% and consistency with the null hypothesis of no oscillations to sterile neutrinos with a p-value of 3.1\%. Several improvements were made over past analyses, which are reviewed in this article, including upgrades to the reconstruction and the study of sources of systematic uncertainty. We provide details of the fit quality and discuss stability tests that split the data for separate samples, comparing results. We find that the fits are consistent between split data sets. △ Less

Submitted 13 May, 2024; originally announced May 2024.

Comments: 18 pages, 17 figures, 2 tables. This long-form paper is a companion to the letter "A search for an eV-scale sterile neutrino using improved high-energy νμ event reconstruction in IceCube."

arXiv:2405.08070 [pdf, other]

A search for an eV-scale sterile neutrino using improved high-energy $ν_μ$ event reconstruction in IceCube

Authors: IceCube Collaboration, R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise , et al. (398 additional authors not shown)

Abstract: This Letter presents the result of a 3+1 sterile neutrino search using 10.7 years of IceCube data. We analyze atmospheric muon neutrinos that traverse the Earth with energies ranging from 0.5 to 100 TeV, incorporating significant improvements in modeling neutrino flux and detector response compared to earlier studies. Notably, for the first time, we categorize data into starting and through-going… ▽ More This Letter presents the result of a 3+1 sterile neutrino search using 10.7 years of IceCube data. We analyze atmospheric muon neutrinos that traverse the Earth with energies ranging from 0.5 to 100 TeV, incorporating significant improvements in modeling neutrino flux and detector response compared to earlier studies. Notably, for the first time, we categorize data into starting and through-going events, distinguishing neutrino interactions with vertices inside or outside the instrumented volume, to improve energy resolution. The best-fit point for a 3+1 model is found to be at $\sin^2(2θ_{24}) = 0.16$ and $Δm^{2}_{41} = 3.5$ eV$^2$, which agrees with previous iterations of this study. The result is consistent with the null hypothesis of no sterile neutrinos with a p-value of 3.1\%. △ Less

Submitted 13 May, 2024; originally announced May 2024.

Comments: 9 pages, 3 figures. This letter is supported by the long-form paper "Methods and stability tests associated with the sterile neutrino search using improved high-energy $ν_μ$ event reconstruction in IceCube," also appearing on arXiv

arXiv:2405.08015 [pdf, other]

A Methodology-Oriented Study of Catastrophic Forgetting in Incremental Deep Neural Networks

Authors: Ashutosh Kumar, Sonali Agarwal, D Jude Hemanth

Abstract: Human being and different species of animals having the skills to gather, transferring knowledge, processing, fine-tune and generating information throughout their lifetime. The ability of learning throughout their lifespan is referred as continuous learning which is using neurocognition mechanism. Consequently, in real world computational system of incremental learning autonomous agents also need… ▽ More Human being and different species of animals having the skills to gather, transferring knowledge, processing, fine-tune and generating information throughout their lifetime. The ability of learning throughout their lifespan is referred as continuous learning which is using neurocognition mechanism. Consequently, in real world computational system of incremental learning autonomous agents also needs such continuous learning mechanism which provide retrieval of information and long-term memory consolidation. However, the main challenge in artificial intelligence is that the incremental learning of the autonomous agent when new data confronted. In such scenarios, the main concern is catastrophic forgetting(CF), i.e., while learning the sequentially, neural network underfits the old data when it confronted with new data. To tackle this CF problem many numerous studied have been proposed, however it is very difficult to compare their performance due to dissimilarity in their evaluation mechanism. Here we focus on the comparison of all algorithms which are having similar type of evaluation mechanism. Here we are comparing three types of incremental learning methods: (1) Exemplar based methods, (2) Memory based methods, and (3) Network based method. In this survey paper, methodology oriented study for catastrophic forgetting in incremental deep neural network is addressed. Furthermore, it contains the mathematical overview of impact-full methods which can be help researchers to deal with CF. △ Less

Submitted 11 May, 2024; originally announced May 2024.

arXiv:2405.07701 [pdf, ps, other]

Phase separation in a binary mixture of sticky spheres

Authors: D. C. Thakur, Jalim Singh, A. V. Anil Kumar

Abstract: We numerically investigate the dependence of range of attractive potential on the phase separation of 2-D binary systems. Through extensive simulations and analysis, we show that when the range of attractive interactions approaches the sticky sphere limit, the system undergoes a phase separation at lower temperature. Further reduction in temperature causes the system to mix again. These mixing-dem… ▽ More We numerically investigate the dependence of range of attractive potential on the phase separation of 2-D binary systems. Through extensive simulations and analysis, we show that when the range of attractive interactions approaches the sticky sphere limit, the system undergoes a phase separation at lower temperature. Further reduction in temperature causes the system to mix again. These mixing-demixing-mixing transitions are of first order. Such phase separation is not observed for systems with larger interaction range. In the phase separated region of the phase diagram, one of the components of the mixture chooses to be in crystalline configuration, while other being in disordered state △ Less

Submitted 13 May, 2024; originally announced May 2024.

Comments: 7 pages, 7figures

arXiv:2405.07362 [pdf, other]

Entanglement Dynamics in Quantum Continuous-Variable States

Authors: Ankit Kumar

Abstract: Due to the weakness of gravitational coupling, all quantum experiments up to date in which gravity plays a role utilized the field of the Earth. Since this field undergoes practically undetectable back-action from quantum particles, it effectively admits a classical description as a fixed background Newtonian field or spacetime. This argument strongly motivates theoretical and experimental researc… ▽ More Due to the weakness of gravitational coupling, all quantum experiments up to date in which gravity plays a role utilized the field of the Earth. Since this field undergoes practically undetectable back-action from quantum particles, it effectively admits a classical description as a fixed background Newtonian field or spacetime. This argument strongly motivates theoretical and experimental research towards a demonstration of gravitation between two quantum masses, as this is one of the most straightforward scenarios where quantum features of gravity could be observed. Several proposals studied the possibility of generating entanglement between two massive objects. Along the same lines, with a particular focus on gravity, this thesis introduces general tools to tackle interaction-mediated entanglement and applies them to two particles prepared in continuous-variable states. △ Less

Submitted 15 May, 2024; v1 submitted 12 May, 2024; originally announced May 2024.

Comments: PhD Thesis. jointly supervised by: Prof. P. Arumugam (IIT Roorkee, India) and Prof. Tomasz Paterek (Uni. of Gdansk, Poland)

arXiv:2405.07079 [pdf, other]

Host-Based Allocators for Device Memory

Authors: Oren Bell, Ashwin Kumar, Chris Gill

Abstract: Memory allocation is a fairly mature field of computer science. However, we challenge a prevailing assumption in the literature over the last 50 years which, if reconsidered, necessitates a fundamental reevaluation of many classical memory management algorithms. We pose a model where the allocation algorithm runs on host memory but allocates device memory and so incur the following constraint: the… ▽ More Memory allocation is a fairly mature field of computer science. However, we challenge a prevailing assumption in the literature over the last 50 years which, if reconsidered, necessitates a fundamental reevaluation of many classical memory management algorithms. We pose a model where the allocation algorithm runs on host memory but allocates device memory and so incur the following constraint: the allocator can't read the memory it is allocating. This means we are unable to use boundary tags, which is a concept that has been ubiquitous in nearly every allocation algorithm. In this paper, we propose alternate algorithms to work around this constraint, and discuss in general the implications of this system model. △ Less

Submitted 11 May, 2024; originally announced May 2024.

Comments: 9 pages, 4 figures

arXiv:2405.06777 [pdf, other]

Multiple magnetic interactions and large inverse magnetocaloric effect in TbSi and TbSi$_{0.6}$Ge$_{0.4}$

Authors: Ajay Kumar, Prashant Singh, Andrew Doyle, Deborah L. Schlagel, Yaroslav Mudryk

Abstract: We present a comprehensive investigation of the electronic structure, magnetization, specific heat, and crystallography of TbSi (FeB structure type) and TbSi$_{0.6}$Ge$_{0.4}$ (CrB structure type) compounds. Both TbSi and TbSi$_{0.6}$Ge$_{0.4}$ exhibit two antiferromagnetic (AFM) transitions at T$_{\rm N1}\approx$ 58~K and 57~K, and T$_{\rm N2}\approx$ 36~K and 44~K, respectively, along with an on… ▽ More We present a comprehensive investigation of the electronic structure, magnetization, specific heat, and crystallography of TbSi (FeB structure type) and TbSi$_{0.6}$Ge$_{0.4}$ (CrB structure type) compounds. Both TbSi and TbSi$_{0.6}$Ge$_{0.4}$ exhibit two antiferromagnetic (AFM) transitions at T$_{\rm N1}\approx$ 58~K and 57~K, and T$_{\rm N2}\approx$ 36~K and 44~K, respectively, along with an onset of weak metamagnetic-like transition around 6~T between T$_{\rm N1}$ and T$_{\rm N2}$. High-resolution specific heat (C$_{\rm P}$) measurements show the second- and first-order nature of the magnetic transition at T$_{\rm N1}$ and T$_{\rm N2}$, respectively, for both samples. However, in the case of TbSi, the low-temperature (LT) AFM to high-temperature (HT) AFM transition takes place via an additional AFM phase at the intermediate temperature (IT), where both LT to IT AFM and IT to HT AFM phase transitions exhibit a first-order nature. Both TbSi and TbSi$_{0.6}$Ge$_{0.4}$ manifest significant magnetic entropy changes ($ΔS_{\rm M}$) of 9.6 and 11.6~J/kg-K, respectively, for $Δμ_0H$=7~T, at T$_{\rm N2}$. The HT AFM phase of TbSi$_{0.6}$Ge$_{0.4}$ is found to be more susceptible to the external magnetic field, causing a significant broadening in the peaks of $ΔS_{\rm M}$ curves at higher magnetic fields. Temperature and field-dependent specific heat data have been utilized to construct the complex H-T phase diagram of these compounds. Furthermore, temperature-dependent x-ray diffraction measurements demonstrate substantial magnetostriction and anisotropic thermal expansion of the unit cell in both samples. △ Less

Submitted 10 May, 2024; originally announced May 2024.

Comments: Submitted on 10 May 2024

arXiv:2405.05243 [pdf, other]

Deep learning-based variational autoencoder for classification of quantum and classical states of light

Authors: Mahesh Bhupati, Abhishek Mall, Anshuman Kumar, Pankaj K. Jha

Abstract: Advancements in optical quantum technologies have been enabled by the generation, manipulation, and characterization of light, with identification based on its photon statistics. However, characterizing light and its sources through single photon measurements often requires efficient detectors and longer measurement times to obtain high-quality photon statistics. Here we introduce a deep learning-… ▽ More Advancements in optical quantum technologies have been enabled by the generation, manipulation, and characterization of light, with identification based on its photon statistics. However, characterizing light and its sources through single photon measurements often requires efficient detectors and longer measurement times to obtain high-quality photon statistics. Here we introduce a deep learning-based variational autoencoder (VAE) method for classifying single photon added coherent state (SPACS), single photon added thermal state (SPACS), mixed states between coherent/SPACS and thermal/SPATS of light. Our semisupervised learning-based VAE efficiently maps the photon statistics features of light to a lower dimension, enabling quasi-instantaneous classification with low average photon counts. The proposed VAE method is robust and maintains classification accuracy in the presence of losses inherent in an experiment, such as finite collection efficiency, non-unity quantum efficiency, finite number of detectors, etc. Additionally, leveraging the transfer learning capabilities of VAE enables successful classification of data of any quality using a single trained model. We envision that such a deep learning methodology will enable better classification of quantum light and light sources even in the presence of poor detection quality. △ Less

Submitted 8 May, 2024; originally announced May 2024.

arXiv:2405.04986 [pdf, other]

Constraining the core radius and density jumps inside Earth using atmospheric neutrino oscillations

Authors: Anuj Kumar Upadhyay, Anil Kumar, Sanjib Kumar Agarwalla, Amol Dighe

Abstract: Atmospheric neutrinos can act as a tool to probe the interior of Earth using weak interactions, and can provide information complementary to that obtained from gravitational and seismic measurements. While passing through Earth, multi-GeV neutrinos encounter Earth matter effects due to the coherent forward scattering with the ambient electrons, which alter the neutrino oscillation probabilities. T… ▽ More Atmospheric neutrinos can act as a tool to probe the interior of Earth using weak interactions, and can provide information complementary to that obtained from gravitational and seismic measurements. While passing through Earth, multi-GeV neutrinos encounter Earth matter effects due to the coherent forward scattering with the ambient electrons, which alter the neutrino oscillation probabilities. These matter effects depend upon the density distribution of electrons inside Earth, and hence, can be used to determine the internal structure of Earth. In this work, we employ a five-layered model of Earth where the layer densities and radii are modified, keeping the mass and moment of inertia of Earth unchanged and respecting the hydrostatic equilibrium condition. We use the proposed INO-ICAL detector as an example of an atmospheric neutrino experiment that can distinguish between neutrinos and antineutrinos efficiently in the multi-GeV energy range. Our analysis demonstrates the role such an experiment can play in simultaneously constraining the density jumps inside Earth and the location of the core-mantle boundary. △ Less

Submitted 8 May, 2024; originally announced May 2024.

Comments: 34 pages, 10 figures, and 3 tables. Comments are welcome

Report number: IP/BBSR/2024-03, TIFR/TH/24-05

arXiv:2405.04724 [pdf, other]

Legendrian knots and multi-crossings

Authors: Amit Kumar, Jake Murphy, Brian Naff

Abstract: It was shown in arXiv:1208.5742 that any smooth knot can be represented by an übercrossing projection, i.e. a knot projection with no crossings aside from a single multi-crossing. We extend this idea to Legendrian knots and investigate übercrossing and petal projections in the front and Lagrangian projections. We show that any Legendrian knot with an übercrossing projection in the front projection… ▽ More It was shown in arXiv:1208.5742 that any smooth knot can be represented by an übercrossing projection, i.e. a knot projection with no crossings aside from a single multi-crossing. We extend this idea to Legendrian knots and investigate übercrossing and petal projections in the front and Lagrangian projections. We show that any Legendrian knot with an übercrossing projection in the front projection is smoothly isotopic to the unknot and we demonstrate how to compute the $tb$ and rotation numbers for petal projections in the Lagrangian projection. △ Less

Submitted 7 May, 2024; originally announced May 2024.

Comments: 9 pages, 6 figures

MSC Class: 57M27; 53D10

arXiv:2405.03948 [pdf, other]

The Fault in Our Recommendations: On the Perils of Optimizing the Measurable

Authors: Omar Besbes, Yash Kanoria, Akshit Kumar

Abstract: Recommendation systems are widespread, and through customized recommendations, promise to match users with options they will like. To that end, data on engagement is collected and used. Most recommendation systems are ranking-based, where they rank and recommend items based on their predicted engagement. However, the engagement signals are often only a crude proxy for utility, as data on the latte… ▽ More Recommendation systems are widespread, and through customized recommendations, promise to match users with options they will like. To that end, data on engagement is collected and used. Most recommendation systems are ranking-based, where they rank and recommend items based on their predicted engagement. However, the engagement signals are often only a crude proxy for utility, as data on the latter is rarely collected or available. This paper explores the following question: By optimizing for measurable proxies, are recommendation systems at risk of significantly under-delivering on utility? If so, how can one improve utility which is seldom measured? To study these questions, we introduce a model of repeated user consumption in which, at each interaction, users select between an outside option and the best option from a recommendation set. Our model accounts for user heterogeneity, with the majority preferring ``popular'' content, and a minority favoring ``niche'' content. The system initially lacks knowledge of individual user preferences but can learn them through observations of users' choices over time. Our theoretical and numerical analysis demonstrate that optimizing for engagement can lead to significant utility losses. Instead, we propose a utility-aware policy that initially recommends a mix of popular and niche content. As the platform becomes more forward-looking, our utility-aware policy achieves the best of both worlds: near-optimal utility and near-optimal engagement simultaneously. Our study elucidates an important feature of recommendation systems; given the ability to suggest multiple items, one can perform significant exploration without incurring significant reductions in engagement. By recommending high-risk, high-reward items alongside popular items, systems can enhance discovery of high utility items without significantly affecting engagement. △ Less

Submitted 6 May, 2024; originally announced May 2024.

arXiv:2405.03817 [pdf, other]

Search for joint multimessenger signals from potential Galactic PeVatrons with HAWC and IceCube

Authors: R. Alfaro, C. Alvarez, J. C. Arteaga-Velázquez, D. Avila Rojas, H. A. Ayala Solares, R. Babu, E. Belmont-Moreno, K. S. Caballero-Mora, T. Capistrán, A. Carramiñana, S. Casanova, U. Cotti, J. Cotzomi, S. Coutiño de León, E. De la Fuente, D. Depaoli, N. Di Lalla, R. Diaz Hernandez, J. C. Díaz-Vélez, K. Engel, T. Ergin, K. L. Fan, K. Fang, N. Fraija, S. Fraija , et al. (469 additional authors not shown)

Abstract: Galactic PeVatrons are sources that can accelerate cosmic rays to PeV energies. The high-energy cosmic rays are expected to interact with the surrounding ambient material or radiation, resulting in the production of gamma rays and neutrinos. To optimize for the detection of such associated production of gamma rays and neutrinos for a given source morphology and spectrum, a multi-messenger analysis… ▽ More Galactic PeVatrons are sources that can accelerate cosmic rays to PeV energies. The high-energy cosmic rays are expected to interact with the surrounding ambient material or radiation, resulting in the production of gamma rays and neutrinos. To optimize for the detection of such associated production of gamma rays and neutrinos for a given source morphology and spectrum, a multi-messenger analysis that combines gamma rays and neutrinos is required. In this study, we use the Multi-Mission Maximum Likelihood framework (3ML) with IceCube Maximum Likelihood Analysis software (i3mla) and HAWC Accelerated Likelihood (HAL) to search for a correlation between 22 known gamma-ray sources from the third HAWC gamma-ray catalog and 14 years of IceCube track-like data. No significant neutrino emission from the direction of the HAWC sources was found. We report the best-fit gamma-ray model and 90% CL neutrino flux limit from the 22 sources. From the neutrino flux limit, we conclude that the gamma-ray emission from five of the sources can not be produced purely from hadronic interactions. We report the limit for the fraction of gamma rays produced by hadronic interactions for these five sources. △ Less

Submitted 6 May, 2024; originally announced May 2024.

arXiv:2405.03005 [pdf, other]

Safe Reinforcement Learning with Learned Non-Markovian Safety Constraints

Authors: Siow Meng Low, Akshat Kumar

Abstract: In safe Reinforcement Learning (RL), safety cost is typically defined as a function dependent on the immediate state and actions. In practice, safety constraints can often be non-Markovian due to the insufficient fidelity of state representation, and safety cost may not be known. We therefore address a general setting where safety labels (e.g., safe or unsafe) are associated with state-action traj… ▽ More In safe Reinforcement Learning (RL), safety cost is typically defined as a function dependent on the immediate state and actions. In practice, safety constraints can often be non-Markovian due to the insufficient fidelity of state representation, and safety cost may not be known. We therefore address a general setting where safety labels (e.g., safe or unsafe) are associated with state-action trajectories. Our key contributions are: first, we design a safety model that specifically performs credit assignment to assess contributions of partial state-action trajectories on safety. This safety model is trained using a labeled safety dataset. Second, using RL-as-inference strategy we derive an effective algorithm for optimizing a safe policy using the learned safety model. Finally, we devise a method to dynamically adapt the tradeoff coefficient between reward maximization and safety compliance. We rewrite the constrained optimization problem into its dual problem and derive a gradient-based method to dynamically adjust the tradeoff coefficient during training. Our empirical results demonstrate that this approach is highly scalable and able to satisfy sophisticated non-Markovian safety constraints. △ Less

Submitted 5 May, 2024; originally announced May 2024.

arXiv:2405.02247 [pdf, other]

Deep Learning of ab initio Hessians for Transition State Optimization

Authors: Eric C. -Y. Yuan, Anup Kumar, Xingyi Guan, Eric D. Hermes, Andrew S. Rosen, Judit Zádor, Teresa Head-Gordon, Samuel M. Blau

Abstract: Identifying transition states -- saddle points on the potential energy surface connecting reactant and product minima -- is central to predicting kinetic barriers and understanding chemical reaction mechanisms. In this work, we train an equivariant neural network potential, NewtonNet, on an ab initio dataset of thousands of organic reactions from which we derive the analytical Hessians from the fu… ▽ More Identifying transition states -- saddle points on the potential energy surface connecting reactant and product minima -- is central to predicting kinetic barriers and understanding chemical reaction mechanisms. In this work, we train an equivariant neural network potential, NewtonNet, on an ab initio dataset of thousands of organic reactions from which we derive the analytical Hessians from the fully differentiable machine learning (ML) model. By reducing the computational cost by several orders of magnitude relative to the Density Functional Theory (DFT) ab initio source, we can afford to use the learned Hessians at every step for the saddle point optimizations. We have implemented our ML Hessian algorithm in Sella, an open source software package designed to optimize atomic systems to find saddle point structures, in order to compare transition state optimization against quasi-Newton Hessian updates using DFT or the ML model. We show that the full ML Hessian robustly finds the transition states of 240 unseen organic reactions, even when the quality of the initial guess structures are degraded, while reducing the number of optimization steps to convergence by 2--3$\times$ compared to the quasi-Newton DFT and ML methods. All data generation, NewtonNet model, and ML transition state finding methods are available in an automated workflow. △ Less

Submitted 3 May, 2024; originally announced May 2024.

Showing 51–100 of 2,910 results for author: Kumar, A