Skip to main content

Showing 1–50 of 233 results for author: Dash, S

  1. arXiv:2407.03211  [pdf, other

    cs.CL cs.LG

    How Does Quantization Affect Multilingual LLMs?

    Authors: Kelly Marchisio, Saurabh Dash, Hongyu Chen, Dennis Aumiller, Ahmet Üstün, Sara Hooker, Sebastian Ruder

    Abstract: Quantization techniques are widely used to improve inference speed and deployment of large language models. While a wide body of work examines the impact of quantized LLMs on English tasks, none have examined the effect of quantization across languages. We conduct a thorough analysis of quantized multilingual LLMs, focusing on their performance across languages and at varying scales. We use automa… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  2. arXiv:2406.17812  [pdf, other

    cs.LG cs.AI cs.DC

    Scalable Artificial Intelligence for Science: Perspectives, Methods and Exemplars

    Authors: Wesley Brewer, Aditya Kashi, Sajal Dash, Aristeidis Tsaris, Junqi Yin, Mallikarjun Shankar, Feiyi Wang

    Abstract: In a post-ChatGPT world, this paper explores the potential of leveraging scalable artificial intelligence for scientific discovery. We propose that scaling up artificial intelligence on high-performance computing platforms is essential to address such complex problems. This perspective focuses on scientific use cases like cognitive simulations, large language models for scientific inquiry, medical… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 17 pages, 5 figures

  3. arXiv:2405.20835  [pdf, other

    cs.LG cs.AI cs.CL

    Outliers and Calibration Sets have Diminishing Effect on Quantization of Modern LLMs

    Authors: Davide Paglieri, Saurabh Dash, Tim Rocktäschel, Jack Parker-Holder

    Abstract: Post-Training Quantization (PTQ) enhances the efficiency of Large Language Models (LLMs) by enabling faster operation and compatibility with more accessible hardware through reduced memory usage, at the cost of small performance drops. We explore the role of calibration sets in PTQ, specifically their effect on hidden activations in various notable open-source LLMs. Calibration sets are crucial fo… ▽ More

    Submitted 5 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

  4. arXiv:2405.16431  [pdf, other

    nucl-ex hep-ex hep-ph

    Monte-Carlo Study Of Higher-Order Cumulants of Net-Particle Distributions in $p+p$ Collisions at $\sqrt{s}$ = 13 TeV

    Authors: Abdussamad M, Rahul Verma, Nirbhay Kumar Behera, Sadhana Dash, Basanta Kumar Nandi

    Abstract: Measurement of higher order cumulants of the distributions of conserved quantities, like net-charge, net-baryon and net-strangeness in heavy-ion collisions, is proposed as a sensitive tool to determine the freeze-out parameters and the nature of phase transitions at the LHC energies. Baseline measurements for heavy-ion collisions are essential to understand the experimental measurements. Recently,… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 18 Pages, 12 figures

  5. arXiv:2405.15032  [pdf, other

    cs.CL

    Aya 23: Open Weight Releases to Further Multilingual Progress

    Authors: Viraat Aryabumi, John Dang, Dwarak Talupuru, Saurabh Dash, David Cairuz, Hangyu Lin, Bharat Venkitesh, Madeline Smith, Jon Ander Campos, Yi Chern Tan, Kelly Marchisio, Max Bartolo, Sebastian Ruder, Acyr Locatelli, Julia Kreutzer, Nick Frosst, Aidan Gomez, Phil Blunsom, Marzieh Fadaee, Ahmet Üstün, Sara Hooker

    Abstract: This technical report introduces Aya 23, a family of multilingual language models. Aya 23 builds on the recent release of the Aya model (Üstün et al., 2024), focusing on pairing a highly performant pre-trained model with the recently released Aya collection (Singh et al., 2024). The result is a powerful multilingual large language model serving 23 languages, expanding state-of-art language modelin… ▽ More

    Submitted 31 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  6. arXiv:2404.08919  [pdf, ps, other

    hep-ph hep-th nucl-th

    Study of transport properties of a hot and dense QCD matter using a novel approximation method

    Authors: Anowar Shaikh, Shubhalaxmi Rath, Sadhana Dash, Binata Panda

    Abstract: We have studied the charge and heat transport properties of a hot and dense QCD matter using a novel approximation method within the quasiparticle model. Utilizing a novel collision integral for both the relaxation time approximation (RTA) and the Bhatnagar-Gross-Krook (BGK) models, we have solved the relativistic Boltzmann transport equation to estimate the electrical conductivity and the thermal… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: 29 pages, 6 figures

  7. arXiv:2404.06648  [pdf, other

    astro-ph.EP

    Constraints on atmospheric water abundance and cloud deck pressure in the warm Neptune GJ 3470 b via CARMENES transmission spectroscopy

    Authors: Spandan Dash, Matteo Brogi, Siddharth Gandhi, Marina Lafarga, Annabella Meech, Aaron Bello-Arufe, Peter J. Wheatley

    Abstract: Observations of cooler atmospheres of super-Earths and Neptune sized objects often show flat transmission spectra. The most likely cause of this trend is the presence of aerosols (i.e. clouds and hazes) in the atmospheres of such objects. High-resolution spectroscopy provides an opportunity to test this hypothesis by targeting molecular species whose spectral line cores extend above the level of s… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 18 pages, 8 figures, Accepted for publication in Monthly Notices of the Royal Astronomical Society Main Journal on April 9, 2024

  8. arXiv:2403.01833  [pdf, ps, other

    hep-ph

    Charged particle multiplicity fluctuation in $A-A$ collisions at RHIC and LHC energies using Angantyr model

    Authors: Pritindra Bhowmick, Sadhana Dash, Basanta Nandi, Claude Pruneau

    Abstract: Event-by-event fluctuations of the charged particle multiplicity are studied for a wide range of centralities for Au$-$Au collisions at $\sqrt{s_{NN}}$ = 200 GeV, Pb$-$Pb collisions at $\sqrt{s_{NN}}$ = 2.76 TeV and 5.02 TeV using the Pythia 8 Angantyr model. The centrality dependence of $ω_{ch}$ observable, which quantifies the fluctuations in terms of scaled variance is studied for different p… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  9. arXiv:2403.01240  [pdf, ps, other

    hep-ph hep-th nucl-th

    Analyzing the transport coefficients and observables of a rotating QGP medium in kinetic theory framework with a novel approach to the collision integral

    Authors: Shubhalaxmi Rath, Sadhana Dash

    Abstract: In the present work, we have studied how the rotation of the QGP medium affects the transport coefficients and observables in heavy ion collisions. For the noncentral collisions, although most of the angular momentum gets carried away by the spectators, there still remains a finite angular momentum with a finite range of angular velocity, which thus incites rotation in the produced matter. As a re… ▽ More

    Submitted 23 March, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

    Comments: 31 pages, 8 figures

  10. arXiv:2403.01224  [pdf, other

    hep-ph nucl-th

    Study of identified particle production as a function of transverse event activity classifier, $S_{T}$ in p$-$p collisions

    Authors: Rahul Verma, Vishu Saini, Basanta Nandi, Sadhana Dash

    Abstract: A new observable, $S_{T}$, is introduced in terms of the sum of the transverse momentum of charged particles ($\sum_{i} p_{T_{i}}$ ) produced in proton proton (p$-$p) collisions at LHC energies to probe the underlying events (UE). The UE are defined as those aspects of proton-proton collisions that are not attributed to the primary hard scattering process, but rather to the accompanying interactio… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  11. arXiv:2402.01565  [pdf, other

    quant-ph cond-mat.str-el

    Efficiency of neural quantum states in light of the quantum geometric tensor

    Authors: Sidhartha Dash, Filippo Vicentini, Michel Ferrero, Antoine Georges

    Abstract: Neural quantum state (NQS) ansätze have shown promise in variational Monte Carlo algorithms by their theoretical capability of representing any quantum state. However, the reason behind the practical improvement in their performance with an increase in the number of parameters is not fully understood. In this work, we systematically study the efficiency of restricted Boltzmann Machines (RBMs) to r… ▽ More

    Submitted 3 March, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  12. arXiv:2401.14262  [pdf, other

    cond-mat.mes-hall

    Room temperature nonlocal detection of charge-spin interconversion in a topological insulator

    Authors: Anamul Md. Hoque, Lars Sjöström, Dmitrii Khokhriakov, Bing Zhao, Saroj P. Dash

    Abstract: Topological insulators (TIs) are emerging materials for next-generation low-power nanoelectronic and spintronic device applications. TIs possess non-trivial spin-momentum locking features in the topological surface states in addition to the spin-Hall effect (SHE), and Rashba states due to high spin-orbit coupling (SOC) properties. These phenomena are vital for observing the charge-spin conversion… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 12 pages, 4 figures

  13. arXiv:2312.12705  [pdf, other

    cs.DC cs.AI

    Optimizing Distributed Training on Frontier for Large Language Models

    Authors: Sajal Dash, Isaac Lyngaas, Junqi Yin, Xiao Wang, Romain Egele, Guojing Cong, Feiyi Wang, Prasanna Balaprakash

    Abstract: Large language models (LLMs) have demonstrated remarkable success as foundational models, benefiting various downstream applications through fine-tuning. Recent studies on loss scaling have demonstrated the superior performance of larger LLMs compared to their smaller counterparts. Nevertheless, training LLMs with billions of parameters poses significant challenges and requires considerable comput… ▽ More

    Submitted 21 December, 2023; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: Edited the abstract to better communicate the scope of the work

  14. arXiv:2312.10223  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Signature of pressure-induced topological phase transition in ZrTe$_5$

    Authors: Zoltán Kovács-Krausz, Dániel Nagy, Albin Márffy, Bogdan Karpiak, Zoltán Tajkov, László Oroszlány, János Koltai, Péter Nemes-Incze, Saroj P. Dash, Péter Makk, Szabolcs Csonka, Endre Tóvári

    Abstract: The layered van der Waals material ZrTe$_5$ is known as a candidate topological insulator (TI), however its topological phase and the relation with other properties such as an apparent Dirac semimetallic state is still a subject of debate. We employ a semiclassical multicarrier transport (MCT) model to analyze the magnetotransport of ZrTe$_5$ nanodevices at hydrostatic pressures up to 2 GPa. The t… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Main Text: 10 pages, 5 figures; Supporting Information: 12 pages, 8 figures

  15. Large Non-Volatile Frequency Tuning of Spin Hall Nano-Oscillators using Circular Memristive Nano-Gates

    Authors: Maha Khademi, Akash Kumar, Mona Rajabali, Saroj P. Dash, Johan Åkerman

    Abstract: Spin Hall nano oscillators (SHNOs) are promising candidates for neuromorphic computing due to their miniaturized dimensions, non-linearity, fast dynamics, and ability to synchronize in long chains and arrays. However, tuning the individual SHNOs in large chains/arrays, which is key to implementing synaptic control, has remained a challenge. Here, we demonstrate circular memristive nano-gates, both… ▽ More

    Submitted 18 January, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Marie Sklodowska-Curie Actions, H2020-MSCA-ITN-2020; Project Acronym SPEAR; Grant Agreement No. 955671

  16. arXiv:2311.14154  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci

    Structural transitions in superconducting NbTiN thin films

    Authors: Siddhesh Sanjay Yeram, Sonam Bhakat, Subhashree S. Dash, Avradeep Pal

    Abstract: Superconducting NbTiN thin films have garnered extensive interest due to their use in Superconducting Nanowire Single-Photon Detectors (SNSPDs) and other low-temperature applications for potential use in quantum computing and nanoelectronics. This study examines structural phase transitions observed in NbTiN thin films by analyzing the grazing angle x-ray diffraction patterns of a set of reactive… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: 3 figures, 3 tables

  17. arXiv:2311.08145  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci cond-mat.other physics.app-ph

    Ultra-low-current-density single-layer magnetic Weyl semimetal spin Hall nano-oscillators

    Authors: Lakhan Bainsla, Yuya Sakuraba, Avinash Kumar Chaurasiya, Akash Kumar, Keisuke Masuda, Ahmad A. Awad, Nilamani Behera, Roman Khymyn, Saroj Prasad Dash, Johan Åkerman

    Abstract: Topological quantum materials can exhibit unconventional surface states and anomalous transport properties. Still, their applications in spintronic devices are restricted as they require the growth of high-quality thin films with bulk-like properties. Here, we study 10--30 nm thick epitaxial ferromagnetic Co$_{\rm 2}$MnGa films with high structural order and very high values of the anomalous Hall… ▽ More

    Submitted 19 April, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: 19 pages

  18. arXiv:2311.05278  [pdf, other

    cond-mat.str-el

    Evidence of electron correlation induced kink in Dirac bands in a non-symmorphic Kondo lattice system, CeAgSb2

    Authors: Sawani Datta, Khadiza Ali, Rahul Verma, Bahadur Singh, Saroj P. Dash, A. Thamizhavel, Kalobaran Maiti

    Abstract: We study the behavior of Dirac fermions in the presence of electron correlation in a nonsymmorphic Kondo lattice system, CeAgSb2 employing high-resolution angle-resolved photoemission spectroscopy and first-principles calculations. Experiments reveal crossings of highly dispersive linear bands at the Brillouin zone boundary due to non-symmorphic symmetry. In addition, anisotropic Dirac cones are o… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: 4 figures

  19. arXiv:2311.02382  [pdf, other

    cs.DC cs.AI

    Ultra-Long Sequence Distributed Transformer

    Authors: Xiao Wang, Isaac Lyngaas, Aristeidis Tsaris, Peng Chen, Sajal Dash, Mayanka Chandra Shekar, Tao Luo, Hong-Jun Yoon, Mohamed Wahib, John Gouley

    Abstract: Transformer models trained on long sequences often achieve higher accuracy than short sequences. Unfortunately, conventional transformers struggle with long sequence training due to the overwhelming computation and memory requirements. Existing methods for long sequence training offer limited speedup and memory reduction, and may compromise accuracy. This paper presents a novel and efficient distr… ▽ More

    Submitted 8 November, 2023; v1 submitted 4 November, 2023; originally announced November 2023.

  20. arXiv:2311.01994  [pdf, other

    stat.ML cs.AI cs.LG math.OC

    Obtaining Explainable Classification Models using Distributionally Robust Optimization

    Authors: Sanjeeb Dash, Soumyadip Ghosh, Joao Goncalves, Mark S. Squillante

    Abstract: Model explainability is crucial for human users to be able to interpret how a proposed classifier assigns labels to data based on its feature values. We study generalized linear models constructed using sets of feature value rules, which can capture nonlinear dependencies and interactions. An inherent trade-off exists between rule set sparsity and its prediction accuracy. It is computationally exp… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  21. arXiv:2310.19618  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci cond-mat.other physics.app-ph

    Strong in-plane magnetic anisotropy (Co0.15Fe0.85)5GeTe2/graphene van der Waals heterostructure spin-valve at room temperature

    Authors: Roselle Ngaloy, Bing Zhao, Soheil Ershadrad, Rahul Gupta, Masoumeh Davoudiniya, Lakhan Bainsla, Lars Sjöström, Anamul M. Hoque, Alexei Kalaboukhov, Peter Svedlindh, Biplab Sanyal, Saroj P. Dash

    Abstract: Van der Waals (vdW) magnets are promising owing to their tunable magnetic properties with doping or alloy composition, where the strength of magnetic interactions, their symmetry, and magnetic anisotropy can be tuned according to the desired application. However, most of the vdW magnet based spintronic devices are so far limited to cryogenic temperatures with magnetic anisotropies favouring out-of… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  22. arXiv:2310.16371  [pdf, other

    cs.IT cs.NI

    Synergizing Airborne Non-Terrestrial Networks and Reconfigurable Intelligent Surfaces-Aided 6G IoT

    Authors: Muhammad Ali Jamshed, Aryan Kaushik, Mesut Toka, Wonjae Shin, Muhammad Zeeshan Shakir, Soumya P. Dash, Davide Dardari

    Abstract: On the one hand, Reconfigurable Intelligent Surfaces (RISs) emerge as a promising solution to meet the demand for higher data rates, improved coverage, and efficient spectrum utilization. On the other hand, Non-Terrestrial Networks (NTNs) offer unprecedented possibilities for global connectivity. Moreover, the NTN can also support the upsurge in the number of Internet of Things (IoT) devices by pr… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 15 pages, 5 figures

  23. arXiv:2310.06521  [pdf, other

    cond-mat.str-el

    Skyrmions and magnetic bubbles in spin-orbit coupled metallic magnets

    Authors: Deepti Rana, Soumyaranjan Dash, Monika Bhakar, Rajeshwari Roy Chowdhury, Ravi Prakash Singh, Sanjeev Kumar, Goutam Sheet

    Abstract: Motivated by the observation of Skyrmion-like magnetic textures in 2D itinerant ferromagnets Fe$_n$GeTe$_2$ ($n \geq3$), we develop a microscopic model combining itinerant magnetism and spin-orbit coupling on a triangular lattice. The ground state of the model in the absence of magnetic field consists of filamentary magnetic domain walls revealing a striking similarity with our magnetic force micr… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  24. arXiv:2310.06395  [pdf

    cond-mat.mes-hall

    Large out-of-plane spin-orbit torque in topological Weyl semimetal candidate TaIrTe4

    Authors: Lakhan Bainsla, Bing Zhao, Anamul Md. Hoque, Lars Sjöström, Nilamani Behera, Mahmoud Abdel-Hafiez, Johan Åkerman, Saroj P. Dash

    Abstract: Topological quantum materials, with novel spin textures and broken crystal symmetries are suitable candidates for spintronic memory technologies. Their unique electronic properties, such as protected surface states and exotic quasiparticles, can provide an out-of-plane spin polarized current needed for external field free magnetization switching of magnets with perpendicular magnetic anisotropy. C… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  25. arXiv:2310.04610  [pdf, other

    cs.AI cs.LG

    DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies

    Authors: Shuaiwen Leon Song, Bonnie Kruft, Minjia Zhang, Conglong Li, Shiyang Chen, Chengming Zhang, Masahiro Tanaka, Xiaoxia Wu, Jeff Rasley, Ammar Ahmad Awan, Connor Holmes, Martin Cai, Adam Ghanem, Zhongzhu Zhou, Yuxiong He, Pete Luferenko, Divya Kumar, Jonathan Weyn, Ruixiong Zhang, Sylwester Klocek, Volodymyr Vragov, Mohammed AlQuraishi, Gustaf Ahdritz, Christina Floristean, Cristina Negri , et al. (67 additional authors not shown)

    Abstract: In the upcoming decade, deep learning may revolutionize the natural sciences, enhancing our capacity to model and predict natural occurrences. This could herald a new era of scientific exploration, bringing significant advancements across sectors from drug development to renewable energy. To answer this call, we present DeepSpeed4Science initiative (deepspeed4science.ai) which aims to build unique… ▽ More

    Submitted 11 October, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

  26. arXiv:2308.13408  [pdf

    cond-mat.mes-hall

    Coexistence of non-trivial van der Waals magnetic orders enable field-free spin-orbit torque switching at room temperature

    Authors: Bing Zhao, Lakhan Bainsla, Roselle Ngaloy, Peter Svedlindh, Saroj P. Dash

    Abstract: The discovery of van der Waals (vdW) materials exhibiting non-trivial and tunable magnetic interactions at room temperature can give rise to exotic magnetic states, which are not readily attainable with conventional materials. Such vdW magnets can provide a unique platform for studying new magnetic phenomena and realising magnetization dynamics for energy-efficient and non-volatile spintronic memo… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  27. arXiv:2308.09474  [pdf, other

    cs.AI cs.SC math.OC

    Evolving Scientific Discovery by Unifying Data and Background Knowledge with AI Hilbert

    Authors: Ryan Cory-Wright, Cristina Cornelio, Sanjeeb Dash, Bachir El Khadir, Lior Horesh

    Abstract: The discovery of scientific formulae that parsimoniously explain natural phenomena and align with existing background theory is a key goal in science. Historically, scientists have derived natural laws by manipulating equations based on existing knowledge, forming new equations, and verifying them experimentally. In recent years, data-driven scientific discovery has emerged as a viable competitor… ▽ More

    Submitted 29 April, 2024; v1 submitted 18 August, 2023; originally announced August 2023.

    Comments: Revised version, including a significant number of new experiments+supplementary material in appendix, and a title change

  28. arXiv:2308.06007  [pdf, ps, other

    cs.IT eess.SP

    RIS-Assisted 6G Wireless Communications: A Novel Statistical Framework in the Presence of Direct Channel

    Authors: Soumya P. Dash, Aryan Kaushik

    Abstract: A RIS-assisted wireless communication system in the presence of a direct communication path between the transceiver pair is considered in this paper. The transmitter-RIS and the RIS-receiver channels follow independent Nakagami-m distributions, and the direct channel between the transceiver pair follows a Rayleigh distribution. Considering this system model, the statistics of the composite channel… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: 5 pages

  29. arXiv:2307.12002  [pdf, ps, other

    hep-ph nucl-th

    Nonextensive effects on the viscous properties of hot and magnetized QCD matter

    Authors: Shubhalaxmi Rath, Sadhana Dash

    Abstract: We have studied the effect of the nonextensive Tsallis mechanism on the viscous properties of hot QCD matter in the presence of a strong magnetic field. The results are compared to the case of absence of magnetic field. The viscous coefficients, such as the shear viscosity ($η$) and the bulk viscosity ($ζ$) are determined in the similar environment by utilizing the nonextensive Tsallis mechanism w… ▽ More

    Submitted 6 February, 2024; v1 submitted 22 July, 2023; originally announced July 2023.

    Comments: 34 pages, 7 figures

  30. arXiv:2307.09417  [pdf, ps, other

    cs.IT eess.SP

    RIS-Aided Index Modulation with Greedy Detection over Rician Fading Channels

    Authors: Aritra Basu, Soumya P. Dash, Aryan Kaushik, Debasish Ghose, Marco Di Renzo, Yonina C. Eldar

    Abstract: Index modulation schemes for reconfigurable intelligent surfaces (RIS)-assisted systems are envisioned as promising technologies for fifth-generation-advanced and sixth-generation (6G) wireless communication systems to enhance various system capabilities such as coverage area and network capacity. In this paper, we consider a receive diversity RIS-assisted wireless communication system employing I… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: 30 pages, 7 figures

  31. arXiv:2307.08593  [pdf, other

    physics.acc-ph cs.LG hep-ex nucl-ex nucl-th

    Artificial Intelligence for the Electron Ion Collider (AI4EIC)

    Authors: C. Allaire, R. Ammendola, E. -C. Aschenauer, M. Balandat, M. Battaglieri, J. Bernauer, M. Bondì, N. Branson, T. Britton, A. Butter, I. Chahrour, P. Chatagnon, E. Cisbani, E. W. Cline, S. Dash, C. Dean, W. Deconinck, A. Deshpande, M. Diefenthaler, R. Ent, C. Fanelli, M. Finger, M. Finger, Jr., E. Fol, S. Furletov , et al. (70 additional authors not shown)

    Abstract: The Electron-Ion Collider (EIC), a state-of-the-art facility for studying the strong force, is expected to begin commissioning its first experiments in 2028. This is an opportune time for artificial intelligence (AI) to be included from the start at this facility and in all phases that lead up to the experiments. The second annual workshop organized by the AI4EIC working group, which recently took… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 27 pages, 11 figures, AI4EIC workshop, tutorials and hackathon

  32. arXiv:2307.03842  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Localization and interaction of interlayer excitons in MoSe$_2$/WSe$_2$ heterobilayers

    Authors: Hanlin Fang, Qiaoling Lin, Yi Zhang, Joshua Thompson, Sanshui Xiao, Zhipei Sun, Ermin Malic, Saroj Dash, Witlef Wieczorek

    Abstract: Transition metal dichalcogenide (TMD) heterobilayers provide a versatile platform to explore unique excitonic physics via properties of the constituent TMDs and external stimuli. Interlayer excitons (IXs) can form in TMD heterobilayers as delocalized or localized states. However, the localization of IX in different types of potential traps, the emergence of biexcitons in the high-excitation regime… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: 18 pages, 15 figures incl supplemental material

    Journal ref: Nature Communications 14, 6910 (2023)

  33. arXiv:2306.03524  [pdf, other

    hep-ph nucl-ex

    Multiplicity and Transverse Spherocity dependence of $\langle p_{\rm T} \rangle$ fluctuations of charged particles in p$-$p collisions at $\sqrt{s}$ = 7 and 13 TeV

    Authors: Subhadeep Roy, Tulika Tripathy, Sadhana Dash

    Abstract: The multiplicity dependence of event-by-event fluctuations in mean transverse momentum, $\langle p_{\rm T} \rangle$, of charged particles has been studied in p$-$p collisions at $\sqrt{s}$ = 7 TeV and 13 TeV using the PYTHIA 8 event generator. The charged particles were selected in kinematic range of $0.15 < p_{\rm T}<2$ GeV$/c$ and $|η| < 0.8$. The dynamical fluctuations would indicate towards th… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: 8 pages, 16 figures

  34. arXiv:2305.19268  [pdf, other

    cs.LG cs.AI

    Intriguing Properties of Quantization at Scale

    Authors: Arash Ahmadian, Saurabh Dash, Hongyu Chen, Bharat Venkitesh, Stephen Gou, Phil Blunsom, Ahmet Üstün, Sara Hooker

    Abstract: Emergent properties have been widely adopted as a term to describe behavior not present in smaller models but observed in larger models. Recent work suggests that the trade-off incurred by quantization is also an emergent property, with sharp drops in performance in models over 6B parameters. In this work, we ask "are quantization cliffs in performance solely a factor of scale?" Against a backdrop… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: 32 pages, 14 figures

  35. arXiv:2305.18183  [pdf, other

    cs.LG cs.CV stat.ML

    On Counterfactual Data Augmentation Under Confounding

    Authors: Abbavaram Gowtham Reddy, Saketh Bachu, Saloni Dash, Charchit Sharma, Amit Sharma, Vineeth N Balasubramanian

    Abstract: Counterfactual data augmentation has recently emerged as a method to mitigate confounding biases in the training data. These biases, such as spurious correlations, arise due to various observed and unobserved confounding variables in the data generation process. In this paper, we formally analyze how confounding biases impact downstream classifiers and present a causal viewpoint to the solutions b… ▽ More

    Submitted 21 November, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

  36. arXiv:2305.03881  [pdf, other

    cs.IR cs.CL cs.CV

    Fairness in Image Search: A Study of Occupational Stereotyping in Image Retrieval and its Debiasing

    Authors: Swagatika Dash

    Abstract: Multi-modal search engines have experienced significant growth and widespread use in recent years, making them the second most common internet use. While search engine systems offer a range of services, the image search field has recently become a focal point in the information retrieval community, as the adage goes, "a picture is worth a thousand words". Although popular search engines like Googl… ▽ More

    Submitted 22 August, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: 20 Pages, Work uses Proprietary Search Systems from the year 2021

  37. arXiv:2305.02259  [pdf

    cond-mat.mes-hall

    Thermally-driven Multilevel Non-volatile Memory with Monolayer MoS2 for Neuro-inspired Artificial Learning

    Authors: Sameer Kumar Mallik, Roshan Padhan, Mousam Charan Sahu, Suman Roy, Gopal K Pradhan, Prasana Kumar Sahoo, Saroj Prasad Dash, Satyaprakash Sahoo

    Abstract: The demands of modern electronic components require advanced computing platforms for efficient information processing to realize in-memory operations with a high density of data storage capabilities towards developing alternatives to von Neumann architectures. Herein, we demonstrate the multifunctionality of monolayer MoS2 mem-transistors which can be used as a high-geared intrinsic transistor at… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Journal ref: ACS Applied Materials & Interfaces 2023

  38. arXiv:2303.03071  [pdf, ps, other

    hep-ph hep-th nucl-th

    Impact of nonextensivity on the transport coefficients of a magnetized hot and dense QCD matter

    Authors: Shubhalaxmi Rath, Sadhana Dash

    Abstract: We have studied the impact of the nonextensivity on the transport coefficients related to charge and heat in thermal QCD. For this purpose, the electrical ($σ_{\rm el}$), Hall ($σ_{\rm H}$), thermal ($κ$) and Hall-type thermal ($κ_{\rm H}$) conductivities are determined using the kinetic theory approach in association with the nonextensive Tsallis statistical mechanism. The effect of nonextensivit… ▽ More

    Submitted 25 September, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: 24 pages, 7 figures

  39. arXiv:2302.02235  [pdf, other

    hep-ph nucl-th

    Viscous QCD medium effects on the bottom quark transport coefficients

    Authors: Adiba Shaikh, Sadhana Dash, Basanta K. Nandi

    Abstract: The bottom quark transport coefficients, i.e., drag and diffusion coefficients, have been studied for the collisional and soft gluon radiative processes within the viscous QCD medium. The thermal medium effects are incorporated using the effective fugacity quasiparticle model (EQPM). Both the shear and bulk viscous effects at leading order are embedded through the near-equilibrium distribution fun… ▽ More

    Submitted 4 February, 2023; originally announced February 2023.

    Comments: 9 pages, 4 figures

  40. arXiv:2301.11837  [pdf, other

    cond-mat.mtrl-sci

    A ferromagnetic Eu-Pt surface compound grown below hexagonal boron nitride

    Authors: Alaa Mohammed Idris Bakhit, Khadiza Ali, Anna A. Makarova, Igor Píš, Federica Bondino, Roberto Sant, Saroj P. Dash, Rodrigo Castrillo, Yuri Hasegawa, J. Enrique Ortega, Laura Fernandez, Frederik Schiller

    Abstract: One of the fundamental applications for monolayer-thick 2D materials is their use as protective layers of metal surfaces and in-situ intercalated reactive materials in ambient conditions. Here we investigate the structural, electronic, and magnetic properties, as well as the chemical stability in air of a very reactive metal, Europium, after intercalation between a hexagonal boron nitride (hBN) la… ▽ More

    Submitted 21 June, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

  41. Bottom-up growth of monolayer honeycomb SiC

    Authors: C. M. Polley, H. Fedderwitz, T. Balasubramanian, A. A. Zakharov, R. Yakimova, O. Bäcke, J. Ekman, S. P. Dash, S. Kubatkin, S. Lara-Avila

    Abstract: The long theorized two-dimensional allotrope of SiC has remained elusive amid the exploration of graphenelike honeycomb structured monolayers. It is anticipated to possess a large direct band gap (2.5 eV), ambient stability, and chemical versatility. While $sp^{2}$ bonding between silicon and carbon is energetically favorable, only disordered nanoflakes have been reported to date. Here we demonstr… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

  42. arXiv:2301.01348  [pdf, other

    cs.LG cs.AI cs.RO

    DADAgger: Disagreement-Augmented Dataset Aggregation

    Authors: Akash Haridas, Karim Hamadeh, Samarendra Chandan Bindu Dash

    Abstract: DAgger is an imitation algorithm that aggregates its original datasets by querying the expert on all samples encountered during training. In order to reduce the number of samples queried, we propose a modification to DAgger, known as DADAgger, which only queries the expert for state-action pairs that are out of distribution (OOD). OOD states are identified by measuring the variance of the action p… ▽ More

    Submitted 3 January, 2023; originally announced January 2023.

    Comments: Imitation Learning for Robotics

  43. arXiv:2212.07250  [pdf, other

    cs.PL

    Affine Monads and Lazy Structures for Bayesian Programming

    Authors: Swaraj Dash, Younesse Kaddar, Hugo Paquet, Sam Staton

    Abstract: We show that streams and lazy data structures are a natural idiom for programming with infinite-dimensional Bayesian methods such as Poisson processes, Gaussian processes, jump processes, Dirichlet processes, and Beta processes. The crucial semantic idea, inspired by developments in synthetic probability theory, is to work with two separate monads: an affine monad of probability, which supports la… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: Accepted for POPL 2023

  44. arXiv:2211.15860  [pdf, other

    cs.LG stat.CO

    Bayesian Experimental Design for Symbolic Discovery

    Authors: Kenneth L. Clarkson, Cristina Cornelio, Sanjeeb Dash, Joao Goncalves, Lior Horesh, Nimrod Megiddo

    Abstract: This study concerns the formulation and application of Bayesian optimal experimental design to symbolic discovery, which is the inference from observational data of predictive models taking general functional forms. We apply constrained first-order methods to optimize an appropriate selection criterion, using Hamiltonian Monte Carlo to sample from the prior. A step for computing the predictive dis… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  45. arXiv:2210.15388  [pdf, ps, other

    hep-ph hep-th nucl-th

    Flow of charge and heat in thermal QCD within the weak magnetic field limit: A BGK model approach

    Authors: Anowar Shaikh, Shubhalaxmi Rath, Sadhana Dash, Binata Panda

    Abstract: We have computed the charge and heat transport coefficients of hot QCD matter by solving the relativistic Boltzmann transport equation using the BGK model approximation with a modified collision integral in the weak magnetic field regime. This modified collision integral enhances both charge and heat transport phenomena which can be understood by the large values of the above-mentioned coefficient… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: 41 pages, 26 figures

  46. arXiv:2210.12368  [pdf, other

    cs.LG cs.AI

    Counterfactual Generation Under Confounding

    Authors: Abbavaram Gowtham Reddy, Saloni Dash, Amit Sharma, Vineeth N Balasubramanian

    Abstract: A machine learning model, under the influence of observed or unobserved confounders in the training data, can learn spurious correlations and fail to generalize when deployed. For image classifiers, augmenting a training dataset using counterfactual examples has been empirically shown to break spurious correlations. However, the counterfactual generation task itself becomes more difficult as the l… ▽ More

    Submitted 10 December, 2022; v1 submitted 22 October, 2022; originally announced October 2022.

  47. arXiv:2210.09048  [pdf, other

    physics.ins-det hep-ex nucl-ex

    ATHENA Detector Proposal -- A Totally Hermetic Electron Nucleus Apparatus proposed for IP6 at the Electron-Ion Collider

    Authors: ATHENA Collaboration, J. Adam, L. Adamczyk, N. Agrawal, C. Aidala, W. Akers, M. Alekseev, M. M. Allen, F. Ameli, A. Angerami, P. Antonioli, N. J. Apadula, A. Aprahamian, W. Armstrong, M. Arratia, J. R. Arrington, A. Asaturyan, E. C. Aschenauer, K. Augsten, S. Aune, K. Bailey, C. Baldanza, M. Bansal, F. Barbosa, L. Barion , et al. (415 additional authors not shown)

    Abstract: ATHENA has been designed as a general purpose detector capable of delivering the full scientific scope of the Electron-Ion Collider. Careful technology choices provide fine tracking and momentum resolution, high performance electromagnetic and hadronic calorimetry, hadron identification over a wide kinematic range, and near-complete hermeticity. This article describes the detector design and its e… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Journal ref: JINST 17 (2022) 10, P10019

  48. arXiv:2209.09207  [pdf, other

    cs.CV

    Table Detection in the Wild: A Novel Diverse Table Detection Dataset and Method

    Authors: Mrinal Haloi, Shashank Shekhar, Nikhil Fande, Siddhant Swaroop Dash, Sanjay G

    Abstract: Recent deep learning approaches in table detection achieved outstanding performance and proved to be effective in identifying document layouts. Currently, available table detection benchmarks have many limitations, including the lack of samples diversity, simple table structure, the lack of training cases, and samples quality. In this paper, we introduce a diverse large-scale dataset for table det… ▽ More

    Submitted 30 November, 2023; v1 submitted 31 August, 2022; originally announced September 2022.

    Comments: Open source Table detection dataset and baseline results

    MSC Class: 68T45

  49. arXiv:2209.08784  [pdf, other

    hep-ex

    Study of multiplicity dependence of heavy flavor production in p-p collisions using rope hadronization mechanism

    Authors: Tulika Tripathy, Bharati Naik, Ranjit Nayak, Nirbhay Behera, Basanta K. Nandi, Sadhana Dash

    Abstract: The multiplicity dependence of the production of the charm mesons in p$-$p collisions at $\sqrt{s} = 7$ TeV and 13 TeV as measured by ALICE experiment has been investigated using Pythia 8 event generator by studying the effect of various processes at partonic level such as the effect of different modes of color reconnections and rope hadronization. The relative yields (… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

  50. arXiv:2209.06797  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Revealing the band structure of ZrTe$_5$ using Multicarrier Transport

    Authors: Zoltán Kovács-Krausz, Endre Tóvári, Dániel Nagy, Albin Márffy, Bogdan Karpiak, Zoltán Tajkov, László Oroszlány, János Koltai, Péter Nemes-Incze, Saroj Dash, Péter Makk, Szabolcs Csonka

    Abstract: The layered material ZrTe$_5$ appears to exhibit several exotic behaviors which resulted in significant interest recently, although the exact properties are still highly debated. Among these we find a Dirac/Weyl semimetallic behavior, nontrivial spin textures revealed by low temperature transport, and a potential weak or strong topological phase. The anomalous behavior of resistivity has been rece… ▽ More

    Submitted 19 January, 2023; v1 submitted 14 September, 2022; originally announced September 2022.

    Comments: Main Text: 10 pages, 5 figures; Supporting Information: 10 pages, 7 figures

    Journal ref: Phys. Rev. B 2023, 107, 7, 075152