Skip to main content

Showing 1–50 of 556 results for author: Gupta, N

  1. arXiv:2407.05887  [pdf, other

    cs.CL cs.AI cs.LG

    Generation and De-Identification of Indian Clinical Discharge Summaries using LLMs

    Authors: Sanjeet Singh, Shreya Gupta, Niralee Gupta, Naimish Sharma, Lokesh Srivastava, Vibhu Agarwal, Ashutosh Modi

    Abstract: The consequences of a healthcare data breach can be devastating for the patients, providers, and payers. The average financial impact of a data breach in recent months has been estimated to be close to USD 10 million. This is especially significant for healthcare organizations in India that are managing rapid digitization while still establishing data governance procedures that align with the lett… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Accepted at BioNLP Workshop at ACL 2024; 21 pages (9 pages main content)

  2. arXiv:2407.04727  [pdf

    eess.SP

    Dynamical Embedding of Single Channel Electroencephalogram for Artifact Subspace Reconstruction

    Authors: Doli Hazarika, Vishnu KN, Ramdas Ransing, Cota Navin Gupta

    Abstract: This study introduces a novel framework to apply Artifact Subspace Reconstruction (ASR) algorithm on single-channel Electroencephalogram (EEG) data. ASR, renowned for its automated capability to effectively eliminate various artifacts like eye-blinks and eye movements from EEG signals. Importantly it has been implemented on android smartphones, but relied on multiple channels for principal compone… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  3. arXiv:2406.14670  [pdf, other

    cs.CL cs.AI cs.LG

    Exploring Design Choices for Building Language-Specific LLMs

    Authors: Atula Tejaswi, Nilesh Gupta, Eunsol Choi

    Abstract: Despite rapid progress in large language models (LLMs), their performance on a vast majority of languages remain unsatisfactory. In this paper, we study building language-specific LLMs by adapting monolingual and multilingual LLMs. We conduct systematic experiments on how design choices (base model selection, vocabulary extension, and continued fine-tuning) impact the adapted LLM, both in terms of… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 15 pages, 6 figures, 11 tables

  4. arXiv:2406.10005  [pdf, ps, other

    math.ST

    Optimal Rates for Functional Linear Regression with General Regularization

    Authors: Naveen Gupta, S. Sivananthan, Bharath K. Sriperumbudur

    Abstract: Functional linear regression is one of the fundamental and well-studied methods in functional data analysis. In this work, we investigate the functional linear regression model within the context of reproducing kernel Hilbert space by employing general spectral regularization to approximate the slope function with certain smoothness assumptions. We establish optimal convergence rates for estimatio… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  5. arXiv:2406.05834  [pdf, other

    math.ST math.PR stat.AP

    Stochastic ordering of series and parallel systems lifetime in Archimedean copula under random shock

    Authors: Sarikul Islam, Nitin Gupta

    Abstract: In this manuscript, we studied the stochastic ordering behavior of series as well as parallel systems lifetimes comprising dependent and heterogeneous components, experiencing random shocks, and exhibiting distinct dependency structures. We establish certain conditions for the lifetime of individual components, the dependency among components defined by Archimedean copulas, and the impact of rando… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Number of pages 18, total figure 4

    Report number: AP-2024-19419 MSC Class: Primary: 60E15. Secondary: 90B25; 62G30

  6. arXiv:2405.20244  [pdf, ps, other

    hep-th gr-qc

    Chiral $Λ$-$\mathfrak{bms}_4$ symmetry of 3d conformal gravity

    Authors: Nishant Gupta, Nemani V. Suryanarayana

    Abstract: We propose mixed boundary conditions for 3d conformal gravity consistent with variational principle in its second-order formalism that admit the chiral $Λ$-$\mathfrak{bms}_4$ algebra as their asymptotic symmetry algebra. This algebra is one of the four chiral $\mathcal W$-algebra extensions of $\mathfrak{so}(2,3)$ and is a generalisation of the chiral $\mathfrak{bms}_4$ algebra responsible for sof… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 22 pages

  7. arXiv:2405.19261  [pdf, other

    cs.CL cs.AI cs.LG

    Faster Cascades via Speculative Decoding

    Authors: Harikrishna Narasimhan, Wittawat Jitkrittum, Ankit Singh Rawat, Seungyeon Kim, Neha Gupta, Aditya Krishna Menon, Sanjiv Kumar

    Abstract: Cascades and speculative decoding are two common approaches to improving language models' inference efficiency. Both approaches involve interleaving models of different sizes, but via fundamentally distinct mechanisms: cascades employ a deferral rule that invokes the larger model only for "hard" inputs, while speculative decoding uses speculative execution to primarily invoke the larger model in p… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  8. arXiv:2405.15657  [pdf, other

    astro-ph.HE

    Multiple Emission Regions in Jets of Low Luminosity Active Galactic Nucleus in NGC 4278

    Authors: Samik Dutta, Nayantara Gupta

    Abstract: The Large High Altitude Airshower Array (LHAASO) has detected very high energy gamma rays from the LINER galaxy NGC 4278, which has a low luminosity active galactic nucleus, and symmetric mildly relativistic S-shaped twin jets detected by radio observations. Few low-luminosity active galactic nuclei are detected in gamma rays due to their faintness. Earlier, several radio-emitting components were… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  9. arXiv:2405.14432  [pdf, other

    cs.LG

    Boosting Robustness by Clipping Gradients in Distributed Learning

    Authors: Youssef Allouah, Rachid Guerraoui, Nirupam Gupta, Ahmed Jellouli, Geovani Rizk, John Stephan

    Abstract: Robust distributed learning consists in achieving good learning performance despite the presence of misbehaving workers. State-of-the-art (SOTA) robust distributed gradient descent (Robust-DGD) methods, relying on robust aggregation, have been proven to be optimal: Their learning error matches the lower bound established under the standard heterogeneity model of $(G, B)$-gradient dissimilarity. Th… ▽ More

    Submitted 27 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  10. arXiv:2405.11201  [pdf, ps, other

    math.ST

    On General Weighted Extropy of Percentile Ranked Set Sampling

    Authors: Pradeep Kumar Sahu, Nitin Gupta

    Abstract: The extropy measure, first proposed by Lad, Sanfilippo, and Agro in their (2015) paper in Statistical Science, has attracted considerable attention in recent years. Our study introduces a fresh approach to representing weighted extropy in the framework of percentile ranked set sampling. Furthermore, we provide additional insights such as stochastic orders, characterizations, and bounds. Our findin… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2403.02673, arXiv:2207.02003

  11. arXiv:2405.09966  [pdf, ps, other

    math.PR

    Tempered Fractional Hawkes Process and Its Generalization

    Authors: Neha Gupta, Aditya Maheshwari

    Abstract: Hawkes process (HP) is a point process with a conditionally dependent intensity function. This paper defines the tempered fractional Hawkes process (TFHP) by time-changing the HP with an inverse tempered stable subordinator. We obtained results that generalize the fractional Hawkes process defined in Hainaut (2020) to a tempered version which has \textit{semi-heavy tailed} decay. We derive the mea… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 15 papages

    MSC Class: 60G22; 60G51; 60G55

  12. arXiv:2405.07205  [pdf, ps, other

    math.AG math.AC

    On Epimorphism and related problems for linear hypersurfaces

    Authors: Parnashree Ghosh, Neena Gupta, Ananya Pal

    Abstract: Linear hypersurfaces over a field $k$ have been playing a central role in the study of some of the challenging problems on affine spaces. Breakthroughs on such problems have occurred by examining two questions on linear polynomials of the form\\ $H:=α(X_1,\dots,X_m)Y - F(X_1,\dots, X_m,Z,T)\in D:=k[X_1,\ldots,X_m, Y,Z,T]$: (i) Whether the affine variety $\mathbb{V}\in \mathbb{A}^{m+3}_k$ defined b… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    MSC Class: Primary: 14R10; Secondary: 13B25; 13A50; 13A02

  13. arXiv:2405.04374  [pdf, other

    astro-ph.GA

    ASKAP reveals the radio tail structure of the Corkscrew Galaxy shaped by its passage through the Abell 3627 cluster

    Authors: Bärbel S. Koribalski, Stefan W. Duchesne, Emil Lenc, Tiziana Venturi, Andrea Botteon, Stanislav S. Shabala, Tessa Vernstrom, Ettore Carretti, Ray P. Norris, Craig Anderson, Andrew M. Hopkins, C. J. Riseley, Nikhel Gupta, Velibor Velović, -

    Abstract: Among the bent tail radio galaxies common in galaxy clusters are some with long, collimated tails (so-called head-tail galaxies) shaped by their interactions with the intracluster medium (ICM). Here we report the discovery of intricate filamentary structure in and beyond the ~28' (570 kpc) long, helical radio tail of the Corkscrew Galaxy (1610-60.5, ESO137-G007), which resides in the X-ray bright… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 11 pages, 7 figures, MNRAS, submitted

  14. arXiv:2405.00491  [pdf, ps, other

    cs.LG

    On the Relevance of Byzantine Robust Optimization Against Data Poisoning

    Authors: Sadegh Farhadkhani, Rachid Guerraoui, Nirupam Gupta, Rafael Pinot

    Abstract: The success of machine learning (ML) has been intimately linked with the availability of large amounts of data, typically collected from heterogeneous sources and processed on vast networks of computing devices (also called {\em workers}). Beyond accuracy, the use of ML in critical domains such as healthcare and autonomous driving calls for robustness against {\em data poisoning}and some {\em faul… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 38 pages

  15. arXiv:2404.18462  [pdf, other

    astro-ph.IM

    Self-supervised contrastive learning of radio data for source detection, classification and peculiar object discovery

    Authors: S. Riggi, T. Cecconello, S. Palazzo, A. M. Hopkins, N. Gupta, C. Bordiu, A. Ingallinera, C. Buemi, F. Bufano, F. Cavallaro, M. D. Filipović, P. Leto, S. Loru, A. C. Ruggeri, C. Trigilio, G. Umana, F. Vitello

    Abstract: New advancements in radio data post-processing are underway within the SKA precursor community, aiming to facilitate the extraction of scientific results from survey images through a semi-automated approach. Several of these developments leverage deep learning (DL) methodologies for diverse tasks, including source detection, object or morphology classification, and anomaly detection. Despite subst… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 21 pages, 16 figures

  16. arXiv:2404.16816  [pdf, other

    cs.CL

    IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages

    Authors: Harman Singh, Nitish Gupta, Shikhar Bharadwaj, Dinesh Tewari, Partha Talukdar

    Abstract: As large language models (LLMs) see increasing adoption across the globe, it is imperative for LLMs to be representative of the linguistic diversity of the world. India is a linguistically diverse country of 1.4 Billion people. To facilitate research on multilingual LLM evaluation, we release IndicGenBench - the largest benchmark for evaluating LLMs on user-facing generation tasks across a diverse… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  17. arXiv:2404.10136  [pdf, other

    cs.CL cs.AI cs.LG

    Language Model Cascades: Token-level uncertainty and beyond

    Authors: Neha Gupta, Harikrishna Narasimhan, Wittawat Jitkrittum, Ankit Singh Rawat, Aditya Krishna Menon, Sanjiv Kumar

    Abstract: Recent advances in language models (LMs) have led to significant improvements in quality on complex NLP tasks, but at the expense of increased inference costs. Cascading offers a simple strategy to achieve more favorable cost-quality tradeoffs: here, a small model is invoked for most "easy" instances, while a few "hard" instances are deferred to the large model. While the principles underpinning c… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  18. arXiv:2404.09522  [pdf, other

    astro-ph.GA astro-ph.CO

    The Physalis system: Discovery of ORC-like radio shells around a massive pair of interacting early-type galaxies with offset X-ray emission

    Authors: Bärbel S. Koribalski, Ildar Khabibullin, Klaus Dolag, Eugene Churazov, Ray P. Norris, Ettore Carretti, Andrew M. Hopkins, Tessa Vernstrom, Stanislav S. Shabala, Nikhel Gupta

    Abstract: We present the discovery of large radio shells around a massive pair of interacting galaxies and extended diffuse X-ray emission within the shells. The radio data were obtained with the Australian Square Kilometer Array Pathfinder (ASKAP) in two frequency bands centred at 944 MHz and 1.4 GHz, respectively, while the X-ray data are from the XMM-Newton observatory. The host galaxy pair, which consis… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 12 pages, 8 figures, submitted to MNRAS

  19. arXiv:2404.05872  [pdf, other

    cs.CV cs.LG cs.NE

    TabConv: Low-Computation CNN Inference via Table Lookups

    Authors: Neelesh Gupta, Narayanan Kannan, Pengmiao Zhang, Viktor Prasanna

    Abstract: Convolutional Neural Networks (CNNs) have demonstrated remarkable ability throughout the field of computer vision. However, CNN inference requires a large number of arithmetic operations, making them expensive to deploy in hardware. Current approaches alleviate this issue by developing hardware-supported, algorithmic processes to simplify spatial convolution functions. However, these methods still… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 8 pages, Accepted at CF '24

    ACM Class: I.5.1

  20. arXiv:2404.00665  [pdf, ps, other

    cs.IT

    On cumulative and relative cumulative past information generating function

    Authors: Santosh Kumar Chaudhary, Nitin Gupta, Achintya Roy

    Abstract: In this paper, we introduce the cumulative past information generating function (CPIG) and relative cumulative past information generating function (RCPIG). We study its properties. We establish its relation with generalized cumulative past entropy (GCPE). We defined CPIG stochastic order and its relation with dispersive order. We provide the results for the CPIG measure of the convoluted random v… ▽ More

    Submitted 22 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

  21. arXiv:2403.20327  [pdf, other

    cs.CL cs.AI

    Gecko: Versatile Text Embeddings Distilled from Large Language Models

    Authors: Jinhyuk Lee, Zhuyun Dai, Xiaoqi Ren, Blair Chen, Daniel Cer, Jeremy R. Cole, Kai Hui, Michael Boratko, Rajvi Kapadia, Wen Ding, Yi Luan, Sai Meher Karthik Duddu, Gustavo Hernandez Abrego, Weiqiang Shi, Nithi Gupta, Aditya Kusupati, Prateek Jain, Siddhartha Reddy Jonnalagadda, Ming-Wei Chang, Iftekhar Naim

    Abstract: We present Gecko, a compact and versatile text embedding model. Gecko achieves strong retrieval performance by leveraging a key idea: distilling knowledge from large language models (LLMs) into a retriever. Our two-step distillation process begins with generating diverse, synthetic paired data using an LLM. Next, we further refine the data quality by retrieving a set of candidate passages for each… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: 18 pages

  22. arXiv:2403.17558  [pdf, ps, other

    math.CT math.CO

    Neural category

    Authors: Neha Gupta, Suhith K N

    Abstract: A neural code on $ n $ neurons is a collection of subsets of the set $ [n]=\{1,2,\dots,n\} $. Curto et al. \cite{curto2013neural} associated a ring $\mathcal{R}_{\mathcal{C}}$ (neural ring) to a neural code $\mathcal{C}$. A special class of ring homomorphisms between two neural rings, called neural ring homomorphism, was introduced by Curto and Youngs \cite{curto2020neural}. The main work in this… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    MSC Class: 52A37; 92B99; 18A99

  23. arXiv:2403.17548  [pdf, other

    math.CO

    Properties of graphs of neural codes

    Authors: Suhith K N, Neha Gupta

    Abstract: A neural code on $ n $ neurons is a collection of subsets of the set $ [n]=\{1,2,\dots,n\} $. In this paper, we study some properties of graphs of neural codes. In particular, we study codeword containment graph (CCG) given by Chan et al. (SIAM J. on Dis. Math., 37(1):114-145,2017) and general relationship graph (GRG) given by Gross et al. (Adv. in App. Math., 95:65-95, 2018). We provide a suffici… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    MSC Class: 52A37; 92B99; 05C40; 05C99

  24. arXiv:2403.17397  [pdf, ps, other

    math.AG math.AC

    On Abhyankar-Sathaye Conjecture for a family of linear hypersurfaces in $\A_{k}^4$

    Authors: Parnashree Ghosh, Neena Gupta, Ananya Pal

    Abstract: Let $k$ be a field. In this paper we study the Abhyankar-Sathaye Epimorphism Conjecture for certain hyperplanes in $\A_k^4$ defined by a polynomial of the form $a(X)Y-F(X,Z,T)$. When $k=\bC$, Kaliman, Vénéreau and Zaidenberg have proved that such hyperplanes are rectifiable in $\A^4_{\bC}$. We extend their result over any field of characteristic zero and when $k$ is a field of arbitrary characteri… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Comments are welcome!

    MSC Class: Primary: 14R10; Secondary: 13B25; 13A50; 13A02

  25. arXiv:2403.14235  [pdf, other

    astro-ph.GA astro-ph.CO astro-ph.IM cs.CV cs.LG

    RG-CAT: Detection Pipeline and Catalogue of Radio Galaxies in the EMU Pilot Survey

    Authors: Nikhel Gupta, Ray P. Norris, Zeeshan Hayder, Minh Huynh, Lars Petersson, X. Rosalind Wang, Andrew M. Hopkins, Heinz Andernach, Yjan Gordon, Simone Riggi, Miranda Yew, Evan J. Crawford, Bärbel Koribalski, Miroslav D. Filipović, Anna D. Kapinśka, Stanislav Shabala, Tessa Vernstrom, Joshua R. Marvil

    Abstract: We present source detection and catalogue construction pipelines to build the first catalogue of radio galaxies from the 270 $\rm deg^2$ pilot survey of the Evolutionary Map of the Universe (EMU-PS) conducted with the Australian Square Kilometre Array Pathfinder (ASKAP) telescope. The detection pipeline uses Gal-DINO computer-vision networks (Gupta et al., 2024) to predict the categories of radio… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Accepted for publication in PASA. The paper has 22 pages, 12 figures and 5 tables

  26. arXiv:2403.07534  [pdf, ps, other

    math.NT math.CO

    Frobenius numbers associated with Diophantine triples of $x^2+y^2=z^r$ (extended version)

    Authors: Takao Komatsu, Neha Gupta, Manoj Upreti

    Abstract: We give an explicit formula for the $p$-Frobenius number of triples associated with Diophantine equations $x^2+y^2=z^r$, that is, the largest positive integer that can only be represented in $p$ ways by combining the three integers of the solutions of Diophantine equations $x^2+y^2=z^r$. When $r=2$, the Frobenius number has already been given.

    Submitted 12 March, 2024; originally announced March 2024.

  27. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  28. arXiv:2403.03004  [pdf, other

    astro-ph.CO gr-qc hep-ph

    Ultralight vector dark matter search using data from the KAGRA O3GK run

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi , et al. (1778 additional authors not shown)

    Abstract: Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 20 pages, 5 figures

    Report number: LIGO-P2300250

  29. arXiv:2403.02673  [pdf, ps, other

    math.ST

    On General Weighted Extropy of Extreme Ranked Set Sampling

    Authors: Pradeep Kumar Sahu, Nitin Gupta

    Abstract: The extropy measure, introduced by Lad, Sanfilippo, and Agro in their (2015) paper in Statistical Science, has garnered significant interest over the past years. In this study, we present a novel representation for the weighted extropy within the context of extreme ranked set sampling. Additionally, we offer related findings such as stochastic orders, characterizations, and precise bounds. Our res… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: text overlap with arXiv:2207.02003

  30. arXiv:2403.02337  [pdf, other

    astro-ph.CO

    First Constraints on the Epoch of Reionization Using the non-Gaussianity of the Kinematic Sunyaev-Zel{'}dovich Effect from the South Pole Telescope and {\it Herschel}-SPIRE Observations

    Authors: S. Raghunathan, P. A. R. Ade, A. J. Anderson, B. Ansarinejad, M. Archipley, J. E. Austermann, L. Balkenhol, J. A. Beall, K. Benabed, A. N. Bender, B. A. Benson, F. Bianchini, L. E. Bleem, J. Bock, F. R. Bouchet, L. Bryant, E. Camphuis, J. E. Carlstrom, T. W. Cecil, C. L. Chang, P. Chaubal, H. C. Chiang, P. M. Chichura, T. -L. Chou, R. Citron , et al. (97 additional authors not shown)

    Abstract: We report results from an analysis aimed at detecting the trispectrum of the kinematic Sunyaev-Zel{'}dovich (kSZ) effect by combining data from the South Pole Telescope (SPT) and {\it Herschel}-SPIRE experiments over a 100 ${\rm deg}^{2}$ field. The SPT observations combine data from the previous and current surveys, namely SPTpol and SPT-3G, to achieve depths of 4.5, 3, and 16 $μ{\rm K-arcmin}$ i… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 15 pages, 5 figures (3 in main text and 2 in Appendix); To be submitted to PRL; Comments welcome; Data products and plotting scripts can be downloaded from https://github.com/sriniraghunathan/kSZ_4pt_SPT_SPIRE

  31. arXiv:2402.15329  [pdf, ps, other

    math.AG

    Iterations of the functor of naive $\mathbb A^1$-connected components of varieties

    Authors: Nidhi Gupta

    Abstract: For any sheaf of sets $\mathcal F$ on $Sm/k$, it is well known that the universal $\mathbb A^1$-invariant quotient of $\mathcal F$ is given as the colimit of sheaves $\mathcal S^n(\mathcal F)$ where $\mathcal S(F)$ is the sheaf of naive $\mathbb A^1$-connected components of $\mathcal F$. We show that these infinite iterations of naive $\mathbb A^1$-connected components in the construction of unive… ▽ More

    Submitted 14 March, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: 10 pages, comments are welcome, v2: added subsection 4.3

    MSC Class: 14F42

  32. PaCKD: Pattern-Clustered Knowledge Distillation for Compressing Memory Access Prediction Models

    Authors: Neelesh Gupta, Pengmiao Zhang, Rajgopal Kannan, Viktor Prasanna

    Abstract: Deep neural networks (DNNs) have proven to be effective models for accurate Memory Access Prediction (MAP), a critical task in mitigating memory latency through data prefetching. However, existing DNN-based MAP models suffer from the challenges such as significant physical storage space and poor inference latency, primarily due to their large number of parameters. These limitations render them imp… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: 6 pages, 2 figures, HPEC '23

    Journal ref: 2023 IEEE High Performance Extreme Computing Conference (HPEC), 2023, pp. 1-7

  33. arXiv:2402.12780  [pdf, other

    cs.LG

    Byzantine-Robust Federated Learning: Impact of Client Subsampling and Local Updates

    Authors: Youssef Allouah, Sadegh Farhadkhani, Rachid GuerraouI, Nirupam Gupta, Rafael Pinot, Geovani Rizk, Sasha Voitovych

    Abstract: The possibility of adversarial (a.k.a., {\em Byzantine}) clients makes federated learning (FL) prone to arbitrary manipulation. The natural approach to robustify FL against adversarial clients is to replace the simple averaging operation at the server in the standard $\mathsf{FedAvg}$ algorithm by a \emph{robust averaging rule}. While a significant amount of work has been devoted to studying the c… ▽ More

    Submitted 10 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  34. arXiv:2402.07411  [pdf, other

    cs.LG

    Potential-Based Reward Shaping For Intrinsic Motivation

    Authors: Grant C. Forbes, Nitish Gupta, Leonardo Villalobos-Arias, Colin M. Potts, Arnav Jhala, David L. Roberts

    Abstract: Recently there has been a proliferation of intrinsic motivation (IM) reward-shaping methods to learn in complex and sparse-reward environments. These methods can often inadvertently change the set of optimal policies in an environment, leading to suboptimal behavior. Previous work on mitigating the risks of reward shaping, particularly through potential-based reward shaping (PBRS), has not been ap… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: Extended version of paper appearing in AAMAS 2024

    ACM Class: I.2.6

  35. arXiv:2402.02945  [pdf, other

    math.ST

    Stochastic ordering of extreme order statistics in Archimax copula

    Authors: Sarikul Islam, Nitin Gupta

    Abstract: An extension of Archimax copula class in more than two random variables ( Multivariate ) was introduced in (Jágr 2011) for describing dependency structures among random variables in higher dimension, and some properties of Archimax copula were explored in (Charpentier et al. 2014). In this article, some results for stochastic ordering of extreme order statistics in (Li and Fang 2015) are generaliz… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Multivariate Statistics, Stochastic ordering, 18 pages, 8 figures

    ACM Class: G.3

  36. arXiv:2402.00045  [pdf, other

    cs.MM cs.AI cs.LG

    Detecting Multimedia Generated by Large AI Models: A Survey

    Authors: Li Lin, Neeraj Gupta, Yue Zhang, Hainan Ren, Chun-Hao Liu, Feng Ding, Xin Wang, Xin Li, Luisa Verdoliva, Shu Hu

    Abstract: The rapid advancement of Large AI Models (LAIMs), particularly diffusion models and large language models, has marked a new era where AI-generated multimedia is increasingly integrated into various aspects of daily life. Although beneficial in numerous fields, this content presents significant risks, including potential misuse, societal disruptions, and ethical concerns. Consequently, detecting mu… ▽ More

    Submitted 7 February, 2024; v1 submitted 22 January, 2024; originally announced February 2024.

  37. arXiv:2401.13065  [pdf, ps, other

    math.ST

    Extropy and Varextropy estimators with applications

    Authors: Santosh Kumar Chaudhary, Nitin Gupta

    Abstract: In many statistical studies, the measure of uncertainties like entropy, extropy, varentropy and varextropy of a distribution function is of prime interest. This paper proposes estimators of extropy and varextropy. Proposed estimators are consistent. Based on extropy estimator, a test of symmetry is given. The proposed test has the advantage that we do not need to estimate the centre of symmetry. T… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: text overlap with arXiv:2209.06703

  38. Computational Reverse Engineering Analysis of Scattering Experiments Method for Interpretation of 2D Small-Angle Scattering Profiles (CREASE-2D)

    Authors: Sri Vishnuvardhan Reddy Akepati, Nitant Gupta, Arthi Jayaraman

    Abstract: Characterization of structural diversity within soft materials is key for engineering new materials for various applications. Small-angle scattering (SAS) is a widely used characterization technique that provides structural information in soft materials at varying length scales and typically outputs scattered intensity I(q) as a function of the scattered wavevector represented by its magnitude q a… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 14 pages, 5 figures, supporting information included

  39. arXiv:2401.06362  [pdf, other

    cs.NE cs.AR cs.LG cs.OS

    Attention, Distillation, and Tabularization: Towards Practical Neural Network-Based Prefetching

    Authors: Pengmiao Zhang, Neelesh Gupta, Rajgopal Kannan, Viktor K. Prasanna

    Abstract: Attention-based Neural Networks (NN) have demonstrated their effectiveness in accurate memory access prediction, an essential step in data prefetching. However, the substantial computational overheads associated with these models result in high inference latency, limiting their feasibility as practical prefetchers. To close the gap, we propose a new approach based on tabularization that significan… ▽ More

    Submitted 21 February, 2024; v1 submitted 23 December, 2023; originally announced January 2024.

  40. arXiv:2401.02412  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    LLM Augmented LLMs: Expanding Capabilities through Composition

    Authors: Rachit Bansal, Bidisha Samanta, Siddharth Dalmia, Nitish Gupta, Shikhar Vashishth, Sriram Ganapathy, Abhishek Bapna, Prateek Jain, Partha Talukdar

    Abstract: Foundational models with billions of parameters which have been trained on large corpora of data have demonstrated non-trivial skills in a variety of domains. However, due to their monolithic structure, it is challenging and expensive to augment them or impart new skills. On the other hand, due to their adaptation abilities, several new instances of these models are being trained towards new domai… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: 17 pages, 2 figures, 8 tables

  41. arXiv:2401.02075  [pdf, other

    astro-ph.CO

    SPT Clusters with DES and HST Weak Lensing. II. Cosmological Constraints from the Abundance of Massive Halos

    Authors: S. Bocquet, S. Grandis, L. E. Bleem, M. Klein, J. J. Mohr, T. Schrabback, T. M. C. Abbott, P. A. R. Ade, M. Aguena, A. Alarcon, S. Allam, S. W. Allen, O. Alves, A. Amon, A. J. Anderson, J. Annis, B. Ansarinejad, J. E. Austermann, S. Avila, D. Bacon, M. Bayliss, J. A. Beall, K. Bechtol, M. R. Becker, A. N. Bender , et al. (171 additional authors not shown)

    Abstract: We present cosmological constraints from the abundance of galaxy clusters selected via the thermal Sunyaev-Zel'dovich (SZ) effect in South Pole Telescope (SPT) data with a simultaneous mass calibration using weak gravitational lensing data from the Dark Energy Survey (DES) and the Hubble Space Telescope (HST). The cluster sample is constructed from the combined SPT-SZ, SPTpol ECS, and SPTpol 500d… ▽ More

    Submitted 21 June, 2024; v1 submitted 4 January, 2024; originally announced January 2024.

    Comments: Accepted for publication in Phys. Rev. D. arXiv v2 corresponds to published article

  42. arXiv:2312.15790  [pdf, other

    hep-th quant-ph

    Complexity and Operator Growth for Quantum Systems in Dynamic Equilibrium

    Authors: Cameron Beetar, Nitin Gupta, S. Shajidul Haque, Jeff Murugan, Hendrik J R Van Zyl

    Abstract: Krylov complexity is a measure of operator growth in quantum systems, based on the number of orthogonal basis vectors needed to approximate the time evolution of an operator. In this paper, we study the Krylov complexity of a $\mathsf{PT}$-symmetric system of oscillators, which exhibits two phase transitions that separate a dissipative state, a Rabi-oscillation state, and an ultra-strongly coupled… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

    Comments: 24 + 4 pages and appendices

  43. arXiv:2312.07343  [pdf, ps, other

    cs.HC cs.AI

    Can ChatGPT Play the Role of a Teaching Assistant in an Introductory Programming Course?

    Authors: Anishka, Atharva Mehta, Nipun Gupta, Aarav Balachandran, Dhruv Kumar, Pankaj Jalote

    Abstract: The emergence of Large language models (LLMs) is expected to have a major impact on education. This paper explores the potential of using ChatGPT, an LLM, as a virtual Teaching Assistant (TA) in an Introductory Programming Course. We evaluate ChatGPT's capabilities by comparing its performance with that of human TAs in some of the important TA functions. The TA functions which we focus on include… ▽ More

    Submitted 22 January, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: Under review

  44. arXiv:2312.06728  [pdf, other

    cs.CV astro-ph.CO astro-ph.GA astro-ph.IM

    A Multimodal Dataset and Benchmark for Radio Galaxy and Infrared Host Detection

    Authors: Nikhel Gupta, Zeeshan Hayder, Ray P. Norris, Minh Hyunh, Lars Petersson

    Abstract: We present a novel multimodal dataset developed by expert astronomers to automate the detection and localisation of multi-component extended radio galaxies and their corresponding infrared hosts. The dataset comprises 4,155 instances of galaxies in 2,800 images with both radio and infrared modalities. Each instance contains information on the extended radio galaxy class, its corresponding bounding… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted in NeurIPS 2023 conference ML4PS workshop (https://nips.cc/). The full version accepted in PASA, is available at https://doi.org/10.1017/pasa.2023.64

  45. arXiv:2312.05456  [pdf, other

    cs.LG physics.soc-ph q-bio.PE

    On the calibration of compartmental epidemiological models

    Authors: Nikunj Gupta, Anh Mai, Azza Abouzied, Dennis Shasha

    Abstract: Epidemiological compartmental models are useful for understanding infectious disease propagation and directing public health policy decisions. Calibration of these models is an important step in offering accurate forecasts of disease dynamics and the effectiveness of interventions. In this study, we present an overview of calibrating strategies that can be employed, including several optimization… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  46. arXiv:2312.00306  [pdf, other

    astro-ph.IM astro-ph.CO astro-ph.GA cs.CV

    RadioGalaxyNET: Dataset and Novel Computer Vision Algorithms for the Detection of Extended Radio Galaxies and Infrared Hosts

    Authors: Nikhel Gupta, Zeeshan Hayder, Ray P. Norris, Minh Huynh, Lars Petersson

    Abstract: Creating radio galaxy catalogues from next-generation deep surveys requires automated identification of associated components of extended sources and their corresponding infrared hosts. In this paper, we introduce RadioGalaxyNET, a multimodal dataset, and a suite of novel computer vision algorithms designed to automate the detection and localization of multi-component extended radio galaxies and t… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

    Comments: Accepted for publication in PASA. The paper has 17 pages, 6 figures, 5 tables

  47. arXiv:2311.07512  [pdf, other

    astro-ph.CO astro-ph.GA

    Galaxy Clusters Discovered via the Thermal Sunyaev-Zel'dovich Effect in the 500-square-degree SPTpol Survey

    Authors: L. E. Bleem, M. Klein, T. M. C. Abbott, P. A. R. Ade, M. Aguena, O. Alves, A. J. Anderson, F. Andrade-Oliveira, B. Ansarinejad, M. Archipley, M. L. N. Ashby, J. E. Austermann, D. Bacon, J. A. Beall, A. N. Bender, B. A. Benson, F. Bianchini, S. Bocquet, D. Brooks, D. L. Burke, M. Calzadilla, J. E. Carlstrom, A. Carnero Rosell, J. Carretero, C. L. Chang , et al. (103 additional authors not shown)

    Abstract: We present a catalog of 689 galaxy cluster candidates detected at significance $ξ>4$ via their thermal Sunyaev-Zel'dovich (SZ) effect signature in 95 and 150 GHz data from the 500-square-degree SPTpol survey. We use optical and infrared data from the Dark Energy Camera and the Wide-field Infrared Survey Explorer (WISE) and \spitzer \ satellites, to confirm 544 of these candidates as clusters with… ▽ More

    Submitted 8 February, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: Matches version accepted by OJA. 19 pages + references, 14 figures, cluster candidate table provided in Appendix. Data products available at https://pole.uchicago.edu/public/data/sptpol_500d_clusters/index.html and an interactive sky server at https://skyviewer.ncsa.illinois.edu

    Journal ref: Open Journal of Astrophysics, Volume 7, 2024

  48. arXiv:2311.07481  [pdf, other

    astro-ph.HE

    HESS J1809-193: Gamma-Ray Emission by Cosmic Rays from Past Explosion

    Authors: Sovan Boxi, Nayantara Gupta

    Abstract: The very high energy gamma-ray source HESS J1809-193 has been detected by the LHAASO and HAWC observatory beyond 100 TeV energy. It is an interesting candidate for exploring the underlying mechanisms of gamma-ray production due to the presence of supernova remnants, pulsar and molecular clouds close to it. We have considered the injection of the energetic cosmic rays from a past explosion, whose r… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 8 pages, 6 figures, Accepted in ApJ

  49. arXiv:2311.00336  [pdf, other

    astro-ph.GA

    MALS discovery of a rare HI 21-cm absorber at $z\sim1.35$: origin of the absorbing gas in powerful AGN

    Authors: P. P. Deka, N. Gupta, H. W. Chen, S. D. Johnson, P. Noterdaeme, F. Combes, E. Boettcher, S. A. Balashev, K. L. Emig, G. I. G. Józsa, H. -R. Klöckner, J-. K. Krogager, E. Momjian, P. Petitjean, G. C. Rudie, J. Wagenveld, F. S. Zahedy

    Abstract: We report a new, rare detection of HI 21-cm absorption associated with a quasar (only six known at $1<z<2$) here towards J2339-5523 at $z_{em}$ = 1.3531, discovered through the MeerKAT Absorption Line Survey (MALS). The absorption profile is broad ($\sim 400$ km/s), and the peak is redshifted by $\sim 200$ km/s, from $z_{em}$. Interestingly, optical/FUV spectra of the quasar from Magellan-MIKE/HST… ▽ More

    Submitted 19 February, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: 10 pages, 8 figures, accepted for publication in A&A

  50. arXiv:2310.17204  [pdf, other

    astro-ph.GA

    Cold molecules in HI 21cm absorbers across redshifts 0.1-4

    Authors: Francoise Combes, Neeraj Gupta

    Abstract: Absorption lines at high redshift in front of quasars are rare in the mm domain. Only five associated and five intervening systems have been reported in the literature. These bring very useful information complementary to emission lines, for instance, to distinguish between inflows and outflows. They are also good candidates to study the variations of the fundamental constants. We report here the… ▽ More

    Submitted 16 December, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: 14 pages, 10 figures, accepted in A&A