Skip to main content

Showing 1–50 of 379 results for author: Agrawal, A

  1. arXiv:2407.07840  [pdf, other

    cs.CV cs.CL

    Decompose and Compare Consistency: Measuring VLMs' Answer Reliability via Task-Decomposition Consistency Comparison

    Authors: Qian Yang, Weixiang Yan, Aishwarya Agrawal

    Abstract: Despite tremendous advancements, current state-of-the-art Vision-Language Models (VLMs) are still far from perfect. They tend to hallucinate and may generate biased responses. In such circumstances, having a way to assess the reliability of a given response generated by a VLM is quite useful. Existing methods, such as estimating uncertainty using answer likelihoods or prompt-based confidence gener… ▽ More

    Submitted 11 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: Preprint

  2. arXiv:2407.07000  [pdf, other

    cs.LG cs.AI cs.CL cs.DC

    Metron: Holistic Performance Evaluation Framework for LLM Inference Systems

    Authors: Amey Agrawal, Anmol Agarwal, Nitin Kedia, Jayashree Mohan, Souvik Kundu, Nipun Kwatra, Ramachandran Ramjee, Alexey Tumanov

    Abstract: Serving large language models (LLMs) in production can incur substantial costs, which has prompted recent advances in inference system optimizations. Today, these systems are evaluated against conventional latency and throughput metrics (eg. TTFT, TBT, Normalised Latency and TPOT). However, these metrics fail to fully capture the nuances of LLM inference, leading to an incomplete assessment of use… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  3. arXiv:2407.06167  [pdf, other

    cs.CV cs.LG

    DεpS: Delayed ε-Shrinking for Faster Once-For-All Training

    Authors: Aditya Annavajjala, Alind Khare, Animesh Agrawal, Igor Fedorov, Hugo Latapie, Myungjin Lee, Alexey Tumanov

    Abstract: CNNs are increasingly deployed across different hardware, dynamic environments, and low-power embedded devices. This has led to the design and training of CNN architectures with the goal of maximizing accuracy subject to such variable deployment constraints. As the number of deployment scenarios grows, there is a need to find scalable solutions to design and train specialized CNNs. Once-for-all tr… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Accepted to the 18th European Conference on Computer Vision (ECCV 2024)

  4. arXiv:2407.05618  [pdf, other

    nucl-ex hep-ex

    Improved limit on neutrinoless double beta decay of \mohundred~from AMoRE-I

    Authors: A. Agrawal, V. V. Alenkov, P. Aryal, J. Beyer, B. Bhandari, R. S. Boiko, K. Boonin, O. Buzanov, C. R. Byeon, N. Chanthima, M. K. Cheoun, J. S. Choe, Seonho Choi, S. Choudhury, J. S. Chung, F. A. Danevich, M. Djamal, D. Drung, C. Enss, A. Fleischmann, A. M. Gangapshev, L. Gastaldo, Y. M. Gavrilyuk, A. M. Gezhaev, O. Gileva , et al. (83 additional authors not shown)

    Abstract: AMoRE searches for the signature of neutrinoless double beta decay of $^{100}$Mo with a 100 kg sample of enriched $^{100}$Mo. Scintillating molybdate crystals coupled with a metallic magnetic calorimeter operate at milli-Kelvin temperatures to measure the energy of electrons emitted in the decay. As a demonstration of the full-scale AMoRE, we conducted AMoRE-I, a pre-experiment with 18 molybdate c… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 7 pages, 4 figures

  5. arXiv:2407.00835  [pdf, other

    quant-ph

    Distributed Quantum Computing across an Optical Network Link

    Authors: D. Main, P. Drmota, D. P. Nadlinger, E. M. Ainley, A. Agrawal, B. C. Nichol, R. Srinivas, G. Araneda, D. M. Lucas

    Abstract: Distributed quantum computing (DQC) combines the computing power of multiple networked quantum processing modules, enabling the execution of large quantum circuits without compromising on performance and connectivity. Photonic networks are well-suited as a versatile and reconfigurable interconnect layer for DQC; remote entanglement shared between matter qubits across the network enables all-to-all… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 16 pages, 7 figures, 2 tables

  6. arXiv:2407.00548  [pdf, other

    cs.RO

    KOROL: Learning Visualizable Object Feature with Koopman Operator Rollout for Manipulation

    Authors: Hongyi Chen, Abulikemu Abuduweili, Aviral Agrawal, Yunhai Han, Harish Ravichandar, Changliu Liu, Jeffrey Ichnowski

    Abstract: Learning dexterous manipulation skills presents significant challenges due to complex nonlinear dynamics that underlie the interactions between objects and multi-fingered hands. Koopman operators have emerged as a robust method for modeling such nonlinear dynamics within a linear framework. However, current methods rely on runtime access to ground-truth (GT) object states, making them unsuitable f… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  7. arXiv:2406.17037  [pdf, other

    quant-ph

    Cheaper and more noise-resilient quantum state preparation using eigenvector continuation

    Authors: Anjali A. Agrawal, Akhil Francis, A. F. Kemper

    Abstract: Subspace methods are powerful, noise-resilient methods that can effectively prepare ground states on quantum computers. The challenge is to get a subspace with a small condition number that spans the states of interest using minimal quantum resources. In this work, we will use eigenvector continuation (EC) to build a subspace from the low-lying states of a set of Hamiltonians. The basis vectors ar… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 17 pages, 18 figures

  8. arXiv:2406.10266  [pdf

    cs.CL cs.SI

    COVID-19 Twitter Sentiment Classification Using Hybrid Deep Learning Model Based on Grid Search Methodology

    Authors: Jitendra Tembhurne, Anant Agrawal, Kirtan Lakhotia

    Abstract: In the contemporary era, social media platforms amass an extensive volume of social data contributed by their users. In order to promptly grasp the opinions and emotional inclinations of individuals regarding a product or event, it becomes imperative to perform sentiment analysis on the user-generated content. Microblog comments often encompass both lengthy and concise text entries, presenting a c… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 14 pages, 6 figures, 11 tables

  9. arXiv:2406.09998  [pdf, other

    eess.AS cs.AI cs.LG cs.MM cs.SD

    Understanding Pedestrian Movement Using Urban Sensing Technologies: The Promise of Audio-based Sensors

    Authors: Chaeyeon Han, Pavan Seshadri, Yiwei Ding, Noah Posner, Bon Woo Koo, Animesh Agrawal, Alexander Lerch, Subhrajit Guhathakurta

    Abstract: While various sensors have been deployed to monitor vehicular flows, sensing pedestrian movement is still nascent. Yet walking is a significant mode of travel in many cities, especially those in Europe, Africa, and Asia. Understanding pedestrian volumes and flows is essential for designing safer and more attractive pedestrian infrastructure and for controlling periodic overcrowding. This study dis… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: submitted to Urban Informatics

  10. arXiv:2406.09698  [pdf, other

    physics.ins-det hep-ex

    Projected background and sensitivity of AMoRE-II

    Authors: A. Agrawal, V. V. Alenkov, P. Aryal, J. Beyer, B. Bhandari, R. S. Boiko, K. Boonin, O. Buzanov, C. R. Byeon, N. Chanthima, M. K. Cheoun, J. S. Choe, Seonho Choi, S. Choudhury, J. S. Chung, F. A. Danevich, M. Djamal, D. Drung, C. Enss, A. Fleischmann, A. M. Gangapshev, L. Gastaldo, Y. M. Gavrilyuk, A. M. Gezhaev, O. Gileva , et al. (81 additional authors not shown)

    Abstract: AMoRE-II aims to search for neutrinoless double beta decay with an array of 423 Li$_2$$^{100}$MoO$_4$ crystals operating in the cryogenic system as the main phase of the Advanced Molybdenum-based Rare process Experiment (AMoRE). AMoRE has been planned to operate in three phases: AMoRE-pilot, AMoRE-I, and AMoRE-II. AMoRE-II is currently being installed at the Yemi Underground Laboratory, located ap… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  11. arXiv:2406.08412  [pdf, other

    quant-ph

    Experimental Quantum Advantage in the Odd-Cycle Game

    Authors: P. Drmota, D. Main, E. M. Ainley, A. Agrawal, G. Araneda, D. P. Nadlinger, B. C. Nichol, R. Srinivas, A. Cabello, D. M. Lucas

    Abstract: We report the first experimental demonstration of the odd-cycle game. We entangle two ions separated by ~2 m and the players use them to win the odd-cycle game significantly more often than the best classical strategy allows. The experiment implements the optimal quantum strategy, is free of the detection loophole (the detection efficiency is >~99.999 %), and achieves 97.8(3) % of the theoretical… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  12. arXiv:2406.06511  [pdf, other

    quant-ph cond-mat.str-el

    Quantifying fault tolerant simulation of strongly correlated systems using the Fermi-Hubbard model

    Authors: Anjali A. Agrawal, Joshua Job, Tyler L. Wilson, S. N. Saadatmand, Mark J. Hodson, Josh Y. Mutus, Athena Caesura, Peter D. Johnson, Justin E. Elenewski, Kaitlyn J. Morrell, Alexander F. Kemper

    Abstract: Understanding the physics of strongly correlated materials is one of the grand challenge problems for physics today. A large class of scientifically interesting materials, from high-$T_c$ superconductors to spin liquids, involve medium to strong correlations, and building a holistic understanding of these materials is critical. Doing so is hindered by the competition between the kinetic energy and… ▽ More

    Submitted 13 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  13. arXiv:2406.06079  [pdf, other

    cs.CV

    Latent Representation Matters: Human-like Sketches in One-shot Drawing Tasks

    Authors: Victor Boutin, Rishav Mukherji, Aditya Agrawal, Sabine Muzellec, Thomas Fel, Thomas Serre, Rufin VanRullen

    Abstract: Humans can effortlessly draw new categories from a single exemplar, a feat that has long posed a challenge for generative models. However, this gap has started to close with recent advances in diffusion models. This one-shot drawing task requires powerful inductive biases that have not been systematically investigated. Here, we study how different inductive biases shape the latent space of Latent… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  14. arXiv:2406.01241  [pdf, ps, other

    gr-qc

    Black Holes and Wormholes Beyond Classical General Relativity

    Authors: A. S. Agrawal, Sergio Zerbini, B. Mishra

    Abstract: In the paper, only Static Spherically Symmetric space-times in four dimensions are considered within modified gravity models. The non-singular static metrics, including black holes not admitting a de Sitter core in the center and traversable wormholes, are reconsidered within a class of higher-order $F(R)$, satisfying the constraints $F(0)=\frac{dF}{dR}(0)=0$. Furthermore, by making use of the so-… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  15. arXiv:2405.19747  [pdf, other

    cs.LG stat.ML

    Understanding and mitigating difficulties in posterior predictive evaluation

    Authors: Abhinav Agrawal, Justin Domke

    Abstract: Predictive posterior densities (PPDs) are of interest in approximate Bayesian inference. Typically, these are estimated by simple Monte Carlo (MC) averages using samples from the approximate posterior. We observe that the signal-to-noise ratio (SNR) of such estimators can be extremely low. An analysis for exact inference reveals SNR decays exponentially as there is an increase in (a) the mismatch… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  16. arXiv:2405.17247  [pdf, other

    cs.LG

    An Introduction to Vision-Language Modeling

    Authors: Florian Bordes, Richard Yuanzhe Pang, Anurag Ajay, Alexander C. Li, Adrien Bardes, Suzanne Petryk, Oscar Mañas, Zhiqiu Lin, Anas Mahmoud, Bargav Jayaraman, Mark Ibrahim, Melissa Hall, Yunyang Xiong, Jonathan Lebensold, Candace Ross, Srihari Jayakumar, Chuan Guo, Diane Bouchacourt, Haider Al-Tahan, Karthik Padthe, Vasu Sharma, Hu Xu, Xiaoqing Ellen Tan, Megan Richards, Samuel Lavoie , et al. (16 additional authors not shown)

    Abstract: Following the recent popularity of Large Language Models (LLMs), several attempts have been made to extend them to the visual domain. From having a visual assistant that could guide us through unfamiliar environments to generative models that produce images using only a high-level text description, the vision-language model (VLM) applications will significantly impact our relationship with technol… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  17. arXiv:2405.13938  [pdf, other

    cs.LG cs.AI cs.AR math.NA

    eXmY: A Data Type and Technique for Arbitrary Bit Precision Quantization

    Authors: Aditya Agrawal, Matthew Hedlund, Blake Hechtman

    Abstract: eXmY is a novel data type for quantization of ML models. It supports both arbitrary bit widths and arbitrary integer and floating point formats. For example, it seamlessly supports 3, 5, 6, 7, 9 bit formats. For a specific bit width, say 7, it defines all possible formats e.g. e0m6, e1m5, e2m4, e3m3, e4m2, e5m1 and e6m0. For non-power of two bit widths e.g. 5, 6, 7, we created a novel encoding and… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  18. arXiv:2405.12993  [pdf

    q-fin.ST

    A novel portfolio construction strategy based on the core-periphery profile of stocks

    Authors: Imran Ansari, Charu Sharma, Akshay Agrawal, Niteesh Sahni

    Abstract: This paper highlights the significance of mesoscale structures, particularly the core-periphery structure, in financial networks for portfolio optimization. We build portfolios of stocks belonging to the periphery part of the Planar maximally filtered subgraphs of the underlying network of stocks created from Pearson correlations between pairs of stocks and compare its performance with some well-k… ▽ More

    Submitted 27 April, 2024; originally announced May 2024.

  19. arXiv:2405.12859  [pdf, other

    physics.optics cond-mat.mtrl-sci

    Defect-assisted reversible phase transition in mono- and few-layer ReS$_2$

    Authors: George Zograf, Andrew B. Yankovich, Betül Küçüköz, Abhay V. Agrawal, Alexander Yu. Polyakov, Joachim Ciers, Fredrik Eriksson, Åsa Haglund, Paul Erhart, Tomasz J. Antosiewicz, Eva Olsson, Timur O. Shegai

    Abstract: Transition metal dichalcogenide (TMD) materials have attracted substantial interest due to their remarkable excitonic, optical, electrical, and mechanical properties, which are highly dependent on their crystal structure. Controlling the crystal structure of these materials is essential for fine-tuning their performance, $\textit{e.g.}$, linear and nonlinear optical, as well as charge transport pr… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  20. arXiv:2405.05465  [pdf, other

    cs.LG cs.AI cs.CL

    Vidur: A Large-Scale Simulation Framework For LLM Inference

    Authors: Amey Agrawal, Nitin Kedia, Jayashree Mohan, Ashish Panwar, Nipun Kwatra, Bhargav Gulavani, Ramachandran Ramjee, Alexey Tumanov

    Abstract: Optimizing the deployment of Large language models (LLMs) is expensive today since it requires experimentally running an application workload against an LLM implementation while exploring large configuration space formed by system knobs such as parallelization strategies, batching techniques, and scheduling policies. To address this challenge, we present Vidur - a large-scale, high-fidelity, easil… ▽ More

    Submitted 21 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  21. arXiv:2405.01790  [pdf, other

    cs.CL cs.AI

    Understanding Position Bias Effects on Fairness in Social Multi-Document Summarization

    Authors: Olubusayo Olabisi, Ameeta Agrawal

    Abstract: Text summarization models have typically focused on optimizing aspects of quality such as fluency, relevance, and coherence, particularly in the context of news articles. However, summarization models are increasingly being used to summarize diverse sources of text, such as social media data, that encompass a wide demographic user base. It is thus crucial to assess not only the quality of the gene… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: Accepted at VarDial 2024

  22. arXiv:2404.19159  [pdf, other

    cs.CL

    What Drives Performance in Multilingual Language Models?

    Authors: Sina Bagheri Nezhad, Ameeta Agrawal

    Abstract: This study investigates the factors influencing the performance of multilingual large language models (MLLMs) across diverse languages. We study 6 MLLMs, including masked language models, autoregressive models, and instruction-tuned LLMs, on the SIB-200 dataset, a topic classification dataset encompassing 204 languages. Our analysis considers three scenarios: ALL languages, SEEN languages (present… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Accepted at VarDial @ NAACL 2024

    ACM Class: I.2.7

  23. arXiv:2404.18090  [pdf, other

    cs.CR

    A Novel Classification of Attacks on Blockchain Layers: Vulnerabilities, Attacks, Mitigations, and Research Directions

    Authors: Kaustubh Dwivedi, Ankit Agrawal, Ashutosh Bhatia, Kamlesh Tiwari

    Abstract: The widespread adoption of blockchain technology has amplified the spectrum of potential threats to its integrity and security. The ongoing quest to exploit vulnerabilities emphasizes how critical it is to expand on current research initiatives. Thus, using a methodology based on discrete blockchain layers, our survey study aims to broaden the existing body of knowledge by thoroughly discussing bo… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  24. arXiv:2404.13530  [pdf, other

    cs.CV cs.CL cs.LG

    Listen Then See: Video Alignment with Speaker Attention

    Authors: Aviral Agrawal, Carlos Mateo Samudio Lezcano, Iqui Balam Heredia-Marin, Prabhdeep Singh Sethi

    Abstract: Video-based Question Answering (Video QA) is a challenging task and becomes even more intricate when addressing Socially Intelligent Question Answering (SIQA). SIQA requires context understanding, temporal reasoning, and the integration of multimodal information, but in addition, it requires processing nuanced human behavior. Furthermore, the complexities involved are exacerbated by the dominance… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  25. arXiv:2404.13210  [pdf, other

    physics.atom-ph quant-ph

    Laser cooling $^{88}$Sr to microkelvin temperature with an integrated-photonics system

    Authors: Andrew R. Ferdinand, Zheng Luo, Sindhu Jammi, Zachary Newman, Grisha Spektor, Okan Koksal, Parth B. Patel, Daniel Sheredy, William Lunden, Akash Rakholia, Travis C. Briles, Wenqi Zhu, Martin M. Boyd, Amit Agrawal, Scott B. Papp

    Abstract: We report on experiments generating a magneto-optical trap (MOT) of 88-strontium ($^{88}$Sr) atoms at microkelvin temperature, using integrated-photonics devices. With metasurface optics integrated on a fused-silica substrate, we generate six-beam, circularly polarized, counter-propagating MOTs on the blue broad-line, 461 nm, and red narrow-line, 689 nm, Sr cooling transitions without bulk optics.… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 5 pages, 3 figures

  26. arXiv:2404.12241  [pdf, other

    cs.CL cs.AI

    Introducing v0.5 of the AI Safety Benchmark from MLCommons

    Authors: Bertie Vidgen, Adarsh Agrawal, Ahmed M. Ahmed, Victor Akinwande, Namir Al-Nuaimi, Najla Alfaraj, Elie Alhajjar, Lora Aroyo, Trupti Bavalatti, Max Bartolo, Borhane Blili-Hamelin, Kurt Bollacker, Rishi Bomassani, Marisa Ferrara Boston, Siméon Campos, Kal Chakra, Canyu Chen, Cody Coleman, Zacharie Delpierre Coudert, Leon Derczynski, Debojyoti Dutta, Ian Eisenberg, James Ezick, Heather Frase, Brian Fuller , et al. (75 additional authors not shown)

    Abstract: This paper introduces v0.5 of the AI Safety Benchmark, which has been created by the MLCommons AI Safety Working Group. The AI Safety Benchmark has been designed to assess the safety risks of AI systems that use chat-tuned language models. We introduce a principled approach to specifying and constructing the benchmark, which for v0.5 covers only a single use case (an adult chatting to a general-pu… ▽ More

    Submitted 13 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  27. arXiv:2404.09771  [pdf, other

    cs.CG

    Eliminating Crossings in Ordered Graphs

    Authors: Akanksha Agrawal, Sergio Cabello, Michael Kaufmann, Saket Saurabh, Roohani Sharma, Yushi Uno, Alexander Wolff

    Abstract: Drawing a graph in the plane with as few crossings as possible is one of the central problems in graph drawing and computational geometry. Another option is to remove the smallest number of vertices or edges such that the remaining graph can be drawn without crossings. We study both problems in a book-embedding setting for ordered graphs, that is, graphs with a fixed vertex order. In this setting,… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Appears in Proc. 19th Scandinavian Symposium on Algorithm Theory (SWAT 2024)

  28. arXiv:2404.08744  [pdf, other

    cs.NI cs.ET quant-ph

    Routing and Spectrum Allocation in Broadband Quantum Entanglement Distribution

    Authors: Rohan Bali, Ashley N. Tittelbaugh, Shelbi L. Jenkins, Anuj Agrawal, Jerry Horgan, Marco Ruffini, Daniel C. Kilper, Boulat A. Bash

    Abstract: We investigate resource allocation for quantum entanglement distribution over an optical network. We characterize and model a network architecture that employs a single quasi-deterministic time-frequency heralded Einstein-Podolsky-Rosen (EPR) pair source, and develop a routing scheme for distributing entangled photon pairs over such a network. We focus on max-min fairness in entanglement distribut… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2311.14613

  29. arXiv:2404.02139  [pdf, other

    astro-ph.CO

    Lensed Type Ia Supernova "Encore" at z=2: The First Instance of Two Multiply-Imaged Supernovae in the Same Host Galaxy

    Authors: J. D. R. Pierel, A. B. Newman, S. Dhawan, M. Gu, B. A. Joshi, T. Li, S. Schuldt, L. G. Strolger, S. H. Suyu, G. B. Caminha, S. H. Cohen, J. M. Diego, J. C. J. Dsilva, S. Ertl, B. L. Frye, G. Granata, C. Grillo, A. M. Koekemoer, J. Li, A. Robotham, J. Summers, T. Treu, R. A. Windhorst, A. Zitrin, S. Agarwal , et al. (38 additional authors not shown)

    Abstract: A bright ($m_{\rm F150W,AB}$=24 mag), $z=1.95$ supernova (SN) candidate was discovered in JWST/NIRCam imaging acquired on 2023 November 17. The SN is quintuply-imaged as a result of strong gravitational lensing by a foreground galaxy cluster, detected in three locations, and remarkably is the second lensed SN found in the same host galaxy. The previous lensed SN was called "Requiem", and therefore… ▽ More

    Submitted 3 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: Submitted to ApJL

  30. arXiv:2403.18183  [pdf, other

    cs.AI cs.IR

    Can AI Models Appreciate Document Aesthetics? An Exploration of Legibility and Layout Quality in Relation to Prediction Confidence

    Authors: Hsiu-Wei Yang, Abhinav Agrawal, Pavlos Fragkogiannis, Shubham Nitin Mulay

    Abstract: A well-designed document communicates not only through its words but also through its visual eloquence. Authors utilize aesthetic elements such as colors, fonts, graphics, and layouts to shape the perception of information. Thoughtful document design, informed by psychological insights, enhances both the visual appeal and the comprehension of the content. While state-of-the-art document AI models… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  31. arXiv:2403.18121  [pdf, other

    cs.CL cs.HC

    ChatGPT Role-play Dataset: Analysis of User Motives and Model Naturalness

    Authors: Yufei Tao, Ameeta Agrawal, Judit Dombi, Tetyana Sydorenko, Jung In Lee

    Abstract: Recent advances in interactive large language models like ChatGPT have revolutionized various domains; however, their behavior in natural and role-play conversation settings remains underexplored. In our study, we address this gap by deeply investigating how ChatGPT behaves during conversations in different settings by analyzing its interactions in both a normal way and a role-play setting. We int… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted by LREC-COLING 2024

  32. arXiv:2403.17804  [pdf, other

    cs.CV cs.CL

    Improving Text-to-Image Consistency via Automatic Prompt Optimization

    Authors: Oscar Mañas, Pietro Astolfi, Melissa Hall, Candace Ross, Jack Urbanek, Adina Williams, Aishwarya Agrawal, Adriana Romero-Soriano, Michal Drozdzal

    Abstract: Impressive advances in text-to-image (T2I) generative models have yielded a plethora of high performing models which are able to generate aesthetically appealing, photorealistic images. Despite the progress, these models still struggle to produce images that are consistent with the input prompt, oftentimes failing to capture object quantities, relations and attributes properly. Existing solutions… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  33. arXiv:2403.16287  [pdf, other

    cs.SE

    Coupled Requirements-driven Testing of CPS: From Simulation To Reality

    Authors: Ankit Agrawal, Philipp Zech, Michael Vierhauser

    Abstract: Failures in safety-critical Cyber-Physical Systems (CPS), both software and hardware-related, can lead to severe incidents impacting physical infrastructure or even harming humans. As a result, extensive simulations and field tests need to be conducted, as part of the verification and validation of system requirements, to ensure system safety. However, current simulation and field testing practice… ▽ More

    Submitted 21 April, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

  34. arXiv:2403.14938  [pdf, ps, other

    cs.CL

    On Zero-Shot Counterspeech Generation by LLMs

    Authors: Punyajoy Saha, Aalok Agrawal, Abhik Jana, Chris Biemann, Animesh Mukherjee

    Abstract: With the emergence of numerous Large Language Models (LLM), the usage of such models in various Natural Language Processing (NLP) applications is increasing extensively. Counterspeech generation is one such key task where efforts are made to develop generative models by fine-tuning LLMs with hatespeech - counterspeech pairs, but none of these attempts explores the intrinsic properties of large lan… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 12 pages, 7 tables, accepted at LREC-COLING 2024

  35. arXiv:2403.14208  [pdf, other

    cs.CL

    Automatic Annotation of Grammaticality in Child-Caregiver Conversations

    Authors: Mitja Nikolaus, Abhishek Agrawal, Petros Kaklamanis, Alex Warstadt, Abdellah Fourtassi

    Abstract: The acquisition of grammar has been a central question to adjudicate between theories of language acquisition. In order to conduct faster, more reproducible, and larger-scale corpus studies on grammaticality in child-caregiver conversations, tools for automatic annotation can offer an effective alternative to tedious manual annotation. We propose a coding scheme for context-dependent grammaticalit… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Journal ref: LREC-Coling 2024, May 2024, Turin, Italy

  36. arXiv:2403.07118  [pdf, other

    cs.CL

    Narrating Causal Graphs with Large Language Models

    Authors: Atharva Phatak, Vijay K. Mago, Ameeta Agrawal, Aravind Inbasekaran, Philippe J. Giabbanelli

    Abstract: The use of generative AI to create text descriptions from graphs has mostly focused on knowledge graphs, which connect concepts using facts. In this work we explore the capability of large pretrained language models to generate text from causal graphs, where salient concepts are represented as nodes and causality is represented via directed, typed edges. The causal reasoning encoded in these graph… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: HICSS '24

    Report number: https://hdl.handle.net/10125/107290

    Journal ref: Proceedings of the 57th Hawaii International Conference on System Sciences 2024

  37. arXiv:2403.07089  [pdf, other

    q-bio.NC eess.SP

    Graph learning methods to extract empathy supporting regions in a naturalistic stimuli fMRI

    Authors: Sasanka GRS, Ayushi Agrawal, Santosh Nannuru, Kavita Vemuri

    Abstract: Functional MRI (fMRI) research, employing naturalistic stimuli like movies, explores brain network interactions in complex cognitive processes such as empathy. The empathy network encompasses multiple brain areas, including the Insula, PFC, ACC, and parietal regions. Our novel processing pipeline applies graph learning methods to whole-brain timeseries signals, incorporating high-pass filtering, v… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 9 figures, 2 tables

  38. arXiv:2403.06159  [pdf

    cs.CV q-bio.NC

    Cracking the neural code for word recognition in convolutional neural networks

    Authors: Aakash Agrawal, Stanislas Dehaene

    Abstract: Learning to read places a strong challenge on the visual system. Years of expertise lead to a remarkable capacity to separate highly similar letters and encode their relative positions, thus distinguishing words such as FORM and FROM, invariantly over a large range of sizes and absolute positions. How neural circuits achieve invariant word recognition remains unknown. Here, we address this issue b… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: 33 pages, 6 main figures, 4 supplementary figures

  39. Understanding how social discussion platforms like Reddit are influencing financial behavior

    Authors: Sachin Thukral, Suyash Sangwan, Arnab Chatterjee, Lipika Dey, Aaditya Agrawal, Pramit Kumar Chandra, Animesh Mukherjee

    Abstract: This study proposes content and interaction analysis techniques for a large repository created from social media content. Though we have presented our study for a large platform dedicated to discussions around financial topics, the proposed methods are generic and applicable to all platforms. Along with an extension of topic extraction method using Latent Dirichlet Allocation, we propose a few mea… ▽ More

    Submitted 12 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: 8 pages, 8 figures, 3 tables, and 1 algorithm; Published in WI-IAT 2022 (The 21st IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology)

    Journal ref: IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT) 2022 (pp. 612-619)

  40. arXiv:2403.02310  [pdf, other

    cs.LG cs.DC

    Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve

    Authors: Amey Agrawal, Nitin Kedia, Ashish Panwar, Jayashree Mohan, Nipun Kwatra, Bhargav S. Gulavani, Alexey Tumanov, Ramachandran Ramjee

    Abstract: Each LLM serving request goes through two phases. The first is prefill which processes the entire input prompt and produces the first output token and the second is decode which generates the rest of output tokens, one-at-a-time. Prefill iterations have high latency but saturate GPU compute due to parallel processing of the input prompt. In contrast, decode iterations have low latency but also low… ▽ More

    Submitted 17 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  41. Wormholes in the f(R,L,T) theory of gravity

    Authors: P. H. R. S. Moraes, A. S. Agrawal, B. Mishra

    Abstract: Morris and Thorne developed wormhole solutions in the late 1980s when they discovered a recipe that wormholes must follow for travelers to cross them safely. They describe exotic matter as satisfying $-p_{r} > ρ$, where $p_{r}$ is the radial pressure and $ρ$ is the energy density of the wormhole. This is a notable characteristic of the General Relativity Theory. The current article discusses trave… ▽ More

    Submitted 22 June, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

    Comments: 5 Pages, 4 figures

    Journal ref: Physics Letter B, Volume 855, August 2024

  42. arXiv:2403.00286  [pdf, other

    quant-ph cond-mat.supr-con

    Niobium coaxial cavities with internal quality factors exceeding 1.5 billion for circuit quantum electrodynamics

    Authors: Andrew E. Oriani, Fang Zhao, Tanay Roy, Alexander Anferov, Kevin He, Ankur Agrawal, Riju Banerjee, Srivatsan Chakram, David I. Schuster

    Abstract: Group-V materials such as niobium and tantalum have become popular choices for extending the performance of circuit quantum electrodynamics (cQED) platforms allowing for quantum processors and memories with reduced error rates and more modes. The complex surface chemistry of niobium however makes identifying the main modes of decoherence difficult at millikelvin temperatures and single-photon powe… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 14 pages, 10 figures

  43. arXiv:2402.16714  [pdf, other

    quant-ph cs.AI cs.CL

    Quantum linear algebra is all you need for Transformer architectures

    Authors: Naixu Guo, Zhan Yu, Matthew Choi, Aman Agrawal, Kouhei Nakaji, Alán Aspuru-Guzik, Patrick Rebentrost

    Abstract: Generative machine learning methods such as large-language models are revolutionizing the creation of text and images. While these models are powerful they also harness a large amount of computational resources. The transformer is a key component in large language models that aims to generate a suitable completion of a given partial sequence. In this work, we investigate transformer architectures… ▽ More

    Submitted 30 May, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: 31 pages, 4 figures, 2 tables, comments are welcome

  44. arXiv:2402.16159  [pdf, other

    cs.CL

    DistALANER: Distantly Supervised Active Learning Augmented Named Entity Recognition in the Open Source Software Ecosystem

    Authors: Somnath Banerjee, Avik Dutta, Aaditya Agrawal, Rima Hazra, Animesh Mukherjee

    Abstract: With the AI revolution in place, the trend for building automated systems to support professionals in different domains such as the open source software systems, healthcare systems, banking systems, transportation systems and many others have become increasingly prominent. A crucial requirement in the automation of support tools for such systems is the early identification of named entities, which… ▽ More

    Submitted 20 June, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

    Comments: Accepted at ECML-PKDD 2024 (Long Paper)

  45. arXiv:2402.11465  [pdf, other

    cs.DS

    Odd Cycle Transversal on $P_5$-free Graphs in Polynomial Time

    Authors: Akanksha Agrawal, Paloma T. Lima, Daniel Lokshtanov, Pawel Rzążewski, Saket Saurabh, Roohani Sharma

    Abstract: An independent set in a graph G is a set of pairwise non-adjacent vertices. A graph $G$ is bipartite if its vertex set can be partitioned into two independent sets. In the Odd Cycle Transversal problem, the input is a graph $G$ along with a weight function $w$ associating a rational weight with each vertex, and the task is to find a smallest weight vertex subset $S$ in $G$ such that $G - S$ is bip… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    MSC Class: 68Q25; 05C85 ACM Class: F.2

  46. arXiv:2402.08885  [pdf, other

    physics.optics physics.app-ph physics.atom-ph physics.ins-det

    Three-dimensional, multi-wavelength beam formation with integrated metasurface optics for Sr laser cooling

    Authors: Sindhu Jammi, Andrew R. Ferdinand, Zheng Luo, Zachary L. Newman, Gregory Spektor, Junyeob Song, Okan Koksal, Akash V. Rakholia, William Lunden, Daniel Sheredy, Parth B. Patel, Martin M. Boyd, Wenqi Zhu, Amit Agrawal, Travis C. Briles, Scott B. Papp

    Abstract: We demonstrate the formation of a complex, multi-wavelength, three-dimensional laser beam configuration with integrated metasurface optics. Our experiments support the development of a compact Sr optical-lattice clock, which leverages magneto-optical trapping on atomic transitions at 461 nm and 689 nm without bulk free-space optics. We integrate six, mm-scale metasurface optics on a fused-silica s… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 5 pages, 3 figures

  47. arXiv:2402.06895  [pdf, other

    gr-qc

    Bouncing Scenario and Cosmic Dynamics in Modified Theories of Gravity

    Authors: A. S. Agrawal

    Abstract: The main objective of this study is to investigate the phenomenon of the bouncing scenario of the universe. The most widely recognized cosmological framework is the standard cosmological model, sometimes referred to as the Big Bang model. This is mainly because of its inherent properties and its consistent alignment with recent observational studies. However, the standard cosmological model faces… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

    Comments: Ph.D. Thesis. The thesis contains the publications: Gravitation and Cosmology 29 (3), 294-304, Fortschritte der Physik 70 (1), 2100065, The European Physical Journal C 83 (113), Physics of the Dark Universe 33, 100863, The European Physical Journal C 84 (1), 56

  48. arXiv:2402.05983  [pdf, other

    eess.IV cs.LG physics.app-ph physics.ins-det

    Capability enhancement of the X-ray micro-tomography system via ML-assisted approaches

    Authors: Dhruvi Shah, Shruti Mehta, Ashish Agrawal, Shishir Purohit, Bhaskar Chaudhury

    Abstract: Ring artifacts in X-ray micro-CT images are one of the primary causes of concern in their accurate visual interpretation and quantitative analysis. The geometry of X-ray micro-CT scanners is similar to the medical CT machines, except the sample is rotated with a stationary source and detector. The ring artifacts are caused by a defect or non-linear responses in detector pixels during the MicroCT d… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  49. arXiv:2402.05127  [pdf

    cs.CL cs.AI cs.LG

    Illuminate: A novel approach for depression detection with explainable analysis and proactive therapy using prompt engineering

    Authors: Aryan Agrawal

    Abstract: This paper introduces a novel paradigm for depression detection and treatment using advanced Large Language Models (LLMs): Generative Pre-trained Transformer 4 (GPT-4), Llama 2 chat, and Gemini. These LLMs are fine-tuned with specialized prompts to diagnose, explain, and suggest therapeutic interventions for depression. A unique few-shot prompting method enhances the models' ability to analyze and… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 10 pages, 9 figures, 9 tables

  50. arXiv:2402.02080  [pdf, other

    cs.CL

    Translation Errors Significantly Impact Low-Resource Languages in Cross-Lingual Learning

    Authors: Ashish Sunil Agrawal, Barah Fazili, Preethi Jyothi

    Abstract: Popular benchmarks (e.g., XNLI) used to evaluate cross-lingual language understanding consist of parallel versions of English evaluation sets in multiple target languages created with the help of professional translators. When creating such parallel data, it is critical to ensure high-quality translations for all target languages for an accurate characterization of cross-lingual transfer. In this… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Comments: Accepted to main proceedings of "The 18th Conference of the European Chapter of the Association for Computational Linguistics"