Skip to main content

Showing 1–50 of 2,910 results for author: Kumar, A

  1. arXiv:2407.08378  [pdf

    physics.flu-dyn

    Flame spread over thin circular PMMA rods

    Authors: Manu B V, Amit Kumar

    Abstract: This article presents a series of opposed flow flame spread experiments, conducted using cast cylindrical PMMA (acrylic) rods, 80 mm long and of diameters 1 mm and 0.5 mm, in normal gravity and microgravity environments. The experiments are primarily conducted for molar oxygen levels of 21%, 23% and 40% at 1 atmosphere pressure and opposed flow speed ranging from 0 cm/s to 25 cm/s. Experiments are… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2407.07612  [pdf, other

    cs.LG cs.AI cs.CL

    Teaching Transformers Causal Reasoning through Axiomatic Training

    Authors: Aniket Vashishtha, Abhinav Kumar, Abbavaram Gowtham Reddy, Vineeth N Balasubramanian, Amit Sharma

    Abstract: For text-based AI systems to interact in the real world, causal reasoning is an essential skill. Since interventional data is costly to generate, we study to what extent an agent can learn causal reasoning from passive data. Specifically, we consider an axiomatic training setup where an agent learns from multiple demonstrations of a causal axiom (or rule), rather than incorporating the axiom as an… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  3. arXiv:2407.07480  [pdf, other

    astro-ph.HE

    The discovery of a nearby 421~s transient with CHIME/FRB/Pulsar

    Authors: Fengqiu Adam Dong, Tracy Clarke, Alice P. Curtin, Ajay Kumar, Ingrid Stairs, Shami Chatterjee, Amanda M. Cook, Emmanuel Fonseca, B. M. Gaensler, Jason W. T. Hessels, Victoria M. Kaspi, Mattias Lazda, Kiyoshi W. Masui, James W. McKee, Bradley W. Meyers, Aaron B. Pearlman, Scott M. Ransom, Paul Scholz, Kaitlyn Shin, Kendrick M. Smith, Chia Min Tan

    Abstract: Neutron stars and white dwarfs are both dense remnants of post-main-sequence stars. Pulsars, magnetars and strongly magnetised white dwarfs have all been seen to been observed to exhibit coherent, pulsed radio emission in relation to their rotational period. Recently, a new type of radio long period transient (LPT) has been discovered. The bright radio emission of LPTs resembles that of radio puls… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Submitted

  4. arXiv:2407.06893  [pdf

    cs.CL cs.CE

    Measuring Sustainability Intention of ESG Fund Disclosure using Few-Shot Learning

    Authors: Mayank Singh, Nazia Nafis, Abhijeet Kumar, Mridul Mishra

    Abstract: Global sustainable fund universe encompasses open-end funds and exchange-traded funds (ETF) that, by prospectus or other regulatory filings, claim to focus on Environment, Social and Governance (ESG). Challengingly, the claims can only be confirmed by examining the textual disclosures to check if there is presence of intentionality and ESG focus on its investment strategy. Currently, there is no r… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: This paper was presented at 'AI applications in ESG Conference' at IIM Bangalore, India (Nov, 2023)

  5. arXiv:2407.06868  [pdf, other

    cs.IT cs.LG eess.SP

    Energy Efficient Fair STAR-RIS for Mobile Users

    Authors: Ashok S. Kumar, Nancy Nayak, Sheetal Kalyani, Himal A. Suraweera

    Abstract: In this work, we propose a method to improve the energy efficiency and fairness of simultaneously transmitting and reflecting reconfigurable intelligent surfaces (STAR-RIS) for mobile users, ensuring reduced power consumption while maintaining reliable communication. To achieve this, we introduce a new parameter known as the subsurface assignment variable, which determines the number of STAR-RIS e… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  6. arXiv:2407.06110  [pdf, other

    cs.CV

    FGA: Fourier-Guided Attention Network for Crowd Count Estimation

    Authors: Yashwardhan Chaudhuri, Ankit Kumar, Arun Balaji Buduru, Adel Alshamrani

    Abstract: Crowd counting is gaining societal relevance, particularly in domains of Urban Planning, Crowd Management, and Public Safety. This paper introduces Fourier-guided attention (FGA), a novel attention mechanism for crowd count estimation designed to address the inefficient full-scale global pattern capture in existing works on convolution-based attention networks. FGA efficiently captures multi-scale… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Accepted to IJCNN'24

  7. arXiv:2407.06093  [pdf, other

    cs.AI

    Artificial Intuition: Efficient Classification of Scientific Abstracts

    Authors: Harsh Sakhrani, Naseela Pervez, Anirudh Ravi Kumar, Fred Morstatter, Alexandra Graddy Reed, Andrea Belz

    Abstract: It is desirable to coarsely classify short scientific texts, such as grant or publication abstracts, for strategic insight or research portfolio management. These texts efficiently transmit dense information to experts possessing a rich body of knowledge to aid interpretation. Yet this task is remarkably difficult to automate because of brevity and the absence of context. To address this gap, we h… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  8. arXiv:2407.05263  [pdf, ps, other

    hep-ph nucl-th

    Impact of finite volume on kaon, antikaon, and $φ$ meson masses and decay width in asymmetric strange hadronic matter

    Authors: Zeeshan Ahmad, Nisha Chahal, Arvind Kumar, Suneel Dutt

    Abstract: In the present work, we investigate the impact of finite volume on the in-medium properties of kaons ($K^+$, $K^0$) and antikaons ($K^-$, $\bar{K^0}$), and $φ$ mesons in the isospin asymmetric strange hadronic medium at finite density and temperature. We use the chiral SU(3) hadronic mean-field model, which accounts for the interactions between baryons through the exchange of scalar ($σ, ζ, δ$) an… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: 40 pages and 13 figures

  9. arXiv:2407.04784  [pdf, other

    physics.atom-ph cond-mat.quant-gas quant-ph

    Cavity QED in a High NA Resonator

    Authors: Danial Shadmany, Aishwarya Kumar, Anna Soper, Lukas Palm, Chuan Yin, Henry Ando, Bowen Li, Lavanya Taneja, Matt Jaffe, David Schuster, Jon Simon

    Abstract: From fundamental studies of light-matter interaction to applications in quantum networking and sensing, cavity quantum electrodynamics (QED) provides a platform-crossing toolbox to control interactions between atoms and photons. The coherence of such interactions is determined by the product of the single-pass atomic absorption and the number of photon round-trips. Reducing the cavity loss has ena… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  10. arXiv:2407.04450  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci hep-th

    Massive Dirac-Pauli physics in lead-halide perovskites

    Authors: Abhishek Shiva Kumar, Mikhail Maslov, Mikhail Lemeshko, Artem G. Volosniev, Zhanybek Alpichshev

    Abstract: In standard quantum electrodynamics (QED), the so-called non-minimal (Pauli) coupling is suppressed for elementary particles and has no physical implications. Here, we show that the Pauli term naturally appears in a known family of Dirac materials -- the lead-halide perovskites, suggesting a novel playground for the study of analogue QED effects. We outline measurable manifestations of the Pauli t… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  11. arXiv:2407.04268  [pdf, other

    cs.LG cs.AI cs.SE

    NeuFair: Neural Network Fairness Repair with Dropout

    Authors: Vishnu Asutosh Dasu, Ashish Kumar, Saeid Tizpaz-Niari, Gang Tan

    Abstract: This paper investigates neuron dropout as a post-processing bias mitigation for deep neural networks (DNNs). Neural-driven software solutions are increasingly applied in socially critical domains with significant fairness implications. While neural networks are exceptionally good at finding statistical patterns from data, they may encode and amplify existing biases from the historical data. Existi… ▽ More

    Submitted 12 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

    Comments: Paper accepted at ACM ISSTA 2024

  12. arXiv:2407.04039  [pdf, ps, other

    physics.plasm-ph

    Flexible Stellarator Physics Facility

    Authors: F. I. Parra, S. -G. Baek, M. Churchill, D. R. Demers, B. Dudson, N. M. Ferraro, B. Geiger, S. Gerhardt, K. C. Hammond, S. Hudson, R. Jorge, E. Kolemen, D. M. Kriete, S. T. A. Kumar, M. Landreman, C. Lowe, D. A. Maurer, F. Nespoli, N. Pablant, M. J. Pueschel, A. Punjabi, J. A. Schwartz, C. P. S. Swanson, A. M. Wright

    Abstract: We propose to build a Flexible Stellarator Physics Facility to explore promising regions of the vast parameter space of disruption-free stellarator solutions for Fusion Pilot Plants (FPPs).

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: White paper submitted to FESAC subcommittee on Facilities, 8 pages

  13. arXiv:2407.03941  [pdf, other

    cs.SE cs.AI cs.CL

    Narrow Transformer: Starcoder-Based Java-LM For Desktop

    Authors: Kamalkumar Rathinasamy, Balaji A J, Ankush Kumar, Gagan Gayari, Harshini K, Rajab Ali Mondal, Sreenivasa Raghavan K S, Swayam Singh

    Abstract: This paper presents NT-Java-1.1B, an open-source specialized code language model built on StarCoderBase-1.1B, designed for coding tasks in Java programming. NT-Java-1.1B achieves state-of-the-art performance, surpassing its base model and majority of other models of similar size on MultiPL-E Java code benchmark. While there have been studies on extending large, generic pre-trained models to improv… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    ACM Class: I.2.7

  14. arXiv:2407.03648  [pdf, other

    eess.AS cs.SD

    High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching

    Authors: Gael Le Lan, Bowen Shi, Zhaoheng Ni, Sidd Srinivasan, Anurag Kumar, Brian Ellis, David Kant, Varun Nagaraja, Ernie Chang, Wei-Ning Hsu, Yangyang Shi, Vikas Chandra

    Abstract: We introduce a simple and efficient text-controllable high-fidelity music generation and editing model. It operates on sequences of continuous latent representations from a low frame rate 48 kHz stereo variational auto encoder codec that eliminates the information loss drawback of discrete representations. Based on a diffusion transformer architecture trained on a flow-matching objective the model… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  15. arXiv:2407.02413  [pdf

    cond-mat.mtrl-sci

    First-principles investigation of multifaceted properties; lattice dynamic, structural stability, mechanical, electronic, magnetic and thermodynamic response of Alkali metals-based semi Heusler alloys

    Authors: Diwaker, Shyam L. Gupta, Anupam, Sumit Kumar, Aadil Fayaz, Ashwani Kumar

    Abstract: Taking into considerations the wide compositional stretch of Heusler alloys, the first principles density functional theory based calculations are excellently suitable for estimating the multifaceted properties of alkali metal based LiVSb and NaVSb Heusler alloys. We calculated ground state stability by optimizing the energy in alpha, beta and gamma phase configurations. The materials are dynamica… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  16. arXiv:2407.01351  [pdf, other

    astro-ph.HE

    Probing the connection between IceCube neutrinos and MOJAVE AGN

    Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (399 additional authors not shown)

    Abstract: Active Galactic Nuclei (AGN) are prime candidate sources of the high-energy, astrophysical neutrinos detected by IceCube. This is demonstrated by the real-time multi-messenger detection of the blazar TXS 0506+056 and the recent evidence of neutrino emission from NGC 1068 from a separate time-averaged study. However, the production mechanism of the astrophysical neutrinos in AGN is not well establi… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 14 Pages 7 Figures

  17. arXiv:2407.01314  [pdf, other

    hep-ex

    Search for a light sterile neutrino with 7.5 years of IceCube DeepCore data

    Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (399 additional authors not shown)

    Abstract: We present a search for an eV-scale sterile neutrino using 7.5 years of data from the IceCube DeepCore detector. The analysis uses a sample of 21,914 events with energies between 5 and 150 GeV to search for sterile neutrinos through atmospheric muon neutrino disappearance. Improvements in event selection and treatment of systematic uncertainties provide greater statistical power compared to previo… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 11 pages, 5 figures. To be submitted to Physical Review D

  18. arXiv:2407.01306  [pdf, other

    cs.LG cs.CR

    Unveiling the Unseen: Exploring Whitebox Membership Inference through the Lens of Explainability

    Authors: Chenxi Li, Abhinav Kumar, Zhen Guo, Jie Hou, Reza Tourani

    Abstract: The increasing prominence of deep learning applications and reliance on personalized data underscore the urgent need to address privacy vulnerabilities, particularly Membership Inference Attacks (MIAs). Despite numerous MIA studies, significant knowledge gaps persist, particularly regarding the impact of hidden features (in isolation) on attack efficacy and insufficient justification for the root… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 20 pages, 10 figures, 4 tables

  19. arXiv:2407.00866  [pdf, other

    cs.LG

    Silver Linings in the Shadows: Harnessing Membership Inference for Machine Unlearning

    Authors: Nexhi Sula, Abhinav Kumar, Jie Hou, Han Wang, Reza Tourani

    Abstract: With the continued advancement and widespread adoption of machine learning (ML) models across various domains, ensuring user privacy and data security has become a paramount concern. In compliance with data privacy regulations, such as GDPR, a secure machine learning framework should not only grant users the right to request the removal of their contributed data used for model training but also fa… ▽ More

    Submitted 5 July, 2024; v1 submitted 30 June, 2024; originally announced July 2024.

    Comments: 17 pages, 14 figures, 6 tables

  20. arXiv:2407.00774  [pdf, other

    quant-ph cs.LG

    Advantages of quantum support vector machine in cross-domain classification of quantum states

    Authors: Diksha Sharma, Vivek Balasaheb Sabale, Parvinder Singh, Atul Kumar

    Abstract: In this study, we use cross-domain classification using quantum machine learning for quantum advantages to address the entanglement versus separability paradigm. We further demonstrate the efficient classification of Bell diagonal states into zero and non-zero discord classes. The inherited structure of quantum states and its relation with a particular class of quantum states are exploited to intu… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  21. arXiv:2407.00597  [pdf

    cond-mat.mtrl-sci

    Myriad of Terahertz Magnons with All-Optical Magnetoelectric Functionality for Efficient Spin-Wave Computing in Honeycomb Magnet Co4Ta2O9

    Authors: Brijesh Singh Mehra, Sanjeev Kumar, Gaurav Dubey, Ayyappan Shyam, Ankit Kumar, K Anirudh, Kiran Singh, Dhanvir Singh Rana

    Abstract: Terahertz (THz) magnonics represent the notion of mathematical algebraic operations of magnons such as addition and subtraction in THz regime which is an emergent dissipationless ultrafast alternative to existing data processing technologies. Spin waves on antiferromagnets with a twist in spin order host such magnons in THz regime, which possess advantage of higher processing speeds, additional po… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  22. arXiv:2407.00537  [pdf, other

    eess.IV cs.CV cs.LG

    Accelerating Longitudinal MRI using Prior Informed Latent Diffusion

    Authors: Yonatan Urman, Zachary Shah, Ashwin Kumar, Bruno P. Soares, Kawin Setsompop

    Abstract: MRI is a widely used ionization-free soft-tissue imaging modality, often employed repeatedly over a patient's lifetime. However, prolonged scanning durations, among other issues, can limit availability and accessibility. In this work, we aim to substantially reduce scan times by leveraging prior scans of the same patient. These prior scans typically contain considerable shared information with the… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  23. arXiv:2407.00071  [pdf, other

    cs.AI cs.CL cs.ET cs.LG

    Combinatorial Reasoning: Selecting Reasons in Generative AI Pipelines via Combinatorial Optimization

    Authors: Mert Esencan, Tarun Advaith Kumar, Ata Akbari Asanjan, P. Aaron Lott, Masoud Mohseni, Can Unlu, Davide Venturelli, Alan Ho

    Abstract: Recent Large Language Models (LLMs) have demonstrated impressive capabilities at tasks that require human intelligence and are a significant step towards human-like artificial intelligence (AI). Yet the performance of LLMs at reasoning tasks have been subpar and the reasoning capability of LLMs is a matter of significant debate. While it has been shown that the choice of the prompting technique to… ▽ More

    Submitted 19 June, 2024; originally announced July 2024.

    Comments: 13 pages, 3 figures

  24. arXiv:2406.19421  [pdf, other

    hep-ex physics.ins-det

    The Belle II Detector Upgrades Framework Conceptual Design Report

    Authors: H. Aihara, A. Aloisio, D. P. Auguste, M. Aversano, M. Babeluk, S. Bahinipati, Sw. Banerjee, M. Barbero, J. Baudot, A. Beaubien, F. Becherer, T. Bergauer, F. U. Bernlochner., V. Bertacchi, G. Bertolone, C. Bespin, M. Bessner, S. Bettarini, A. J. Bevan, B. Bhuyan, M. Bona, J. F. Bonis, J. Borah, F. Bosi, R. Boudagga , et al. (186 additional authors not shown)

    Abstract: We describe the planned near-term and potential longer-term upgrades of the Belle II detector at the SuperKEKB electron-positron collider operating at the KEK laboratory in Tsukuba, Japan. These upgrades will allow increasingly sensitive searches for possible new physics beyond the Standard Model in flavor, tau, electroweak and dark sector physics that are both complementary to and competitive wit… ▽ More

    Submitted 4 July, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

    Comments: Editor: F. Forti 170 pages

    Report number: KEK-REPORT-2024-1, BELLE2-REPORT-2024-042

  25. arXiv:2406.18290  [pdf, ps, other

    math.DG math.AP math.SP

    The first Steklov eigenvalue on manifolds with nonnegative Ricci curvature and convex boundary

    Authors: Jonah A. J. Duncan, Aditya Kumar

    Abstract: We establish a new lower bound for the first non-zero Steklov eigenvalue of a compact Riemannian manifold with non-negative Ricci curvature and (strictly) convex boundary. Related results are also obtained under weaker geometric hypotheses.

    Submitted 26 June, 2024; originally announced June 2024.

  26. arXiv:2406.17304  [pdf, other

    cs.CL

    Leveraging LLMs for Dialogue Quality Measurement

    Authors: Jinghan Jia, Abi Komma, Timothy Leffel, Xujun Peng, Ajay Nagesh, Tamer Soliman, Aram Galstyan, Anoop Kumar

    Abstract: In task-oriented conversational AI evaluation, unsupervised methods poorly correlate with human judgments, and supervised approaches lack generalization. Recent advances in large language models (LLMs) show robust zeroshot and few-shot capabilities across NLP tasks. This paper explores using LLMs for automated dialogue quality evaluation, experimenting with various configurations on public and pro… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  27. arXiv:2406.16075  [pdf, other

    cond-mat.dis-nn cond-mat.mtrl-sci cond-mat.soft cond-mat.stat-mech

    Odd Dipole Screening in Radial Inflation

    Authors: Yang Fu, H. George E. Hentschel, Pawandeep Kaur, Avanish Kumar, Itamar Procaccia

    Abstract: The inflation of an inner radial (or spherical) cavity in an amorphous solids confined in a disk (or a sphere), served as a fruitful case model for studying the effects of plastic deformations on the mechanical response. It was shown that when the field associated with Eshelby quadrupolar charges is non-uniform, the displacement field is riddled with dipole charges that screen elasticity, reminisc… ▽ More

    Submitted 27 June, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

  28. arXiv:2406.16008  [pdf, other

    cs.CL cs.AI cs.LG

    Found in the Middle: Calibrating Positional Attention Bias Improves Long Context Utilization

    Authors: Cheng-Yu Hsieh, Yung-Sung Chuang, Chun-Liang Li, Zifeng Wang, Long T. Le, Abhishek Kumar, James Glass, Alexander Ratner, Chen-Yu Lee, Ranjay Krishna, Tomas Pfister

    Abstract: Large language models (LLMs), even when specifically trained to process long input contexts, struggle to capture relevant information located in the middle of their input. This phenomenon has been known as the lost-in-the-middle problem. In this work, we make three contributions. First, we set out to understand the factors that cause this phenomenon. In doing so, we establish a connection between… ▽ More

    Submitted 3 July, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

    Comments: ACL Findings 2024

  29. arXiv:2406.15649  [pdf, other

    cs.CV

    Efficient Human Pose Estimation: Leveraging Advanced Techniques with MediaPipe

    Authors: Sandeep Singh Sengar, Abhishek Kumar, Owen Singh

    Abstract: This study presents significant enhancements in human pose estimation using the MediaPipe framework. The research focuses on improving accuracy, computational efficiency, and real-time processing capabilities by comprehensively optimising the underlying algorithms. Novel modifications are introduced that substantially enhance pose estimation accuracy across challenging scenarios, such as dynamic m… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  30. arXiv:2406.15646  [pdf, other

    cs.CV

    VigilEye -- Artificial Intelligence-based Real-time Driver Drowsiness Detection

    Authors: Sandeep Singh Sengar, Aswin Kumar, Owen Singh

    Abstract: This study presents a novel driver drowsiness detection system that combines deep learning techniques with the OpenCV framework. The system utilises facial landmarks extracted from the driver's face as input to Convolutional Neural Networks trained to recognise drowsiness patterns. The integration of OpenCV enables real-time video processing, making the system suitable for practical implementation… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  31. arXiv:2406.15565  [pdf, other

    cs.CV cs.LG

    Unseen Object Reasoning with Shared Appearance Cues

    Authors: Paridhi Singh, Arun Kumar

    Abstract: This paper introduces an innovative approach to open world recognition (OWR), where we leverage knowledge acquired from known objects to address the recognition of previously unseen objects. The traditional method of object modeling relies on supervised learning with strict closed-set assumptions, presupposing that objects encountered during inference are already known at the training phase. Howev… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  32. arXiv:2406.14532  [pdf, other

    cs.LG cs.CL

    RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold

    Authors: Amrith Setlur, Saurabh Garg, Xinyang Geng, Naman Garg, Virginia Smith, Aviral Kumar

    Abstract: Training on model-generated synthetic data is a promising approach for finetuning LLMs, but it remains unclear when it helps or hurts. In this paper, we investigate this question for math reasoning via an empirical study, followed by building a conceptual understanding of our observations. First, we find that while the typical approach of finetuning a model on synthetic correct or positive problem… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  33. arXiv:2406.13658  [pdf, ps, other

    math.AC

    Generalized Hamming weights and symbolic powers of Stanley-Reisner ideals of matroids

    Authors: Michael DiPasquale, Louiza Fouli, Arvind Kumar, Ştefan O. Tohǎneanu

    Abstract: It is well-known that the first generalized Hamming weight of a code, more commonly called \textit{the minimum distance} of the code, corresponds to the initial degree of the Stanley-Reisner ideal of the matroid of the dual code. Our starting point in this paper is a generalization of this fact -- namely, the $r$-th generalized Hamming weight of a code is the smallest degree of a squarefree monomi… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 37 pages. Comments welcome!

    MSC Class: 94B05; 05B35; 05E40; 13F55; 51E10

  34. arXiv:2406.13236  [pdf, other

    cs.CL cs.AI

    Data Contamination Can Cross Language Barriers

    Authors: Feng Yao, Yufan Zhuang, Zihao Sun, Sunan Xu, Animesh Kumar, Jingbo Shang

    Abstract: The opacity in developing large language models (LLMs) is raising growing concerns about the potential contamination of public benchmarks in the pre-training data. Existing contamination detection methods are typically based on the text overlap between training and evaluation data, which can be too superficial to reflect deeper forms of contamination. In this paper, we first present a cross-lingua… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 12 pages, 5 figures

  35. arXiv:2406.12804  [pdf, other

    astro-ph.HE

    Varying activity and the bursts properties of FRB 20240114A probed with GMRT down to 300 MHz

    Authors: Ajay Kumar, Yogesh Maan, Yash Bhusare

    Abstract: Repeating fast radio bursts can exhibit a wide range of burst repetition rates, from none to hundreds of bursts per hour. Here, we report the detection and characteristics of 57 bursts from the recently discovered FRB 20240114A, observed with GMRT in the frequency ranges 300-500 MHz and 550-750 MHz. Majority of the bursts show narrow emission-bandwidth with $Δν/ν\sim$ around 10 %. All of the burst… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 13 Pages, 5 Figures, Submitted to ApJ

  36. arXiv:2406.12644  [pdf, other

    cs.CL cs.AI

    Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models

    Authors: Devichand Budagam, Sankalp KJ, Ashutosh Kumar, Vinija Jain, Aman Chadha

    Abstract: Assessing the effectiveness of large language models (LLMs) in addressing diverse tasks is essential for comprehending their strengths and weaknesses. Conventional evaluation techniques typically apply a single prompting strategy uniformly across datasets, not considering the varying degrees of task complexity. We introduce the Hierarchical Prompting Taxonomy (HPT), a taxonomy that employs a Hiera… ▽ More

    Submitted 27 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  37. arXiv:2406.11925  [pdf, other

    cs.SE cs.AI cs.CL

    DocCGen: Document-based Controlled Code Generation

    Authors: Sameer Pimparkhede, Mehant Kammakomati, Srikanth Tamilselvam, Prince Kumar, Ashok Pon Kumar, Pushpak Bhattacharyya

    Abstract: Recent developments show that Large Language Models (LLMs) produce state-of-the-art performance on natural language (NL) to code generation for resource-rich general-purpose languages like C++, Java, and Python. However, their practical usage for structured domain-specific languages (DSLs) such as YAML, JSON is limited due to domain-specific schema, grammar, and customizations generally unseen by… ▽ More

    Submitted 3 July, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  38. arXiv:2406.11896  [pdf, other

    cs.LG

    DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning

    Authors: Hao Bai, Yifei Zhou, Mert Cemri, Jiayi Pan, Alane Suhr, Sergey Levine, Aviral Kumar

    Abstract: Training corpuses for vision language models (VLMs) typically lack sufficient amounts of decision-centric data. This renders off-the-shelf VLMs sub-optimal for decision-making tasks such as in-the-wild device control through graphical user interfaces (GUIs). While training with static demonstrations has shown some promise, we show that such methods fall short for controlling real GUIs due to their… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 11 pages of main text, 28 pages in total

  39. arXiv:2406.11619  [pdf, other

    eess.AS cs.LG

    AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech Separation By Leveraging Narrow- and Cross-Band Modeling

    Authors: Vahid Ahmadi Kalkhorani, Cheng Yu, Anurag Kumar, Ke Tan, Buye Xu, DeLiang Wang

    Abstract: Adding visual cues to audio-based speech separation can improve separation performance. This paper introduces AV-CrossNet, an audiovisual (AV) system for speech enhancement, target speaker extraction, and multi-talker speaker separation. AV-CrossNet is extended from the CrossNet architecture, which is a recently proposed network that performs complex spectral mapping for speech separation by lever… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 10 pages, 4 Figures, and 4 Tables

  40. arXiv:2406.10935  [pdf, other

    cs.CV

    Pick-or-Mix: Dynamic Channel Sampling for ConvNets

    Authors: Ashish Kumar, Daneul Kim, Jaesik Park, Laxmidhar Behera

    Abstract: Channel pruning approaches for convolutional neural networks (ConvNets) deactivate the channels, statically or dynamically, and require special implementation. In addition, channel squeezing in representative ConvNets is carried out via 1x1 convolutions which dominates a large portion of computations and network parameters. Given these challenges, we propose an effective multi-purpose module for d… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Published in Computer Vision and Pattern Recognition (CVPR 2024)

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024

  41. arXiv:2406.10764  [pdf, other

    cs.CL

    GNOME: Generating Negotiations through Open-Domain Mapping of Exchanges

    Authors: Darshan Deshpande, Shambhavi Sinha, Anirudh Ravi Kumar, Debaditya Pal, Jonathan May

    Abstract: Language Models have previously shown strong negotiation capabilities in closed domains where the negotiation strategy prediction scope is constrained to a specific setup. In this paper, we first show that these models are not generalizable beyond their original training domain despite their wide-scale pretraining. Following this, we propose an automated framework called GNOME, which processes exi… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  42. arXiv:2406.10024  [pdf, ps, other

    math.CV

    On Fridman invariant, injectivity radius function and squeezing function

    Authors: Akhil Kumar, Sanjay Kumar Pant

    Abstract: We give a class of domains for which Fridman invariant and injectivity radius function coincide with respect to Carathéodory metric. We give explicit expressions of the squeezing functions for these domains and investigate some of their properties.

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 10 pages, work in progress, comments are welcome

    MSC Class: 32F45; 32H02

  43. arXiv:2406.09749  [pdf, other

    cond-mat.mtrl-sci

    Substrate$-$bias driven Sputter deposited $β-$phase dominated Tungsten film for Spintronic applications

    Authors: Abhay Singh Rajawat, Naim Ahmad, Risvana Nasril, Tasneem Sheikh, Mohammad Muhiuddin, A kumar, Mohammad R Rahman, Waseem Akhtar

    Abstract: $β$-Tungsten ($β$-W), a A15 cubic phase of Tungsten exhibits giant spin hall angle as compared to its bcc-phase $α$-Tungsten ($α$-W), making high quality $β$-W film desirable for spin-based application. We report on the substrate bias driven on-demand growth of $β$-W film on SiO$_2$ coated silicon (SiO$_2… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 5 pages, 4 figures

  44. arXiv:2406.09329  [pdf, other

    cs.LG cs.AI

    Is Value Learning Really the Main Bottleneck in Offline RL?

    Authors: Seohong Park, Kevin Frans, Sergey Levine, Aviral Kumar

    Abstract: While imitation learning requires access to high-quality data, offline reinforcement learning (RL) should, in principle, perform similarly or better with substantially lower data quality by using a value function. However, current results indicate that offline RL often performs worse than imitation learning, and it is often unclear what holds back the performance of offline RL. Motivated by this o… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  45. arXiv:2406.09230  [pdf, other

    quant-ph gr-qc

    Correlations and Signaling in the Schrödinger-Newton Model

    Authors: Jacek Aleksander Gruca, Ankit Kumar, Ray Ganardi, Paramasivan Arumugam, Karolina Kropielnicka, Tomasz Paterek

    Abstract: The Schrödinger-Newton model is a semi-classical theory in which, in addition to mutual attraction, massive quantum particles interact with their own gravitational fields. While there are many studies on the phenomenology of single particles, correlation dynamics in multipartite systems is largely unexplored. Here, we show that the Schrödinger-Newton interactions preserve the product form of initi… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  46. arXiv:2406.08645  [pdf, other

    astro-ph.GA astro-ph.CO

    ODIN: Identifying Protoclusters and Cosmic Filaments Traced by Ly$α$-emitting Galaxies

    Authors: Vandana Ramakrishnan, Kyoung-Soo Lee, Maria Celeste Artale, Eric Gawiser. Yujin Yang, Changbom Park, Robin Ciardullo, Lucia Guaita, Sang Hyeok Im, Seongjae Kim, Ankit Kumar, Jaehyun Lee, Seong-Kook Lee, Byeongha Moon, Nelson Padilla, Alexandra Pope, Roxana Popescu, Hyunmi Song, Paulina Troncoso, Francisco Valdes, Ann Zabludoff

    Abstract: To understand the formation and evolution of massive cosmic structures, studying them at high redshift, in the epoch when they formed the majority of their mass is essential. The One-hundred-deg$^2$ DECam Imaging in Narrowbands (ODIN) survey is undertaking the widest-area narrowband program to date, to use Ly$α$-emitting galaxies (LAEs) to trace the large-scale structure (LSS) of the Universe at t… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 26 pages, 18 figures; submitted to ApJ

  47. arXiv:2406.08067  [pdf, other

    cond-mat.soft

    Synchronous and Asynchronous Updates of Active Ising Spins in One Dimension

    Authors: Anish Kumar, Sudipta Pattanayak, R. K. Singh, Shradha Mishra

    Abstract: How do update rules affect the dynamical and steady state properties of a flock? In this study, we have explored the active Ising spins (s = +-1) in one dimension, where spin updates its orientation according to the Metropolis algorithm (based on the neighbors) via two different update rules. (i) Parallel, and (ii) Random-sequential. We explore the effect of Parallel and Random-sequential updates… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 7 pages, 6 figures. arXiv admin note: text overlap with arXiv:1704.04041

  48. arXiv:2406.07601  [pdf, other

    astro-ph.HE hep-ex

    IceCube Search for Neutrino Emission from X-ray Bright Seyfert Galaxies

    Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (400 additional authors not shown)

    Abstract: The recent IceCube detection of TeV neutrino emission from the nearby active galaxy NGC 1068 suggests that active galactic nuclei (AGN) could make a sizable contribution to the diffuse flux of astrophysical neutrinos. The absence of TeV $γ$-rays from NGC 1068 indicates neutrino production in the vicinity of the supermassive black hole, where the high radiation density leads to $γ$-ray attenuation.… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 17 pages, 9 figures

  49. arXiv:2406.07470  [pdf, other

    astro-ph.HE

    Exploring non-radial oscillation modes in dark matter admixed neutron stars

    Authors: Pratik Thakur, Anil Kumar, Vivek Baruah Thapa, Vishal Parmar, Monika Sinha

    Abstract: Because of their extreme densities and consequently, gravitational potential, compact objects such as neutron stars can prove to be excellent captors of dark matter particles. Considering purely gravitational interactions between dark and hadronic matter, we construct dark matter admixed stars composed of two-fluid matter subject to current astrophysical constraints of maximum mass and tidal defor… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 23 pages and 5 figures

  50. arXiv:2406.06684  [pdf, other

    astro-ph.HE

    Search for neutrino emission from hard X-ray AGN with IceCube

    Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (401 additional authors not shown)

    Abstract: Active Galactic Nuclei (AGN) are promising candidate sources of high-energy astrophysical neutrinos since they provide environments rich in matter and photon targets where cosmic ray interactions may lead to the production of gamma rays and neutrinos. We searched for high-energy neutrino emission from AGN using the $\textit{Swift}$-BAT Spectroscopic Survey (BASS) catalog of hard X-ray sources and… ▽ More

    Submitted 12 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.