-
Ambiguous Resonances in Multipulse Quantum Sensing with Nitrogen Vacancy Centers
Authors:
Lucas Tsunaki,
Anmol Singh,
Kseniia Volkova,
Sergei Trofimov,
Tommaso Pregnolato,
Tim Schröder,
Boris Naydenov
Abstract:
Dynamical decoupling multipulse sequences can be applied to solid state spins for sensing weak oscillating fields from nearby single nuclear spins. By periodically reversing the probing system's evolution, other noises are counteracted and filtered out over the total evolution. However, the technique is subject to intricate interactions resulting in additional resonant responses, which can be misi…
▽ More
Dynamical decoupling multipulse sequences can be applied to solid state spins for sensing weak oscillating fields from nearby single nuclear spins. By periodically reversing the probing system's evolution, other noises are counteracted and filtered out over the total evolution. However, the technique is subject to intricate interactions resulting in additional resonant responses, which can be misinterpreted with the actual signal intended to be measured. We experimentally characterized three of these effects present in single nitrogen vacancy centers in diamond, where we also developed a numerical simulations model without rotating wave approximations, showing robust correlation to the experimental data. Regarding centers with the $^{15}$N nitrogen isotope, we observed that a small misalignment in the bias magnetic field causes the precession of the nitrogen nuclear spin to be sensed by the electronic spin of the center. Another studied case of ambiguous resonances comes from the coupling with lattice $^{13}$C nuclei, where we reconstructed the interaction Hamiltonian based on echo modulation frequencies and used this Hamiltonian to simulate multipulse sequences. Finally, we also measured and simulated the effects from the free evolution of the quantum system during finite pulse durations. Due to the large data volume and the strong dependency of these ambiguous resonances with specific experimental parameters, we provide a simulations dataset with a user-friendly graphical interface, where users can compare simulations with their own experimental data for spectral disambiguation. Although focused with nitrogen vacancy centers and dynamical decoupling sequences, these results and the developed model can potentially be applied to other solid state spins and quantum sensing techniques.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Robustness of LLMs to Perturbations in Text
Authors:
Ayush Singh,
Navpreet Singh,
Shubham Vatsal
Abstract:
Having a clean dataset has been the foundational assumption of most natural language processing (NLP) systems. However, properly written text is rarely found in real-world scenarios and hence, oftentimes invalidates the aforementioned foundational assumption. Recently, Large language models (LLMs) have shown impressive performance, but can they handle the inevitable noise in real-world data? This…
▽ More
Having a clean dataset has been the foundational assumption of most natural language processing (NLP) systems. However, properly written text is rarely found in real-world scenarios and hence, oftentimes invalidates the aforementioned foundational assumption. Recently, Large language models (LLMs) have shown impressive performance, but can they handle the inevitable noise in real-world data? This work tackles this critical question by investigating LLMs' resilience against morphological variations in text. To that end, we artificially introduce varying levels of noise into a diverse set of datasets and systematically evaluate LLMs' robustness against the corrupt variations of the original text. Our findings show that contrary to popular beliefs, generative LLMs are quiet robust to noisy perturbations in text. This is a departure from pre-trained models like BERT or RoBERTa whose performance has been shown to be sensitive to deteriorating noisy text. Additionally, we test LLMs' resilience on multiple real-world benchmarks that closely mimic commonly found errors in the wild. With minimal prompting, LLMs achieve a new state-of-the-art on the benchmark tasks of Grammar Error Correction (GEC) and Lexical Semantic Change (LSC). To empower future research, we also release a dataset annotated by humans stating their preference for LLM vs. human-corrected outputs along with the code to reproduce our results.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Uncovering Semantics and Topics Utilized by Threat Actors to Deliver Malicious Attachments and URLs
Authors:
Andrey Yakymovych,
Abhishek Singh
Abstract:
Recent threat reports highlight that email remains the top vector for delivering malware to endpoints. Despite these statistics, detecting malicious email attachments and URLs often neglects semantic cues linguistic features and contextual clues. Our study employs BERTopic unsupervised topic modeling to identify common semantics and themes embedded in email to deliver malicious attachments and cal…
▽ More
Recent threat reports highlight that email remains the top vector for delivering malware to endpoints. Despite these statistics, detecting malicious email attachments and URLs often neglects semantic cues linguistic features and contextual clues. Our study employs BERTopic unsupervised topic modeling to identify common semantics and themes embedded in email to deliver malicious attachments and call-to-action URLs. We preprocess emails by extracting and sanitizing content and employ multilingual embedding models like BGE-M3 for dense representations, which clustering algorithms(HDBSCAN and OPTICS) use to group emails by semantic similarity. Phi3-Mini-4K-Instruct facilitates semantic and hLDA aid in thematic analysis to understand threat actor patterns. Our research will evaluate and compare different clustering algorithms on topic quantity, coherence, and diversity metrics, concluding with insights into the semantics and topics commonly used by threat actors to deliver malicious attachments and URLs, a significant contribution to the field of threat detection.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Derivatives of theta functions as Traces of Partition Eisenstein series
Authors:
Tewodros Amdeberhan,
Ken Ono,
Ajit Singh
Abstract:
In his ``lost notebook'', Ramanujan used iterated derivatives of two theta functions to define sequences of $q$-series $\{U_{2t}(q)\}$ and $\{V_{2t}(q)\}$ that he claimed to be quasimodular. We give the first explicit proof of this claim by expressing them in terms of ``partition Eisenstein series'', extensions of the classical Eisenstein series $E_{2k}(q)$ defined by…
▽ More
In his ``lost notebook'', Ramanujan used iterated derivatives of two theta functions to define sequences of $q$-series $\{U_{2t}(q)\}$ and $\{V_{2t}(q)\}$ that he claimed to be quasimodular. We give the first explicit proof of this claim by expressing them in terms of ``partition Eisenstein series'', extensions of the classical Eisenstein series $E_{2k}(q)$ defined by $$λ=(1^{m_1}, 2^{m_2},\dots, n^{m_n}) \vdash n \ \ \ \ \ \longmapsto \ \ \ \ \ E_λ(q):= E_2(q)^{m_1} E_4(q)^{m_2}\cdots E_{2n}(q)^{m_n}. $$ For functions $φ: \mathcal{P}\mapsto \C$ on partitions, the {\it weight $2n$ partition Eisenstein trace} is $$ \Tr_n(φ;q):=\sum_{λ\vdash n} φ(λ)E_λ(q). $$ For all $t$, we prove that $U_{2t}(q)=\Tr_t(φ_u;q)$ and $V_{2t}(q)=\Tr_t(φ_v;q),$ where $φ_u$ and $φ_v$ are natural partition weights, giving the first explicit quasimodular formulas for these series.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Entanglement asymmetry in conformal field theory and holography
Authors:
Francesco Benini,
Victor Godet,
Amartya Harsh Singh
Abstract:
Entanglement asymmetry is a measure of symmetry breaking in quantum subsystems, inspired by quantum information theory, particularly suited to study out-of-equilibrium states. We study the entanglement asymmetry of a class of excited "coherent states" in conformal quantum field theories with a U(1) symmetry, employing Euclidean path-integral methods with topological symmetry defects and the replic…
▽ More
Entanglement asymmetry is a measure of symmetry breaking in quantum subsystems, inspired by quantum information theory, particularly suited to study out-of-equilibrium states. We study the entanglement asymmetry of a class of excited "coherent states" in conformal quantum field theories with a U(1) symmetry, employing Euclidean path-integral methods with topological symmetry defects and the replica formalism. We compute, at leading order in perturbation theory, the asymmetry for a variety of subsystems, including finite spherical subregions in flat space, in finite volume, and at positive temperature. We also study its Lorentzian time evolution, showcasing the dynamical restoration of the symmetry due to thermalization, as well as the presence of a quantum Mpemba effect. Our results are universal, and apply in any number of dimensions. We also show that the perturbative entanglement asymmetry is related to the Fisher information metric, which has a known holographic dual called Hollands-Wald canonical energy, and that it is captured by the AdS bulk charge contained in the entanglement wedge.
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
Perfect Matching Complexes of Polygonal Line Tiling
Authors:
Himanshu Chandrakar,
Anurag Singh
Abstract:
The perfect matching complex of a simple graph G is a simplicial complex having facets (maximal faces) as the perfect matchings of G. This article discusses the perfect matching complex of polygonal line tiling and the $\left(2 \times n\right)$-grid graph in particular. We use tools from discrete Morse theory to show that the perfect matching complex of any polygonal line tiling is either contract…
▽ More
The perfect matching complex of a simple graph G is a simplicial complex having facets (maximal faces) as the perfect matchings of G. This article discusses the perfect matching complex of polygonal line tiling and the $\left(2 \times n\right)$-grid graph in particular. We use tools from discrete Morse theory to show that the perfect matching complex of any polygonal line tiling is either contractible or homotopically equivalent to a wedge of spheres. While proving our results, we also characterise all the matchings that can not be extended to form a perfect matching.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Carrier Dynamics in High-density Photo-doped MoS$_2$: Monolayer vs Multilayer
Authors:
Durga Prasad Khatua,
Asha Singh,
Sabina Gurung,
J. Jayabalan
Abstract:
Monolayer and multilayer MoS$_2$ are extremely fascinating materials for the use in lasers, compact optical parametric amplifiers, and high-power detectors which demands high excitation light-matter interaction. Consequently, it is essential to understand the carrier dynamics in both the cases at such high excitation densities. In this work, we investigate the carrier dynamics of monolayer and mul…
▽ More
Monolayer and multilayer MoS$_2$ are extremely fascinating materials for the use in lasers, compact optical parametric amplifiers, and high-power detectors which demands high excitation light-matter interaction. Consequently, it is essential to understand the carrier dynamics in both the cases at such high excitation densities. In this work, we investigate the carrier dynamics of monolayer and multilayer MoS$_2$ at photo-doping densities around Mott Density. It is observed that, despite the similarity in band structure near K-point and formation of A-exciton, a substantial difference in the carrier dynamics is observed reflecting the influence of the entire band structure. The exciton dissociation, bandgap renormalization, and intervalley relaxation play a consequential role in dictating the ultrafast transient properties of these samples. The study in this paper provide a substantial understanding of fundamental optoelectronic properties of the two-dimensional MoS$_2$, paving a way for its potential applications in various photonic and optoelectronic domain.
△ Less
Submitted 6 July, 2024;
originally announced July 2024.
-
Certain infinite products in terms of MacMahon type series
Authors:
Seokho Jin,
Badri Vishal Pandey,
Ajit Singh
Abstract:
Recently, Ono and the third author discovered that the reciprocals of the theta series $(q;q)_\infty^3$ and $(q^2;q^2)_\infty(q;q^2)_\infty^2$ have infinitely many closed formulas in terms of MacMahon's quasimodular forms $A_k(q)$ and $C_k(q)$. In this article, we use the well-known infinite product identities due to Jacobi, Watson, and Hirschhorn to derive further such closed formulas for recipro…
▽ More
Recently, Ono and the third author discovered that the reciprocals of the theta series $(q;q)_\infty^3$ and $(q^2;q^2)_\infty(q;q^2)_\infty^2$ have infinitely many closed formulas in terms of MacMahon's quasimodular forms $A_k(q)$ and $C_k(q)$. In this article, we use the well-known infinite product identities due to Jacobi, Watson, and Hirschhorn to derive further such closed formulas for reciprocals of other interesting infinite products. Moreover, with these formulas, we approximate these reciprocals to arbitrary order simply using MacMahon's functions and {\it MacMahon type} functions. For example, let $Θ_{6}(q):=\frac{1}{2}\sum_{n\in\mathbb{Z}} χ_6(n) n q^{\frac{n^2-1}{24}}$ be the theta function corresponding to the odd quadratic character modulo $6$. Then for any positive integer $n$, we have $$\frac{1}{Θ_{6}(q)}= q^{-\frac{3n^2+n}{2}}\sum_{\substack{k=r_1\\ k\equiv n\hspace{-0.2cm}\pmod{2}}}^{r_2}(-1)^{\frac{n-k}{2}}A_{k}(q)C_{\frac{3n-k}{2}}(q)+O(q^{n+1}),$$ where $r_1:=\lfloor\frac{3n-1-\sqrt{12n+13}}{3}\rfloor+1$ and $r_2:=\lceil\frac{3n-1+\sqrt{12n+13}}{3}\rceil-1$.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
Advanced Artificial Intelligence Strategy for Optimizing Urban Rail Network Design using Nature-Inspired Algorithms
Authors:
Hariram Sampath Kumar,
Archana Singh,
Manish Kumar Ojha
Abstract:
This study introduces an innovative methodology for the planning of metro network routes within the urban environment of Chennai, Tamil Nadu, India. A comparative analysis of the modified Ant Colony Optimization (ACO) method (previously developed) with recent breakthroughs in nature-inspired algorithms demonstrates the modified ACO's superiority over modern techniques. By utilizing the modified AC…
▽ More
This study introduces an innovative methodology for the planning of metro network routes within the urban environment of Chennai, Tamil Nadu, India. A comparative analysis of the modified Ant Colony Optimization (ACO) method (previously developed) with recent breakthroughs in nature-inspired algorithms demonstrates the modified ACO's superiority over modern techniques. By utilizing the modified ACO algorithm, the most efficient routes connecting the origin and destination of the metro route are generated. Additionally, the model is applied to the existing metro network to highlight variations between the model's results and the current network. The Google Maps platform, integrated with Python, handles real-time data, including land utilization, Geographical Information Systems (GIS) data, census information, and points of interest. This processing enables the identification of stops within the city and along the chosen routes. The resulting metro network showcases substantial benefits compared to conventional route planning methods, with noteworthy enhancements in workforce productivity, decreased planning time, and cost-efficiency. This study significantly enhances the efficiency of urban transport systems, specifically in rapidly changing metropolitan settings such as chennai.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
Large-scale quantum reservoir learning with an analog quantum computer
Authors:
Milan Kornjača,
Hong-Ye Hu,
Chen Zhao,
Jonathan Wurtz,
Phillip Weinberg,
Majd Hamdan,
Andrii Zhdanov,
Sergio H. Cantu,
Hengyun Zhou,
Rodrigo Araiza Bravo,
Kevin Bagnall,
James I. Basham,
Joseph Campo,
Adam Choukri,
Robert DeAngelo,
Paige Frederick,
David Haines,
Julian Hammett,
Ning Hsu,
Ming-Guang Hu,
Florian Huber,
Paul Niklas Jepsen,
Ningyuan Jia,
Thomas Karolyshyn,
Minho Kwon
, et al. (28 additional authors not shown)
Abstract:
Quantum machine learning has gained considerable attention as quantum technology advances, presenting a promising approach for efficiently learning complex data patterns. Despite this promise, most contemporary quantum methods require significant resources for variational parameter optimization and face issues with vanishing gradients, leading to experiments that are either limited in scale or lac…
▽ More
Quantum machine learning has gained considerable attention as quantum technology advances, presenting a promising approach for efficiently learning complex data patterns. Despite this promise, most contemporary quantum methods require significant resources for variational parameter optimization and face issues with vanishing gradients, leading to experiments that are either limited in scale or lack potential for quantum advantage. To address this, we develop a general-purpose, gradient-free, and scalable quantum reservoir learning algorithm that harnesses the quantum dynamics of neutral-atom analog quantum computers to process data. We experimentally implement the algorithm, achieving competitive performance across various categories of machine learning tasks, including binary and multi-class classification, as well as timeseries prediction. Effective and improving learning is observed with increasing system sizes of up to 108 qubits, demonstrating the largest quantum machine learning experiment to date. We further observe comparative quantum kernel advantage in learning tasks by constructing synthetic datasets based on the geometric differences between generated quantum and classical data kernels. Our findings demonstrate the potential of utilizing classically intractable quantum correlations for effective machine learning. We expect these results to stimulate further extensions to different quantum hardware and machine learning paradigms, including early fault-tolerant hardware and generative machine learning tasks.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Localization beyond Dirac and Weyl fermions
Authors:
Adesh Singh,
Gargee Sharma
Abstract:
In condensed matter, limited symmetry constraints allow free fermionic excitations to exist beyond the conventional Weyl and Dirac electrons of high-energy physics. These excitations carry a higher pseudospin, providing a natural generalization to the Weyl fermion. How do electrons beyond the conventional Dirac and Weyl fermions localize under disorder? In this Letter, we solve the problem of loca…
▽ More
In condensed matter, limited symmetry constraints allow free fermionic excitations to exist beyond the conventional Weyl and Dirac electrons of high-energy physics. These excitations carry a higher pseudospin, providing a natural generalization to the Weyl fermion. How do electrons beyond the conventional Dirac and Weyl fermions localize under disorder? In this Letter, we solve the problem of localization of free fermionic excitations carrying an arbitrary pseudospin-s. We derive exact analytical expressions for fermionic wavefunctions, scattering time, renormalized velocity, Cooperon, and the magnetoconductivity. We discover that the gapless Cooperon mode solely depends on the pseudospin even when Fermi surface is composed of multiple pockets, leading to weak localization (antilocalization) behavior for even (odd) s. Remarkably, we find the localization correction to scale exponentially with s, i.e., faster moving electrons are strongly susceptible to disorder effects. This opens up intriguing possibility for Anderson localization and many-body localization in these materials.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Brevity is the soul of wit: Pruning long files for code generation
Authors:
Aaditya K. Singh,
Yu Yang,
Kushal Tirumala,
Mostafa Elhoushi,
Ari S. Morcos
Abstract:
Data curation is commonly considered a "secret-sauce" for LLM training, with higher quality data usually leading to better LLM performance. Given the scale of internet-scraped corpora, data pruning has become a larger and larger focus. Specifically, many have shown that de-duplicating data, or sub-selecting higher quality data, can lead to efficiency or performance improvements. Generally, three t…
▽ More
Data curation is commonly considered a "secret-sauce" for LLM training, with higher quality data usually leading to better LLM performance. Given the scale of internet-scraped corpora, data pruning has become a larger and larger focus. Specifically, many have shown that de-duplicating data, or sub-selecting higher quality data, can lead to efficiency or performance improvements. Generally, three types of methods are used to filter internet-scale corpora: embedding-based, heuristic-based, and classifier-based. In this work, we contrast the former two in the domain of finetuning LLMs for code generation. We find that embedding-based methods are often confounded by length, and that a simple heuristic--pruning long files--outperforms other methods in compute-limited regimes. Our method can yield up to a 2x efficiency benefit in training (while matching performance) or a 3.5% absolute performance improvement on HumanEval (while matching compute). However, we find that perplexity on held-out long files can increase, begging the question of whether optimizing data mixtures for common coding benchmarks (HumanEval, MBPP) actually best serves downstream use cases. Overall, we hope our work builds useful intuitions about code data (specifically, the low quality of extremely long code files) provides a compelling heuristic-based method for data pruning, and brings to light questions in how we evaluate code generation models.
△ Less
Submitted 29 June, 2024;
originally announced July 2024.
-
Arithmetic properties for generalized cubic partitions and overpartitions modulo a prime
Authors:
Tewodros Amdeberhan,
James A. Sellers,
Ajit Singh
Abstract:
A cubic partition is an integer partition wherein the even parts can appear in two colors. In this paper, we introduce the notion of generalized cubic partitions and prove a number of new congruences akin to the classical Ramanujan-type. We emphasize two methods of proofs, one elementary (relying significantly on functional equations) and the other based on modular forms. We close by proving analo…
▽ More
A cubic partition is an integer partition wherein the even parts can appear in two colors. In this paper, we introduce the notion of generalized cubic partitions and prove a number of new congruences akin to the classical Ramanujan-type. We emphasize two methods of proofs, one elementary (relying significantly on functional equations) and the other based on modular forms. We close by proving analogous results for generalized overcubic partitions.
△ Less
Submitted 15 June, 2024;
originally announced July 2024.
-
On cases where Litt's game is fair
Authors:
Anne-Laure Basdevant,
Olivier Hénard,
Edouard Maurel-Segala,
Arvind Singh
Abstract:
A fair coin is flipped $n$ times, and two finite sequences of heads and tails (words) $A$ and $B$ of the same length are given. Each time the word $A$ appears in the sequence of coin flips, Alice gets a point, and each time the word $B$ appears, Bob gets a point. Who is more likely to win? This puzzle is a slight extension of Litt's game that recently set Twitter abuzz. We show that Litt's game is…
▽ More
A fair coin is flipped $n$ times, and two finite sequences of heads and tails (words) $A$ and $B$ of the same length are given. Each time the word $A$ appears in the sequence of coin flips, Alice gets a point, and each time the word $B$ appears, Bob gets a point. Who is more likely to win? This puzzle is a slight extension of Litt's game that recently set Twitter abuzz. We show that Litt's game is fair for any value of $n$ and any two words that have the same auto-correlation structure by building up a bijection that exchanges Bob and Alice scores; the fact that the inter-correlation does not come into play in this case may come up as a surprise.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Orientation reconstruction of transformation $α$ titanium alloys via polarized light microscopy: methodology and assessment
Authors:
Amit Singh,
Mark Obstalecki,
Darren C. Pagan,
Michael Glavicic,
Matthew Kasemer
Abstract:
Emerging microstructural characterization methods have received increased attention owing to their promise of relatively inexpensive and rapid measurement of polycrystalline surface morphology and crystallographic orientations. Among these nascent methods, polarized light microscopy (PLM) is attractive for characterizing alloys comprised of hexagonal crystals, but is hindered by its inability to m…
▽ More
Emerging microstructural characterization methods have received increased attention owing to their promise of relatively inexpensive and rapid measurement of polycrystalline surface morphology and crystallographic orientations. Among these nascent methods, polarized light microscopy (PLM) is attractive for characterizing alloys comprised of hexagonal crystals, but is hindered by its inability to measure complete crystal orientations. In this study, we explore the potential to reconstruct quasi-deterministic orientations for titanium microstructures characterized via PLM by considering the Burgers orientation relationship between the room temperature $α$ (HCP) phase fibers measured via PLM, and the $β$ (BCC) phase orientations of the parent grains present above the transus temperature. We describe this method -- which is capable of narrowing down the orientations to one of four possibilities -- and demonstrate its abilities on idealized computational samples in which the parent $β$ microstructure is fully, unambiguously known. We further utilize this method to inform the instantiation of samples for crystal plasticity simulations, and demonstrate the significant improvement in deformation field predictions when utilizing this reconstruction method compared to using results from traditional PLM.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Malaria Cell Detection Using Deep Neural Networks
Authors:
Saurabh Sawant,
Anurag Singh
Abstract:
Malaria remains one of the most pressing public health concerns globally, causing significant morbidity and mortality, especially in sub-Saharan Africa. Rapid and accurate diagnosis is crucial for effective treatment and disease management. Traditional diagnostic methods, such as microscopic examination of blood smears, are labor-intensive and require significant expertise, which may not be readil…
▽ More
Malaria remains one of the most pressing public health concerns globally, causing significant morbidity and mortality, especially in sub-Saharan Africa. Rapid and accurate diagnosis is crucial for effective treatment and disease management. Traditional diagnostic methods, such as microscopic examination of blood smears, are labor-intensive and require significant expertise, which may not be readily available in resource-limited settings. This project aims to automate the detection of malaria-infected cells using a deep learning approach. We employed a convolutional neural network (CNN) based on the ResNet50 architecture, leveraging transfer learning to enhance performance. The Malaria Cell Images Dataset from Kaggle, containing 27,558 images categorized into infected and uninfected cells, was used for training and evaluation. Our model demonstrated high accuracy, precision, and recall, indicating its potential as a reliable tool for assisting in malaria diagnosis. Additionally, a web application was developed using Streamlit to allow users to upload cell images and receive predictions about malaria infection, making the technology accessible and user-friendly. This paper provides a comprehensive overview of the methodology, experiments, and results, highlighting the effectiveness of deep learning in medical image analysis.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Ultrafast carrier dynamics in epitaxial graphene nanoribbons studied by time-resolved terahertz spectroscopy
Authors:
Arvind Singh,
Hynek Němec,
Jan Kunc,
Petr Kužel
Abstract:
Optical pump-terahertz probe spectroscopy has been used to investigate ultrafast photo-induced charge carrier transport in epitaxial graphene nanoribbons. The picosecond THz photoconductivity first increases with an increasing pump fluence and then it saturates at high fluences. This behavior is due to an interplay between contributions of the directly photoexcited carriers and the secondary carri…
▽ More
Optical pump-terahertz probe spectroscopy has been used to investigate ultrafast photo-induced charge carrier transport in epitaxial graphene nanoribbons. The picosecond THz photoconductivity first increases with an increasing pump fluence and then it saturates at high fluences. This behavior is due to an interplay between contributions of the directly photoexcited carriers and the secondary carriers, which are equilibrium conduction band carriers strongly heated by the carrier-carrier scattering process after the photoexcitation. This phenomenon leads to a non-monotonic variation of the carrier mobility and plasmonic resonance frequency as a function of the pump fluence and, at higher fluences, to a balance between a decreasing carrier scattering time and an increasing Drude weight. In addition, a weak carrier localization occurring at low pump fluences is progressively lifted as a result of increasing initial carrier temperature.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
In vivo and in vitro study of resorbable magnesium wires for medical implants: Mg purity, surface quality, Zn alloying and polymer coating
Authors:
K. Tesar,
J. Lunackova,
M. Jex,
M. Zaloudkova,
R. Vrbova,
M. Bartos,
P. Klein,
L. Vistejnova,
J. Duskova,
E. Filova,
Z. Sucharda,
M. Steinerova,
S. Habr,
K. Balik,
A. Singh
Abstract:
Magnesium is an excellent material in terms of biocompatibility and its corrosion products can serve as an active source for new bone formation. However, localized corrosion and H2 generation limit the potential of Mg-based implants. Utilizing low-alloyed Mg-Zn wires can strongly reduce problems with large H2 bubbles and improve the mechanical properties considerably while maintaining excellent lo…
▽ More
Magnesium is an excellent material in terms of biocompatibility and its corrosion products can serve as an active source for new bone formation. However, localized corrosion and H2 generation limit the potential of Mg-based implants. Utilizing low-alloyed Mg-Zn wires can strongly reduce problems with large H2 bubbles and improve the mechanical properties considerably while maintaining excellent long-term biocompatibility. Acidic pickling and a polymer coating can be effectively used to lower the rate of in vivo degradation. In this work, microstructural, mechanical, and in vitro characterization of 250 um and 300 um extruded wires made from ultra-pure Mg, commercially pure Mg, Mg-0.15Zn, Mg-0.4Zn and Mg-1Zn was performed. Additionally, Mg-0.4Zn wires together with a variant coated with a copolymer of L-lactide and ε-caprolactone were tested in vivo on artificially damaged Wistar rat femurs. Based on the observed Mg-induced osteogenesis, polymer-coated Mg wires with a small addition of Zn are a perspective material for bone-support applications, such as cerclage and fixation wires.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Arboretum: A Large Multimodal Dataset Enabling AI for Biodiversity
Authors:
Chih-Hsuan Yang,
Benjamin Feuer,
Zaki Jubery,
Zi K. Deng,
Andre Nakkab,
Md Zahid Hasan,
Shivani Chiranjeevi,
Kelly Marshall,
Nirmal Baishnab,
Asheesh K Singh,
Arti Singh,
Soumik Sarkar,
Nirav Merchant,
Chinmay Hegde,
Baskar Ganapathysubramanian
Abstract:
We introduce Arboretum, the largest publicly accessible dataset designed to advance AI for biodiversity applications. This dataset, curated from the iNaturalist community science platform and vetted by domain experts to ensure accuracy, includes 134.6 million images, surpassing existing datasets in scale by an order of magnitude. The dataset encompasses image-language paired data for a diverse set…
▽ More
We introduce Arboretum, the largest publicly accessible dataset designed to advance AI for biodiversity applications. This dataset, curated from the iNaturalist community science platform and vetted by domain experts to ensure accuracy, includes 134.6 million images, surpassing existing datasets in scale by an order of magnitude. The dataset encompasses image-language paired data for a diverse set of species from birds (Aves), spiders/ticks/mites (Arachnida), insects (Insecta), plants (Plantae), fungus/mushrooms (Fungi), snails (Mollusca), and snakes/lizards (Reptilia), making it a valuable resource for multimodal vision-language AI models for biodiversity assessment and agriculture research. Each image is annotated with scientific names, taxonomic details, and common names, enhancing the robustness of AI model training.
We showcase the value of Arboretum by releasing a suite of CLIP models trained using a subset of 40 million captioned images. We introduce several new benchmarks for rigorous assessment, report accuracy for zero-shot learning, and evaluations across life stages, rare species, confounding species, and various levels of the taxonomic hierarchy.
We anticipate that Arboretum will spur the development of AI models that can enable a variety of digital tools ranging from pest control strategies, crop monitoring, and worldwide biodiversity assessment and environmental conservation. These advancements are critical for ensuring food security, preserving ecosystems, and mitigating the impacts of climate change. Arboretum is publicly available, easily accessible, and ready for immediate use.
Please see the \href{https://baskargroup.github.io/Arboretum/}{project website} for links to our data, models, and code.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Optimizing Configuration Selection in Reconfigurable-Antenna MIMO Systems: Physics-Inspired Heuristic Solvers
Authors:
I. Krikidis,
C. Psomas,
A. K. Singh,
K. Jamieson
Abstract:
Reconfigurable antenna multiple-input multiple-output (MIMO) is a foundational technology for the continuing evolution of cellular systems, including upcoming 6G communication systems. In this paper, we address the problem of flexible/reconfigurable antenna configuration selection for point-to-point MIMO antenna systems by using physics-inspired heuristics. Firstly, we optimize the antenna configu…
▽ More
Reconfigurable antenna multiple-input multiple-output (MIMO) is a foundational technology for the continuing evolution of cellular systems, including upcoming 6G communication systems. In this paper, we address the problem of flexible/reconfigurable antenna configuration selection for point-to-point MIMO antenna systems by using physics-inspired heuristics. Firstly, we optimize the antenna configuration to maximize the signal-to-noise ratio (SNR) at the receiver by leveraging two basic heuristic solvers, i.e., coherent Ising machines (CIMs), that mimic quantum mechanical dynamics, and quantum annealing (QA), where a real-world QA architecture is considered (D-Wave). A mathematical framework that converts the configuration selection problem into CIM- and QA- compatible unconstrained quadratic formulations is investigated. Numerical and experimental results show that the proposed designs outperform classical counterparts and achieve near-optimal performance (similar to exhaustive search with exponential complexity) while ensuring polynomial complexity. Moreover, we study the optimal antenna configuration that maximizes the end-to-end Shannon capacity. A simulated annealing (SA) heuristic which achieves near-optimal performance through appropriate parameterization is adopted. A modified version of the basic SA that exploits parallel tempering to avoid local maxima is also studied, which provides additional performance gains. Extended numerical studies show that the SA solutions outperform conventional heuristics (which are also developed for comparison purposes), while the employment of the SNR-based solutions is highly sub-optimal.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
GraphEval2000: Benchmarking and Improving Large Language Models on Graph Datasets
Authors:
Qiming Wu,
Zichen Chen,
Will Corcoran,
Misha Sra,
Ambuj K. Singh
Abstract:
Large language models (LLMs) have achieved remarkable success in natural language processing (NLP), demonstrating significant capabilities in processing and understanding text data. However, recent studies have identified limitations in LLMs' ability to reason about graph-structured data. To address this gap, we introduce GraphEval2000, the first comprehensive graph dataset, comprising 40 graph da…
▽ More
Large language models (LLMs) have achieved remarkable success in natural language processing (NLP), demonstrating significant capabilities in processing and understanding text data. However, recent studies have identified limitations in LLMs' ability to reason about graph-structured data. To address this gap, we introduce GraphEval2000, the first comprehensive graph dataset, comprising 40 graph data structure problems along with 2000 test cases. Additionally, we introduce an evaluation framework based on GraphEval2000, designed to assess the graph reasoning abilities of LLMs through coding challenges. Our dataset categorizes test cases into four primary and four sub-categories, ensuring a comprehensive evaluation. We evaluate eight popular LLMs on GraphEval2000, revealing that LLMs exhibit a better understanding of directed graphs compared to undirected ones. While private LLMs consistently outperform open-source models, the performance gap is narrowing. Furthermore, to improve the usability of our evaluation framework, we propose Structured Symbolic Decomposition (SSD), an instruction-based method designed to enhance LLM performance on GraphEval2000. Results show that SSD improves the performance of GPT-3.5, GPT-4, and GPT-4o on complex graph problems, with an increase of 11.11\%, 33.37\%, and 33.37\%, respectively.
△ Less
Submitted 23 June, 2024;
originally announced June 2024.
-
Parabolic vector bundles and Lie algebroid connections
Authors:
David Alfaya,
Indranil Biswas,
Pradip Kumar,
Anoop Singh
Abstract:
Given a holomorphic Lie algebroid on an m-pointed Riemann surface, we define parabolic Lie algebroid connections on any parabolic vector bundle equipped with parabolic structure over the marked points. An analogue of the Atiyah exact sequence for parabolic Lie algebroids is constructed. For any Lie algebroid whose underlying holomorphic vector bundle is stable, we give a complete characterization…
▽ More
Given a holomorphic Lie algebroid on an m-pointed Riemann surface, we define parabolic Lie algebroid connections on any parabolic vector bundle equipped with parabolic structure over the marked points. An analogue of the Atiyah exact sequence for parabolic Lie algebroids is constructed. For any Lie algebroid whose underlying holomorphic vector bundle is stable, we give a complete characterization of all the parabolic vector bundles that admit a parabolic Lie algebroid connection.
△ Less
Submitted 22 June, 2024;
originally announced June 2024.
-
Keystroke Dynamics Against Academic Dishonesty in the Age of LLMs
Authors:
Debnath Kundu,
Atharva Mehta,
Rajesh Kumar,
Naman Lal,
Avinash Anand,
Apoorv Singh,
Rajiv Ratn Shah
Abstract:
The transition to online examinations and assignments raises significant concerns about academic integrity. Traditional plagiarism detection systems often struggle to identify instances of intelligent cheating, particularly when students utilize advanced generative AI tools to craft their responses. This study proposes a keystroke dynamics-based method to differentiate between bona fide and assist…
▽ More
The transition to online examinations and assignments raises significant concerns about academic integrity. Traditional plagiarism detection systems often struggle to identify instances of intelligent cheating, particularly when students utilize advanced generative AI tools to craft their responses. This study proposes a keystroke dynamics-based method to differentiate between bona fide and assisted writing within academic contexts. To facilitate this, a dataset was developed to capture the keystroke patterns of individuals engaged in writing tasks, both with and without the assistance of generative AI. The detector, trained using a modified TypeNet architecture, achieved accuracies ranging from 74.98% to 85.72% in condition-specific scenarios and from 52.24% to 80.54% in condition-agnostic scenarios. The findings highlight significant differences in keystroke dynamics between genuine and assisted writing. The outcomes of this study enhance our understanding of how users interact with generative AI and have implications for improving the reliability of digital educational platforms.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Differentiable-Optimization Based Neural Policy for Occlusion-Aware Target Tracking
Authors:
Houman Masnavi,
Arun Kumar Singh,
Farrokh Janabi-Sharifi
Abstract:
Tracking a target in cluttered and dynamic environments is challenging but forms a core component in applications like aerial cinematography. The obstacles in the environment not only pose collision risk but can also occlude the target from the field-of-view of the robot. Moreover, the target future trajectory may be unknown and only its current state can be estimated. In this paper, we propose a…
▽ More
Tracking a target in cluttered and dynamic environments is challenging but forms a core component in applications like aerial cinematography. The obstacles in the environment not only pose collision risk but can also occlude the target from the field-of-view of the robot. Moreover, the target future trajectory may be unknown and only its current state can be estimated. In this paper, we propose a learned probabilistic neural policy for safe, occlusion-free target tracking.
The core novelty of our work stems from the structure of our policy network that combines generative modeling based on Conditional Variational Autoencoder (CVAE) with differentiable optimization layers. The role of the CVAE is to provide a base trajectory distribution which is then projected onto a learned feasible set through the optimization layer. Furthermore, both the weights of the CVAE network and the parameters of the differentiable optimization can be learned in an end-to-end fashion through demonstration trajectories.
We improve the state-of-the-art (SOTA) in the following respects. We show that our learned policy outperforms existing SOTA in terms of occlusion/collision avoidance capabilities and computation time. Second, we present an extensive ablation showing how different components of our learning pipeline contribute to the overall tracking task. We also demonstrate the real-time performance of our approach on resource-constrained hardware such as NVIDIA Jetson TX2. Finally, our learned policy can also be viewed as a reactive planner for navigation in highly cluttered environments.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
Invariant rings of the special orthogonal group have nonunimodal $h$-vectors
Authors:
Aldo Conca,
Anurag K. Singh,
Matteo Varbaro
Abstract:
For $K$ an infinite field of characteristic other than two, consider the action of the special orthogonal group $\operatorname{SO}_t(K)$ on a polynomial ring via copies of the regular representation. When $K$ has characteristic zero, Boutot's theorem implies that the invariant ring has rational singularities; when $K$ has positive characteristic, the invariant ring is $F$-regular, as proven by Has…
▽ More
For $K$ an infinite field of characteristic other than two, consider the action of the special orthogonal group $\operatorname{SO}_t(K)$ on a polynomial ring via copies of the regular representation. When $K$ has characteristic zero, Boutot's theorem implies that the invariant ring has rational singularities; when $K$ has positive characteristic, the invariant ring is $F$-regular, as proven by Hashimoto using good filtrations. We give a new proof of this, viewing the invariant ring for $\operatorname{SO}_t(K)$ as a cyclic cover of the invariant ring for the corresponding orthogonal group; this point of view has a number of useful consequences, for example it readily yields the $a$-invariant and information on the Hilbert series. Indeed, we use this to show that the $h$-vector of the invariant ring for $\operatorname{SO}_t(K)$ need not be unimodal.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
AMC: Access to Miss Correlation Prefetcher for Evolving Graph Analytics
Authors:
Abhishek Singh,
Christian Schulte,
Xiaochen Guo
Abstract:
Modern memory hierarchies work well with applications that have good spatial locality. Evolving (dynamic) graphs are important applications widely used to model graphs and networks with edge and vertex changes. They exhibit irregular memory access patterns and suffer from a high miss ratio and long miss penalty. Prefetching can be employed to predict and fetch future demand misses. However, curren…
▽ More
Modern memory hierarchies work well with applications that have good spatial locality. Evolving (dynamic) graphs are important applications widely used to model graphs and networks with edge and vertex changes. They exhibit irregular memory access patterns and suffer from a high miss ratio and long miss penalty. Prefetching can be employed to predict and fetch future demand misses. However, current hardware prefetchers can not efficiently predict for applications with irregular memory accesses. In evolving graph applications, vertices that do not change during graph changes exhibit the same access correlation patterns. Current temporal prefetchers use one-to-one or one-to-many correlation to exploit these patterns. Similar patterns are recorded in the same entry, which causes aliasing and can lead to poor prefetch accuracy and coverage. This work proposes a software-assisted hardware prefetcher for evolving graphs. The key idea is to record the correlations between a sequence of vertex accesses and the following misses and then prefetch when the same vertex access sequence occurs in the future. The proposed Access-to-Miss Correlation (AMC) prefetcher provides a lightweight programming interface to identify the data structures of interest and sets the iteration boundary to update the correlation table. For the evaluated applications, AMC achieves a geomean speedup of 1.5x as compared to the best-performing prefetcher in prior work (VLDP). AMC can achieve an average of 62% accuracy and coverage, whereas VLDP has an accuracy of 31% and coverage of 23%.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
Global Human-guided Counterfactual Explanations for Molecular Properties via Reinforcement Learning
Authors:
Danqing Wang,
Antonis Antoniades,
Kha-Dinh Luong,
Edwin Zhang,
Mert Kosan,
Jiachen Li,
Ambuj Singh,
William Yang Wang,
Lei Li
Abstract:
Counterfactual explanations of Graph Neural Networks (GNNs) offer a powerful way to understand data that can naturally be represented by a graph structure. Furthermore, in many domains, it is highly desirable to derive data-driven global explanations or rules that can better explain the high-level properties of the models and data in question. However, evaluating global counterfactual explanations…
▽ More
Counterfactual explanations of Graph Neural Networks (GNNs) offer a powerful way to understand data that can naturally be represented by a graph structure. Furthermore, in many domains, it is highly desirable to derive data-driven global explanations or rules that can better explain the high-level properties of the models and data in question. However, evaluating global counterfactual explanations is hard in real-world datasets due to a lack of human-annotated ground truth, which limits their use in areas like molecular sciences. Additionally, the increasing scale of these datasets provides a challenge for random search-based methods. In this paper, we develop a novel global explanation model RLHEX for molecular property prediction. It aligns the counterfactual explanations with human-defined principles, making the explanations more interpretable and easy for experts to evaluate. RLHEX includes a VAE-based graph generator to generate global explanations and an adapter to adjust the latent representation space to human-defined principles. Optimized by Proximal Policy Optimization (PPO), the global explanations produced by RLHEX cover 4.12% more input graphs and reduce the distance between the counterfactual explanation set and the input set by 0.47% on average across three molecular datasets. RLHEX provides a flexible framework to incorporate different human-designed principles into the counterfactual explanation generation process, aligning these explanations with domain expertise. The code and data are released at https://github.com/dqwang122/RLHEX.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Connecting Rashba and Dresselhaus spin-orbit interactions to inversion asymmetry in perovskite oxide heterostructures
Authors:
Nirmal Ganguli,
Avishek Singh,
Vivek Kumar,
Jayita Chakraborty
Abstract:
Inversion asymmetry, combined with spin orbit interaction, leads to Rashba or Dresselhaus effects, or combinations of them that are promising for technologies based on antiferromagnetic spintronics. Since understanding the exact nature of spin-orbit interaction is crucial for developing a technology based on it, mapping the nature of inversion asymmetry with the type of spin-orbit interaction beco…
▽ More
Inversion asymmetry, combined with spin orbit interaction, leads to Rashba or Dresselhaus effects, or combinations of them that are promising for technologies based on antiferromagnetic spintronics. Since understanding the exact nature of spin-orbit interaction is crucial for developing a technology based on it, mapping the nature of inversion asymmetry with the type of spin-orbit interaction becomes the key. We simulate a perovskite oxide heterostructure LaAlO$_3|$SrIrO$_3|$SrTiO$_3$ preserving the inversion symmetry within density functional theory to demonstrate the relation between the nature of inversion asymmetry and the corresponding Rashba or Dresselhaus-type interaction. With progressive distortion in the heterostructure, we find how the structure inversion asymmetry sets in with distorted bond lengths and bond angles, leading to Rashba effect in the system. Further, introduction of tilted IrO$_6$ octahedra leads to bulk inversion asymmetry, helping a combined Rashba-Dresselhaus interaction to set in. A comparison of the spin textures obtained from our DFT calculations and theoretical modeling helps us identify the exact nature of the interactions. Besides demonstrating the connection between the nature of asymmetry with Rashba and Dresselhaus interactions, our work may serve as a guide to identifying different types of Rashba-like spin-orbit interactions.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Various Representation Dimensions associated with a Finite Group
Authors:
Anupam Singh,
Ayush Udeep
Abstract:
To a finite group $G$, one can associate several notions of dimensions (or degrees). In this survey, we attempt to bring together some of the notions of dimensions or degrees defined using representations of the group in General Linear Groups and permutation groups. These are embedding degree, minimal faithful irreducible character degree, minimal faithful permutation representation degree, minima…
▽ More
To a finite group $G$, one can associate several notions of dimensions (or degrees). In this survey, we attempt to bring together some of the notions of dimensions or degrees defined using representations of the group in General Linear Groups and permutation groups. These are embedding degree, minimal faithful irreducible character degree, minimal faithful permutation representation degree, minimal faithful quasi-permutation representation degree and essential dimension. We briefly present the progress in understanding these notions and the related problems.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Class-specific Data Augmentation for Plant Stress Classification
Authors:
Nasla Saleem,
Aditya Balu,
Talukder Zaki Jubery,
Arti Singh,
Asheesh K. Singh,
Soumik Sarkar,
Baskar Ganapathysubramanian
Abstract:
Data augmentation is a powerful tool for improving deep learning-based image classifiers for plant stress identification and classification. However, selecting an effective set of augmentations from a large pool of candidates remains a key challenge, particularly in imbalanced and confounding datasets. We propose an approach for automated class-specific data augmentation using a genetic algorithm.…
▽ More
Data augmentation is a powerful tool for improving deep learning-based image classifiers for plant stress identification and classification. However, selecting an effective set of augmentations from a large pool of candidates remains a key challenge, particularly in imbalanced and confounding datasets. We propose an approach for automated class-specific data augmentation using a genetic algorithm. We demonstrate the utility of our approach on soybean [Glycine max (L.) Merr] stress classification where symptoms are observed on leaves; a particularly challenging problem due to confounding classes in the dataset. Our approach yields substantial performance, achieving a mean-per-class accuracy of 97.61% and an overall accuracy of 98% on the soybean leaf stress dataset. Our method significantly improves the accuracy of the most challenging classes, with notable enhancements from 83.01% to 88.89% and from 85.71% to 94.05%, respectively.
A key observation we make in this study is that high-performing augmentation strategies can be identified in a computationally efficient manner. We fine-tune only the linear layer of the baseline model with different augmentations, thereby reducing the computational burden associated with training classifiers from scratch for each augmentation policy while achieving exceptional performance. This research represents an advancement in automated data augmentation strategies for plant stress classification, particularly in the context of confounding datasets. Our findings contribute to the growing body of research in tailored augmentation techniques and their potential impact on disease management strategies, crop yields, and global food security. The proposed approach holds the potential to enhance the accuracy and efficiency of deep learning-based tools for managing plant stresses in agriculture.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Segregation Kinetics of Miktoarm Star Polymers: A Dissipative Particle Dynamics Study
Authors:
Dorothy Gogoi,
Avinash Chauhan,
Sanjay Puri,
Awaneesh Singh
Abstract:
We study the phase separation kinetics of miktoarm star polymer (MSP) melts and blends with diverse architectures using dissipative particle dynamics simulations. Our study focuses on symmetric and asymmetric miktoarm star polymer (SMSP/AMSP) mixtures based on arm composition and number. For a fixed MSP chain size, the characteristic microphase-separated domains initially show diffusive growth wit…
▽ More
We study the phase separation kinetics of miktoarm star polymer (MSP) melts and blends with diverse architectures using dissipative particle dynamics simulations. Our study focuses on symmetric and asymmetric miktoarm star polymer (SMSP/AMSP) mixtures based on arm composition and number. For a fixed MSP chain size, the characteristic microphase-separated domains initially show diffusive growth with a growth exponent $φ\sim 1/3$ for both melts that gradually crossover to saturation at late times. The simulation results demonstrate that the evolution morphology of SMSP melts exhibits perfect dynamic scaling with varying arm numbers; the time scale follows a power-law decay with an exponent $θ\simeq 1$ as the number of arms increases. The structural constraints on AMSP melts cause the domain growth rate to decrease as the number of one type of arms increases while their length remains fixed. This increase in the number of arms for AMSP corresponds to increased off-criticality. The saturation length in AMSP follows a power law increase with an exponent $λ\simeq 2/3$ as off-criticality decreases. Additionally, macrophase separation kinetics in SMSP/AMSP blends show a transition from viscous ($φ\sim 1$) to inertial ($φ\sim 2/3$) hydrodynamic growth regimes at late times; this exhibits the same dynamical universality class as linear polymer blends, with slight deviations at early stages.
△ Less
Submitted 15 June, 2024;
originally announced June 2024.
-
Quantifying Variance in Evaluation Benchmarks
Authors:
Lovish Madaan,
Aaditya K. Singh,
Rylan Schaeffer,
Andrew Poulton,
Sanmi Koyejo,
Pontus Stenetorp,
Sharan Narang,
Dieuwke Hupkes
Abstract:
Evaluation benchmarks are the cornerstone of measuring capabilities of large language models (LLMs), as well as driving progress in said capabilities. Originally designed to make claims about capabilities (or lack thereof) in fully pretrained models, evaluation benchmarks are now also extensively used to decide between various training choices. Despite this widespread usage, we rarely quantify the…
▽ More
Evaluation benchmarks are the cornerstone of measuring capabilities of large language models (LLMs), as well as driving progress in said capabilities. Originally designed to make claims about capabilities (or lack thereof) in fully pretrained models, evaluation benchmarks are now also extensively used to decide between various training choices. Despite this widespread usage, we rarely quantify the variance in our evaluation benchmarks, which dictates whether differences in performance are meaningful. Here, we define and measure a range of metrics geared towards measuring variance in evaluation benchmarks, including seed variance across initialisations, and monotonicity during training. By studying a large number of models -- both openly available and pretrained from scratch -- we provide empirical estimates for a variety of variance metrics, with considerations and recommendations for practitioners. We also evaluate the utility and tradeoffs of continuous versus discrete performance measures and explore options for better understanding and reducing this variance. We find that simple changes, such as framing choice tasks (like MMLU) as completion tasks, can often reduce variance for smaller scale ($\sim$7B) models, while more involved methods inspired from human testing literature (such as item analysis and item response theory) struggle to meaningfully reduce variance. Overall, our work provides insights into variance in evaluation benchmarks, suggests LM-specific techniques to reduce variance, and more generally encourages practitioners to carefully factor in variance when comparing models.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Temporal Planning via Interval Logic Satisfiability for Autonomous Systems
Authors:
Miquel Ramirez,
Anubhav Singh,
Peter Stuckey,
Chris Manzie
Abstract:
Many automated planning methods and formulations rely on suitably designed abstractions or simplifications of the constrained dynamics associated with agents to attain computational scalability. We consider formulations of temporal planning where intervals are associated with both action and fluent atoms, and relations between these are given as sentences in Allen's Interval Logic. We propose a no…
▽ More
Many automated planning methods and formulations rely on suitably designed abstractions or simplifications of the constrained dynamics associated with agents to attain computational scalability. We consider formulations of temporal planning where intervals are associated with both action and fluent atoms, and relations between these are given as sentences in Allen's Interval Logic. We propose a notion of planning graphs that can account for complex concurrency relations between actions and fluents as a Constraint Programming (CP) model. We test an implementation of our algorithm on a state-of-the-art framework for CP and compare it with PDDL 2.1 planners that capture plans requiring complex concurrent interactions between agents. We demonstrate our algorithm outperforms existing PDDL 2.1 planners in the case studies. Still, scalability remains challenging when plans must comply with intricate concurrent interactions and the sequencing of actions.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Distribution of hooks in self-conjugate partitions
Authors:
William Craig,
Ken Ono,
Ajit Singh
Abstract:
We confirm the speculation that the distribution of $t$-hooks among unrestricted integer partitions essentially descends to self-conjugate partitions. Namely, we prove that the number of hooks of length $t$ among the size $n$ self-conjugate partitions is asymptotically normally distributed with mean
$μ_t(n) \sim \frac{\sqrt{6n}}π + \frac{3}{π^2} - \frac{t}{2}$ and variance…
▽ More
We confirm the speculation that the distribution of $t$-hooks among unrestricted integer partitions essentially descends to self-conjugate partitions. Namely, we prove that the number of hooks of length $t$ among the size $n$ self-conjugate partitions is asymptotically normally distributed with mean
$μ_t(n) \sim \frac{\sqrt{6n}}π + \frac{3}{π^2} - \frac{t}{2}$ and variance $σ_t^2(n) \sim \frac{(π^2 - 6) \sqrt{6n}}{π^3}.$
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Faster Spectral Density Estimation and Sparsification in the Nuclear Norm
Authors:
Yujia Jin,
Ishani Karmarkar,
Christopher Musco,
Aaron Sidford,
Apoorv Vikram Singh
Abstract:
We consider the problem of estimating the spectral density of the normalized adjacency matrix of an $n$-node undirected graph. We provide a randomized algorithm that, with $O(nε^{-2})$ queries to a degree and neighbor oracle and in $O(nε^{-3})$ time, estimates the spectrum up to $ε$ accuracy in the Wasserstein-1 metric. This improves on previous state-of-the-art methods, including an $O(nε^{-7})$…
▽ More
We consider the problem of estimating the spectral density of the normalized adjacency matrix of an $n$-node undirected graph. We provide a randomized algorithm that, with $O(nε^{-2})$ queries to a degree and neighbor oracle and in $O(nε^{-3})$ time, estimates the spectrum up to $ε$ accuracy in the Wasserstein-1 metric. This improves on previous state-of-the-art methods, including an $O(nε^{-7})$ time algorithm from [Braverman et al., STOC 2022] and, for sufficiently small $ε$, a $2^{O(ε^{-1})}$ time method from [Cohen-Steiner et al., KDD 2018]. To achieve this result, we introduce a new notion of graph sparsification, which we call nuclear sparsification. We provide an $O(nε^{-2})$-query and $O(nε^{-2})$-time algorithm for computing $O(nε^{-2})$-sparse nuclear sparsifiers. We show that this bound is optimal in both its sparsity and query complexity, and we separate our results from the related notion of additive spectral sparsification. Of independent interest, we show that our sparsification method also yields the first deterministic algorithm for spectral density estimation that scales linearly with $n$ (sublinear in the representation size of the graph).
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Hybrid Reinforcement Learning from Offline Observation Alone
Authors:
Yuda Song,
J. Andrew Bagnell,
Aarti Singh
Abstract:
We consider the hybrid reinforcement learning setting where the agent has access to both offline data and online interactive access. While Reinforcement Learning (RL) research typically assumes offline data contains complete action, reward and transition information, datasets with only state information (also known as observation-only datasets) are more general, abundant and practical. This motiva…
▽ More
We consider the hybrid reinforcement learning setting where the agent has access to both offline data and online interactive access. While Reinforcement Learning (RL) research typically assumes offline data contains complete action, reward and transition information, datasets with only state information (also known as observation-only datasets) are more general, abundant and practical. This motivates our study of the hybrid RL with observation-only offline dataset framework. While the task of competing with the best policy "covered" by the offline data can be solved if a reset model of the environment is provided (i.e., one that can be reset to any state), we show evidence of hardness when only given the weaker trace model (i.e., one can only reset to the initial states and must produce full traces through the environment), without further assumption of admissibility of the offline data. Under the admissibility assumptions -- that the offline data could actually be produced by the policy class we consider -- we propose the first algorithm in the trace model setting that provably matches the performance of algorithms that leverage a reset model. We also perform proof-of-concept experiments that suggest the effectiveness of our algorithm in practice.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Scaling the Vocabulary of Non-autoregressive Models for Efficient Generative Retrieval
Authors:
Ravisri Valluri,
Akash Kumar Mohankumar,
Kushal Dave,
Amit Singh,
Jian Jiao,
Manik Varma,
Gaurav Sinha
Abstract:
Generative Retrieval introduces a new approach to Information Retrieval by reframing it as a constrained generation task, leveraging recent advancements in Autoregressive (AR) language models. However, AR-based Generative Retrieval methods suffer from high inference latency and cost compared to traditional dense retrieval techniques, limiting their practical applicability. This paper investigates…
▽ More
Generative Retrieval introduces a new approach to Information Retrieval by reframing it as a constrained generation task, leveraging recent advancements in Autoregressive (AR) language models. However, AR-based Generative Retrieval methods suffer from high inference latency and cost compared to traditional dense retrieval techniques, limiting their practical applicability. This paper investigates fully Non-autoregressive (NAR) language models as a more efficient alternative for generative retrieval. While standard NAR models alleviate latency and cost concerns, they exhibit a significant drop in retrieval performance (compared to AR models) due to their inability to capture dependencies between target tokens. To address this, we question the conventional choice of limiting the target token space to solely words or sub-words. We propose PIXAR, a novel approach that expands the target vocabulary of NAR models to include multi-word entities and common phrases (up to 5 million tokens), thereby reducing token dependencies. PIXAR employs inference optimization strategies to maintain low inference latency despite the significantly larger vocabulary. Our results demonstrate that PIXAR achieves a relative improvement of 31.0% in MRR@10 on MS MARCO and 23.2% in Hits@5 on Natural Questions compared to standard NAR models with similar latency and cost. Furthermore, online A/B experiments on a large commercial search engine show that PIXAR increases ad clicks by 5.08% and revenue by 4.02%.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Exploring Topic Modelling of User Reviews as a Monitoring Mechanism for Emergent Issues Within Social VR Communities
Authors:
Angelo Singh,
Joseph O'Hagan
Abstract:
Users of social virtual reality (VR) platforms often use user reviews to document incidents of witnessed and/or experienced user harassment. However, at present, research has yet to be explore utilising this data as a monitoring mechanism to identify emergent issues within social VR communities. Such a system would be of much benefit to developers and researchers as it would enable the automatic i…
▽ More
Users of social virtual reality (VR) platforms often use user reviews to document incidents of witnessed and/or experienced user harassment. However, at present, research has yet to be explore utilising this data as a monitoring mechanism to identify emergent issues within social VR communities. Such a system would be of much benefit to developers and researchers as it would enable the automatic identification of emergent issues as they occur, provide a means of longitudinally analysing harassment, and reduce the reliance on alternative, high cost, monitoring methodologies, e.g. observation or interview studies. To contribute towards the development of such a system, we collected approximately 40,000 Rec Room user reviews from the Steam storefront. We then analysed our dataset's sentiment, word/term frequencies, and conducted a topic modelling analysis of the negative reviews detected in our dataset. We report our approach was capable of longitudinally monitoring changes in review sentiment and identifying high level themes related to types of harassment known to occur in social VR platforms.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
How Good is Zero-Shot MT Evaluation for Low Resource Indian Languages?
Authors:
Anushka Singh,
Ananya B. Sai,
Raj Dabre,
Ratish Puduppully,
Anoop Kunchukuttan,
Mitesh M Khapra
Abstract:
While machine translation evaluation has been studied primarily for high-resource languages, there has been a recent interest in evaluation for low-resource languages due to the increasing availability of data and models. In this paper, we focus on a zero-shot evaluation setting focusing on low-resource Indian languages, namely Assamese, Kannada, Maithili, and Punjabi. We collect sufficient Multi-…
▽ More
While machine translation evaluation has been studied primarily for high-resource languages, there has been a recent interest in evaluation for low-resource languages due to the increasing availability of data and models. In this paper, we focus on a zero-shot evaluation setting focusing on low-resource Indian languages, namely Assamese, Kannada, Maithili, and Punjabi. We collect sufficient Multi-Dimensional Quality Metrics (MQM) and Direct Assessment (DA) annotations to create test sets and meta-evaluate a plethora of automatic evaluation metrics. We observe that even for learned metrics, which are known to exhibit zero-shot performance, the Kendall Tau and Pearson correlations with human annotations are only as high as 0.32 and 0.45. Synthetic data approaches show mixed results and overall do not help close the gap by much for these languages. This indicates that there is still a long way to go for low-resource evaluation.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Room-temperature tunable tunneling magnetoresistance in Fe3GaTe2/WSe2/Fe3GaTe2 van der Waals heterostructures
Authors:
Haiyang Pan,
Anil Kumar Singh,
Chusheng Zhang,
Xueqi Hu,
Jiayu Shi,
Liheng An,
Naizhou Wang,
Ruihuan Duan,
Zheng Liu,
S tuart S. P. Parkin,
Pritam Deb,
Weibo Gao
Abstract:
The exceptional properties of two-dimensional (2D) magnet materials present a novel approach to fabricate functional magnetic tunnel junctions (MTJ) by constructing full van der Waals (vdW) heterostructures with atomically sharp and clean interfaces. The exploration of vdW MTJ devices with high working temperature and adjustable functionalities holds great potential for advancing the application o…
▽ More
The exceptional properties of two-dimensional (2D) magnet materials present a novel approach to fabricate functional magnetic tunnel junctions (MTJ) by constructing full van der Waals (vdW) heterostructures with atomically sharp and clean interfaces. The exploration of vdW MTJ devices with high working temperature and adjustable functionalities holds great potential for advancing the application of 2D materials in magnetic sensing and data storage. Here, we report the observation of highly tunable room-temperature tunneling magnetoresistance through electronic means in a full vdW Fe3GaTe2/WSe2/Fe3GaTe2 MTJ. The spin valve effect of the MTJ can be detected even with the current below 1 nA, both at low and room temperatures, yielding a tunneling magnetoresistance (TMR) of 340% at 2 K and 50% at 300 K, respectively. Importantly, the magnitude and sign of TMR can be modulated by a DC bias current, even at room temperature, a capability that was previously unrealized in full vdW MTJs. This tunable TMR arises from the contribution of energy-dependent localized spin states in the metallic ferromagnet Fe3GaTe2 during tunnel transport when a finite electrical bias is applied. Our work offers a new perspective for designing and exploring room-temperature tunable spintronic devices based on vdW magnet heterostructures.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Simplicial complexes and matroids with vanishing $T^2$
Authors:
Alexandru Constantinescu,
Patricia Klein,
Thai Thanh Nguyen,
Anurag Singh,
Lorenzo Venturello
Abstract:
We investigate quotients by radical monomial ideals for which $T^2$, the second cotangent cohomology module, vanishes. The dimension of the graded components of $T^2$, and thus their vanishing, depends only on the combinatorics of the corresponding simplicial complex. We give both a complete characterization and a full list of one dimensional complexes with $T^2=0$. We characterize the graded comp…
▽ More
We investigate quotients by radical monomial ideals for which $T^2$, the second cotangent cohomology module, vanishes. The dimension of the graded components of $T^2$, and thus their vanishing, depends only on the combinatorics of the corresponding simplicial complex. We give both a complete characterization and a full list of one dimensional complexes with $T^2=0$. We characterize the graded components of $T^2$ when the simplicial complex is a uniform matroid. Finally, we show that $T^2$ vanishes for all matroids of corank at most two and conjecture that all connected matroids with vanishing $T^2$ are of corank at most two.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
A Study of Optimizations for Fine-tuning Large Language Models
Authors:
Arjun Singh,
Nikhil Pandey,
Anup Shirgaonkar,
Pavan Manoj,
Vijay Aski
Abstract:
Fine-tuning large language models is a popular choice among users trying to adapt them for specific applications. However, fine-tuning these models is a demanding task because the user has to examine several factors, such as resource budget, runtime, model size and context length among others. A specific challenge is that fine-tuning is memory intensive, imposing constraints on the required hardwa…
▽ More
Fine-tuning large language models is a popular choice among users trying to adapt them for specific applications. However, fine-tuning these models is a demanding task because the user has to examine several factors, such as resource budget, runtime, model size and context length among others. A specific challenge is that fine-tuning is memory intensive, imposing constraints on the required hardware memory and context length of training data that can be handled. In this work, we share a detailed study on a variety of fine-tuning optimizations across different fine-tuning scenarios. In particular, we assess Gradient Checkpointing, Low-Rank Adaptation, DeepSpeed's Zero Redundancy Optimizer and FlashAttention. With a focus on memory and runtime, we examine the impact of different optimization combinations on GPU memory usage and execution runtime during fine-tuning phase. We provide our recommendation on the best default optimization for balancing memory and runtime across diverse model sizes. We share effective strategies for fine-tuning very large models with tens or hundreds of billions of parameters and enabling large context lengths during fine-tuning. Furthermore, we propose the appropriate optimization mixtures for fine-tuning under GPU resource limitations.
△ Less
Submitted 6 June, 2024; v1 submitted 4 June, 2024;
originally announced June 2024.
-
Fluids flow in granular aggregate packings reconstructed by high-energy X-ray computed tomography and lattice Boltzmann method
Authors:
Qifeng Lyu,
Anguo Chen,
Jie Jia,
Amardeep Singh,
Pengfei Dai
Abstract:
Properties of fluids flow in granular aggregates are important for the design of pervious infrastructures used to alleviate urban water-logging problems. Here in this work, five groups of aggregates packing with similar average porosities but varying particle sizes were scanned by a high-energy X-ray computed tomography (X-CT) facility. The structures of the packings were reconstructed. Porosities…
▽ More
Properties of fluids flow in granular aggregates are important for the design of pervious infrastructures used to alleviate urban water-logging problems. Here in this work, five groups of aggregates packing with similar average porosities but varying particle sizes were scanned by a high-energy X-ray computed tomography (X-CT) facility. The structures of the packings were reconstructed. Porosities were calculated and compared with those measured by the volume and mass of infilled water in the packing. Then pore networks were extracted and analyzed. Simulations of fluids flow in the packings were performed by using a lattice Boltzmann method (LBM) with BGK (Bhatnagar-Gross-Krook) collision model in the pore-network domain of the packings. Results showed wall effect on the porosity of aggregates packing was significant and the influence increased with the aggregate sizes. In addition, Poisson law and power law can be used to fit the coordination number and coordination volume of the packing's pore network, respectively. Moreover, the mass flow rates of fluids in the aggregates were affected by the porosities. On the two-dimensional slices, the mass flow rate decreased when the slice porosity increased. But for the three-dimensional blocks, the average mass flow rate increased with the volume porosity. And the permeability of the aggregates packing showed correlating change trend with the average pore diameter and fitting parameters of coordination volumes, when the sizes of aggregates changed. Though the limitation of merging interfaces causing fluctuation and discontinuity on micro parameters of fluid flow existed, the methods and results here may provide knowledge and insights for numerical simulations and optimal design of aggregate-based materials.
△ Less
Submitted 2 June, 2024;
originally announced June 2024.
-
Understanding Preference Fine-Tuning Through the Lens of Coverage
Authors:
Yuda Song,
Gokul Swamy,
Aarti Singh,
J. Andrew Bagnell,
Wen Sun
Abstract:
Learning from human preference data has emerged as the dominant paradigm for fine-tuning large language models (LLMs). The two most common families of techniques -- online reinforcement learning (RL) such as Proximal Policy Optimization (PPO) and offline contrastive methods such as Direct Preference Optimization (DPO) -- were positioned as equivalent in prior work due to the fact that both have to…
▽ More
Learning from human preference data has emerged as the dominant paradigm for fine-tuning large language models (LLMs). The two most common families of techniques -- online reinforcement learning (RL) such as Proximal Policy Optimization (PPO) and offline contrastive methods such as Direct Preference Optimization (DPO) -- were positioned as equivalent in prior work due to the fact that both have to start from the same offline preference dataset. To further expand our theoretical understanding of the similarities and differences between online and offline techniques for preference fine-tuning, we conduct a rigorous analysis through the lens of dataset coverage, a concept that captures how the training data covers the test distribution and is widely used in RL. We prove that a global coverage condition is both necessary and sufficient for offline contrastive methods to converge to the optimal policy, but a weaker partial coverage condition suffices for online RL methods. This separation provides one explanation of why online RL methods can perform better than offline methods, especially when the offline preference data is not diverse enough. Finally, motivated by our preceding theoretical observations, we derive a hybrid preference optimization (HyPO) algorithm that uses offline data for contrastive-based preference optimization and online data for KL regularization. Theoretically and empirically, we demonstrate that HyPO is more performant than its pure offline counterpart DPO, while still preserving its computation and memory efficiency.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Progenitor and explosion properties of SN 2023ixf estimated based on a light-curve model grid of Type II supernovae
Authors:
Takashi J. Moriya,
Avinash Singh
Abstract:
We estimate the progenitor and explosion properties of the nearby Type II SN 2023ixf using a synthetic model grid of Type II supernova light curves. By comparing the light curves of SN 2023ixf with the pre-existing grid of Type II supernovae containing about 228,000 models with different combinations of the progenitor and explosion properties, we obtain the chi2 value for every model and evaluate…
▽ More
We estimate the progenitor and explosion properties of the nearby Type II SN 2023ixf using a synthetic model grid of Type II supernova light curves. By comparing the light curves of SN 2023ixf with the pre-existing grid of Type II supernovae containing about 228,000 models with different combinations of the progenitor and explosion properties, we obtain the chi2 value for every model and evaluate the properties of the models providing small values of chi2. We found that the light-curve models with the progenitor zero-age main-sequence mass of 10 Msun, the explosion energy of (2-3)e51 erg, the 56Ni mass of 0.04-0.06 Msun, the mass-loss rate of 1e-3 - 1e-2 Msun/yr with a wind velocity of 10 km/s, and the dense, confined circumstellar matter radius of (6-10)e14 cm match well to the observed light curves of SN 2023ixf. The photospheric velocity evolution of these models is also consistent with the observed velocity evolution. Although our parameter estimation is based on a pre-existing model grid and we do not perform any additional computations, the estimated parameters are consistent with those obtained by the detailed modeling of SN 2023ixf previously reported. This result shows that comparing the pre-existing model grid is a reasonable way to obtain a rough estimate for the properties of Type II supernovae. This simple way to estimate the properties of Type II supernovae will be essential in the Vera C. Rubin Observatory's Legacy Survey of Space and Time (LSST) era when thousands of Type II supernovae are expected to be discovered yearly.
△ Less
Submitted 2 June, 2024;
originally announced June 2024.
-
ViSpeR: Multilingual Audio-Visual Speech Recognition
Authors:
Sanath Narayan,
Yasser Abdelaziz Dahou Djilali,
Ankit Singh,
Eustache Le Bihan,
Hakim Hacid
Abstract:
This work presents an extensive and detailed study on Audio-Visual Speech Recognition (AVSR) for five widely spoken languages: Chinese, Spanish, English, Arabic, and French. We have collected large-scale datasets for each language except for English, and have engaged in the training of supervised learning models. Our model, ViSpeR, is trained in a multi-lingual setting, resulting in competitive pe…
▽ More
This work presents an extensive and detailed study on Audio-Visual Speech Recognition (AVSR) for five widely spoken languages: Chinese, Spanish, English, Arabic, and French. We have collected large-scale datasets for each language except for English, and have engaged in the training of supervised learning models. Our model, ViSpeR, is trained in a multi-lingual setting, resulting in competitive performance on newly established benchmarks for each language. The datasets and models are released to the community with an aim to serve as a foundation for triggering and feeding further research work and exploration on Audio-Visual Speech Recognition, an increasingly important area of research. Code available at \href{https://github.com/YasserdahouML/visper}{https://github.com/YasserdahouML/visper}.
△ Less
Submitted 27 May, 2024;
originally announced June 2024.
-
Unravelling the asphericities in the explosion and multi-faceted circumstellar matter of SN 2023ixf
Authors:
Avinash Singh,
R. S. Teja,
T. J. Moriya,
K. Maeda,
K. S. Kawabata,
M. Tanaka,
R. Imazawa,
T. Nakaoka,
A. Gangopadhyay,
M. Yamanaka,
V. Swain,
D. K. Sahu,
G. C. Anupama,
B. Kumar,
R. M. Anche,
Y. Sano,
A. Raj,
V. K. Agnihotri,
V. Bhalerao,
D. Bisht,
M. S. Bisht,
K. Belwal,
S. K. Chakrabarti,
M. Fujii,
T. Nagayama
, et al. (11 additional authors not shown)
Abstract:
We present a detailed investigation of photometric, spectroscopic, and polarimetric observations of the Type II SN 2023ixf. The early detection of highly-ionized flash features, rapid ascent in ultraviolet flux coupled with the blueward shift in near-ultraviolet colors and temperature provides compelling evidence for a delayed shock breakout from a confined dense circumstellar matter (CSM) envelop…
▽ More
We present a detailed investigation of photometric, spectroscopic, and polarimetric observations of the Type II SN 2023ixf. The early detection of highly-ionized flash features, rapid ascent in ultraviolet flux coupled with the blueward shift in near-ultraviolet colors and temperature provides compelling evidence for a delayed shock breakout from a confined dense circumstellar matter (CSM) enveloping the progenitor star. The temporal evolution of polarization in the SN 2023ixf phase revealed three distinct peaks in polarization evolution at 1.4 d, 6.4 d, and 79.2 d, indicating an asymmetric dense CSM, an aspherical shock front and clumpiness in the low-density extended CSM, and an aspherical inner ejecta/He-core. SN 2023ixf displayed two dominant axes, one along the CSM-outer ejecta and the other along the inner ejecta/He-core, showcasing the independent origin of asymmetry in the early and late evolution. The argument for an aspherical shock front is further strengthened by the presence of a high-velocity broad absorption feature in the blue wing of the Balmer features in addition to the P-Cygni absorption post 16 d. Hydrodynamical light curve modeling indicated a progenitor mass of 10 solar mass with a radius of 470 solar radius, explosion energy of 2e51 erg, and 0.06 solar mass of 56Ni. The modeling also indicated a two-zone CSM: a confined dense CSM extending up to 5e14 cm, with a mass-loss rate of 1e-2 solar mass per year, and an extended CSM spanning from 5e14 cm to 1e16 cm with a mass-loss rate of 1e-4 solar mass per year. The early nebular phase observations display an axisymmetric line profile of [OI] and red-ward attenuation of the emission of Halpha post 125 days, marking the onset of dust formation.
△ Less
Submitted 31 May, 2024;
originally announced May 2024.
-
Can GPT Redefine Medical Understanding? Evaluating GPT on Biomedical Machine Reading Comprehension
Authors:
Shubham Vatsal,
Ayush Singh
Abstract:
Large language models (LLMs) have shown remarkable performance on many tasks in different domains. However, their performance in closed-book biomedical machine reading comprehension (MRC) has not been evaluated in depth. In this work, we evaluate GPT on four closed-book biomedical MRC benchmarks. We experiment with different conventional prompting techniques as well as introduce our own novel prom…
▽ More
Large language models (LLMs) have shown remarkable performance on many tasks in different domains. However, their performance in closed-book biomedical machine reading comprehension (MRC) has not been evaluated in depth. In this work, we evaluate GPT on four closed-book biomedical MRC benchmarks. We experiment with different conventional prompting techniques as well as introduce our own novel prompting method. To solve some of the retrieval problems inherent to LLMs, we propose a prompting strategy named Implicit Retrieval Augmented Generation (RAG) that alleviates the need for using vector databases to retrieve important chunks in traditional RAG setups. Moreover, we report qualitative assessments on the natural language generation outputs from our approach. The results show that our new prompting technique is able to get the best performance in two out of four datasets and ranks second in rest of them. Experiments show that modern-day LLMs like GPT even in a zero-shot setting can outperform supervised models, leading to new state-of-the-art (SoTA) results on two of the benchmarks.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Learning Social Welfare Functions
Authors:
Kanad Shrikar Pardeshi,
Itai Shapira,
Ariel D. Procaccia,
Aarti Singh
Abstract:
Is it possible to understand or imitate a policy maker's rationale by looking at past decisions they made? We formalize this question as the problem of learning social welfare functions belonging to the well-studied family of power mean functions. We focus on two learning tasks; in the first, the input is vectors of utilities of an action (decision or policy) for individuals in a group and their a…
▽ More
Is it possible to understand or imitate a policy maker's rationale by looking at past decisions they made? We formalize this question as the problem of learning social welfare functions belonging to the well-studied family of power mean functions. We focus on two learning tasks; in the first, the input is vectors of utilities of an action (decision or policy) for individuals in a group and their associated social welfare as judged by a policy maker, whereas in the second, the input is pairwise comparisons between the welfares associated with a given pair of utility vectors. We show that power mean functions are learnable with polynomial sample complexity in both cases, even if the comparisons are social welfare information is noisy. Finally, we design practical algorithms for these tasks and evaluate their performance.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Multi-agent Collaborative Perception for Robotic Fleet: A Systematic Review
Authors:
Apoorv Singh,
Gaurav Raut,
Alka Choudhary
Abstract:
Collaborative perception in multi-robot fleets is a way to incorporate the power of unity in robotic fleets. Collaborative perception refers to the collective ability of multiple entities or agents to share and integrate their sensory information for a more comprehensive understanding of their environment. In other words, it involves the collaboration and fusion of data from various sensors or sou…
▽ More
Collaborative perception in multi-robot fleets is a way to incorporate the power of unity in robotic fleets. Collaborative perception refers to the collective ability of multiple entities or agents to share and integrate their sensory information for a more comprehensive understanding of their environment. In other words, it involves the collaboration and fusion of data from various sensors or sources to enhance perception and decision-making capabilities. By combining data from diverse sources, such as cameras, lidar, radar, or other sensors, the system can create a more accurate and robust representation of the environment. In this review paper, we have summarized findings from 20+ research papers on collaborative perception. Moreover, we discuss testing and evaluation frameworks commonly accepted in academia and industry for autonomous vehicles and autonomous mobile robots. Our experiments with the trivial perception module show an improvement of over 200% with collaborative perception compared to individual robot perception. Here's our GitHub repository that shows the benefits of collaborative perception: https://github.com/synapsemobility/synapseBEV
△ Less
Submitted 22 March, 2024;
originally announced May 2024.