subscribe to arXiv mailings

Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting

Authors: Zilong Wang, Zifeng Wang, Long Le, Huaixiu Steven Zheng, Swaroop Mishra, Vincent Perot, Yuwei Zhang, Anush Mattapalli, Ankur Taly, Jingbo Shang, Chen-Yu Lee, Tomas Pfister

Abstract: Retrieval augmented generation (RAG) combines the generative abilities of large language models (LLMs) with external knowledge sources to provide more accurate and up-to-date responses. Recent RAG advancements focus on improving retrieval outcomes through iterative LLM refinement or self-critique capabilities acquired through additional instruction tuning of LLMs. In this work, we introduce Specul… ▽ More Retrieval augmented generation (RAG) combines the generative abilities of large language models (LLMs) with external knowledge sources to provide more accurate and up-to-date responses. Recent RAG advancements focus on improving retrieval outcomes through iterative LLM refinement or self-critique capabilities acquired through additional instruction tuning of LLMs. In this work, we introduce Speculative RAG - a framework that leverages a larger generalist LM to efficiently verify multiple RAG drafts produced in parallel by a smaller, distilled specialist LM. Each draft is generated from a distinct subset of retrieved documents, offering diverse perspectives on the evidence while reducing input token counts per draft. This approach enhances comprehension of each subset and mitigates potential position bias over long context. Our method accelerates RAG by delegating drafting to the smaller specialist LM, with the larger generalist LM performing a single verification pass over the drafts. Extensive experiments demonstrate that Speculative RAG achieves state-of-the-art performance with reduced latency on TriviaQA, MuSiQue, PubHealth, and ARC-Challenge benchmarks. It notably enhances accuracy by up to 12.97% while reducing latency by 51% compared to conventional RAG systems on PubHealth. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: Preprint

arXiv:2407.07956 [pdf, other]

Inflationary Gravitational Waves as a probe of the unknown post-inflationary primordial Universe

Authors: Athul K. Soman, Swagat S. Mishra, Mohammed Shafi, Soumen Basak

Abstract: One of the key predictions of the standard inflationary paradigm is the quantum mechanical generation of the transverse and traceless tensor fluctuations due to the rapid accelerated expansion of space, which later constitute a stochastic background of primordial gravitational waves (GWs). The amplitude of the (nearly) scale-invariant inflationary tensor power spectrum at large scales provides us… ▽ More One of the key predictions of the standard inflationary paradigm is the quantum mechanical generation of the transverse and traceless tensor fluctuations due to the rapid accelerated expansion of space, which later constitute a stochastic background of primordial gravitational waves (GWs). The amplitude of the (nearly) scale-invariant inflationary tensor power spectrum at large scales provides us with crucial information about the energy scale of inflation in the case of the minimal inflaton coupling to gravity. Furthermore, the spectral energy density, $Ω_{_{\rm GW}}(f)$, of the GWs at sufficiently small scales (or, large frequencies $f$) serves as an important observational probe of post-inflationary primordial dynamics. In fact, the small-scale spectral tilt, $n_{_{\rm GW}} = \frac{{\rm d}\log{Ω_{_{\rm GW}}}}{{\rm d}\log{f}}$, of the spectral energy density of GWs is sensitive to the (unknown) post-inflationary equation of state (EoS), $w$, of the universe; with a softer EoS ($w < 1/3$) leading to a red tilt: $n_{_{\rm GW}} < 0$, while a stiffer EoS ($w > 1/3$) resulting in a blue tilt: $n_{_{\rm GW}} > 0$. The post-inflationary dynamics, however, is generically expected to be quite complex, potentially involving a number of distinct phases. Hence, in this work, we discuss the possibility of multiple sharp transitions, namely $w_1 \to w_2 \to w_3 \to ... \to w_n$, in the EoS of the post-inflationary universe and compute the corresponding spectral energy density of the inflationary GWs. We explicitly determine the region of the parameter space $\lbrace{ w_1, \, w_2, \, w_3, ..., w_n\rbrace}$ which leads to a potentially detectable signal in the upcoming GW detectors, without violating the current constraints. △ Less

Submitted 10 July, 2024; originally announced July 2024.

Comments: 47 pages, 12 figures, Github link provided in the paper

arXiv:2407.07132 [pdf, other]

The neutron star mass, distance, and inclination from precision timing of the brilliant millisecond pulsar J0437$-$4715

Authors: Daniel J. Reardon, Matthew Bailes, Ryan M. Shannon, Chris Flynn, Jacob Askew, N. D. Ramesh Bhat, Zu-Cheng Chen, Małgorzata Curyło, Yi Feng, George B. Hobbs, Agastya Kapur, Matthew Kerr, Xiaojin Liu, Richard N. Manchester, Rami Mandow, Saurav Mishra, Christopher J. Russell, Mohsen Shamohammadi, Lei Zhang, Andrew Zic

Abstract: The observation of neutron stars enables the otherwise impossible study of fundamental physical processes. Timing of binary radio pulsars is particularly powerful, as it enables precise characterization of their (three-dimensional) positions and orbits. PSR J0437$-$4715 is an important millisecond pulsar for timing array experiments and is also a primary target for the Neutron Star Interior Compos… ▽ More The observation of neutron stars enables the otherwise impossible study of fundamental physical processes. Timing of binary radio pulsars is particularly powerful, as it enables precise characterization of their (three-dimensional) positions and orbits. PSR J0437$-$4715 is an important millisecond pulsar for timing array experiments and is also a primary target for the Neutron Star Interior Composition ExploreR (NICER). The main aim of the NICER mission is to constrain the neutron star equation of state by inferring the compactness ($M_p/R$) of the star. Direct measurements of the mass $M_p$ from pulsar timing therefore substantially improve constraints on the radius $R$, and the equation of state. Here we use observations spanning 26 years from Murriyang, the 64-m Parkes radio telescope, to improve the timing model for this pulsar. Among the new precise measurements are the pulsar mass $M_p=1.418\pm 0.044$ M$_{\odot}$, distance $D=156.96 \pm 0.11$ pc, and orbital inclination angle $i=137.506 \pm 0.016^\circ$, which can be used to inform the X-ray pulse profile models inferred from NICER observations. We demonstrate that these results are consistent between multiple data sets from the Parkes Pulsar Timing Array (PPTA), each modelled with different noise assumptions. Using the longest available PPTA data set, we measure an apparent second derivative of the pulsar spin frequency and discuss how this can be explained either by kinematic effects due to the proper motion and radial velocity of the pulsar, or excess low-frequency noise such as a gravitational-wave background. △ Less

Submitted 9 July, 2024; originally announced July 2024.

Comments: 13 pages, 3 figures, accepted for publication in Astrophysical Journal Letters

arXiv:2407.06799 [pdf, other]

Investigating the Kinetic Effects on Current Gradient-Driven Instabilities of Electron Current Layers via Particle-in-Cell Simulations

Authors: Sushmita Mishra, Gurudatt Gaur, Bhavesh G. Patel

Abstract: Electron current layers form in various natural and laboratory plasmas and are susceptible to several instabilities. Tearing, driven by current gradients, stands out as a prominent instability in these layers and is considered a potential mechanism for magnetic reconnection in collisionless regimes. Electron inertia serves as a non-ideal factor causing magnetic field lines to break and subsequentl… ▽ More Electron current layers form in various natural and laboratory plasmas and are susceptible to several instabilities. Tearing, driven by current gradients, stands out as a prominent instability in these layers and is considered a potential mechanism for magnetic reconnection in collisionless regimes. Electron inertia serves as a non-ideal factor causing magnetic field lines to break and subsequently reconnect. Another mode driven by current gradients, known as the surface-preserving mode, maintains magnetic field topology. We investigated the kinetic effects on these modes in the presence of finite electron temperatures using two-dimensional particle-in-cell simulations (implemented with the OSIRIS codebase). Temperature stabilizes the tearing mode to a large extent, except at low temperatures, due to increased electron Larmor radius and subsequent magnetic field diffusion. Introducing uniform guide fields revealed that growth rates decrease at higher temperatures due to reduced plasma beta. Conversely, for the surface-preserving mode, growth rates increase with temperature, likely due to enhanced electron flow velocities. Mixed modes, where both tearing and surface-preserving modes coexist, exhibit asymmetric structures characteristic of asymmetric magnetic reconnection. Finally, we propose future research directions building upon our findings. △ Less

Submitted 9 July, 2024; originally announced July 2024.

arXiv:2407.05986 [pdf, other]

KidSat: satellite imagery to map childhood poverty dataset and benchmark

Authors: Makkunda Sharma, Fan Yang, Duy-Nhat Vo, Esra Suel, Swapnil Mishra, Samir Bhatt, Oliver Fiala, William Rudgard, Seth Flaxman

Abstract: Satellite imagery has emerged as an important tool to analyse demographic, health, and development indicators. While various deep learning models have been built for these tasks, each is specific to a particular problem, with few standard benchmarks available. We propose a new dataset pairing satellite imagery and high-quality survey data on child poverty to benchmark satellite feature representat… ▽ More Satellite imagery has emerged as an important tool to analyse demographic, health, and development indicators. While various deep learning models have been built for these tasks, each is specific to a particular problem, with few standard benchmarks available. We propose a new dataset pairing satellite imagery and high-quality survey data on child poverty to benchmark satellite feature representations. Our dataset consists of 33,608 images, each 10 km $\times$ 10 km, from 19 countries in Eastern and Southern Africa in the time period 1997-2022. As defined by UNICEF, multidimensional child poverty covers six dimensions and it can be calculated from the face-to-face Demographic and Health Surveys (DHS) Program . As part of the benchmark, we test spatial as well as temporal generalization, by testing on unseen locations, and on data after the training years. Using our dataset we benchmark multiple models, from low-level satellite imagery models such as MOSAIKS , to deep learning foundation models, which include both generic vision models such as Self-Distillation with no Labels (DINOv2) models and specific satellite imagery models such as SatMAE. We provide open source code for building the satellite dataset, obtaining ground truth data from DHS and running various models assessed in our work. △ Less

Submitted 8 July, 2024; originally announced July 2024.

Comments: 15 pages, 1 figure

arXiv:2407.05861 [pdf, other]

Dynamical Swirl Structures Powered by Microswimmers in Active Nematics

Authors: Partha Sarathi Mondal, Pawan Kumar Mishra, Tamás Vicsek, Shradha Mishra

Abstract: Active nematics, in their pure form, have demonstrated a plethora of dynamic and steady state behaviors, including large-scale dynamic structures, collective flows, and intricate multi-spatial temporal dynamics. This complexity further increases in the presence of external polar agents. We investigate active nematics interspersed with polar microswimmers, akin to active apolar cells infused with a… ▽ More Active nematics, in their pure form, have demonstrated a plethora of dynamic and steady state behaviors, including large-scale dynamic structures, collective flows, and intricate multi-spatial temporal dynamics. This complexity further increases in the presence of external polar agents. We investigate active nematics interspersed with polar microswimmers, akin to active apolar cells infused with active impurities, microswimmers. Our comprehensive numerical study reveals that varying the microswimmers' motility induces a novel spatiotemporal state in the active nematics backdrop. This state is marked by macroscopic swirl-like structures and a reduction in the overall order of the active nematics. Interestingly, this state emerges at intermediate motility levels, where microswimmers form local clusters and exhibit coherent motion. However, at higher motility levels, the swirls become less coherent, and microswimmer clustering intensifies. We show that the effect of the polar microswimmers on active nematics can be interpreted as a spatiotemporally correlated colored noise on active nematics, which promotes bend instability in active nematics, leading to the observed swirling dynamics. Our findings indicate that the spatiotemporal states are highly sensitive to the microswimmers' motility, offering potential avenues for pathogen identification based on known motility characteristics △ Less

Submitted 8 July, 2024; originally announced July 2024.

Comments: 31 pages, 18 figures

arXiv:2407.05271 [pdf, other]

Beyond Binary Gender Labels: Revealing Gender Biases in LLMs through Gender-Neutral Name Predictions

Authors: Zhiwen You, HaeJin Lee, Shubhanshu Mishra, Sullam Jeoung, Apratim Mishra, Jinseok Kim, Jana Diesner

Abstract: Name-based gender prediction has traditionally categorized individuals as either female or male based on their names, using a binary classification system. That binary approach can be problematic in the cases of gender-neutral names that do not align with any one gender, among other reasons. Relying solely on binary gender categories without recognizing gender-neutral names can reduce the inclusiv… ▽ More Name-based gender prediction has traditionally categorized individuals as either female or male based on their names, using a binary classification system. That binary approach can be problematic in the cases of gender-neutral names that do not align with any one gender, among other reasons. Relying solely on binary gender categories without recognizing gender-neutral names can reduce the inclusiveness of gender prediction tasks. We introduce an additional gender category, i.e., "neutral", to study and address potential gender biases in Large Language Models (LLMs). We evaluate the performance of several foundational and large language models in predicting gender based on first names only. Additionally, we investigate the impact of adding birth years to enhance the accuracy of gender prediction, accounting for shifting associations between names and genders over time. Our findings indicate that most LLMs identify male and female names with high accuracy (over 80%) but struggle with gender-neutral names (under 40%), and the accuracy of gender prediction is higher for English-based first names than non-English names. The experimental results show that incorporating the birth year does not improve the overall accuracy of gender prediction, especially for names with evolving gender associations. We recommend using caution when applying LLMs for gender identification in downstream tasks, particularly when dealing with non-binary gender labels. △ Less

Submitted 7 July, 2024; originally announced July 2024.

Comments: Accepted at ACL 2024, GeBNLP Workshop

arXiv:2407.05164 [pdf, other]

doi 10.1007/s10714-024-03265-1

A dynamical system analysis of bouncing cosmology with spatial curvature

Authors: Soumya Chakraborty, Sudip Mishra, Subenoy Chakraborty

Abstract: The present work deals with a FLRW cosmological model with spatial curvature and minimally coupled scalar field as the matter content. The curvature term behaves as a perfect fluid with the equation of state parameter w_K = -1/3 Using suitable transformation of variables, the evolution equations are reduced to an autonomous system for both power law and exponential form of the scalar potential. Th… ▽ More The present work deals with a FLRW cosmological model with spatial curvature and minimally coupled scalar field as the matter content. The curvature term behaves as a perfect fluid with the equation of state parameter w_K = -1/3 Using suitable transformation of variables, the evolution equations are reduced to an autonomous system for both power law and exponential form of the scalar potential. The critical points are analyzed with center manifold theory and stability has been discussed. Also, critical points at infinity have been studied using the notion of Poincare sphere. Finally, the cosmological implications of the critical points and cosmological bouncing scenarios are discussed. It is found that the cosmological bounce takes place near the points at infinity when the non-isolated critical points on the equator of the Poincare sphere are saddle or saddle-node in nature. △ Less

Submitted 6 July, 2024; originally announced July 2024.

arXiv:2407.04173 [pdf, other]

Quantifying Prediction Consistency Under Model Multiplicity in Tabular LLMs

Authors: Faisal Hamman, Pasan Dissanayake, Saumitra Mishra, Freddy Lecue, Sanghamitra Dutta

Abstract: Fine-tuning large language models (LLMs) on limited tabular data for classification tasks can lead to \textit{fine-tuning multiplicity}, where equally well-performing models make conflicting predictions on the same inputs due to variations in the training process (i.e., seed, random weight initialization, retraining on additional or deleted samples). This raises critical concerns about the robustn… ▽ More Fine-tuning large language models (LLMs) on limited tabular data for classification tasks can lead to \textit{fine-tuning multiplicity}, where equally well-performing models make conflicting predictions on the same inputs due to variations in the training process (i.e., seed, random weight initialization, retraining on additional or deleted samples). This raises critical concerns about the robustness and reliability of Tabular LLMs, particularly when deployed for high-stakes decision-making, such as finance, hiring, education, healthcare, etc. This work formalizes the challenge of fine-tuning multiplicity in Tabular LLMs and proposes a novel metric to quantify the robustness of individual predictions without expensive model retraining. Our metric quantifies a prediction's stability by analyzing (sampling) the model's local behavior around the input in the embedding space. Interestingly, we show that sampling in the local neighborhood can be leveraged to provide probabilistic robustness guarantees against a broad class of fine-tuned models. By leveraging Bernstein's Inequality, we show that predictions with sufficiently high robustness (as defined by our measure) will remain consistent with high probability. We also provide empirical evaluation on real-world datasets to support our theoretical results. Our work highlights the importance of addressing fine-tuning instabilities to enable trustworthy deployment of LLMs in high-stakes and safety-critical applications. △ Less

Submitted 4 July, 2024; originally announced July 2024.

arXiv:2407.00900 [pdf, other]

MathCAMPS: Fine-grained Synthesis of Mathematical Problems From Human Curricula

Authors: Shubhra Mishra, Gabriel Poesia, Belinda Mo, Noah D. Goodman

Abstract: Mathematical problem solving is an important skill for Large Language Models (LLMs), both as an important capability and a proxy for a range of reasoning abilities. Existing benchmarks probe a diverse set of skills, but they yield aggregate accuracy metrics, obscuring specific abilities or weaknesses. Furthermore, they are difficult to extend with new problems, risking data contamination over time… ▽ More Mathematical problem solving is an important skill for Large Language Models (LLMs), both as an important capability and a proxy for a range of reasoning abilities. Existing benchmarks probe a diverse set of skills, but they yield aggregate accuracy metrics, obscuring specific abilities or weaknesses. Furthermore, they are difficult to extend with new problems, risking data contamination over time. To address these challenges, we propose MathCAMPS: a method to synthesize high-quality mathematical problems at scale, grounded on 44 fine-grained "standards" from the Mathematics Common Core (CC) Standard for K-8 grades. We encode each standard in a formal grammar, allowing us to sample diverse symbolic problems and their answers. We then use LLMs to realize the symbolic problems into word problems. We propose a cycle-consistency method for validating problem faithfulness. Finally, we derive follow-up questions from symbolic structures and convert them into follow-up word problems - a novel task of mathematical dialogue that probes for robustness in understanding. Experiments on 23 LLMs show surprising failures even in the strongest models (in particular when asked simple follow-up questions). Moreover, we evaluate training checkpoints of Pythia 12B on MathCAMPS, allowing us to analyze when particular mathematical skills develop during its training. Our framework enables the community to reproduce and extend our pipeline for a fraction of the typical cost of building new high-quality datasets. △ Less

Submitted 30 June, 2024; originally announced July 2024.

Comments: Dataset and code: https://github.com/gpoesia/mathcamps/

arXiv:2407.00437 [pdf]

Enhancement in Photoluminescence of Pt/Ag-Pt Embedded ZrO2 Thin Films by Plasma Co-sputtering

Authors: Shailendra Kumar Mishra, Ibnul Farid, Aritra Tarafder, Joyanti Chutia, Subir Biswas, Arup Ratan Pal, Neeraj Shukla

Abstract: Platinum, Silver-Platinum embedded Zirconia (Pt/Ag-Pt ZrO2) thin films have been fabricated on silicon wafers and glass substrates using the plasma co-sputtering method. Zirconia thin films are of significant technological importance due to their remarkable electrical, optical, and mechanical properties, as well as their high melting temperature of 2715°C, which makes them increasingly attractive… ▽ More Platinum, Silver-Platinum embedded Zirconia (Pt/Ag-Pt ZrO2) thin films have been fabricated on silicon wafers and glass substrates using the plasma co-sputtering method. Zirconia thin films are of significant technological importance due to their remarkable electrical, optical, and mechanical properties, as well as their high melting temperature of 2715°C, which makes them increasingly attractive for various applications. In this study, ZrO2 thin films were deposited for 3 minutes, followed by the deposition of Pt-Ag/Pt onto the fabricated zirconia thin films, with deposition times ranging from 15 to 60 seconds. The varying deposition times of Pt-Ag/Pt influenced the optical and electronic properties of the thin films due to alterations in their surface roughness. The characteristics of the grown zirconia and Pt/Ag-Pt sputtered zirconia nanostructures were investigated using Atomic Force Microscopy (AFM), Scanning Electron Microscopy (SEM), X-ray Diffraction (XRD), UV-visible spectroscopy, and Photoluminescence spectroscopy. The optical transmittance of these thin films was examined across the visible and near-infrared spectral ranges. The investigation revealed various properties, such as enhanced photoluminescence and the emergence of new peaks in the visible range spectra. Plasmonic peaks were induced, and an increase in the sharpness of these peaks was observed between 403.15 nm and 512.10 nm for the Pt/Ag-Pt deposited samples. This enhancement in photoluminescence is attributed to the plasmonic properties of Pt-Ag nanoparticles on the zirconia thin film. The study demonstrates that these optically tuned thin film coatings, with their enhanced photoluminescence properties, can significantly improve the heat-resistance capacity of devices, mitigating issues related to overheating and device shutdown. △ Less

Submitted 29 June, 2024; originally announced July 2024.

Comments: 8 figures, 1 table

arXiv:2406.17610 [pdf, other]

YAQQ: Yet Another Quantum Quantizer -- Design Space Exploration of Quantum Gate Sets using Novelty Search

Authors: Aritra Sarkar, Akash Kundu, Matthew Steinberg, Sibasish Mishra, Sebastiaan Fauquenot, Tamal Acharya, Jarosław A. Miszczak, Sebastian Feld

Abstract: In the standard circuit model of quantum computation, the number and quality of the quantum gates composing the circuit influence the runtime and fidelity of the computation. The fidelity of the decomposition of quantum algorithms, represented as unitary matrices, to bounded depth quantum circuits depends strongly on the set of gates available for the decomposition routine. To investigate this dep… ▽ More In the standard circuit model of quantum computation, the number and quality of the quantum gates composing the circuit influence the runtime and fidelity of the computation. The fidelity of the decomposition of quantum algorithms, represented as unitary matrices, to bounded depth quantum circuits depends strongly on the set of gates available for the decomposition routine. To investigate this dependence, we explore the design space of discrete quantum gate sets and present a software tool for comparative analysis of quantum processing units and control protocols based on their native gates. The evaluation is conditioned on a set of unitary transformations representing target use cases on the quantum processors. The cost function considers three key factors: (i) the statistical distribution of the decomposed circuits' depth, (ii) the statistical distribution of process fidelities for the approximate decomposition, and (iii) the relative novelty of a gate set compared to other gate sets in terms of the aforementioned properties. The developed software, YAQQ (Yet Another Quantum Quantizer), enables the discovery of an optimized set of quantum gates through this tunable joint cost function. To identify these gate sets, we use the novelty search algorithm, circuit decomposition techniques, and stochastic optimization to implement YAQQ within the Qiskit quantum simulator environment. YAQQ exploits reachability tradeoffs conceptually derived from quantum algorithmic information theory. Our results demonstrate the pragmatic application of identifying gate sets that are advantageous to popularly used quantum gate sets in representing quantum algorithms. Consequently, we demonstrate pragmatic use cases of YAQQ in comparing transversal logical gate sets in quantum error correction codes, designing optimal quantum instruction sets, and compiling to specific quantum processors. △ Less

Submitted 25 June, 2024; originally announced June 2024.

arXiv:2406.17066 [pdf, other]

Tolerance of Reinforcement Learning Controllers against Deviations in Cyber Physical Systems

Authors: Changjian Zhang, Parv Kapoor, Eunsuk Kang, Romulo Meira-Goes, David Garlan, Akila Ganlath, Shatadal Mishra, Nejib Ammar

Abstract: Cyber-physical systems (CPS) with reinforcement learning (RL)-based controllers are increasingly being deployed in complex physical environments such as autonomous vehicles, the Internet-of-Things(IoT), and smart cities. An important property of a CPS is tolerance; i.e., its ability to function safely under possible disturbances and uncertainties in the actual operation. In this paper, we introduc… ▽ More Cyber-physical systems (CPS) with reinforcement learning (RL)-based controllers are increasingly being deployed in complex physical environments such as autonomous vehicles, the Internet-of-Things(IoT), and smart cities. An important property of a CPS is tolerance; i.e., its ability to function safely under possible disturbances and uncertainties in the actual operation. In this paper, we introduce a new, expressive notion of tolerance that describes how well a controller is capable of satisfying a desired system requirement, specified using Signal Temporal Logic (STL), under possible deviations in the system. Based on this definition, we propose a novel analysis problem, called the tolerance falsification problem, which involves finding small deviations that result in a violation of the given requirement. We present a novel, two-layer simulation-based analysis framework and a novel search heuristic for finding small tolerance violations. To evaluate our approach, we construct a set of benchmark problems where system parameters can be configured to represent different types of uncertainties and disturbancesin the system. Our evaluation shows that our falsification approach and heuristic can effectively find small tolerance violations. △ Less

Submitted 24 June, 2024; originally announced June 2024.

Comments: arXiv admin note: text overlap with arXiv:2311.07462

arXiv:2406.16717 [pdf, other]

Probing Yu-Shiba-Rusinov state via quantum noise and $Δ_T$ noise

Authors: Tusaradri Mohapatra, Sachiraj Mishra, Colin Benjamin

Abstract: Recent attention has been drawn to temperature gradient generated $Δ_T$ noise at vanishing charge current. This study delves into examining the properties of spin-polarised $ΔT$ noise in conjunction with $Δ_T$-shot noise, $Δ_T$-thermal noise, and quantum noise (again both shot and thermal noise) in a one-dimensional (1D) structure comprising metal/spin-flipper/metal/insulator/superconductor juncti… ▽ More Recent attention has been drawn to temperature gradient generated $Δ_T$ noise at vanishing charge current. This study delves into examining the properties of spin-polarised $ΔT$ noise in conjunction with $Δ_T$-shot noise, $Δ_T$-thermal noise, and quantum noise (again both shot and thermal noise) in a one-dimensional (1D) structure comprising metal/spin-flipper/metal/insulator/superconductor junction to probe Yu-Shiba-Rusinov (YSR) bound states. YSR bound states, which are localized states within the superconducting gap of a superconductor are induced by a magnetic impurity acting as a spin-flipper. A YSR bound state should be distinguished from a Majorana bound state (MBS), which too can occur due to interaction with magnetic impurities, e.g., magnetic adatoms on superconductors, and this can lead to false positives in detecting MBS. Clarifying this by providing a unique signature for the YSR-bound state is the main aim of this work. In this paper, we show that YSR bound states can be effectively probed using quantum noise and the recently discovered $Δ_T$ noise, with a focus on especially spin transport. We see that the spin $Δ_T$ noise is a superior tool compared to the charge $Δ_T$ noise as a probe for YSR bound states. Additionally, our analysis of quantum noise reveals that similar to $Δ_T$ noise, spin quantum noise is more effective than charge quantum noise in detecting YSR bound states. △ Less

Submitted 24 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

Comments: 14 pages, 6 figures, 3 tables

arXiv:2406.16273 [pdf, other]

YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals

Authors: Sandeep Mishra, Oindrila Saha, Alan C. Bovik

Abstract: 3D generation guided by text-to-image diffusion models enables the creation of visually compelling assets. However previous methods explore generation based on image or text. The boundaries of creativity are limited by what can be expressed through words or the images that can be sourced. We present YouDream, a method to generate high-quality anatomically controllable animals. YouDream is guided u… ▽ More 3D generation guided by text-to-image diffusion models enables the creation of visually compelling assets. However previous methods explore generation based on image or text. The boundaries of creativity are limited by what can be expressed through words or the images that can be sourced. We present YouDream, a method to generate high-quality anatomically controllable animals. YouDream is guided using a text-to-image diffusion model controlled by 2D views of a 3D pose prior. Our method generates 3D animals that are not possible to create using previous text-to-3D generative methods. Additionally, our method is capable of preserving anatomic consistency in the generated animals, an area where prior text-to-3D approaches often struggle. Moreover, we design a fully automated pipeline for generating commonly found animals. To circumvent the need for human intervention to create a 3D pose, we propose a multi-agent LLM that adapts poses from a limited library of animal 3D poses to represent the desired animal. A user study conducted on the outcomes of YouDream demonstrates the preference of the animal models generated by our method over others. Turntable results and code are released at https://youdream3d.github.io/ △ Less

Submitted 23 June, 2024; originally announced June 2024.

arXiv:2406.15444 [pdf, other]

Investigating the Robustness of LLMs on Math Word Problems

Authors: Ujjwala Anantheswaran, Himanshu Gupta, Kevin Scaria, Shreyas Verma, Chitta Baral, Swaroop Mishra

Abstract: Large Language Models (LLMs) excel at various tasks, including solving math word problems (MWPs), but struggle with real-world problems containing irrelevant information. To address this, we propose a prompting framework that generates adversarial variants of MWPs by adding irrelevant variables. We introduce a dataset, ProbleMATHIC, containing both adversarial and non-adversarial MWPs. Our experim… ▽ More Large Language Models (LLMs) excel at various tasks, including solving math word problems (MWPs), but struggle with real-world problems containing irrelevant information. To address this, we propose a prompting framework that generates adversarial variants of MWPs by adding irrelevant variables. We introduce a dataset, ProbleMATHIC, containing both adversarial and non-adversarial MWPs. Our experiments reveal that LLMs are susceptible to distraction by numerical noise, resulting in an average relative performance drop of ~26% on adversarial MWPs. To mitigate this, we fine-tune LLMs (Llama-2, Mistral) on the adversarial samples from our dataset. Fine-tuning on adversarial training instances improves performance on adversarial MWPs by ~8%, indicating increased robustness to noise and better ability to identify relevant data for reasoning. Finally, to assess the generalizability of our prompting framework, we introduce GSM-8K-Adv, an adversarial variant of the GSM-8K benchmark. LLMs continue to struggle when faced with adversarial information, reducing performance by up to ~6%. △ Less

Submitted 30 May, 2024; originally announced June 2024.

arXiv:2406.12963 [pdf, other]

Weak Superfluidity in Twisted Optical Potentials

Authors: Dean Johnstone, Shanya Mishra, Zhaoxuan Zhu, Hepeng Yao, Laurent Sanchez-Palencia

Abstract: A controlled twist between different underlying lattices allows one to interpolate, under a unified framework, across ordered and (quasi-)disordered matter while drastically changing quantum transport properties. Here, we use quantum Monte Carlo simulations to determine the unique phase diagrams of strongly-correlated ultracold bosons in twisted optical potentials. We show that at commensurate twi… ▽ More A controlled twist between different underlying lattices allows one to interpolate, under a unified framework, across ordered and (quasi-)disordered matter while drastically changing quantum transport properties. Here, we use quantum Monte Carlo simulations to determine the unique phase diagrams of strongly-correlated ultracold bosons in twisted optical potentials. We show that at commensurate twisting angles, spectral gaps govern the formation of insulating patterns, separated by thin superfluid domains. The latter form weak superfluids, which are very sensitive to thermal fluctuations, but can be stabilized under appropriate parameter control. In contrast, slightly changing the twisting angle to a incommensurate value destroys most spectral gaps, leaving behind a prominent Bose glass phase. Our results are directly applicable to current generation experiments that quantum simulate moiré physics. △ Less

Submitted 18 June, 2024; originally announced June 2024.

Comments: 11 pages, 5 figures, comments welcome

arXiv:2406.10692 [pdf, other]

doi 10.1142/S0219887824502505

Dynamical system analysis of quintessence dark energy model

Authors: Soumya Chakraborty, Sudip Mishra, Subenoy Chakraborty

Abstract: Our work deals with the dynamical system analysis of quintessence dark energy scalar field model with exponential potential. A dynamical system analysis has been applied at the background level. Using suitable transformation of variables, the evolution equations are reduced to an autonomous system for exponential form of the scalar potential. The critical points are analyzed with center manifold t… ▽ More Our work deals with the dynamical system analysis of quintessence dark energy scalar field model with exponential potential. A dynamical system analysis has been applied at the background level. Using suitable transformation of variables, the evolution equations are reduced to an autonomous system for exponential form of the scalar potential. The critical points are analyzed with center manifold theory and stability has been discussed by using Schwarzian derivative. Finally, cosmological implications of the critical points are discussed and it is found that the stability of the late-time attractor changes for quintessence dark energy model. △ Less

Submitted 15 June, 2024; originally announced June 2024.

arXiv:2406.09074 [pdf, other]

Entanglement properties of optomagnonic crystal from nonlinear perspective

Authors: M. Wanic, C. Jasiukiewicz, Z. Toklikishvili, V. Jandieri, M. Trybus, E. Jartych, S. K. Mishra, L. Chotorlishvili

Abstract: Optomagnonics is a new field of research in condensed matter physics and quantum optics focused on strong magnon-photon interactions. Particular interest concerns realistic, experimentally feasible materials and prototype cheap elements for futuristic nanodevices implemented in the processing or storing of quantum information. Quantifying the entanglement between two continuous bosonic modes, such… ▽ More Optomagnonics is a new field of research in condensed matter physics and quantum optics focused on strong magnon-photon interactions. Particular interest concerns realistic, experimentally feasible materials and prototype cheap elements for futuristic nanodevices implemented in the processing or storing of quantum information. Quantifying the entanglement between two continuous bosonic modes, such as magnons and photons, is not trivial. The state-of-the-art for today is the logarithmic negativity, calculated through the quantum Langevin equations subjected to thermal noise. However, due to its complexity, this method requires further approximation. In the present work, we propose a new procedure that avoids the linearization of dynamics. Prior analyzing the quantum entanglement, we explore the nonlinear semiclassical dynamics in detail and precisely define the phase space. The typical nonlinear dynamical system holds bifurcation points and fixed points of different characters in its phase space. Our main finding is that entanglement is not defined in the Saddle Point region. On the other hand, the maximum of the entanglement corresponds to the region near the border between the Stable node and Stable spiral regions. In numerical calculations, we considered a particular system: optomagnonic crystal based on the yttrium iron garnet (YIG) slab with the periodic air holes drilled in the slab. In our case, Magnon-photon interaction occurs due to the magneto-electric effect in YIG. We provide explicit derivation of the coupling term. Besides, we calculate photon modes for a particular geometry of the optomagnonic crystal. We analyzed the amplitude-frequency characteristics of the optomagnonic crystal and showed that due to the instability region, one could efficiently switch the mean magnon numbers in the system and control entanglement in the system. △ Less

Submitted 13 June, 2024; originally announced June 2024.

Comments: 18 pages, 16 figures

arXiv:2406.08067 [pdf, other]

Synchronous and Asynchronous Updates of Active Ising Spins in One Dimension

Authors: Anish Kumar, Sudipta Pattanayak, R. K. Singh, Shradha Mishra

Abstract: How do update rules affect the dynamical and steady state properties of a flock? In this study, we have explored the active Ising spins (s = +-1) in one dimension, where spin updates its orientation according to the Metropolis algorithm (based on the neighbors) via two different update rules. (i) Parallel, and (ii) Random-sequential. We explore the effect of Parallel and Random-sequential updates… ▽ More How do update rules affect the dynamical and steady state properties of a flock? In this study, we have explored the active Ising spins (s = +-1) in one dimension, where spin updates its orientation according to the Metropolis algorithm (based on the neighbors) via two different update rules. (i) Parallel, and (ii) Random-sequential. We explore the effect of Parallel and Random-sequential updates on the dynamical properties of flocks in one dimension. Due to the inherent asynchronous nature of the Random-sequential update, the directional switching of the flock is increased compared to the Parallel one. The nature of phase transition is affected by the difference in the updating mechanism: discontinuous for Parallel and continuous for Random-sequential updates. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 7 pages, 6 figures. arXiv admin note: text overlap with arXiv:1704.04041

arXiv:2406.07742 [pdf, other]

C3DAG: Controlled 3D Animal Generation using 3D pose guidance

Authors: Sandeep Mishra, Oindrila Saha, Alan C. Bovik

Abstract: Recent advancements in text-to-3D generation have demonstrated the ability to generate high quality 3D assets. However while generating animals these methods underperform, often portraying inaccurate anatomy and geometry. Towards ameliorating this defect, we present C3DAG, a novel pose-Controlled text-to-3D Animal Generation framework which generates a high quality 3D animal consistent with a give… ▽ More Recent advancements in text-to-3D generation have demonstrated the ability to generate high quality 3D assets. However while generating animals these methods underperform, often portraying inaccurate anatomy and geometry. Towards ameliorating this defect, we present C3DAG, a novel pose-Controlled text-to-3D Animal Generation framework which generates a high quality 3D animal consistent with a given pose. We also introduce an automatic 3D shape creator tool, that allows dynamic pose generation and modification via a web-based tool, and that generates a 3D balloon animal using simple geometries. A NeRF is then initialized using this 3D shape using depth-controlled SDS. In the next stage, the pre-trained NeRF is fine-tuned using quadruped-pose-controlled SDS. The pipeline that we have developed not only produces geometrically and anatomically consistent results, but also renders highly controlled 3D animals, unlike prior methods which do not allow fine-grained pose control. △ Less

Submitted 11 June, 2024; originally announced June 2024.

arXiv:2406.06661 [pdf, other]

doi 10.3847/1538-4357/ad5555

Constraining extended teleparallel gravity via cosmography: A model-independent approach

Authors: Sai Swagat Mishra, N. S. Kavya, P. K. Sahoo, V. Venkatesha

Abstract: As a classical approach, the dynamics of the Universe, influenced by its dark components, are unveiled through prior modifications of Einstein's equations. Cosmography, on the other hand, is a highly efficient tool for reconstructing any modified theory in a model-independent manner. By employing kinematic variables, it offers a profound explanation for cosmic expansion. Although the cosmographica… ▽ More As a classical approach, the dynamics of the Universe, influenced by its dark components, are unveiled through prior modifications of Einstein's equations. Cosmography, on the other hand, is a highly efficient tool for reconstructing any modified theory in a model-independent manner. By employing kinematic variables, it offers a profound explanation for cosmic expansion. Although the cosmographical approach has been highly successful in several geometric theories in recent years, it has not been extensively explored in coupled gravities. With this in mind, we intend to constrain an extended teleparallel gravity model, $f(T,\mathcal{T})$, through cosmographic parameters. We utilize Taylor series expansion, assuming a minimally coupled form, to constrain the unknowns involved in the series. To achieve this, we conduct a Markov Chain Monte Carlo analysis (MCMC) using three different datasets (CC, BAO, and Pantheon+SH0ES). The constrained results obtained from MCMC are then compared and verified using various cosmological parameters. Finally, we compare the resulting models with \textbf{three} well-known $f(T,\mathcal{T})$ models. △ Less

Submitted 10 June, 2024; originally announced June 2024.

Comments: ApJ accepted version

Journal ref: The Astrophysical Journal (2024)

arXiv:2406.06555 [pdf, other]

An Evaluation Benchmark for Autoformalization in Lean4

Authors: Aryan Gulati, Devanshu Ladsaria, Shubhra Mishra, Jasdeep Sidhu, Brando Miranda

Abstract: Large Language Models (LLMs) hold the potential to revolutionize autoformalization. The introduction of Lean4, a mathematical programming language, presents an unprecedented opportunity to rigorously assess the autoformalization capabilities of LLMs. This paper introduces a novel evaluation benchmark designed for Lean4, applying it to test the abilities of state-of-the-art LLMs, including GPT-3.5,… ▽ More Large Language Models (LLMs) hold the potential to revolutionize autoformalization. The introduction of Lean4, a mathematical programming language, presents an unprecedented opportunity to rigorously assess the autoformalization capabilities of LLMs. This paper introduces a novel evaluation benchmark designed for Lean4, applying it to test the abilities of state-of-the-art LLMs, including GPT-3.5, GPT-4, and Gemini Pro. Our comprehensive analysis reveals that, despite recent advancements, these LLMs still exhibit limitations in autoformalization, particularly in more complex areas of mathematics. These findings underscore the need for further development in LLMs to fully harness their potential in scientific research and development. This study not only benchmarks current LLM capabilities but also sets the stage for future enhancements in autoformalization. △ Less

Submitted 1 June, 2024; originally announced June 2024.

Comments: To appear at ICLR 2024 as part of the Tiny Papers track

arXiv:2406.04520 [pdf, other]

NATURAL PLAN: Benchmarking LLMs on Natural Language Planning

Authors: Huaixiu Steven Zheng, Swaroop Mishra, Hugh Zhang, Xinyun Chen, Minmin Chen, Azade Nova, Le Hou, Heng-Tze Cheng, Quoc V. Le, Ed H. Chi, Denny Zhou

Abstract: We introduce NATURAL PLAN, a realistic planning benchmark in natural language containing 3 key tasks: Trip Planning, Meeting Planning, and Calendar Scheduling. We focus our evaluation on the planning capabilities of LLMs with full information on the task, by providing outputs from tools such as Google Flights, Google Maps, and Google Calendar as contexts to the models. This eliminates the need for… ▽ More We introduce NATURAL PLAN, a realistic planning benchmark in natural language containing 3 key tasks: Trip Planning, Meeting Planning, and Calendar Scheduling. We focus our evaluation on the planning capabilities of LLMs with full information on the task, by providing outputs from tools such as Google Flights, Google Maps, and Google Calendar as contexts to the models. This eliminates the need for a tool-use environment for evaluating LLMs on Planning. We observe that NATURAL PLAN is a challenging benchmark for state of the art models. For example, in Trip Planning, GPT-4 and Gemini 1.5 Pro could only achieve 31.1% and 34.8% solve rate respectively. We find that model performance drops drastically as the complexity of the problem increases: all models perform below 5% when there are 10 cities, highlighting a significant gap in planning in natural language for SoTA LLMs. We also conduct extensive ablation studies on NATURAL PLAN to further shed light on the (in)effectiveness of approaches such as self-correction, few-shot generalization, and in-context planning with long-contexts on improving LLM planning. △ Less

Submitted 6 June, 2024; originally announced June 2024.

arXiv:2406.03622 [pdf, other]

Generalized two-point visual control model of human steering for accurate state estimation

Authors: Rene Mai, Katherine Sears, Grace Roessling, Agung Julius, Sandipan Mishra

Abstract: We derive and validate a generalization of the two-point visual control model, an accepted cognitive science model for human steering behavior. The generalized model is needed as current steering models are either insufficiently accurate or too complex for online state estimation. We demonstrate that the generalized model replicates specific human steering behavior with high precision (85\% reduct… ▽ More We derive and validate a generalization of the two-point visual control model, an accepted cognitive science model for human steering behavior. The generalized model is needed as current steering models are either insufficiently accurate or too complex for online state estimation. We demonstrate that the generalized model replicates specific human steering behavior with high precision (85\% reduction in modeling error) and integrate this model into a human-as-advisor framework where human steering inputs are used for state estimation. As a benchmark study, we use this framework to decipher ambiguous lane markings represented by biased lateral position measurements. We demonstrate that, with the generalized model, the state estimator can accurately estimate the true vehicle state, providing lateral state estimates with under 0.25 m error on average across participants. However, without the generalized model, the estimator cannot accurately estimate the vehicle's lateral state. △ Less

Submitted 5 June, 2024; originally announced June 2024.

Comments: 6 pages, 9 figures, This work has been submitted to IFAC for possible publication

arXiv:2406.02625 [pdf, other]

Progressive Inference: Explaining Decoder-Only Sequence Classification Models Using Intermediate Predictions

Authors: Sanjay Kariyappa, Freddy Lécué, Saumitra Mishra, Christopher Pond, Daniele Magazzeni, Manuela Veloso

Abstract: This paper proposes Progressive Inference - a framework to compute input attributions to explain the predictions of decoder-only sequence classification models. Our work is based on the insight that the classification head of a decoder-only Transformer model can be used to make intermediate predictions by evaluating them at different points in the input sequence. Due to the causal attention mechan… ▽ More This paper proposes Progressive Inference - a framework to compute input attributions to explain the predictions of decoder-only sequence classification models. Our work is based on the insight that the classification head of a decoder-only Transformer model can be used to make intermediate predictions by evaluating them at different points in the input sequence. Due to the causal attention mechanism, these intermediate predictions only depend on the tokens seen before the inference point, allowing us to obtain the model's prediction on a masked input sub-sequence, with negligible computational overheads. We develop two methods to provide sub-sequence level attributions using this insight. First, we propose Single Pass-Progressive Inference (SP-PI), which computes attributions by taking the difference between consecutive intermediate predictions. Second, we exploit a connection with Kernel SHAP to develop Multi Pass-Progressive Inference (MP-PI). MP-PI uses intermediate predictions from multiple masked versions of the input to compute higher quality attributions. Our studies on a diverse set of models trained on text classification tasks show that SP-PI and MP-PI provide significantly better attributions compared to prior work. △ Less

Submitted 3 June, 2024; originally announced June 2024.

arXiv:2406.01899 [pdf, other]

Cross-Domain Graph Data Scaling: A Showcase with Diffusion Models

Authors: Wenzhuo Tang, Haitao Mao, Danial Dervovic, Ivan Brugere, Saumitra Mishra, Yuying Xie, Jiliang Tang

Abstract: Models for natural language and images benefit from data scaling behavior: the more data fed into the model, the better they perform. This 'better with more' phenomenon enables the effectiveness of large-scale pre-training on vast amounts of data. However, current graph pre-training methods struggle to scale up data due to heterogeneity across graphs. To achieve effective data scaling, we aim to d… ▽ More Models for natural language and images benefit from data scaling behavior: the more data fed into the model, the better they perform. This 'better with more' phenomenon enables the effectiveness of large-scale pre-training on vast amounts of data. However, current graph pre-training methods struggle to scale up data due to heterogeneity across graphs. To achieve effective data scaling, we aim to develop a general model that is able to capture diverse data patterns of graphs and can be utilized to adaptively help the downstream tasks. To this end, we propose UniAug, a universal graph structure augmentor built on a diffusion model. We first pre-train a discrete diffusion model on thousands of graphs across domains to learn the graph structural patterns. In the downstream phase, we provide adaptive enhancement by conducting graph structure augmentation with the help of the pre-trained diffusion model via guided generation. By leveraging the pre-trained diffusion model for structure augmentation, we consistently achieve performance improvements across various downstream tasks in a plug-and-play manner. To the best of our knowledge, this study represents the first demonstration of a data-scaling graph structure augmentor on graphs across domains. △ Less

Submitted 3 June, 2024; originally announced June 2024.

arXiv:2406.00108 [pdf, other]

Formation and decay of oscillons after inflation in the presence of an external coupling, Part-I: Lattice simulations

Authors: Mohammed Shafi, Edmund J. Copeland, Rafid Mahbub, Swagat S. Mishra, Soumen Basak

Abstract: We investigate the formation and decay of oscillons during the post-inflationary reheating epoch from inflaton oscillations around asymptotically flat potentials $V(\varphi)$ in the presence of an external coupling of the form $\frac{1}{2}\, g^2 \, \varphi^2 \, χ^2$. It is well-known that in the absence of such an external coupling, the attractive self-interaction term in the potential leads to th… ▽ More We investigate the formation and decay of oscillons during the post-inflationary reheating epoch from inflaton oscillations around asymptotically flat potentials $V(\varphi)$ in the presence of an external coupling of the form $\frac{1}{2}\, g^2 \, \varphi^2 \, χ^2$. It is well-known that in the absence of such an external coupling, the attractive self-interaction term in the potential leads to the formation of copious amounts of long-lived oscillons both for symmetric and asymmetric plateau potentials. We perform a detailed numerical analysis to study the formation of oscillons in the $α$-attractor E- and T-model potentials using the publicly available lattice simulation code ${\cal C}$osmo${\cal L}$attice. We observe the formation of nonlinear oscillon-like structures with the average equation of state $\langle w_\varphi\rangle \simeq 0$ for a range of values of the inflaton self-coupling $λ$ and the external coupling $g^2$. Our results demonstrate that oscillons form even in the presence of an external coupling and we determine the upper bound on $g^2$ which facilitates oscillon formation. We also find that eventually, these oscillons decay into the scalar inflaton radiation as well as into the quanta of the offspring field $χ$. Thus, we establish the possibility that reheating could have proceeded through the channel of oscillon decay, along with the usual decay of the oscillating inflaton condensate into $χ$ particles. For a given value of the self-coupling $λ$, we notice that the lifetime of a population of oscillons decreases with an increase in the strength of the external coupling, following an (approximately) inverse power-law dependence on $g^2$. △ Less

Submitted 31 May, 2024; originally announced June 2024.

Comments: 44 pages, 21 figures, Github link provided in the paper

arXiv:2405.19907 [pdf]

DFT study of structural, electronic and optical properties of 2D MgO monolayer under bi-axial mechanical strain

Authors: Kamal Kumar, Anjali Kumari, Soni Mishra, Ramesh Sharma, Abhishek Kumar Mishra

Abstract: The structural, electronic, and dielectric (optical) properties of graphene-like 2D MgO monolayer have been explored through first-principles calculations under bi-axial tensile and compressive mechanical strain within a range of -10% to +10%. Our findings revealed that the pristine MgO monolayer is an indirect band gap semiconducting material and the semiconducting mature of MgO monolayer remains… ▽ More The structural, electronic, and dielectric (optical) properties of graphene-like 2D MgO monolayer have been explored through first-principles calculations under bi-axial tensile and compressive mechanical strain within a range of -10% to +10%. Our findings revealed that the pristine MgO monolayer is an indirect band gap semiconducting material and the semiconducting mature of MgO monolayer remains consistent under both compressive and tensile mechanical strain. This nature of MgO is confirmed through partial density of states (PDOS) as well as electronic band structure. PDOS exhibits the contribution of different atomic orbitals in bond formation and nature of bond, while band structure provides insight into electron transitions between energy levels of valance and conduction bands. All optical parameters (dielectric function, reflectivity, energy loss, refractive index, extinction coefficient and absorption) are plotted in an energy range 0-15 eV. Within this energy interval, MgO possesses the highest value of the refractive index (2.13) at 3.12 eV energy. Also, a detailed analysis of changes in the geometrical structure of MgO monolayer is provided. △ Less

Submitted 30 May, 2024; originally announced May 2024.

Comments: 6 figures

arXiv:2405.19101 [pdf, other]

Poseidon: Efficient Foundation Models for PDEs

Authors: Maximilian Herde, Bogdan Raonić, Tobias Rohner, Roger Käppeli, Roberto Molinaro, Emmanuel de Bézenac, Siddhartha Mishra

Abstract: We introduce Poseidon, a foundation model for learning the solution operators of PDEs. It is based on a multiscale operator transformer, with time-conditioned layer norms that enable continuous-in-time evaluations. A novel training strategy leveraging the semi-group property of time-dependent PDEs to allow for significant scaling-up of the training data is also proposed. Poseidon is pretrained on… ▽ More We introduce Poseidon, a foundation model for learning the solution operators of PDEs. It is based on a multiscale operator transformer, with time-conditioned layer norms that enable continuous-in-time evaluations. A novel training strategy leveraging the semi-group property of time-dependent PDEs to allow for significant scaling-up of the training data is also proposed. Poseidon is pretrained on a diverse, large scale dataset for the governing equations of fluid dynamics. It is then evaluated on a suite of 15 challenging downstream tasks that include a wide variety of PDE types and operators. We show that Poseidon exhibits excellent performance across the board by outperforming baselines significantly, both in terms of sample efficiency and accuracy. Poseidon also generalizes very well to new physics that is not seen during pretraining. Moreover, Poseidon scales with respect to model and data size, both for pretraining and for downstream tasks. Taken together, our results showcase the surprising ability of Poseidon to learn effective representations from a very small set of PDEs during pretraining in order to generalize well to unseen and unrelated PDEs downstream, demonstrating its potential as an effective, general purpose PDE foundation model. Finally, the Poseidon model as well as underlying pretraining and downstream datasets are open sourced, with code being available at https://github.com/camlab-ethz/poseidon and pretrained models and datasets at https://huggingface.co/camlab-ethz. △ Less

Submitted 29 May, 2024; originally announced May 2024.

arXiv:2405.18875 [pdf, other]

Counterfactual Metarules for Local and Global Recourse

Authors: Tom Bewley, Salim I. Amoukou, Saumitra Mishra, Daniele Magazzeni, Manuela Veloso

Abstract: We introduce T-CREx, a novel model-agnostic method for local and global counterfactual explanation (CE), which summarises recourse options for both individuals and groups in the form of human-readable rules. It leverages tree-based surrogate models to learn the counterfactual rules, alongside 'metarules' denoting their regions of optimality, providing both a global analysis of model behaviour and… ▽ More We introduce T-CREx, a novel model-agnostic method for local and global counterfactual explanation (CE), which summarises recourse options for both individuals and groups in the form of human-readable rules. It leverages tree-based surrogate models to learn the counterfactual rules, alongside 'metarules' denoting their regions of optimality, providing both a global analysis of model behaviour and diverse recourse options for users. Experiments indicate that T-CREx achieves superior aggregate performance over existing rule-based baselines on a range of CE desiderata, while being orders of magnitude faster to run. △ Less

Submitted 29 May, 2024; originally announced May 2024.

Comments: Accepted at ICML 2024

arXiv:2405.14558 [pdf, other]

FUSE: Fast Unified Simulation and Estimation for PDEs

Authors: Levi E. Lingsch, Dana Grund, Siddhartha Mishra, Georgios Kissas

Abstract: The joint prediction of continuous fields and statistical estimation of the underlying discrete parameters is a common problem for many physical systems, governed by PDEs. Hitherto, it has been separately addressed by employing operator learning surrogates for field prediction while using simulation-based inference (and its variants) for statistical parameter determination. Here, we argue that sol… ▽ More The joint prediction of continuous fields and statistical estimation of the underlying discrete parameters is a common problem for many physical systems, governed by PDEs. Hitherto, it has been separately addressed by employing operator learning surrogates for field prediction while using simulation-based inference (and its variants) for statistical parameter determination. Here, we argue that solving both problems within the same framework can lead to consistent gains in accuracy and robustness. To this end, We propose a novel and flexible formulation of the operator learning problem that allows jointly predicting continuous quantities and inferring distributions of discrete parameters, and thus amortizing the cost of both the inverse and the surrogate models to a joint pre-training step. We present the capabilities of the proposed methodology for predicting continuous and discrete biomarkers in full-body haemodynamics simulations under different levels of missing information. We also consider a test case for atmospheric large-eddy simulation of a two-dimensional dry cold bubble, where we infer both continuous time-series and information about the systems conditions. We present comparisons against different baselines to showcase significantly increased accuracy in both the inverse and the surrogate tasks. △ Less

Submitted 23 May, 2024; originally announced May 2024.

arXiv:2405.10385 [pdf, other]

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning

Authors: Mina Ghashami, Soumya Smruti Mishra

Abstract: The SemEval 2024 BRAINTEASER task represents a pioneering venture in Natural Language Processing (NLP) by focusing on lateral thinking, a dimension of cognitive reasoning that is often overlooked in traditional linguistic analyses. This challenge comprises of Sentence Puzzle and Word Puzzle subtasks and aims to test language models' capacity for divergent thinking. In this paper, we present our… ▽ More The SemEval 2024 BRAINTEASER task represents a pioneering venture in Natural Language Processing (NLP) by focusing on lateral thinking, a dimension of cognitive reasoning that is often overlooked in traditional linguistic analyses. This challenge comprises of Sentence Puzzle and Word Puzzle subtasks and aims to test language models' capacity for divergent thinking. In this paper, we present our approach to the BRAINTEASER task. We employ a holistic strategy by leveraging cutting-edge pre-trained models in multiple choice architecture, and diversify the training data with Sentence and Word Puzzle datasets. To gain further improvement, we fine-tuned the model with synthetic humor or jokes dataset and the RiddleSense dataset which helped augmenting the model's lateral thinking abilities. Empirical results show that our approach achieve 92.5% accuracy in Sentence Puzzle subtask and 80.2% accuracy in Word Puzzle subtask. △ Less

Submitted 20 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

Comments: Accepted at SemEval 2024 (Colocated with NAACL 2024)

Journal ref: Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024)

arXiv:2405.09898 [pdf]

NH3 gas sensing over 2D Phosphorene sheet: A First-Principles Study

Authors: Naresh Kumar, Yogendra K. Gautam, Soni Mishra, Anuj Kumar, Abhishek Kumar Mishra

Abstract: First-principles based calculations were executed to investigate the sensing properties of ammonia gas molecules on two-dimensional pristine black phosphorene towards its application as a gas sensor and related applications. We discuss in detail, the interaction of ammonia gas molecules on the phosphorene single sheet through the structural change analysis, electronic band gap, Bader charge transf… ▽ More First-principles based calculations were executed to investigate the sensing properties of ammonia gas molecules on two-dimensional pristine black phosphorene towards its application as a gas sensor and related applications. We discuss in detail, the interaction of ammonia gas molecules on the phosphorene single sheet through the structural change analysis, electronic band gap, Bader charge transfer, and density-of-states calculations. Our calculations indicate that the phosphorene could be used as a detector of ammonia, where good sensitivity and very short recovery time at room temperature have confirmed the potential use of phosphorene in the detection of ammonia. △ Less

Submitted 16 May, 2024; originally announced May 2024.

Comments: 21 pages, Figures 8

arXiv:2405.08925 [pdf, other]

Directional cues affect the collective behaviour of Self propelled particles in one dimension

Authors: Pawan Kumar Mishra, Abhra Puitandy, Shradha Mishra

Abstract: This study explores the effect of quenched disorder on the characteristic of self-propelled particles in one-dimension. Here,particles interact with disorder which serve as directional cues. The study investigates how the density of the disorder influence the emergence of ordering and clustering in the collection of the self propelled particles. We introduce the microscopic model as well as corres… ▽ More This study explores the effect of quenched disorder on the characteristic of self-propelled particles in one-dimension. Here,particles interact with disorder which serve as directional cues. The study investigates how the density of the disorder influence the emergence of ordering and clustering in the collection of the self propelled particles. We introduce the microscopic model as well as corresponding coarse-grained equations of motion for the local density and the orientation of particle. Disorder affects the macroscopic ordering in the system, the size of the ordered clusters decays algebraically with disorder. Further, the disorder also affects the clustering of particles; in the presence of disorder, a big macroscopic cluster breaks into small clusters, leads to the localization of particles around it and results in high density around the disorder. △ Less

Submitted 14 May, 2024; originally announced May 2024.

arXiv:2405.06842 [pdf, other]

BitVMX: A CPU for Universal Computation on Bitcoin

Authors: Sergio Demian Lerner, Ramon Amela, Shreemoy Mishra, Martin Jonas, Javier Álvarez Cid-Fuentes

Abstract: BitVMX is a new design for a virtual CPU to optimistically execute arbitrary programs on Bitcoin based on a challenge response game introduced in BitVM. Similar to BitVM1 we create a general-purpose CPU to be verified in Bitcoin script. Our design supports common architectures, such as RISC-V or MIPS. Our main contribution to the state of the art is a design that uses hash chains of program traces… ▽ More BitVMX is a new design for a virtual CPU to optimistically execute arbitrary programs on Bitcoin based on a challenge response game introduced in BitVM. Similar to BitVM1 we create a general-purpose CPU to be verified in Bitcoin script. Our design supports common architectures, such as RISC-V or MIPS. Our main contribution to the state of the art is a design that uses hash chains of program traces, memory mapped registers, and a new challenge-response protocol. We present a new message linking protocol as a means to allow authenticated communication between the participants. This protocol emulates stateful smart contracts by sharing state between transactions. This provides a basis for our verification game which uses a graph of pre-signed transactions to support challenge-response interactions. In case of a dispute, the hash chain of program trace is used with selective pre-signed transactions to locate (via $n$-ary search) and then recover the precise nature of errors in the computation. Unlike BitVM1, our approach does not require the creation of Merkle trees for CPU instructions or memory words. Additionally, it does not rely on signature equivocations. These differences help avoid complexities associated with BitVM1 and make BitVMX a compelling alternative to BitVM2. Our approach is quite flexible, BitVMX can be instantiated to balance transaction cost vs round complexity, prover cost vs verifier cost, and precomputations vs round complexity. △ Less

Submitted 10 May, 2024; originally announced May 2024.

arXiv:2405.05757 [pdf, other]

Design and Implementation of Energy-Efficient Wireless Tire Sensing System with Delay Analysis for Intelligent Vehicles

Authors: Shashank Mishra, Jia-Ming Liang

Abstract: The growing prevalence of Internet of Things (IoT) technologies has led to a rise in the popularity of intelligent vehicles that incorporate a range of sensors to monitor various aspects, such as driving speed, fuel usage, distance proximity and tire anomalies. Nowadays, real-time tire sensing systems play important roles for intelligent vehicles in increasing mileage, reducing fuel consumption, i… ▽ More The growing prevalence of Internet of Things (IoT) technologies has led to a rise in the popularity of intelligent vehicles that incorporate a range of sensors to monitor various aspects, such as driving speed, fuel usage, distance proximity and tire anomalies. Nowadays, real-time tire sensing systems play important roles for intelligent vehicles in increasing mileage, reducing fuel consumption, improving driving safety, and reducing the potential for traffic accidents. However, the current tire sensing system drains a significant vehicle' energy and lacks effective collection of sensing data, which may not guarantee the immediacy of driving safety. Thus, this paper designs an energy-efficient wireless tire sensing system (WTSS), which leverages energy-saving techniques to significantly reduce power consumption while ensuring data retrieval delays during real-time monitoring. Additionally, we mathematically analyze the worst-case transmission delay of the system to ensure the immediacy based on the collision probabilities of sensor transmissions. This system has been implemented and verified by the simulation and field trial experiments. These results show that the proposed scheme provides enhanced performance in energy efficiency and accurately identifies the worst transmission delay. △ Less

Submitted 27 May, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

arXiv:2405.05354 [pdf, other]

Transfer-LMR: Heavy-Tail Driving Behavior Recognition in Diverse Traffic Scenarios

Authors: Chirag Parikh, Ravi Shankar Mishra, Rohan Chandra, Ravi Kiran Sarvadevabhatla

Abstract: Recognizing driving behaviors is important for downstream tasks such as reasoning, planning, and navigation. Existing video recognition approaches work well for common behaviors (e.g. "drive straight", "brake", "turn left/right"). However, the performance is sub-par for underrepresented/rare behaviors typically found in tail of the behavior class distribution. To address this shortcoming, we propo… ▽ More Recognizing driving behaviors is important for downstream tasks such as reasoning, planning, and navigation. Existing video recognition approaches work well for common behaviors (e.g. "drive straight", "brake", "turn left/right"). However, the performance is sub-par for underrepresented/rare behaviors typically found in tail of the behavior class distribution. To address this shortcoming, we propose Transfer-LMR, a modular training routine for improving the recognition performance across all driving behavior classes. We extensively evaluate our approach on METEOR and HDD datasets that contain rich yet heavy-tailed distribution of driving behaviors and span diverse traffic scenarios. The experimental results demonstrate the efficacy of our approach, especially for recognizing underrepresented/rare driving behaviors. △ Less

Submitted 8 May, 2024; originally announced May 2024.

arXiv:2405.04256 [pdf, other]

doi 10.1103/PhysRevB.109.205115

Fermi surface of the chiral topological semimetal CoSi

Authors: Nico Huber, Sanu Mishra, Ilya Sheikin, Kirill Alpin, Andreas P. Schnyder, Georg Benka, Andreas Bauer, Christian Pfleiderer, Marc A. Wilde

Abstract: We report a study of the Fermi surface of the chiral semimetal CoSi and its relationship to a network of multifold topological crossing points,Weyl points, and topological nodal planes in the electronic band structure. Combining quantum oscillations in the Hall resistivity, magnetization, and torque magnetization with ab initio electronic structure calculations, we identify two groups of Fermi-sur… ▽ More We report a study of the Fermi surface of the chiral semimetal CoSi and its relationship to a network of multifold topological crossing points,Weyl points, and topological nodal planes in the electronic band structure. Combining quantum oscillations in the Hall resistivity, magnetization, and torque magnetization with ab initio electronic structure calculations, we identify two groups of Fermi-surface sheets, one centered at the R point and the other centered at the $Γ$ point. The presence of topological nodal planes at the Brillouin zone boundary enforces topological protectorates on the Fermi-surface sheets centered at the R point. In addition, Weyl points exist close to the Fermi-surface sheets centered at the R and the $Γ$ points. In contrast, topological crossing points at the R point and the $Γ$ point, which have been advertised to feature exceptionally large Chern numbers, are located at a larger distance to the Fermi level. Representing a unique example in which the multitude of topological band crossings has been shown to form a complex network, our observations in CoSi highlight the need for detailed numerical calculations of the Berry curvature at the Fermi level, regardless of the putative existence and the possible character of topological band crossings in the band structure. △ Less

Submitted 7 May, 2024; originally announced May 2024.

Journal ref: Physical Review B 109, 205115 (2024)

arXiv:2405.04223 [pdf, other]

Measurement of gravitational acceleration in a single laser operated atomic fountain

Authors: Kavish Bhardwaj, S. Singh, S. P. Ram, B. Jain, Vijay Kumar, Ayukt Pathak, Shradha Tiwari, V. B. Tiwari, S. R. Mishra

Abstract: We present measurements on Earth's gravitational acceleration (g) using an in-house developed cold atom gravimeter (CAG) in an atomic fountain geometry. In the setup, the laser cooled $^{87}Rb$ atoms are launched vertically up in the fountain geometry and Doppler sensitive two-photon Raman pulse atom interferometry is applied to detect the gravitational acceleration experienced by the atoms. Using… ▽ More We present measurements on Earth's gravitational acceleration (g) using an in-house developed cold atom gravimeter (CAG) in an atomic fountain geometry. In the setup, the laser cooled $^{87}Rb$ atoms are launched vertically up in the fountain geometry and Doppler sensitive two-photon Raman pulse atom interferometry is applied to detect the gravitational acceleration experienced by the atoms. Using our gravimeter setup, we have measured the local value of 'g' in our laboratory with sensitivity of 621 $μ$Gal for integration time of 1350 s. △ Less

Submitted 7 May, 2024; originally announced May 2024.

Comments: 12 pages, 8 figures

arXiv:2405.01183 [pdf, other]

An efficient quantifier elimination procedure for Presburger arithmetic

Authors: Christoph Haase, Shankara Narayanan Krishna, Khushraj Madnani, Om Swostik Mishra, Georg Zetzsche

Abstract: All known quantifier elimination procedures for Presburger arithmetic require doubly exponential time for eliminating a single block of existentially quantified variables. It has even been claimed in the literature that this upper bound is tight. We observe that this claim is incorrect and develop, as the main result of this paper, a quantifier elimination procedure eliminating a block of existent… ▽ More All known quantifier elimination procedures for Presburger arithmetic require doubly exponential time for eliminating a single block of existentially quantified variables. It has even been claimed in the literature that this upper bound is tight. We observe that this claim is incorrect and develop, as the main result of this paper, a quantifier elimination procedure eliminating a block of existentially quantified variables in singly exponential time. As corollaries, we can establish the precise complexity of numerous problems. Examples include deciding (i) monadic decomposability for existential formulas, (ii) whether an existential formula defines a well-quasi ordering or, more generally, (iii) certain formulas of Presburger arithmetic with Ramsey quantifiers. Moreover, despite the exponential blowup, our procedure shows that under mild assumptions, even NP upper bounds for decision problems about quantifier-free formulas can be transferred to existential formulas. The technical basis of our results is a kind of small model property for parametric integer programming that generalizes the seminal results by von zur Gathen and Sieveking on small integer points in convex polytopes. △ Less

Submitted 2 May, 2024; originally announced May 2024.

Comments: Accepted for publication at ICALP 2024

arXiv:2404.16687 [pdf, other]

NTIRE 2024 Quality Assessment of AI-Generated Content Challenge

Authors: Xiaohong Liu, Xiongkuo Min, Guangtao Zhai, Chunyi Li, Tengchuan Kou, Wei Sun, Haoning Wu, Yixuan Gao, Yuqin Cao, Zicheng Zhang, Xiele Wu, Radu Timofte, Fei Peng, Huiyuan Fu, Anlong Ming, Chuanming Wang, Huadong Ma, Shuai He, Zifei Dou, Shu Chen, Huacong Zhang, Haiyi Xie, Chengwei Wang, Baoying Chen, Jishen Zeng , et al. (89 additional authors not shown)

Abstract: This paper reports on the NTIRE 2024 Quality Assessment of AI-Generated Content Challenge, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2024. This challenge is to address a major challenge in the field of image and video processing, namely, Image Quality Assessment (IQA) and Video Quality Assessment (VQA) for AI-Generated Conte… ▽ More This paper reports on the NTIRE 2024 Quality Assessment of AI-Generated Content Challenge, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2024. This challenge is to address a major challenge in the field of image and video processing, namely, Image Quality Assessment (IQA) and Video Quality Assessment (VQA) for AI-Generated Content (AIGC). The challenge is divided into the image track and the video track. The image track uses the AIGIQA-20K, which contains 20,000 AI-Generated Images (AIGIs) generated by 15 popular generative models. The image track has a total of 318 registered participants. A total of 1,646 submissions are received in the development phase, and 221 submissions are received in the test phase. Finally, 16 participating teams submitted their models and fact sheets. The video track uses the T2VQA-DB, which contains 10,000 AI-Generated Videos (AIGVs) generated by 9 popular Text-to-Video (T2V) models. A total of 196 participants have registered in the video track. A total of 991 submissions are received in the development phase, and 185 submissions are received in the test phase. Finally, 12 participating teams submitted their models and fact sheets. Some methods have achieved better results than baseline methods, and the winning methods in both tracks have demonstrated superior prediction performance on AIGC. △ Less

Submitted 7 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

arXiv:2404.14790 [pdf, other]

Pressure-dependent electronic superlattice in the Kagome-superconductor CsV$\mathrm{_3}$Sb$\mathrm{_5}$

Authors: F. Stier, A. -A. Haghighirad, G. Garbarino, S. Mishra, N. Stilkerich, D. Chen, C. Shekhar, T. Lacmann, C. Felser, T. Ritschel, J. Geck, M. Le Tacon

Abstract: We present a high-resolution single crystal x-ray diffraction study of Kagome superconductor \cvs, exploring its response to variations in pressure and temperature. We discover that at low temperatures, the structural modulations of the electronic superlattice, commonly associated with charge density wave order, undergo a transformation around $p \sim$ 0.7 GPa from the familiar $2\times2$ pattern… ▽ More We present a high-resolution single crystal x-ray diffraction study of Kagome superconductor \cvs, exploring its response to variations in pressure and temperature. We discover that at low temperatures, the structural modulations of the electronic superlattice, commonly associated with charge density wave order, undergo a transformation around $p \sim$ 0.7 GPa from the familiar $2\times2$ pattern to a long-range ordered modulation at wavevector $q=(0, 3/8, 1/2)$. Our observations align with inferred changes in the CDW pattern from prior transport and nuclear magnetic resonance studies, providing new insights into these transitions. Interestingly, the pressure-induced variations in the electronic superlattice correlate with two peaks in the superconducting transition temperature as pressure changes, hinting that fluctuations within the electronic superlattice could be key to stabilizing superconductivity. However, our findings contrast with the minimal pressure dependency anticipated by ab initio calculations of the electronic structure. They also challenge prevailing scenarios based on a Peierls-like nesting mechanism involving van Hove singularities. △ Less

Submitted 23 April, 2024; originally announced April 2024.

arXiv:2404.13747 [pdf]

Evolution of Ring Airy vortex beam and Ring Pearcey vortex beam in turbulent atmosphere and a comparative analysis of their channel efficiency

Authors: Shakti Singh, Sanjay Kumar Mishra, Akhilesh Kumar Mishra

Abstract: An optical vortex beam propagating through turbulent atmosphere encounters distortions in the wavefront that results in modal scattering. Abruptly autofocussing (AAF) beams with orbital angular momentum have gained significant attention due to their non-diffracting and self-healing nature. These warrants understanding of the behaviour of these beams through turbulent atmosphere absolutely necessar… ▽ More An optical vortex beam propagating through turbulent atmosphere encounters distortions in the wavefront that results in modal scattering. Abruptly autofocussing (AAF) beams with orbital angular momentum have gained significant attention due to their non-diffracting and self-healing nature. These warrants understanding of the behaviour of these beams through turbulent atmosphere absolutely necessary. With this intuition, in the present work we investigate the behaviour of two AAF beams namely ring Airy vortex beam (RAVB) and ring Pearcey vortex beam (RPVB) through the turbulent atmosphere in two cases: multiplexed and non-multiplexed. We propagate multiplexed as well as non-multiplexed RAVB and RPVB in different levels of turbulent atmosphere. In non-multiplexed case, channel efficiency declines for both the beams with increase in modes numbers. In multiplexed case, increasing the gap between the mode sets results in decrease in channel efficiency. We also report that in weak atmospheric turbulence RAVB outperform RPVB in terms of channel efficiency. We use optical transformation sorting (log-polar) method to demultiplex the optical beams at the output. Furthermore, we investigate and compare the OAM spectra of both beams in different levels of atmospheric turbulence and at different propagation distances. The comparison reveals that the spectra of RPVB is more dispersive as compared to that of RAVB. △ Less

Submitted 21 April, 2024; originally announced April 2024.

Comments: 11 pages, 8 figures

arXiv:2404.13154 [pdf, other]

Phase-Field Modeling of Fracture with Physics-Informed Deep Learning

Authors: M. Manav, R. Molinaro, S. Mishra, L. De Lorenzis

Abstract: We explore the potential of the deep Ritz method to learn complex fracture processes such as quasistatic crack nucleation, propagation, kinking, branching, and coalescence within the unified variational framework of phase-field modeling of brittle fracture. We elucidate the challenges related to the neural-network-based approximation of the energy landscape, and the ability of an optimization appr… ▽ More We explore the potential of the deep Ritz method to learn complex fracture processes such as quasistatic crack nucleation, propagation, kinking, branching, and coalescence within the unified variational framework of phase-field modeling of brittle fracture. We elucidate the challenges related to the neural-network-based approximation of the energy landscape, and the ability of an optimization approach to reach the correct energy minimum, and we discuss the choices in the construction and training of the neural network which prove to be critical to accurately and efficiently capture all the relevant fracture phenomena. The developed method is applied to several benchmark problems and the results are shown to be in qualitative and quantitative agreement with the finite element solution. The robustness of the approach is tested by using neural networks with different initializations. △ Less

Submitted 19 April, 2024; originally announced April 2024.

Comments: 43 pages, 29 figures

arXiv:2404.10157 [pdf, other]

Salient Object-Aware Background Generation using Text-Guided Diffusion Models

Authors: Amir Erfan Eshratifar, Joao V. B. Soares, Kapil Thadani, Shaunak Mishra, Mikhail Kuznetsov, Yueh-Ning Ku, Paloma de Juan

Abstract: Generating background scenes for salient objects plays a crucial role across various domains including creative design and e-commerce, as it enhances the presentation and context of subjects by integrating them into tailored environments. Background generation can be framed as a task of text-conditioned outpainting, where the goal is to extend image content beyond a salient object's boundaries on… ▽ More Generating background scenes for salient objects plays a crucial role across various domains including creative design and e-commerce, as it enhances the presentation and context of subjects by integrating them into tailored environments. Background generation can be framed as a task of text-conditioned outpainting, where the goal is to extend image content beyond a salient object's boundaries on a blank background. Although popular diffusion models for text-guided inpainting can also be used for outpainting by mask inversion, they are trained to fill in missing parts of an image rather than to place an object into a scene. Consequently, when used for background creation, inpainting models frequently extend the salient object's boundaries and thereby change the object's identity, which is a phenomenon we call "object expansion." This paper introduces a model for adapting inpainting diffusion models to the salient object outpainting task using Stable Diffusion and ControlNet architectures. We present a series of qualitative and quantitative results across models and datasets, including a newly proposed metric to measure object expansion that does not require any human labeling. Compared to Stable Diffusion 2.0 Inpainting, our proposed approach reduces object expansion by 3.6x on average with no degradation in standard visual metrics across multiple datasets. △ Less

Submitted 15 April, 2024; originally announced April 2024.

Comments: Accepted for publication at CVPR 2024's Generative Models for Computer Vision workshop

arXiv:2404.08391 [pdf, other]

On trapping geometry for cold atoms in a radio-frequency (rf) dressed potential

Authors: Sourabh Sarkar, S. P. Ram, Kavish Bhardwaj, Gunjan Verma, V. B. Tiwari, S. R. Mishra

Abstract: We have investigated the atom trapping geometry for trapping of $^{87}{Rb}$ atoms in a radio-frequency (rf) dressed potential generated after superposing a strong linearly polarized rf-field on a static magnetic trap. For this, laser cooled atoms in a magneto-optical trap (MOT) in an ultra-high vacuum (UHV) chamber (pressure $\sim$ 1.5 $\times$ $10^{-10}$ Torr) were trapped in a quadrupole magneti… ▽ More We have investigated the atom trapping geometry for trapping of $^{87}{Rb}$ atoms in a radio-frequency (rf) dressed potential generated after superposing a strong linearly polarized rf-field on a static magnetic trap. For this, laser cooled atoms in a magneto-optical trap (MOT) in an ultra-high vacuum (UHV) chamber (pressure $\sim$ 1.5 $\times$ $10^{-10}$ Torr) were trapped in a quadrupole magnetic trap and evaporatively cooled before transferring them to the rf-dressed potential. The experimentally observed hollow shell type atom trapping geometry has been explained by theoretical modelling of the trapping potential. △ Less

Submitted 12 April, 2024; originally announced April 2024.

arXiv:2404.07243 [pdf, other]

Interacting tachyon with varying mass dark matter

Authors: Goutam Mandal, Sudip Mishra, Abdulla Al Mamon, Sujay Kr. Biswas

Abstract: This paper presents an investigation of cosmological dynamics of tachyon fluid coupled to varyingmass dark matter particles in the background of spatially flat FLRW universe. The mechanism of varying mass particles scenario assumes the mass of the dark matter depends on time t through the scalar field $φ$ in the sense that the decaying of dark matter reproduces the scalar field. First, we analyze… ▽ More This paper presents an investigation of cosmological dynamics of tachyon fluid coupled to varyingmass dark matter particles in the background of spatially flat FLRW universe. The mechanism of varying mass particles scenario assumes the mass of the dark matter depends on time t through the scalar field $φ$ in the sense that the decaying of dark matter reproduces the scalar field. First, we analyze the model from dynamical systems perspective by converting the cosmological evolution equations into an autonomous system of ordinary differential equations with a suitable transformation of variables. We choose the mass of dark matter as exponential function of scalar field and the exponential potential of the tachyon field is undertaken in such a way that the autonomous system is reduced in three dimensional form. The critical points obtained from the system are non-hyperbolic in nature. The center manifold theory is employed to discuss the nature of the critical points. Numerical investigation also carried out for some critical points. From this analysis, we obtain dust dominated decelerated transient phase of the universe followed by dark energy dominated scaling attractor alleviating the coincidence problem. Next, we perform the statefinder diagnostic approach to compare our model to $Λ$CDM and finally we study the evolution of the Hubble parameter and the distance modulus and compare this with observational data. △ Less

Submitted 9 April, 2024; originally announced April 2024.

Comments: 17 pages, 8 Caption figures, 21 figures

arXiv:2404.04706 [pdf, other]

doi 10.1007/978-981-97-0407-1_7

Advances in Differential Privacy and Differentially Private Machine Learning

Authors: Saswat Das, Subhankar Mishra

Abstract: There has been an explosion of research on differential privacy (DP) and its various applications in recent years, ranging from novel variants and accounting techniques in differential privacy to the thriving field of differentially private machine learning (DPML) to newer implementations in practice, like those by various companies and organisations such as census bureaus. Most recent surveys foc… ▽ More There has been an explosion of research on differential privacy (DP) and its various applications in recent years, ranging from novel variants and accounting techniques in differential privacy to the thriving field of differentially private machine learning (DPML) to newer implementations in practice, like those by various companies and organisations such as census bureaus. Most recent surveys focus on the applications of differential privacy in particular contexts like data publishing, specific machine learning tasks, analysis of unstructured data, location privacy, etc. This work thus seeks to fill the gap for a survey that primarily discusses recent developments in the theory of differential privacy along with newer DP variants, viz. Renyi DP and Concentrated DP, novel mechanisms and techniques, and the theoretical developments in differentially private machine learning in proper detail. In addition, this survey discusses its applications to privacy-preserving machine learning in practice and a few practical implementations of DP. △ Less

Submitted 6 April, 2024; originally announced April 2024.

Journal ref: Information Technology Security, 2024, pp 147 to 188, Springer Tracts in Electrical and Electronics Engineering, Springer, Singapore

arXiv:2404.04177 [pdf, other]

Discriminating chaotic and integrable regimes in quenched field Floquet system using saturation of Out-of-time-order correlation

Authors: Rohit Kumar Shukla, Gaurav Rudra Malik, S. Aravinda, Sunil Kumar Mishra

Abstract: The dynamic region of out-of-time-ordered correlators (OTOCs) is a valuable discriminator of chaos in classical and semiclassical systems, as it captures the characteristic exponential growth. However, in spin systems, it does not reliably quantify chaos, exhibiting similar behavior in both integrable and chaotic systems. Instead, we leverage the saturation behavior of OTOCs as a means to differen… ▽ More The dynamic region of out-of-time-ordered correlators (OTOCs) is a valuable discriminator of chaos in classical and semiclassical systems, as it captures the characteristic exponential growth. However, in spin systems, it does not reliably quantify chaos, exhibiting similar behavior in both integrable and chaotic systems. Instead, we leverage the saturation behavior of OTOCs as a means to differentiate between chaotic and integrable regimes. We use integrable and nonintegrable quenched field Floquet systems to describe this discriminator. In the integrable system, the saturation region of OTOCs exhibits oscillatory behavior, whereas, in the chaotic system, it shows exact saturation i.e., system gets thermalized. To gain a clearer understanding of the oscillations, we calculate the inverse participation ratio (IPR) for the normalized Fourier spectrum of OTOC. In order to further substantiate our findings, we propose the nearest-neighbor spacing distribution (NNSD) of time-dependent unitary operators. This distribution effectively differentiates chaotic and regular regions, corroborating the outcomes derived from the saturation behavior of OTOC. △ Less

Submitted 5 April, 2024; originally announced April 2024.

Comments: 12 Pages and 12 Figures

Showing 1–50 of 1,026 results for author: Mishra, S