-
Effect of ground-state deformation on the Isoscalar Giant Monopole Resonance and the first observation of overtones of the Isoscalar Giant Quadrupole Resonance in rare-earth Nd isotopes
Authors:
M. Abdullah,
S. Bagchi,
M. N. Harakeh,
H. Akimune,
D. Das,
T. Doi,
L. M. Donaldson,
Y. Fujikawa,
M. Fujiwara,
T. Furuno,
U. Garg,
Y. K. Gupta,
K. B. Howard,
Y. Hijikata,
K. Inaba,
S. Ishida,
M. Itoh,
N. Kalantar-Nayestanaki,
D. Kar,
T. Kawabata,
S. Kawashima,
K. Khokhar,
K. Kitamura,
N. Kobayashi,
Y. Matsuda
, et al. (11 additional authors not shown)
Abstract:
The strength distributions of the Isoscalar Giant Monopole Resonance (ISGMR) and Isoscalar Giant Quadrupole Resonance (ISGQR) in 142,146-150Nd have been determined via inelastic alpha-particle scattering with the Grand Raiden (GR) Spectrometer at the Research Center for Nuclear Physics (RCNP), Japan. In the deformed nuclei 146-150Nd, the ISGMR strength distributions exhibit a splitting into two co…
▽ More
The strength distributions of the Isoscalar Giant Monopole Resonance (ISGMR) and Isoscalar Giant Quadrupole Resonance (ISGQR) in 142,146-150Nd have been determined via inelastic alpha-particle scattering with the Grand Raiden (GR) Spectrometer at the Research Center for Nuclear Physics (RCNP), Japan. In the deformed nuclei 146-150Nd, the ISGMR strength distributions exhibit a splitting into two components, while the nearly spherical nucleus 142Nd displays a single peak in the ISGMR strength distribution. A noteworthy achievement in this study is the first-time detection of overtones in the Isoscalar Giant Quadrupole Resonance (ISGQR) strength distributions within Nd isotopes at an excitation energy around 25 MeV obtained through Multipole Decomposition Analysis (MDA).
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Segmentation-Free Guidance for Text-to-Image Diffusion Models
Authors:
Kambiz Azarian,
Debasmit Das,
Qiqi Hou,
Fatih Porikli
Abstract:
We introduce segmentation-free guidance, a novel method designed for text-to-image diffusion models like Stable Diffusion. Our method does not require retraining of the diffusion model. At no additional compute cost, it uses the diffusion model itself as an implied segmentation network, hence named segmentation-free guidance, to dynamically adjust the negative prompt for each patch of the generate…
▽ More
We introduce segmentation-free guidance, a novel method designed for text-to-image diffusion models like Stable Diffusion. Our method does not require retraining of the diffusion model. At no additional compute cost, it uses the diffusion model itself as an implied segmentation network, hence named segmentation-free guidance, to dynamically adjust the negative prompt for each patch of the generated image, based on the patch's relevance to concepts in the prompt. We evaluate segmentation-free guidance both objectively, using FID, CLIP, IS, and PickScore, and subjectively, through human evaluators. For the subjective evaluation, we also propose a methodology for subsampling the prompts in a dataset like MS COCO-30K to keep the number of human evaluations manageable while ensuring that the selected subset is both representative in terms of content and fair in terms of model performance. The results demonstrate the superiority of our segmentation-free guidance to the widely used classifier-free method. Human evaluators preferred segmentation-free guidance over classifier-free 60% to 19%, with 18% of occasions showing a strong preference. Additionally, PickScore win-rate, a recently proposed metric mimicking human preference, also indicates a preference for our method over classifier-free.
△ Less
Submitted 3 June, 2024;
originally announced July 2024.
-
Toward Wireless System and Circuit Co-Design for the Internet of Self-Adaptive Things
Authors:
Diptashree Das,
Mohammad Abdi,
Minghan Liu,
Marvin Onabajo,
Francesco Restuccia
Abstract:
The deployment of a growing number of devices in Internet of Things (IoT) networks implies that uninterrupted and seamless adaptation of wireless communication parameters (e.g., carrier frequency, bandwidth and modulation) will become essential. To utilize wireless devices capable of switching several communication parameters requires real-time self-optimizations at the radio frequency integrated…
▽ More
The deployment of a growing number of devices in Internet of Things (IoT) networks implies that uninterrupted and seamless adaptation of wireless communication parameters (e.g., carrier frequency, bandwidth and modulation) will become essential. To utilize wireless devices capable of switching several communication parameters requires real-time self-optimizations at the radio frequency integrated circuit (RFIC) level based on system level performance metrics during the processing of complex modulated signals. This article introduces a novel design verification approach for reconfigurable RFICs based on end-to-end wireless system-level performance metrics while operating in a dynamically changing communication environment. In contrast to prior work, this framework includes two modules that simulate a wireless channel and decode waveforms. These are connected to circuit-level modules that capture device- and circuit-level non-idealities of RFICs for design validation and optimization, such as transistor noises, intermodulation/harmonic distortions, and memory effects from parasitic capacitances. We demonstrate this framework with a receiver (RX) consisting of a reconfigurable complementary metal-oxide semiconductor (CMOS) low-noise amplifier (LNA) designed at the transistor level, a behavioral model of a mixer, and an ideal filter model. The seamless integration between system-level wireless models with circuit-level and behavioral models (such as VerilogA-based models) for RFIC blocks enables to preemptively evaluate circuit and system designs, and to optimize for different communication scenarios with adaptive circuits having extensive tuning ranges. An exemplary case study is presented, in which simulation results reveal that the LNA power consumption can be reduced up to 16x depending on system-level requirements.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Demand Analysis and Customized Product Offering Design on E-Commerce Platform
Authors:
Dipankar Das
Abstract:
It can be observed that the purchasing decision of an individual consumer in an electronic marketplace is determined by a set of factors, such as personal characteristics of the consumer, product pricing, minimum price-quantity combination offered, decision-making space, and underlying motivation of the consumer. These factors are combined to form a consumer's choice problem domain, which plays a…
▽ More
It can be observed that the purchasing decision of an individual consumer in an electronic marketplace is determined by a set of factors, such as personal characteristics of the consumer, product pricing, minimum price-quantity combination offered, decision-making space, and underlying motivation of the consumer. These factors are combined to form a consumer's choice problem domain, which plays a pivotal role in the product offering. In this study, we attempt to focus on how the products? Offered can be customized by incorporating the quantity and pack size of the products along with the factors above to form a more extensive domain for examining the combined effects of all of these factors on demand. Accordingly, the demand function is defined by a novel method invoking the extended domain of choice problem in the electronic marketplace. Consequently, the predictable uncertainty associated with the consumer's demand function may disappear, increase the likelihood of earning optimum revenue through customized combinations of the components of the extended domain of choice problem, and improve the understanding of the fluctuations in consumer demand. Finally, we propose a generalized price response function with standard properties applicable to E-Commerce.
△ Less
Submitted 29 April, 2024;
originally announced June 2024.
-
Coexistence of local magnetism and superconductivity in the heavy-fermion CeRh$_2$As$_2$ revealed by $μ$SR studies
Authors:
Seunghyun Khim,
Oliver Stockert,
Manuel Brando,
Christoph Geibel,
Chirstopher Baines,
Thomas J. Hicken,
Hubertus Luetkens,
Debarchan Das,
Toni Shiroka,
Zurab Guguchia,
Robert Scheuermann
Abstract:
The superconducting (SC) state ($T_\mathrm{c}$ = 0.3 K) of the heavy-fermion compound CeRh$_2$As$_2$, which undergoes an unusual field-induced transition to another high-field SC state, emerges from an unknown ordered state below $T_\mathrm{o}$ = 0.55 K. While an electronic multipolar order of itinerant Ce-4$f$ states was proposed to account for the $T_\mathrm{o}$ phase, the exact order parameter…
▽ More
The superconducting (SC) state ($T_\mathrm{c}$ = 0.3 K) of the heavy-fermion compound CeRh$_2$As$_2$, which undergoes an unusual field-induced transition to another high-field SC state, emerges from an unknown ordered state below $T_\mathrm{o}$ = 0.55 K. While an electronic multipolar order of itinerant Ce-4$f$ states was proposed to account for the $T_\mathrm{o}$ phase, the exact order parameter has not been known to date. Here, we report on muon spin relaxation ($μ$SR) studies of the magnetic and SC properties in CeRh$_2$As$_2$ single crystals at low temperatures. We reveal a magnetic origin of the $T_\mathrm{o}$ order by identifying a spontaneous internal field below $T_\mathrm{o}$ = 0.55 K. Furthermore, we find evidence of a microscopic coexistence of local magnetism with bulk superconductivity. Our findings open the possibility that the $T_\mathrm{o}$ phase involves both dipole and higher order Ce-4$f$ moment degrees of freedom and accounts for the unusual non-Fermi liquid behavior.
△ Less
Submitted 26 June, 2024; v1 submitted 24 June, 2024;
originally announced June 2024.
-
Blind Baselines Beat Membership Inference Attacks for Foundation Models
Authors:
Debeshee Das,
Jie Zhang,
Florian Tramèr
Abstract:
Membership inference (MI) attacks try to determine if a data sample was used to train a machine learning model. For foundation models trained on unknown Web data, MI attacks can be used to detect copyrighted training materials, measure test set contamination, or audit machine unlearning. Unfortunately, we find that evaluations of MI attacks for foundation models are flawed, because they sample mem…
▽ More
Membership inference (MI) attacks try to determine if a data sample was used to train a machine learning model. For foundation models trained on unknown Web data, MI attacks can be used to detect copyrighted training materials, measure test set contamination, or audit machine unlearning. Unfortunately, we find that evaluations of MI attacks for foundation models are flawed, because they sample members and non-members from different distributions. For 8 published MI evaluation datasets, we show that blind attacks -- that distinguish the member and non-member distributions without looking at any trained model -- outperform state-of-the-art MI attacks. Existing evaluations thus tell us nothing about membership leakage of a foundation model's training data.
△ Less
Submitted 23 June, 2024;
originally announced June 2024.
-
Molecule Graph Networks with Many-body Equivariant Interactions
Authors:
Zetian Mao,
Jiawen Li,
Chen Liang,
Diptesh Das,
Masato Sumita,
Koji Tsuda
Abstract:
Message passing neural networks have demonstrated significant efficacy in predicting molecular interactions. Introducing equivariant vectorial representations augments expressivity by capturing geometric data symmetries, thereby improving model accuracy. However, two-body bond vectors in opposition may cancel each other out during message passing, leading to the loss of directional information on…
▽ More
Message passing neural networks have demonstrated significant efficacy in predicting molecular interactions. Introducing equivariant vectorial representations augments expressivity by capturing geometric data symmetries, thereby improving model accuracy. However, two-body bond vectors in opposition may cancel each other out during message passing, leading to the loss of directional information on their shared node. In this study, we develop Equivariant N-body Interaction Networks (ENINet) that explicitly integrates equivariant many-body interactions to preserve directional information in the message passing scheme. Experiments indicate that integrating many-body equivariant representations enhances prediction accuracy across diverse scalar and tensorial quantum chemical properties. Ablation studies show an average performance improvement of 7.9% across 11 out of 12 properties in QM9, 27.9% in forces in MD17, and 11.3% in polarizabilities (CCSD) in QM7b.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Non-Kerr Constraints using Binary Black Hole inspirals considering phase modifications up to 4 PN order
Authors:
Debtroy Das,
Swarnim Shashank,
Cosimo Bambi
Abstract:
The gravitational field around an astrophysical black hole (BH) is thought to be described by the Kerr spacetime, which is a solution of the Einstein equation. Signatures of binary black hole (BBH) coalescence in gravitational waves (GW) follow the Kerr spacetime as the theoretical foundation. Hence, any possible deviations from the Kerr spacetime around BHs serve as a test of the nature of gravit…
▽ More
The gravitational field around an astrophysical black hole (BH) is thought to be described by the Kerr spacetime, which is a solution of the Einstein equation. Signatures of binary black hole (BBH) coalescence in gravitational waves (GW) follow the Kerr spacetime as the theoretical foundation. Hence, any possible deviations from the Kerr spacetime around BHs serve as a test of the nature of gravity in the strong-field regime and of the predictions of General Relativity. In our study, we perform a theory-agnostic test of the Kerr hypothesis using BBH inspirals from the third Gravitational-wave Transient Catalog (GWTC-3). Considering the Johannsen metric, we compute the leading-order deviation to the emitted GW in the frequency domain. Our results provide constraints on two deformation parameters ($α_{13}$ and $ε_3$) and demonstrate the degeneracy between these two non-Kerr parameters.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Task Planning for Object Rearrangement in Multi-room Environments
Authors:
Karan Mirakhor,
Sourav Ghosh,
Dipanjan Das,
Brojeshwar Bhowmick
Abstract:
Object rearrangement in a multi-room setup should produce a reasonable plan that reduces the agent's overall travel and the number of steps. Recent state-of-the-art methods fail to produce such plans because they rely on explicit exploration for discovering unseen objects due to partial observability and a heuristic planner to sequence the actions for rearrangement. This paper proposes a novel hie…
▽ More
Object rearrangement in a multi-room setup should produce a reasonable plan that reduces the agent's overall travel and the number of steps. Recent state-of-the-art methods fail to produce such plans because they rely on explicit exploration for discovering unseen objects due to partial observability and a heuristic planner to sequence the actions for rearrangement. This paper proposes a novel hierarchical task planner to efficiently plan a sequence of actions to discover unseen objects and rearrange misplaced objects within an untidy house to achieve a desired tidy state. The proposed method introduces several novel techniques, including (i) a method for discovering unseen objects using commonsense knowledge from large language models, (ii) a collision resolution and buffer prediction method based on Cross-Entropy Method to handle blocked goal and swap cases, (iii) a directed spatial graph-based state space for scalability, and (iv) deep reinforcement learning (RL) for producing an efficient planner. The planner interleaves the discovery of unseen objects and rearrangement to minimize the number of steps taken and overall traversal of the agent. The paper also presents new metrics and a benchmark dataset called MoPOR to evaluate the effectiveness of the rearrangement planning in a multi-room setting. The experimental results demonstrate that the proposed method effectively addresses the multi-room rearrangement problem.
△ Less
Submitted 1 June, 2024;
originally announced June 2024.
-
A Stochastic Incentive-based Demand Response Program for Virtual Power Plant with Solar, Battery, Electric Vehicles, and Controllable Loads
Authors:
Pratik Harsh,
Hongjian Sun,
Debapriya Das,
Goyal Awagan,
Jing Jiang
Abstract:
The growing integration of distributed energy resources (DERs) into the power grid necessitates an effective coordination strategy to maximize their benefits. Acting as an aggregator of DERs, a virtual power plant (VPP) facilitates this coordination, thereby amplifying their impact on the transmission level of the power grid. Further, a demand response program enhances the scheduling approach by m…
▽ More
The growing integration of distributed energy resources (DERs) into the power grid necessitates an effective coordination strategy to maximize their benefits. Acting as an aggregator of DERs, a virtual power plant (VPP) facilitates this coordination, thereby amplifying their impact on the transmission level of the power grid. Further, a demand response program enhances the scheduling approach by managing the energy demands in parallel with the uncertain energy outputs of the DERs. This work presents a stochastic incentive-based demand response model for the scheduling operation of VPP comprising solar-powered generating stations, battery swapping stations, electric vehicle charging stations, and consumers with controllable loads. The work also proposes a priority mechanism to consider the individual preferences of electric vehicle users and consumers with controllable loads. The scheduling approach for the VPP is framed as a multi-objective optimization problem, normalized using the utopia-tracking method. Subsequently, the normalized optimization problem is transformed into a stochastic formulation to address uncertainties in energy demand from charging stations and controllable loads. The proposed VPP scheduling approach is addressed on a 33-node distribution system simulated using MATLAB software, which is further validated using a real-time digital simulator.
△ Less
Submitted 31 May, 2024;
originally announced June 2024.
-
QUBIQ: Uncertainty Quantification for Biomedical Image Segmentation Challenge
Authors:
Hongwei Bran Li,
Fernando Navarro,
Ivan Ezhov,
Amirhossein Bayat,
Dhritiman Das,
Florian Kofler,
Suprosanna Shit,
Diana Waldmannstetter,
Johannes C. Paetzold,
Xiaobin Hu,
Benedikt Wiestler,
Lucas Zimmer,
Tamaz Amiranashvili,
Chinmay Prabhakar,
Christoph Berger,
Jonas Weidner,
Michelle Alonso-Basant,
Arif Rashid,
Ujjwal Baid,
Wesam Adel,
Deniz Ali,
Bhakti Baheti,
Yingbin Bai,
Ishaan Bhatt,
Sabri Can Cetindag
, et al. (55 additional authors not shown)
Abstract:
Uncertainty in medical image segmentation tasks, especially inter-rater variability, arising from differences in interpretations and annotations by various experts, presents a significant challenge in achieving consistent and reliable image segmentation. This variability not only reflects the inherent complexity and subjective nature of medical image interpretation but also directly impacts the de…
▽ More
Uncertainty in medical image segmentation tasks, especially inter-rater variability, arising from differences in interpretations and annotations by various experts, presents a significant challenge in achieving consistent and reliable image segmentation. This variability not only reflects the inherent complexity and subjective nature of medical image interpretation but also directly impacts the development and evaluation of automated segmentation algorithms. Accurately modeling and quantifying this variability is essential for enhancing the robustness and clinical applicability of these algorithms. We report the set-up and summarize the benchmark results of the Quantification of Uncertainties in Biomedical Image Quantification Challenge (QUBIQ), which was organized in conjunction with International Conferences on Medical Image Computing and Computer-Assisted Intervention (MICCAI) 2020 and 2021. The challenge focuses on the uncertainty quantification of medical image segmentation which considers the omnipresence of inter-rater variability in imaging datasets. The large collection of images with multi-rater annotations features various modalities such as MRI and CT; various organs such as the brain, prostate, kidney, and pancreas; and different image dimensions 2D-vs-3D. A total of 24 teams submitted different solutions to the problem, combining various baseline models, Bayesian neural networks, and ensemble model techniques. The obtained results indicate the importance of the ensemble models, as well as the need for further research to develop efficient 3D methods for uncertainty quantification methods in 3D segmentation tasks.
△ Less
Submitted 24 June, 2024; v1 submitted 19 March, 2024;
originally announced May 2024.
-
Drastic modification in thermal conductivity of TiCoSb Half-Heusler alloy: Phonon engineering by lattice softening and ionic polarization
Authors:
S. Mahakal,
Avijit Jana,
Diptasikha Das,
Nabakumar Rana,
Pallabi Sardar,
Aritra Banerjee,
Shamima Hussain,
Santanu K. Maiti,
K. Malik
Abstract:
A drastic variation in thermal conductivity (\k{appa}) for synthesized samples (TiCoSb1+x, x=0.0, 0.01, 0.02, 0.03, 0.04, and 0.06) is observed and ~47% reduction in \k{appa} is reported for TiCoSb1.02 sample. In depth structural analysis is performed, employing mixed-phase Rietveld refinement technique. Embedded phases and vacancy are analyzed from X-ray diffraction (XRD) and Scanning electron mi…
▽ More
A drastic variation in thermal conductivity (\k{appa}) for synthesized samples (TiCoSb1+x, x=0.0, 0.01, 0.02, 0.03, 0.04, and 0.06) is observed and ~47% reduction in \k{appa} is reported for TiCoSb1.02 sample. In depth structural analysis is performed, employing mixed-phase Rietveld refinement technique. Embedded phases and vacancy are analyzed from X-ray diffraction (XRD) and Scanning electron microscopy data. Local structures of the synthesized samples are explored for the first time by X-ray absorption spectroscopy measurements for TiCoSb system and corroborated with Rietveld refinement data. Lattice dynamics are revealed using Raman Spectroscopy (RS) measurements in unprecedented attempts for TiCoSb system. XRD and RS data accomplishes that variation in \k{appa} as a function of Sb concentration is observed owing to an alteration in phonon group velocity related to lattice softening. Polar nature of TiCoSb HH sample is revealed. LO-TO splitting (related to polar optical phonon scattering) in phonon vibration is observed due to polar nature of TiCoSb synthesized samples. Tailoring in LO-TO splitting due to screening effect, correlated with Co vacancies is reported for TiCoSb1+x synthesized samples. Lattice softening and LO-TO splitting lead to decreases in \k{appa}~47% for TiCoSb1.02 synthesized sample.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
IoT-enabled Stability Chamber for the Pharmaceutical Industry
Authors:
Nitol Saha,
Md Masruk Aulia,
Dibakar Das,
Md. Mostafizur Rahman
Abstract:
A stability chamber is a critical piece of equipment for any pharmaceutical facility to retain the manufactured product for testing the stability and quality of the products over a certain period of time by keeping the products in different sets of environmental conditions. In this paper, we proposed an IoT-enabled stability chamber for the pharmaceutical industry. We developed four stability cham…
▽ More
A stability chamber is a critical piece of equipment for any pharmaceutical facility to retain the manufactured product for testing the stability and quality of the products over a certain period of time by keeping the products in different sets of environmental conditions. In this paper, we proposed an IoT-enabled stability chamber for the pharmaceutical industry. We developed four stability chambers by using the existing utilities of a manufacturing facility. The state-of-the-art automatic PID controlling system of Siemens S7-1200 PLC was used to control each chamber. PC-based Siemens WinCC Runtime Advanced visualization platform was used to visualize the data of the chamber which is FDA 21 CFR Part 11 Compliant. Additionally, an Internet of Things-based (IoT-based) application was also developed to monitor the sensor's data remotely using any client application.
△ Less
Submitted 21 May, 2024; v1 submitted 14 May, 2024;
originally announced May 2024.
-
DOLOMITES: Domain-Specific Long-Form Methodical Tasks
Authors:
Chaitanya Malaviya,
Priyanka Agrawal,
Kuzman Ganchev,
Pranesh Srinivasan,
Fantine Huot,
Jonathan Berant,
Mark Yatskar,
Dipanjan Das,
Mirella Lapata,
Chris Alberti
Abstract:
Experts in various fields routinely perform methodical writing tasks to plan, organize, and report their work. From a clinician writing a differential diagnosis for a patient, to a teacher writing a lesson plan for students, these tasks are pervasive, requiring to methodically generate structured long-form output for a given input. We develop a typology of methodical tasks structured in the form o…
▽ More
Experts in various fields routinely perform methodical writing tasks to plan, organize, and report their work. From a clinician writing a differential diagnosis for a patient, to a teacher writing a lesson plan for students, these tasks are pervasive, requiring to methodically generate structured long-form output for a given input. We develop a typology of methodical tasks structured in the form of a task objective, procedure, input, and output, and introduce DoLoMiTes, a novel benchmark with specifications for 519 such tasks elicited from hundreds of experts from across 25 fields. Our benchmark further contains specific instantiations of methodical tasks with concrete input and output examples (1,857 in total) which we obtain by collecting expert revisions of up to 10 model-generated examples of each task. We use these examples to evaluate contemporary language models highlighting that automating methodical tasks is a challenging long-form generation problem, as it requires performing complex inferences, while drawing upon the given context as well as domain knowledge.
△ Less
Submitted 28 May, 2024; v1 submitted 9 May, 2024;
originally announced May 2024.
-
Probabilistic Interval Analysis of Unreliable Programs
Authors:
Dibyendu Das,
Soumyajit Dey
Abstract:
Advancement of chip technology will make future computer chips faster. Power consumption of such chips shall also decrease. But this speed gain shall not come free of cost, there is going to be a trade-off between speed and efficiency, i.e accuracy of the computation. In order to achieve this extra speed we will simply have to let our computers make more mistakes in computations. Consequently, sys…
▽ More
Advancement of chip technology will make future computer chips faster. Power consumption of such chips shall also decrease. But this speed gain shall not come free of cost, there is going to be a trade-off between speed and efficiency, i.e accuracy of the computation. In order to achieve this extra speed we will simply have to let our computers make more mistakes in computations. Consequently, systems built with these type of chips will possess an innate unreliability lying within. Programs written for these systems will also have to incorporate this unreliability. Researchers have already started developing programming frameworks for unreliable architectures as such.
In the present work, we use a restricted version of C-type languages to model the programs written for unreliable architectures. We propose a technique for statically analyzing codes written for these kind of architectures. Our technique, which primarily focuses on Interval/Range Analysis of this type of programs, uses the well established theory of abstract interpretation. While discussing unreliability of hardware, there comes scope of failure of the hardware components implicitly. There are two types of failure models, namely: 1) permanent failure model, where the hardware stops execution on failure and 2) transient failure model, where on failure, the hardware continues subsequent operations with wrong operand values. In this paper, we've only taken transient failure model into consideration. The goal of this analysis is to predict the probability with which a program variable assumes values from a given range at a given program point.
△ Less
Submitted 25 April, 2024;
originally announced April 2024.
-
Transformer-based Joint Modelling for Automatic Essay Scoring and Off-Topic Detection
Authors:
Sourya Dipta Das,
Yash Vadi,
Kuldeep Yadav
Abstract:
Automated Essay Scoring (AES) systems are widely popular in the market as they constitute a cost-effective and time-effective option for grading systems. Nevertheless, many studies have demonstrated that the AES system fails to assign lower grades to irrelevant responses. Thus, detecting the off-topic response in automated essay scoring is crucial in practical tasks where candidates write unrelate…
▽ More
Automated Essay Scoring (AES) systems are widely popular in the market as they constitute a cost-effective and time-effective option for grading systems. Nevertheless, many studies have demonstrated that the AES system fails to assign lower grades to irrelevant responses. Thus, detecting the off-topic response in automated essay scoring is crucial in practical tasks where candidates write unrelated text responses to the given task in the question. In this paper, we are proposing an unsupervised technique that jointly scores essays and detects off-topic essays. The proposed Automated Open Essay Scoring (AOES) model uses a novel topic regularization module (TRM), which can be attached on top of a transformer model, and is trained using a proposed hybrid loss function. After training, the AOES model is further used to calculate the Mahalanobis distance score for off-topic essay detection. Our proposed method outperforms the baseline we created and earlier conventional methods on two essay-scoring datasets in off-topic detection as well as on-topic scoring. Experimental evaluation results on different adversarial strategies also show how the suggested method is robust for detecting possible human-level perturbations.
△ Less
Submitted 24 March, 2024;
originally announced April 2024.
-
Simple lift of non-simple closed curves
Authors:
Deblina Das,
Arpan Kabiraj
Abstract:
Given a compact, oriented surface $S$ of finite genus and finitely many boundary components, we provide examples of finite covers $\tilde{S}$ of $S$ and non-simple closed curves $γ$ on $S$ which lifts to simple closed curves on $\tilde{S}$. In particular, given any positive integer $n\geq 2$, we construct explicit non-simple closed curves on $S$ which has a simple lift to a degree $n$ cover of…
▽ More
Given a compact, oriented surface $S$ of finite genus and finitely many boundary components, we provide examples of finite covers $\tilde{S}$ of $S$ and non-simple closed curves $γ$ on $S$ which lifts to simple closed curves on $\tilde{S}$. In particular, given any positive integer $n\geq 2$, we construct explicit non-simple closed curves on $S$ which has a simple lift to a degree $n$ cover of $S$.
△ Less
Submitted 6 June, 2024; v1 submitted 11 April, 2024;
originally announced April 2024.
-
Anticipate & Collab: Data-driven Task Anticipation and Knowledge-driven Planning for Human-robot Collaboration
Authors:
Shivam Singh,
Karthik Swaminathan,
Raghav Arora,
Ramandeep Singh,
Ahana Datta,
Dipanjan Das,
Snehasis Banerjee,
Mohan Sridharan,
Madhava Krishna
Abstract:
An agent assisting humans in daily living activities can collaborate more effectively by anticipating upcoming tasks. Data-driven methods represent the state of the art in task anticipation, planning, and related problems, but these methods are resource-hungry and opaque. Our prior work introduced a proof of concept framework that used an LLM to anticipate 3 high-level tasks that served as goals f…
▽ More
An agent assisting humans in daily living activities can collaborate more effectively by anticipating upcoming tasks. Data-driven methods represent the state of the art in task anticipation, planning, and related problems, but these methods are resource-hungry and opaque. Our prior work introduced a proof of concept framework that used an LLM to anticipate 3 high-level tasks that served as goals for a classical planning system that computed a sequence of low-level actions for the agent to achieve these goals. This paper describes DaTAPlan, our framework that significantly extends our prior work toward human-robot collaboration. Specifically, DaTAPlan planner computes actions for an agent and a human to collaboratively and jointly achieve the tasks anticipated by the LLM, and the agent automatically adapts to unexpected changes in human action outcomes and preferences. We evaluate DaTAPlan capabilities in a realistic simulation environment, demonstrating accurate task anticipation, effective human-robot collaboration, and the ability to adapt to unexpected changes. Project website: https://dataplan-hrc.github.io
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
On Bootstrapping Lasso in Generalized Linear Models and the Cross Validation
Authors:
Mayukh Choudhury,
Debraj Das
Abstract:
Generalized linear models or GLM constitutes an important set of models which generalizes the ordinary linear regression by connecting the response variable with the covariates through arbitrary link functions. On the other hand, Lasso is a popular and easy to implement penalization method in regression when all the covariates are not relevant. However, Lasso generally has non-tractable asymptotic…
▽ More
Generalized linear models or GLM constitutes an important set of models which generalizes the ordinary linear regression by connecting the response variable with the covariates through arbitrary link functions. On the other hand, Lasso is a popular and easy to implement penalization method in regression when all the covariates are not relevant. However, Lasso generally has non-tractable asymptotic distribution and hence development of an alternative method of distributional approximation is required for the purpose of statistical inference. In this paper, we develop a Bootstrap method which works as an approximation of the distribution of the Lasso estimator for all the sub-models of GLM. To connect the distributional approximation theory based on the proposed Bootstrap method with the practical implementation of Lasso, we explore the asymptotic properties of K-fold cross validation-based penalty parameter. The results established essentially justifies drawing valid statistical inference regarding the unknown parameters based on the proposed Bootstrap method for any sub model of GLM after selecting the penalty parameter using K-fold cross validation. Good finite sample properties are also shown through a moderately large simulation study. The method is also implemented on a real data set.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
qIoV: A Quantum-Driven Internet-of-Vehicles-Based Approach for Environmental Monitoring and Rapid Response Systems
Authors:
Ankur Nahar,
Koustav Kumar Mondal,
Debasis Das,
Rajkumar Buyya
Abstract:
This research addresses the critical necessity for advanced rapid response operations in managing a spectrum of environmental hazards. We propose a novel framework, qIoV that integrates quantum computing with the Internet-of-Vehicles (IoV) to leverage the computational efficiency, parallelism, and entanglement properties of quantum mechanics. Our approach involves the use of environmental sensors…
▽ More
This research addresses the critical necessity for advanced rapid response operations in managing a spectrum of environmental hazards. We propose a novel framework, qIoV that integrates quantum computing with the Internet-of-Vehicles (IoV) to leverage the computational efficiency, parallelism, and entanglement properties of quantum mechanics. Our approach involves the use of environmental sensors mounted on vehicles for precise air quality assessment. These sensors are designed to be highly sensitive and accurate, leveraging the principles of quantum mechanics to detect and measure environmental parameters. A salient feature of our proposal is the Quantum Mesh Network Fabric (QMF), a system designed to dynamically adjust the quantum network topology in accordance with vehicular movements. This capability is critical to maintaining the integrity of quantum states against environmental and vehicular disturbances, thereby ensuring reliable data transmission and processing. Moreover, our methodology is further augmented by the incorporation of a variational quantum classifier (VQC) with advanced quantum entanglement techniques. This integration offers a significant reduction in latency for hazard alert transmission, thus enabling expedited communication of crucial data to emergency response teams and the public. Our study on the IBM OpenQSAM 3 platform, utilizing a 127 Qubit system, revealed significant advancements in pair plot analysis, achieving over 90% in precision, recall, and F1-Score metrics and an 83% increase in the speed of toxic gas detection compared to conventional methods.Additionally, theoretical analyses validate the efficiency of quantum rotation, teleportation protocols, and the fidelity of quantum entanglement, further underscoring the potential of quantum computing in enhancing analytical performance.
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
eRST: A Signaled Graph Theory of Discourse Relations and Organization
Authors:
Amir Zeldes,
Tatsuya Aoyama,
Yang Janet Liu,
Siyao Peng,
Debopam Das,
Luke Gessler
Abstract:
In this article we present Enhanced Rhetorical Structure Theory (eRST), a new theoretical framework for computational discourse analysis, based on an expansion of Rhetorical Structure Theory (RST). The framework encompasses discourse relation graphs with tree-breaking, nonprojective and concurrent relations, as well as implicit and explicit signals which give explainable rationales to our analyses…
▽ More
In this article we present Enhanced Rhetorical Structure Theory (eRST), a new theoretical framework for computational discourse analysis, based on an expansion of Rhetorical Structure Theory (RST). The framework encompasses discourse relation graphs with tree-breaking, nonprojective and concurrent relations, as well as implicit and explicit signals which give explainable rationales to our analyses. We survey shortcomings of RST and other existing frameworks, such as Segmented Discourse Representation Theory (SDRT), the Penn Discourse Treebank (PDTB) and Discourse Dependencies, and address these using constructs in the proposed theory. We provide annotation, search and visualization tools for data, and present and evaluate a freely available corpus of English annotated according to our framework, encompassing 12 spoken and written genres with over 200K tokens. Finally, we discuss automatic parsing, evaluation metrics and applications for data in our framework.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
Ultrahigh Frequency and Multi-channel Output in Skyrmion Based Nano-oscillator
Authors:
Abhishek Sharma,
Saumya Gupta,
Debasis Das,
Ashwin. A. Tulapurkar,
Bhaskaran Muralidharan
Abstract:
Spintronic nano-oscillators can generate tunable microwave signals that find a wide range of applications in the field of telecommunication to modern neuromorphic computing systems. Among other spintronic devices, a magnetic skyrmion is a promising candidate for the next generation of low-power devices due to its small size and topological stability. In this work, we propose a multi-channel oscill…
▽ More
Spintronic nano-oscillators can generate tunable microwave signals that find a wide range of applications in the field of telecommunication to modern neuromorphic computing systems. Among other spintronic devices, a magnetic skyrmion is a promising candidate for the next generation of low-power devices due to its small size and topological stability. In this work, we propose a multi-channel oscillator design based on the synthetic anti-ferromagnetic (SAF) skyrmion pair. The mitigation of the skyrmion Hall effect in SAF and the associated decimation of the Magnus force endows the proposed oscillator with an ultra-high frequency of 41GHz and a multi-channel frequency output driven by the same current. The ultrahigh operational frequency represents an $\sim$342 times improvement compared to the monolayer single skyrmion oscillator featuring a constant uniaxial anisotropy profile. Using micromagnetic simulations, we demonstrate the effectiveness of our proposed multi-channel oscillator design by introducing multi-channel nanotracks along with multiple skyrmions for enhanced frequency operation. The ultrahigh operational frequency and multi-channel output are attributed to three key factors: The oscillator design accounting for a finite spin-flip length of the spacer (such as Ru) material, tangential velocity proportionality on input spin current along with weak dependence on the radius of rotation of the skyrmion-pair, skyrmion interlocking in the channel enabled by the multi-channel high Ku rings and skyrmion-skyrmion repulsion, therefore resulting ultrahigh frequency and multi-channel outputs.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
PosSAM: Panoptic Open-vocabulary Segment Anything
Authors:
Vibashan VS,
Shubhankar Borse,
Hyojin Park,
Debasmit Das,
Vishal Patel,
Munawar Hayat,
Fatih Porikli
Abstract:
In this paper, we introduce an open-vocabulary panoptic segmentation model that effectively unifies the strengths of the Segment Anything Model (SAM) with the vision-language CLIP model in an end-to-end framework. While SAM excels in generating spatially-aware masks, it's decoder falls short in recognizing object class information and tends to oversegment without additional guidance. Existing appr…
▽ More
In this paper, we introduce an open-vocabulary panoptic segmentation model that effectively unifies the strengths of the Segment Anything Model (SAM) with the vision-language CLIP model in an end-to-end framework. While SAM excels in generating spatially-aware masks, it's decoder falls short in recognizing object class information and tends to oversegment without additional guidance. Existing approaches address this limitation by using multi-stage techniques and employing separate models to generate class-aware prompts, such as bounding boxes or segmentation masks. Our proposed method, PosSAM is an end-to-end model which leverages SAM's spatially rich features to produce instance-aware masks and harnesses CLIP's semantically discriminative features for effective instance classification. Specifically, we address the limitations of SAM and propose a novel Local Discriminative Pooling (LDP) module leveraging class-agnostic SAM and class-aware CLIP features for unbiased open-vocabulary classification. Furthermore, we introduce a Mask-Aware Selective Ensembling (MASE) algorithm that adaptively enhances the quality of generated masks and boosts the performance of open-vocabulary classification during inference for each image. We conducted extensive experiments to demonstrate our methods strong generalization properties across multiple datasets, achieving state-of-the-art performance with substantial improvements over SOTA open-vocabulary panoptic segmentation methods. In both COCO to ADE20K and ADE20K to COCO settings, PosSAM outperforms the previous state-of-the-art methods by a large margin, 2.4 PQ and 4.6 PQ, respectively. Project Website: https://vibashan.github.io/possam-web/.
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
SNOW-SCA: ML-assisted Side-Channel Attack on SNOW-V
Authors:
Harshit Saurabh,
Anupam Golder,
Samarth Shivakumar Titti,
Suparna Kundu,
Chaoyun Li,
Angshuman Karmakar,
Debayan Das
Abstract:
This paper presents SNOW-SCA, the first power side-channel analysis (SCA) attack of a 5G mobile communication security standard candidate, SNOW-V, running on a 32-bit ARM Cortex-M4 microcontroller. First, we perform a generic known-key correlation (KKC) analysis to identify the leakage points. Next, a correlation power analysis (CPA) attack is performed, which reduces the attack complexity to two…
▽ More
This paper presents SNOW-SCA, the first power side-channel analysis (SCA) attack of a 5G mobile communication security standard candidate, SNOW-V, running on a 32-bit ARM Cortex-M4 microcontroller. First, we perform a generic known-key correlation (KKC) analysis to identify the leakage points. Next, a correlation power analysis (CPA) attack is performed, which reduces the attack complexity to two key guesses for each key byte. The correct secret key is then uniquely identified utilizing linear discriminant analysis (LDA). The profiled SCA attack with LDA achieves 100% accuracy after training with $<200$ traces, which means the attack succeeds with just a single trace. Overall, using the \textit{combined CPA and LDA attack} model, the correct secret key byte is recovered with <50 traces collected using the ChipWhisperer platform. The entire 256-bit secret key of SNOW-V can be recovered incrementally using the proposed SCA attack. Finally, we suggest low-overhead countermeasures that can be used to prevent these SCA attacks.
△ Less
Submitted 13 March, 2024;
originally announced March 2024.
-
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
Authors:
Gemini Team,
Petko Georgiev,
Ving Ian Lei,
Ryan Burnell,
Libin Bai,
Anmol Gulati,
Garrett Tanzer,
Damien Vincent,
Zhufeng Pan,
Shibo Wang,
Soroosh Mariooryad,
Yifan Ding,
Xinyang Geng,
Fred Alcober,
Roy Frostig,
Mark Omernick,
Lexi Walker,
Cosmin Paduraru,
Christina Sorokin,
Andrea Tacchetti,
Colin Gaffney,
Samira Daruki,
Olcan Sercinoglu,
Zach Gleicher,
Juliette Love
, et al. (1092 additional authors not shown)
Abstract:
In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February…
▽ More
In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February version on the great majority of capabilities and benchmarks; (2) Gemini 1.5 Flash, a more lightweight variant designed for efficiency with minimal regression in quality. Gemini 1.5 models achieve near-perfect recall on long-context retrieval tasks across modalities, improve the state-of-the-art in long-document QA, long-video QA and long-context ASR, and match or surpass Gemini 1.0 Ultra's state-of-the-art performance across a broad set of benchmarks. Studying the limits of Gemini 1.5's long-context ability, we find continued improvement in next-token prediction and near-perfect retrieval (>99%) up to at least 10M tokens, a generational leap over existing models such as Claude 3.0 (200k) and GPT-4 Turbo (128k). Finally, we highlight real-world use cases, such as Gemini 1.5 collaborating with professionals on completing their tasks achieving 26 to 75% time savings across 10 different job categories, as well as surprising new capabilities of large language models at the frontier; when given a grammar manual for Kalamang, a language with fewer than 200 speakers worldwide, the model learns to translate English to Kalamang at a similar level to a person who learned from the same content.
△ Less
Submitted 14 June, 2024; v1 submitted 8 March, 2024;
originally announced March 2024.
-
Reversibility in the Seifert-fibered spaces
Authors:
Anushree Das,
Debattam Das
Abstract:
An element $a$ in a group $Γ$ is called \emph{reversible} if there exists $g \in Γ$ such that $gag^{-1}=a^{-1}$. The reversible elements are also known as `real elements' or `reciprocal elements' in literature. In this paper, we classify the reversible elements in Fuchsian groups, and use this classification to find all reversible elements in a Seifert-fibered group. In the last section we apply t…
▽ More
An element $a$ in a group $Γ$ is called \emph{reversible} if there exists $g \in Γ$ such that $gag^{-1}=a^{-1}$. The reversible elements are also known as `real elements' or `reciprocal elements' in literature. In this paper, we classify the reversible elements in Fuchsian groups, and use this classification to find all reversible elements in a Seifert-fibered group. In the last section we apply this classification to the braid groups, particularly to the braid group on $3$ strands.
△ Less
Submitted 6 March, 2024; v1 submitted 3 March, 2024;
originally announced March 2024.
-
A Unified Evaluation Framework for Spiking Neural Network Hardware Accelerators Based on Emerging Non-Volatile Memory Devices
Authors:
Debasis Das,
Xuanyao Fong
Abstract:
Spiking Neural Networks (SNNs) have emerged as a promising paradigm, offering event-driven and energy-efficient computation. In recent studies, various devices tailored for SNN synapses and neurons have been proposed, leveraging the unique characteristics of emerging non-volatile memory (eNVM) technologies. While substantial progress has been made in exploring the capabilities of SNNs and designin…
▽ More
Spiking Neural Networks (SNNs) have emerged as a promising paradigm, offering event-driven and energy-efficient computation. In recent studies, various devices tailored for SNN synapses and neurons have been proposed, leveraging the unique characteristics of emerging non-volatile memory (eNVM) technologies. While substantial progress has been made in exploring the capabilities of SNNs and designing dedicated hardware components, there exists a critical gap in establishing a unified approach for evaluating hardware-level metrics. Specifically, metrics such as latency, and energy consumption, are pivotal in assessing the practical viability and efficiency of the constructed neural network. In this article, we address this gap by presenting a comprehensive framework for evaluating hardware-level metrics in SNNs based on non-volatile memory devices. We systematically analyze the impact of synaptic and neuronal components on energy consumption providing a unified perspective for assessing the overall efficiency of the network. In this study, our emphasis lies on the neuron and synaptic device based on magnetic skyrmions. Nevertheless, our framework is versatile enough to encompass other emerging devices as well. Utilizing our proposed skyrmionic devices, the constructed SNN demonstrates an inference accuracy of approximately 98% and achieves energy consumption on the order of pJ when processing the Modified National Institute of Standards and Technology (MNIST) handwritten digit dataset.
△ Less
Submitted 29 February, 2024;
originally announced February 2024.
-
MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning
Authors:
Debrup Das,
Debopriyo Banerjee,
Somak Aditya,
Ashish Kulkarni
Abstract:
Tool-augmented Large Language Models (TALMs) are known to enhance the skillset of large language models (LLMs), thereby, leading to their improved reasoning abilities across many tasks. While, TALMs have been successfully employed in different question-answering benchmarks, their efficacy on complex mathematical reasoning benchmarks, and the potential complementary benefits offered by tools for kn…
▽ More
Tool-augmented Large Language Models (TALMs) are known to enhance the skillset of large language models (LLMs), thereby, leading to their improved reasoning abilities across many tasks. While, TALMs have been successfully employed in different question-answering benchmarks, their efficacy on complex mathematical reasoning benchmarks, and the potential complementary benefits offered by tools for knowledge retrieval and mathematical equation solving are open research questions. In this work, we present MathSensei, a tool-augmented large language model for mathematical reasoning. We study the complementary benefits of the tools - knowledge retriever (Bing Web Search), program generator + executor (Python), and symbolic equation solver (Wolfram-Alpha API) through evaluations on mathematical reasoning datasets. We perform exhaustive ablations on MATH, a popular dataset for evaluating mathematical reasoning on diverse mathematical disciplines. We also conduct experiments involving well-known tool planners to study the impact of tool sequencing on the model performance. MathSensei achieves 13.5% better accuracy over gpt-3.5-turbo with Chain-of-Thought on the MATH dataset. We further observe that TALMs are not as effective for simpler math word problems (in GSM-8K), and the benefit increases as the complexity and required knowledge increases (progressively over AQuA, MMLU-Math, and higher level complex questions in MATH). The code and data are available at https://github.com/Debrup-61/MathSensei.
△ Less
Submitted 3 April, 2024; v1 submitted 27 February, 2024;
originally announced February 2024.
-
Charge orders with distinct magnetic response in a prototypical kagome superconductor LaRu$_{3}$Si$_{2}$
Authors:
C. Mielke III,
V. Sazgari,
I. Plokhikh,
S. Shin,
H. Nakamura,
J. N. Graham,
J. Küspert,
I. Bialo,
G. Garbarino,
D. Das,
M. Medarde,
M. Bartkowiak,
S. S. Islam,
R. Khasanov,
H. Luetkens,
M. Z. Hasan,
E. Pomjakushina,
J. -X. Yin,
M. H. Fischer,
J. Chang,
T. Neupert,
S. Nakatsuji,
B. Wehinger,
D. J. Gawryluk,
Z. Guguchia
Abstract:
The kagome lattice has emerged as a promising platform for hosting unconventional chiral charge order at high temperatures. Notably, in LaRu$_{3}$Si$_{2}$, a room-temperature charge-ordered state with a propagation vector of ($\frac{1}{4}$,~0,~0) has been recently identified. However, understanding the interplay between this charge order and superconductivity, particularly with respect to time-rev…
▽ More
The kagome lattice has emerged as a promising platform for hosting unconventional chiral charge order at high temperatures. Notably, in LaRu$_{3}$Si$_{2}$, a room-temperature charge-ordered state with a propagation vector of ($\frac{1}{4}$,~0,~0) has been recently identified. However, understanding the interplay between this charge order and superconductivity, particularly with respect to time-reversal-symmetry breaking, remains elusive. In this study, we employ single crystal X-ray diffraction, magnetotransport, and muon-spin rotation experiments to investigate the charge order and its electronic and magnetic responses in LaRu$_{3}$Si$_{2}$ across a wide temperature range down to the superconducting state. Our findings reveal the emergence of a charge order with a propagation vector of ($\frac{1}{6}$,~0,~0) below $T_{\rm CO,2}$ ${\simeq}$ 80 K, coexisting with the previously identified room-temperature primary charge order ($\frac{1}{4}$,~0,~0). The primary charge-ordered state exhibits zero magnetoresistance. In contrast, the appearance of the secondary charge order at $T_{\rm CO,2}$ is accompanied by a notable magnetoresistance response and a pronounced temperature-dependent Hall effect, which experiences a sign reversal, switching from positive to negative below $T^{*}$ ${\simeq}$ 35 K. Intriguingly, we observe an enhancement in the internal field width sensed by the muon ensemble below $T^{*}$ ${\simeq}$ 35 K. Moreover, the muon spin relaxation rate exhibits a substantial increase upon the application of an external magnetic field below $T_{\rm CO,2}$ ${\simeq}$ 80 K. Our results highlight the coexistence of two distinct types of charge order in LaRu$_{3}$Si$_{2}$ within the correlated kagome lattice, namely a non-magnetic charge order ($\frac{1}{4}$,~0,~0) below $T_{\rm co,1}$ ${\simeq}$ 400 K and a time-reversal-symmetry-breaking charge order below $T_{\rm CO,2}$.
△ Less
Submitted 28 February, 2024; v1 submitted 25 February, 2024;
originally announced February 2024.
-
Research status of the Mendeleev Periodic Table: a bibliometric analysis
Authors:
Kamna Sharma,
Deepak Kumar Das,
Saibal Ray
Abstract:
In this paper, we present a bibliometric analysis of the Mendeleev Periodic Table. We have conducted a comprehensive analysis of the Scopus-based database using the keyword "Mendeleev Periodic Table". Our findings suggest that the Mendeleev Periodic Table is an influential topic in the field of Inorganic as well as Organic Chemistry. Future researchers may focus on expanding our analysis to includ…
▽ More
In this paper, we present a bibliometric analysis of the Mendeleev Periodic Table. We have conducted a comprehensive analysis of the Scopus-based database using the keyword "Mendeleev Periodic Table". Our findings suggest that the Mendeleev Periodic Table is an influential topic in the field of Inorganic as well as Organic Chemistry. Future researchers may focus on expanding our analysis to include other bibliometric indicators to gain a more comprehensive understanding of the impact of the Mendeleev Periodic Table in chemistry-based scientific investigations and even in the field of astrochemistry.
△ Less
Submitted 18 February, 2024;
originally announced February 2024.
-
Depth-dependent study of time-reversal symmetry-breaking in the kagome superconductor $A$V$_{3}$Sb$_{5}$
Authors:
J. N. Graham,
C. Mielke III,
D. Das,
T. Morresi,
V. Sazgari,
A. Suter,
T. Prokscha,
H. Deng,
R. Khasanov,
S. D. Wilson,
A. C. Salinas,
M. M. Martins,
Y. Zhong,
K. Okazaki,
Z. Wang,
M. Z. Hasan,
M. Fischer,
T. Neupert,
J. -X. Yin,
S. Sanna,
H. Luetkens,
Z. Salman,
P. Bonfa,
Z. Guguchia
Abstract:
The breaking of time-reversal symmetry (TRS) in the normal state of kagome superconductors $A$V$_{3}$Sb$_{5}$ stands out as a significant feature. Yet the extent to which this effect can be tuned remains uncertain, a crucial aspect to grasp in light of the varying details of TRS breaking observed through different techniques. Here, we employ the unique low-energy muon spin rotation technique combi…
▽ More
The breaking of time-reversal symmetry (TRS) in the normal state of kagome superconductors $A$V$_{3}$Sb$_{5}$ stands out as a significant feature. Yet the extent to which this effect can be tuned remains uncertain, a crucial aspect to grasp in light of the varying details of TRS breaking observed through different techniques. Here, we employ the unique low-energy muon spin rotation technique combined with local field numerical analysis to study the TRS breaking response as a function of depth from the surface in single crystals of RbV$_{3}$Sb$_{5}$ with charge order and Cs(V$_{0.86}$Ta$_{0.14}$)$_{3}$Sb$_{5}$ without charge order. In the bulk (i.e., > 33 nm from the surface) of RbV$_{3}$Sb$_{5}$, we have detected a notable increase in the internal magnetic field width experienced by the muon ensemble. This increase occurs only within the charge ordered state. Intriguingly, the muon spin relaxation rate is significantly enhanced near the surface (i.e., < 33 nm from the surface) of RbV$_{3}$Sb$_{5}$, and this effect commences at temperatures significantly higher than the onset of charge order. Conversely, in Cs(V$_{0.86}$Ta$_{0.14}$)$_{3}$Sb$_{5}$, we do not observe a similar enhancement in the internal field width, neither in the bulk nor near the surface. These observations indicate a strong connection between charge order and TRS breaking on one hand, and on the other hand, suggest that TRS breaking can occur prior to long-range charge order. This research offers compelling evidence for depth-dependent magnetism in $A$V$_{3}$Sb$_{5}$ superconductors in the presence of charge order. Such findings are likely to elucidate the intricate microscopic mechanisms that underpin the TRS breaking phenomena in these materials.
△ Less
Submitted 16 February, 2024;
originally announced February 2024.
-
Sign of the $hZZ$ coupling and implication for new physics
Authors:
Dipankar Das,
Anirban Kundu,
Miguel Levy,
Anugrah M. Prasad,
Ipsita Saha,
Agnivo Sarkar
Abstract:
The magnitudes of the couplings of the scalar resonance at 125 GeV with the SM particles are found to be consistent with those of the SM Higgs boson. However, the signs are not experimentally determined in most of the cases, a prime example being that with the $Z$-boson pair. In other words, $κ_Z^h$, the ratio of the couplings of the actual 125 GeV resonance with $ZZ$ and that of the SM Higgs boso…
▽ More
The magnitudes of the couplings of the scalar resonance at 125 GeV with the SM particles are found to be consistent with those of the SM Higgs boson. However, the signs are not experimentally determined in most of the cases, a prime example being that with the $Z$-boson pair. In other words, $κ_Z^h$, the ratio of the couplings of the actual 125 GeV resonance with $ZZ$ and that of the SM Higgs boson with the same, is consistent with both $+1$ and $-1$, the latter being the `wrong-sign'. We argue that the wrong-sign $hZZ$ coupling will necessitate the intervention of new physics below $\mathcal{O}\left(620\right)$ GeV to safeguard the underlying theory from unitarity violation. The strength of the new nonstandard couplings can be derived from the unitarity sum rules, which are comparable to the SM-Higgs couplings in magnitude. Thus the strong limits from the direct searches at the LHC can help us rule out the existence of such nonstandard particles with unusually large couplings thereby disfavoring the possibility of a wrong-sign $hZZ$ coupling.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
Universal stress correlations in crystalline and amorphous packings
Authors:
Roshan Maharana,
Debankur Das,
Pinaki Chaudhuri,
Kabir Ramola
Abstract:
We present a universal characterization of stress correlations in athermal systems, across crystalline to amorphous packings. Via numerical analysis of static configurations of particles interacting through harmonic as well as Lennard-Jones potentials, for a variety of preparation protocols and ranges of microscopic disorder, we show that the properties of the stress correlations at large lengthsc…
▽ More
We present a universal characterization of stress correlations in athermal systems, across crystalline to amorphous packings. Via numerical analysis of static configurations of particles interacting through harmonic as well as Lennard-Jones potentials, for a variety of preparation protocols and ranges of microscopic disorder, we show that the properties of the stress correlations at large lengthscales are surprisingly universal across all situations, independent of structural correlations, or the correlations in orientational order. In the near-crystalline limit, we present exact results for the stress correlations for both models, which work surprisingly well at large lengthscales, even in the amorphous phase. Finally, we study the differences in stress fluctuations across the amorphization transition, where stress correlations reveal the loss of periodicity in the structure at short lengthscales with increasing disorder.
△ Less
Submitted 25 April, 2024; v1 submitted 13 February, 2024;
originally announced February 2024.
-
Ir-Sb Binary System: Unveiling Nodeless Unconventional Superconductivity Proximate to Honeycomb-Vacancy Ordering
Authors:
V. Sazgari,
Tianping Ying,
J. N. Graham,
C. Mielke III,
D. Das,
S. S. Islam,
M. Bartkowiak,
R. Khasanov,
H. Luetkens,
H. Hosono,
Z. Guguchia
Abstract:
Vacancies play a crucial role in solid-state physics, but their impact on materials with strong electron-electron correlations has been underexplored. A recent study on the Ir-Sb binary system, Ir$_{16}$Sb$_{18}$ revealed a novel extended buckled-honeycomb vacancy (BHV) order. Superconductivity is induced by suppressing the BHV ordering through high-pressure growth with excess Ir atoms or isovalen…
▽ More
Vacancies play a crucial role in solid-state physics, but their impact on materials with strong electron-electron correlations has been underexplored. A recent study on the Ir-Sb binary system, Ir$_{16}$Sb$_{18}$ revealed a novel extended buckled-honeycomb vacancy (BHV) order. Superconductivity is induced by suppressing the BHV ordering through high-pressure growth with excess Ir atoms or isovalent Rh substitution, although the nature of superconducting pairing has remained unexplored. Here, we conduct muon spin rotation experiments probing the temperature-dependence of the effective magnetic penetration depth $λ_{eff}\left(T\right)$ in Ir$_{1-δ}$Sb (synthesized at 5.5 GPa with $T_{\rm c}$ = 4.2 K) and ambient pressure synthesized optimally Rh-doped Ir$_{1-x}$Rh$_{x}$Sb ($x$=0.3, $T_{\rm c}$ = 2.7 K). The exponential temperature dependence of the superfluid density $n_{\rm s}$/m$^{*}$ at low temperatures indicates a fully gapped superconducting state in both samples. Notably, the ratio of $T_{\rm c}$ to the superfluid density is comparable to previously measured unconventional superconductors. A significant increase in $n_{\rm s}$/m$^{*}$ in the high-pressure synthesized sample correlates with $T_{\rm c}$, a hallmark feature of unconventional superconductivity. We further demonstrate a similar effect induced by chemical pressure (Rh substitution) and hydrostatic pressure in Ir$_{1-x}$Rh$_{x}$Sb, highlighting that the dome-shaped phase diagram is a fundamental feature of the material. These findings underscore the unconventional nature of the observed superconductivity, and classifies IrSb as the first unconventional superconducting parent phase with ordered vacancies.
△ Less
Submitted 9 February, 2024;
originally announced February 2024.
-
First (calibration) experiment using proton beam from FRENA at SINP
Authors:
C. Basu,
K. Banerjee,
T. K. Ghosh,
G. Mukherjee,
C. Bhattacharya,
Shraddha S Desai,
R. Shil,
A. K. Saha,
J. K. Meena,
T. Bar,
D. Basak,
L. K. Sahoo,
S. Saha,
C. Marick,
D. Das,
D. Das,
D. Das,
M. Kujur,
S. Roy,
S. S. Basu,
U. Gond,
A. Saha,
A. Das,
M. Samanta,
P. Saha
, et al. (1 additional authors not shown)
Abstract:
This work presents the first calibration experiment of a 3 MV Tandetron accelerator, FRENA, performed in May 2022. The $^7$Li(p,n) reaction threshold was measured to calibrate the terminal voltage measuring device. A LiF target of thickness 175 $μ$g/cm$^2$ was used in the experiment. The measured threshold was 1872$\pm$2.7 keV, indicating 6$-$10 keV energy shift.
This work presents the first calibration experiment of a 3 MV Tandetron accelerator, FRENA, performed in May 2022. The $^7$Li(p,n) reaction threshold was measured to calibrate the terminal voltage measuring device. A LiF target of thickness 175 $μ$g/cm$^2$ was used in the experiment. The measured threshold was 1872$\pm$2.7 keV, indicating 6$-$10 keV energy shift.
△ Less
Submitted 24 January, 2024;
originally announced February 2024.
-
Under the Surface: Tracking the Artifactuality of LLM-Generated Data
Authors:
Debarati Das,
Karin De Langis,
Anna Martin-Boyle,
Jaehyung Kim,
Minhwa Lee,
Zae Myung Kim,
Shirley Anugrah Hayati,
Risako Owan,
Bin Hu,
Ritik Parkar,
Ryan Koo,
Jonginn Park,
Aahan Tyagi,
Libby Ferland,
Sanjali Roy,
Vincent Liu,
Dongyeop Kang
Abstract:
This work delves into the expanding role of large language models (LLMs) in generating artificial data. LLMs are increasingly employed to create a variety of outputs, including annotations, preferences, instruction prompts, simulated dialogues, and free text. As these forms of LLM-generated data often intersect in their application, they exert mutual influence on each other and raise significant c…
▽ More
This work delves into the expanding role of large language models (LLMs) in generating artificial data. LLMs are increasingly employed to create a variety of outputs, including annotations, preferences, instruction prompts, simulated dialogues, and free text. As these forms of LLM-generated data often intersect in their application, they exert mutual influence on each other and raise significant concerns about the quality and diversity of the artificial data incorporated into training cycles, leading to an artificial data ecosystem. To the best of our knowledge, this is the first study to aggregate various types of LLM-generated text data, from more tightly constrained data like "task labels" to more lightly constrained "free-form text". We then stress test the quality and implications of LLM-generated artificial data, comparing it with human data across various existing benchmarks. Despite artificial data's capability to match human performance, this paper reveals significant hidden disparities, especially in complex tasks where LLMs often miss the nuanced understanding of intrinsic human-generated content. This study critically examines diverse LLM-generated data and emphasizes the need for ethical practices in data creation and when using LLMs. It highlights the LLMs' shortcomings in replicating human traits and behaviors, underscoring the importance of addressing biases and artifacts produced in LLM-generated content for future research and development. All data and code are available on our project page.
△ Less
Submitted 30 January, 2024; v1 submitted 26 January, 2024;
originally announced January 2024.
-
Tuning of Charge Order by Uniaxial Stress in a Cuprate Superconductor
Authors:
Laure Thomarat,
Frank Elson,
Elisabetta Nocerino,
Debarchan Das,
Oleh Ivashko,
Marek Bartkowiak,
Martin Månsson,
Yasmine Sassa,
Tadashi Adachi,
Martin v. Zimmermann,
Hubertus Luetkens,
Johan Chang,
Marc Janoschek,
Zurab Guguchia,
Gediminas Simutis
Abstract:
Strongly correlated electron materials are often characterized by competition and interplay of multiple quantum states. For example, in high-temperature cuprate superconductors unconventional superconductivity, spin- and charge-density wave orders coexist. A key question is whether competing states coexist on the atomic scale or if they segregate into distinct 'islands'. Using X-ray diffraction, w…
▽ More
Strongly correlated electron materials are often characterized by competition and interplay of multiple quantum states. For example, in high-temperature cuprate superconductors unconventional superconductivity, spin- and charge-density wave orders coexist. A key question is whether competing states coexist on the atomic scale or if they segregate into distinct 'islands'. Using X-ray diffraction, we investigate the competition between charge order and superconductivity in the archetypal cuprate La(2-x)BaxCuO4, around the x = 1/8-doping, where uniaxial stress restores optimal 3D superconductivity at approximately 0.06 GPa. We find that the charge order peaks and the correlation length along the stripe are strongly reduced up to the critical stress, above which they stay constant. Simultaneously, the charge order onset temperature only shows a modest decrease. Our findings suggest that optimal 3D superconductivity is not linked to the absence of charge stripes but instead requires their arrangement into smaller 'islands'. Our results provide insight into the length scales over which the interplay between superconductivity and charge order takes place.
△ Less
Submitted 24 January, 2024;
originally announced January 2024.
-
Ti4Ir2O a time-reversal-invariant fully gapped unconventional superconductor
Authors:
Debarchan Das,
KeYuan Ma,
Jan Jaroszynski,
Vahid Sazgari,
Tomasz Klimczuk,
Fabian O. von Rohr,
Zurab Guguchia
Abstract:
Here we report muon spin rotation (muSR) experiments on the temperature and field dependence of the effective magnetic penetration depth (lambda) in the eta-carbide-type suboxide Ti4Ir2O, a superconductor with an considerably high upper critical field. Temperature dependence of penetration depth, obtained from transverse-field (TF)-muSR measurements, is in perfect agreement with an isotropic fully…
▽ More
Here we report muon spin rotation (muSR) experiments on the temperature and field dependence of the effective magnetic penetration depth (lambda) in the eta-carbide-type suboxide Ti4Ir2O, a superconductor with an considerably high upper critical field. Temperature dependence of penetration depth, obtained from transverse-field (TF)-muSR measurements, is in perfect agreement with an isotropic fully gaped superconducting state. Furthermore, our ZF muSR results confirm that the time-reversal symmetry is preserved in the superconducting state. We find, however, a notably low ratio of 1.22 between the superconducting critical temperature and the superfluid density. This value is close to most unconventional superconductors, showing that a very small superfluid density is present in the superconducting state of Ti4Ir2O. The presented results will pave the way for further theoretical and experimental investigations to obtain a microscopic understanding of the origin of such a high upper critical field in an isotropic single gap superconducting system.
△ Less
Submitted 24 January, 2024;
originally announced January 2024.
-
The "Colonial Impulse" of Natural Language Processing: An Audit of Bengali Sentiment Analysis Tools and Their Identity-based Biases
Authors:
Dipto Das,
Shion Guha,
Jed Brubaker,
Bryan Semaan
Abstract:
While colonization has sociohistorically impacted people's identities across various dimensions, those colonial values and biases continue to be perpetuated by sociotechnical systems. One category of sociotechnical systems--sentiment analysis tools--can also perpetuate colonial values and bias, yet less attention has been paid to how such tools may be complicit in perpetuating coloniality, althoug…
▽ More
While colonization has sociohistorically impacted people's identities across various dimensions, those colonial values and biases continue to be perpetuated by sociotechnical systems. One category of sociotechnical systems--sentiment analysis tools--can also perpetuate colonial values and bias, yet less attention has been paid to how such tools may be complicit in perpetuating coloniality, although they are often used to guide various practices (e.g., content moderation). In this paper, we explore potential bias in sentiment analysis tools in the context of Bengali communities that have experienced and continue to experience the impacts of colonialism. Drawing on identity categories most impacted by colonialism amongst local Bengali communities, we focused our analytic attention on gender, religion, and nationality. We conducted an algorithmic audit of all sentiment analysis tools for Bengali, available on the Python package index (PyPI) and GitHub. Despite similar semantic content and structure, our analyses showed that in addition to inconsistencies in output from different tools, Bengali sentiment analysis tools exhibit bias between different identity categories and respond differently to different ways of identity expression. Connecting our findings with colonially shaped sociocultural structures of Bengali communities, we discuss the implications of downstream bias of sentiment analysis tools.
△ Less
Submitted 19 January, 2024;
originally announced January 2024.
-
CrisisKAN: Knowledge-infused and Explainable Multimodal Attention Network for Crisis Event Classification
Authors:
Shubham Gupta,
Nandini Saini,
Suman Kundu,
Debasis Das
Abstract:
Pervasive use of social media has become the emerging source for real-time information (like images, text, or both) to identify various events. Despite the rapid growth of image and text-based event classification, the state-of-the-art (SOTA) models find it challenging to bridge the semantic gap between features of image and text modalities due to inconsistent encoding. Also, the black-box nature…
▽ More
Pervasive use of social media has become the emerging source for real-time information (like images, text, or both) to identify various events. Despite the rapid growth of image and text-based event classification, the state-of-the-art (SOTA) models find it challenging to bridge the semantic gap between features of image and text modalities due to inconsistent encoding. Also, the black-box nature of models fails to explain the model's outcomes for building trust in high-stakes situations such as disasters, pandemic. Additionally, the word limit imposed on social media posts can potentially introduce bias towards specific events. To address these issues, we proposed CrisisKAN, a novel Knowledge-infused and Explainable Multimodal Attention Network that entails images and texts in conjunction with external knowledge from Wikipedia to classify crisis events. To enrich the context-specific understanding of textual information, we integrated Wikipedia knowledge using proposed wiki extraction algorithm. Along with this, a guided cross-attention module is implemented to fill the semantic gap in integrating visual and textual data. In order to ensure reliability, we employ a model-specific approach called Gradient-weighted Class Activation Mapping (Grad-CAM) that provides a robust explanation of the predictions of the proposed model. The comprehensive experiments conducted on the CrisisMMD dataset yield in-depth analysis across various crisis-specific tasks and settings. As a result, CrisisKAN outperforms existing SOTA methodologies and provides a novel view in the domain of explainable multimodal event classification.
△ Less
Submitted 11 January, 2024;
originally announced January 2024.
-
An operational approach to classifying measurement incompatibility
Authors:
Arun Kumar Das,
Saheli Mukherjee,
Debashis Saha,
Debarshi Das,
A. S. Majumdar
Abstract:
Measurement incompatibility has proved to be an important resource for information-processing tasks. In this work, we analyze various levels of incompatibility of measurement sets. We provide operational classification of measurement incompatibility with respect to two elementary classical operations, viz., coarse-graining of measurement outcomes and convex mixing of different measurements. We der…
▽ More
Measurement incompatibility has proved to be an important resource for information-processing tasks. In this work, we analyze various levels of incompatibility of measurement sets. We provide operational classification of measurement incompatibility with respect to two elementary classical operations, viz., coarse-graining of measurement outcomes and convex mixing of different measurements. We derive analytical criteria for determining when a set of projective measurements is fully incompatible with respect to coarse-graining or convex mixing. Robustness against white noise is investigated for mutually unbiased bases that can sustain full incompatibility. Furthermore, we propose operational witnesses for different levels of incompatibility subject to classical operations, using the input-output statistics of Bell-type experiments as well as experiments in the prepare-and-measure scenario.
△ Less
Submitted 2 January, 2024;
originally announced January 2024.
-
On the area swept by a biased diffusion till its first-exit time: Martingale approach and gambling opportunities
Authors:
Yonathan Sarmiento,
Debraj Das,
Édgar Roldán
Abstract:
Using martingale theory, we compute, in very few lines, exact analytical expressions for various first-exit-time statistics associated with one-dimensional biased diffusion. Examples include the distribution for the first-exit time from an interval, moments for the first-exit site, and functionals of the position, which involve memory and time integration. As a key example, we compute analytically…
▽ More
Using martingale theory, we compute, in very few lines, exact analytical expressions for various first-exit-time statistics associated with one-dimensional biased diffusion. Examples include the distribution for the first-exit time from an interval, moments for the first-exit site, and functionals of the position, which involve memory and time integration. As a key example, we compute analytically the mean area swept by a biased diffusion until it escapes an interval that may be asymmetric and have arbitrary length. The mean area allows us to derive the hitherto unexplored cross-correlation function between the first-exit time and the first-exit site, which vanishes only for exit problems from symmetric intervals. As a colophon, we explore connections of our results with gambling, showing that betting on the time-integrated value of a losing game it is possible to design a strategy that leads to a net average win.
△ Less
Submitted 10 May, 2024; v1 submitted 31 December, 2023;
originally announced January 2024.
-
Tensile strain induced brightening of momentum forbidden dark exciton in WS$_2$
Authors:
Tamaghna Chowdhury,
Sagnik Chatterjee,
Dibyasankar Das,
Ivan Timokhin,
Pablo Díaz Núñez,
Gokul M. A.,
Suman Chatterjee,
Kausik Majumdar,
Prasenjit Ghosh,
Artem Mishchenko,
Atikur Rahman
Abstract:
Transition-metal dichalcogenides (TMDs) host tightly bound quasi-particles called excitons. Based on spin and momentum selection rules, these excitons can be either optically bright or dark. In tungsten-based TMDs, momentum-forbidden dark exciton is the energy ground state and therefore it strongly affect the emission properties. In this work, we brighten the momentum forbidden dark exciton by pla…
▽ More
Transition-metal dichalcogenides (TMDs) host tightly bound quasi-particles called excitons. Based on spin and momentum selection rules, these excitons can be either optically bright or dark. In tungsten-based TMDs, momentum-forbidden dark exciton is the energy ground state and therefore it strongly affect the emission properties. In this work, we brighten the momentum forbidden dark exciton by placing WS$_2$ on top of nanotextured substrates which put the WS$_2$ layer under tensile strain, modifying electronic bandstructure. This enables phonon assisted scattering of exciton between momentum valleys, thereby brightening momentum forbidden dark excitons. Our results will pave the way to design ultrasensitive strain sensing devices based on TMDs.
△ Less
Submitted 26 December, 2023;
originally announced December 2023.
-
New physics interpretations for nonstandard values of $h\to Zγ$
Authors:
Rafael Boto,
Dipankar Das,
Jorge C. Romao,
Ipsita Saha,
Joao P. Silva
Abstract:
Current measurement of the $h\to Zγ$ signal strength invite us to speculate about possible new physics interactions that exclusively affect $μ_{Zγ}$ without altering the other signal strengths. Additional consideration of tree-unitarity enables us to correlate the nonstandard values of $μ_{Zγ}$ with an upper limit on the scale of new physics. We find that even when $μ_{Zγ}$ deviates from the SM va…
▽ More
Current measurement of the $h\to Zγ$ signal strength invite us to speculate about possible new physics interactions that exclusively affect $μ_{Zγ}$ without altering the other signal strengths. Additional consideration of tree-unitarity enables us to correlate the nonstandard values of $μ_{Zγ}$ with an upper limit on the scale of new physics. We find that even when $μ_{Zγ}$ deviates from the SM value by only $20\%$, the scale of new physics should be well within the reach of the LHC.
△ Less
Submitted 27 March, 2024; v1 submitted 20 December, 2023;
originally announced December 2023.
-
Gemini: A Family of Highly Capable Multimodal Models
Authors:
Gemini Team,
Rohan Anil,
Sebastian Borgeaud,
Jean-Baptiste Alayrac,
Jiahui Yu,
Radu Soricut,
Johan Schalkwyk,
Andrew M. Dai,
Anja Hauth,
Katie Millican,
David Silver,
Melvin Johnson,
Ioannis Antonoglou,
Julian Schrittwieser,
Amelia Glaese,
Jilin Chen,
Emily Pitler,
Timothy Lillicrap,
Angeliki Lazaridou,
Orhan Firat,
James Molloy,
Michael Isard,
Paul R. Barham,
Tom Hennigan,
Benjamin Lee
, et al. (1325 additional authors not shown)
Abstract:
This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr…
▽ More
This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultra model advances the state of the art in 30 of 32 of these benchmarks - notably being the first model to achieve human-expert performance on the well-studied exam benchmark MMLU, and improving the state of the art in every one of the 20 multimodal benchmarks we examined. We believe that the new capabilities of the Gemini family in cross-modal reasoning and language understanding will enable a wide variety of use cases. We discuss our approach toward post-training and deploying Gemini models responsibly to users through services including Gemini, Gemini Advanced, Google AI Studio, and Cloud Vertex AI.
△ Less
Submitted 17 June, 2024; v1 submitted 18 December, 2023;
originally announced December 2023.
-
Chaotic and Thermal Aspects in the $| HES \rangle$ S-Matrix
Authors:
Diptarka Das,
Santanu Mandal,
Anurag Sarkar
Abstract:
We compute tree level scattering amplitudes involving more than one highly excited states and tachyons in bosonic string theory. We use these amplitudes to understand chaotic and thermal aspects of the excited string states lending support to the Susskind-Horowitz-Polchinski correspondence principle. The unaveraged amplitudes exhibit chaos in the resonance distribution as a function of kinematic p…
▽ More
We compute tree level scattering amplitudes involving more than one highly excited states and tachyons in bosonic string theory. We use these amplitudes to understand chaotic and thermal aspects of the excited string states lending support to the Susskind-Horowitz-Polchinski correspondence principle. The unaveraged amplitudes exhibit chaos in the resonance distribution as a function of kinematic parameters, which can be described by random matrix theory. Upon coarse-graining these amplitudes are shown to exponentiate, and capture various thermal features, including features of a stringy version of the eigenstate thermalization hypothesis as well as notions of typicality. Further, we compute the effective string form factor corresponding to the highly excited states, and argue for the random walk behaviour of the long strings.
△ Less
Submitted 18 December, 2023; v1 submitted 4 December, 2023;
originally announced December 2023.
-
Neural Parametric Gaussians for Monocular Non-Rigid Object Reconstruction
Authors:
Devikalyan Das,
Christopher Wewer,
Raza Yunus,
Eddy Ilg,
Jan Eric Lenssen
Abstract:
Reconstructing dynamic objects from monocular videos is a severely underconstrained and challenging problem, and recent work has approached it in various directions. However, owing to the ill-posed nature of this problem, there has been no solution that can provide consistent, high-quality novel views from camera positions that are significantly different from the training views. In this work, we…
▽ More
Reconstructing dynamic objects from monocular videos is a severely underconstrained and challenging problem, and recent work has approached it in various directions. However, owing to the ill-posed nature of this problem, there has been no solution that can provide consistent, high-quality novel views from camera positions that are significantly different from the training views. In this work, we introduce Neural Parametric Gaussians (NPGs) to take on this challenge by imposing a two-stage approach: first, we fit a low-rank neural deformation model, which then is used as regularization for non-rigid reconstruction in the second stage. The first stage learns the object's deformations such that it preserves consistency in novel views. The second stage obtains high reconstruction quality by optimizing 3D Gaussians that are driven by the coarse model. To this end, we introduce a local 3D Gaussian representation, where temporally shared Gaussians are anchored in and deformed by local oriented volumes. The resulting combined model can be rendered as radiance fields, resulting in high-quality photo-realistic reconstructions of the non-rigidly deforming objects. We demonstrate that NPGs achieve superior results compared to previous works, especially in challenging scenarios with few multi-view cues.
△ Less
Submitted 31 March, 2024; v1 submitted 2 December, 2023;
originally announced December 2023.
-
Friction of a driven chain: Role of momentum conservation, Goldstone and radiation modes
Authors:
Debankur Das,
Richard Vink,
Matthias Krüger
Abstract:
We analytically study friction and dissipation of a driven bead in a 1D harmonic chain, and analyze the role of internal damping mechanism as well as chain length. Specifically, we investigate Dissipative Particle Dynamics and Langevin Dynamics, as paradigmatic examples that do and do not display translational symmetry, with distinct results: For identical parameters, the friction forces can diffe…
▽ More
We analytically study friction and dissipation of a driven bead in a 1D harmonic chain, and analyze the role of internal damping mechanism as well as chain length. Specifically, we investigate Dissipative Particle Dynamics and Langevin Dynamics, as paradigmatic examples that do and do not display translational symmetry, with distinct results: For identical parameters, the friction forces can differ by many orders of magnitude. For slow driving, a Goldstone mode traverses the entire system, resulting in friction of the driven bead that grows arbitrarily large (Langevin) or gets arbitrarily small (Dissipative Particle Dynamics) with system size. For a long chain, the friction for DPD is shown to be bound, while it shows a singularity (i.e. can be arbitrarily large) for Langevin damping. For long underdamped chains, a radiation mode is recovered in either case, with friction independent of damping mechanism. For medium length chains, the chain shows the expected resonant behavior. At the resonance, friction is non-analytic in damping parameter $γ$, depending on it as $γ^{-1}$. Generally, no zero frequency bulk friction coefficient can be determined, as the limits of small frequency and infinite chain length do not commute, and we discuss the regimes where "simple" macroscopic friction occurs.
△ Less
Submitted 23 November, 2023;
originally announced November 2023.
-
Exactly Solvable Floquet Dynamics for Conformal Field Theories in Dimensions Greater than Two
Authors:
Diptarka Das,
Sumit R. Das,
Arnab Kundu,
Krishnendu Sengupta
Abstract:
We find classes of driven conformal field theories (CFT) in d + 1 dimensions with d > 1, whose quench and Floquet dynamics can be computed exactly. The setup is suitable for studying periodic drives, consisting of square pulse protocols for which Hamiltonian evolution takes place with different deformations of the original CFT Hamiltonian in successive time intervals. These deformations are realiz…
▽ More
We find classes of driven conformal field theories (CFT) in d + 1 dimensions with d > 1, whose quench and Floquet dynamics can be computed exactly. The setup is suitable for studying periodic drives, consisting of square pulse protocols for which Hamiltonian evolution takes place with different deformations of the original CFT Hamiltonian in successive time intervals. These deformations are realized by specific combinations of conformal generators with a deformation parameter $β$; the $β< 1$ ($β> 1$) Hamiltonians can be unitarily related to the standard (Luscher-Mack) CFT Hamiltonian. The resulting time evolution can be then calculated by conformal transformations. For $d\leq 3$ we show that the transformations can be obtained in a quaternion formalism. Evolution with such a single Hamiltonian yields qualitatively different time dependences of observables depending on the value of $β$, ranging from exponential decays characteristic of heating to oscillations and power law decays. This manifests in the behavior of the fidelity, unequal-time correlator, and the energy density at the end of a single cycle of a square pulse protocol with different hamiltonians in successive time intervals. When the Hamiltonians in a cycle involve generators of a single SU(1, 1) subalgebra we calculate the Floquet Hamiltonian. We show that one can get dynamical phase transitions by varying the time period of a cycle, where the system can go from a non-heating phase which is oscillatory as a function of the time period to a heating phase with an exponentially damped behavior. Our methods can be generalized to other discrete and continuous protocols. We also point out that our results are expected to hold for a broader class of QFTs that possesses an SL(2, C) symmetry with fields that transform as quasi-primaries under this. As an example, we briefly comment on celestial CFTs in this context.
△ Less
Submitted 29 May, 2024; v1 submitted 22 November, 2023;
originally announced November 2023.
-
Human perceptual decision making of nonequilibrium fluctuations
Authors:
Aybüke Durmaz,
Yonathan Sarmiento,
Gianfranco Fortunato,
Debraj Das,
Mathew Ernst Diamond,
Domenica Bueti,
Édgar Roldán
Abstract:
Perceptual decision-making frequently requires making rapid, reliable choices upon encountering noisy sensory inputs. To better define the statistical processes underlying perceptual decision-making, here we characterize the choices of human participants visualizing a system of nonequilibrium stationary physical dynamics and compare such choices to the performance of an optimal agent computing Wal…
▽ More
Perceptual decision-making frequently requires making rapid, reliable choices upon encountering noisy sensory inputs. To better define the statistical processes underlying perceptual decision-making, here we characterize the choices of human participants visualizing a system of nonequilibrium stationary physical dynamics and compare such choices to the performance of an optimal agent computing Wald's sequential probability ratio test (SPRT). Participants viewed movies of a particle endowed with drifted Brownian dynamics and had to judge the motion as leftward or rightward. Overall, the results uncovered fundamental performance limits, consistent with recently established thermodynamic trade-offs involving speed, accuracy, and dissipation. Specifically, decision times are sensitive to entropy production rates. Moreover, to achieve a given level of observed accuracy, participants require more time than predicted by SPRT, indicating suboptimal integration of available information. In view of such suboptimality, we develop an alternative account based on evidence integration with a memory time constant. Setting the time constant proportionately to the deviation from equilibrium in the stimuli significantly improved trial-by-trial predictions of decision metrics with respect to SPRT. This study shows that perceptual psychophysics using stimuli rooted in nonequilibrium physical processes provides a robust platform for understanding how the brain takes decisions on stochastic information inputs.
△ Less
Submitted 22 November, 2023; v1 submitted 21 November, 2023;
originally announced November 2023.