-
Diphoton signals for the Georgi-Machacek scenario at the Large Hadron Collider
Authors:
Satyaki Bhattacharya,
Rituparna Ghosh,
Biswarup Mukhopadhyaya
Abstract:
The diphoton channel for exploring the Georgi-Machacek (GM) scenario containing scalar triplets at the Large Hadron Collider (LHC) has been identified as germane, and subjected to a detailed study. The scalar spectrum of the model, which imposes a custodial SU(2) on the potential, gets classified into a 5-plet, a 3-plet and two singlets under the custodial symmetry. While most attempts to probe or…
▽ More
The diphoton channel for exploring the Georgi-Machacek (GM) scenario containing scalar triplets at the Large Hadron Collider (LHC) has been identified as germane, and subjected to a detailed study. The scalar spectrum of the model, which imposes a custodial SU(2) on the potential, gets classified into a 5-plet, a 3-plet and two singlets under the custodial symmetry. While most attempts to probe or constrain the scenario at the LHC depend largely on signals of charged scalars, we point out that the custodial SU(2) singlet state H can have a substantial branching ratio (amounting to a few per cent) into two photons. We carry out a detailed simulation of the resulting signal and the standard model backgrounds, obtaining the signal significance in different regions of the parameter space using the profile likelihood ratio method. Substantial regions of the GM parameter space is thus shown to be accessible to LHC studies, both at the high-luminosity run with $\int {\cal L} dt = 3000 fb^{-1}$, and also in Run-3 with $\int {\cal L} dt = 300 fb^{-1}$, even after folding in systematic errors.
△ Less
Submitted 11 July, 2024; v1 submitted 10 July, 2024;
originally announced July 2024.
-
Impact of Network Topology on Byzantine Resilience in Decentralized Federated Learning
Authors:
Siddhartha Bhattacharya,
Daniel Helo,
Joshua Siegel
Abstract:
Federated learning (FL) enables a collaborative environment for training machine learning models without sharing training data between users. This is typically achieved by aggregating model gradients on a central server. Decentralized federated learning is a rising paradigm that enables users to collaboratively train machine learning models in a peer-to-peer manner, without the need for a central…
▽ More
Federated learning (FL) enables a collaborative environment for training machine learning models without sharing training data between users. This is typically achieved by aggregating model gradients on a central server. Decentralized federated learning is a rising paradigm that enables users to collaboratively train machine learning models in a peer-to-peer manner, without the need for a central aggregation server. However, before applying decentralized FL in real-world use training environments, nodes that deviate from the FL process (Byzantine nodes) must be considered when selecting an aggregation function. Recent research has focused on Byzantine-robust aggregation for client-server or fully connected networks, but has not yet evaluated such aggregation schemes for complex topologies possible with decentralized FL. Thus, the need for empirical evidence of Byzantine robustness in differing network topologies is evident. This work investigates the effects of state-of-the-art Byzantine-robust aggregation methods in complex, large-scale network structures. We find that state-of-the-art Byzantine robust aggregation strategies are not resilient within large non-fully connected networks. As such, our findings point the field towards the development of topology-aware aggregation schemes, especially necessary within the context of large scale real-world deployment.
△ Less
Submitted 6 July, 2024;
originally announced July 2024.
-
Suppressed Electric Quadrupole Collectivity in $^{49}$Ti
Authors:
T. J. Gray,
J. M. Allmond,
C. Benetti,
C. Wibisono,
L. Baby,
A. Gargano,
T. Miyagi,
A. O. Macchiavelli,
A. E. Stuchbery,
J. L. Wood,
S. Ajayi,
J. Aragon,
B. W. Asher,
P. Barber,
S. Bhattacharya,
R. Boisseau,
J. M. Christie,
A. L. Conley,
P. De Rosa,
D. T. Dowling,
C. Esparza,
J. Gibbons,
K. Hanselman,
J. D. Holt,
S. Lopez-Caceres
, et al. (12 additional authors not shown)
Abstract:
Single-step Coulomb excitation of $^{46,48,49,50}$Ti is presented. A complete set of $E2$ matrix elements for the quintuplet of states in $^{49}$Ti, centered on the $2^+$ core excitation, was measured for the first time. A total of nine $E2$ matrix elements are reported, four of which were previously unknown. $^{49}_{22}$Ti$_{27}$ shows a $20\%$ quenching in electric quadrupole transition strength…
▽ More
Single-step Coulomb excitation of $^{46,48,49,50}$Ti is presented. A complete set of $E2$ matrix elements for the quintuplet of states in $^{49}$Ti, centered on the $2^+$ core excitation, was measured for the first time. A total of nine $E2$ matrix elements are reported, four of which were previously unknown. $^{49}_{22}$Ti$_{27}$ shows a $20\%$ quenching in electric quadrupole transition strength as compared to its semi-magic $^{50}_{22}$Ti$_{28}$ neighbour. This $20\%$ quenching, while empirically unprecedented, can be explained with a remarkably simple two-state mixing model, which is also consistent with other ground-state properties such as the magnetic dipole moment and electric quadrupole moment. A connection to nucleon transfer data and the quenching of single-particle strength is also demonstrated. The simplicity of the $^{49}$Ti-$^{50}$Ti pair (i.e., approximate single-$j$ $0f_{7/2}$ valence space and isolation of yrast states from non-yrast states) provides a unique opportunity to disentangle otherwise competing effects in the ground-state properties of atomic nuclei, the emergence of collectivity, and the role of proton-neutron interactions.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Data Efficient Evaluation of Large Language Models and Text-to-Image Models via Adaptive Sampling
Authors:
Cong Xu,
Gayathri Saranathan,
Mahammad Parwez Alam,
Arpit Shah,
James Lim,
Soon Yee Wong,
Foltin Martin,
Suparna Bhattacharya
Abstract:
Evaluating LLMs and text-to-image models is a computationally intensive task often overlooked. Efficient evaluation is crucial for understanding the diverse capabilities of these models and enabling comparisons across a growing number of new models and benchmarks. To address this, we introduce SubLIME, a data-efficient evaluation framework that employs adaptive sampling techniques, such as cluster…
▽ More
Evaluating LLMs and text-to-image models is a computationally intensive task often overlooked. Efficient evaluation is crucial for understanding the diverse capabilities of these models and enabling comparisons across a growing number of new models and benchmarks. To address this, we introduce SubLIME, a data-efficient evaluation framework that employs adaptive sampling techniques, such as clustering and quality-based methods, to create representative subsets of benchmarks. Our approach ensures statistically aligned model rankings compared to full datasets, evidenced by high Pearson correlation coefficients. Empirical analysis across six NLP benchmarks reveals that: (1) quality-based sampling consistently achieves strong correlations (0.85 to 0.95) with full datasets at a 10\% sampling rate such as Quality SE and Quality CPD (2) clustering methods excel in specific benchmarks such as MMLU (3) no single method universally outperforms others across all metrics. Extending this framework, we leverage the HEIM leaderboard to cover 25 text-to-image models on 17 different benchmarks. SubLIME dynamically selects the optimal technique for each benchmark, significantly reducing evaluation costs while preserving ranking integrity and score distribution. Notably, a minimal sampling rate of 1% proves effective for benchmarks like MMLU. Additionally, we demonstrate that employing difficulty-based sampling to target more challenging benchmark segments enhances model differentiation with broader score distributions. We also combine semantic search, tool use, and GPT-4 review to identify redundancy across benchmarks within specific LLM categories, such as coding benchmarks. This allows us to further reduce the number of samples needed to maintain targeted rank preservation. Overall, SubLIME offers a versatile and cost-effective solution for the robust evaluation of LLMs and text-to-image models.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
LHC EFT WG Note: SMEFT predictions, event reweighting, and simulation
Authors:
Alberto Belvedere,
Saptaparna Bhattacharya,
Giacomo Boldrini,
Suman Chatterjee,
Alessandro Calandri,
Sergio Sánchez Cruz,
Jennet Dickinson,
Franz J. Glessgen,
Reza Goldouzian,
Alexander Grohsjean,
Laurids Jeppe,
Charlotte Knight,
Olivier Mattelaer,
Kelci Mohrman,
Hannah Nelson,
Vasilije Perovic,
Matteo Presilla,
Robert Schöfbeck,
Nick Smith
Abstract:
This note gives an overview of the tools for predicting expectations in the Standard Model effective field theory (SMEFT) at the tree level and one loop available through event generators. Methods of event reweighting, the separate simulation of squared matrix elements, and the simulation of the full SMEFT process are compared in terms of statistical efficacy and potential biases.
This note gives an overview of the tools for predicting expectations in the Standard Model effective field theory (SMEFT) at the tree level and one loop available through event generators. Methods of event reweighting, the separate simulation of squared matrix elements, and the simulation of the full SMEFT process are compared in terms of statistical efficacy and potential biases.
△ Less
Submitted 28 June, 2024; v1 submitted 20 June, 2024;
originally announced June 2024.
-
Generalization error of min-norm interpolators in transfer learning
Authors:
Yanke Song,
Sohom Bhattacharya,
Pragya Sur
Abstract:
This paper establishes the generalization error of pooled min-$\ell_2$-norm interpolation in transfer learning where data from diverse distributions are available. Min-norm interpolators emerge naturally as implicit regularized limits of modern machine learning algorithms. Previous work characterized their out-of-distribution risk when samples from the test distribution are unavailable during trai…
▽ More
This paper establishes the generalization error of pooled min-$\ell_2$-norm interpolation in transfer learning where data from diverse distributions are available. Min-norm interpolators emerge naturally as implicit regularized limits of modern machine learning algorithms. Previous work characterized their out-of-distribution risk when samples from the test distribution are unavailable during training. However, in many applications, a limited amount of test data may be available during training, yet properties of min-norm interpolation in this setting are not well-understood. We address this gap by characterizing the bias and variance of pooled min-$\ell_2$-norm interpolation under covariate and model shifts. The pooled interpolator captures both early fusion and a form of intermediate fusion. Our results have several implications: under model shift, for low signal-to-noise ratio (SNR), adding data always hurts. For higher SNR, transfer learning helps as long as the shift-to-signal (SSR) ratio lies below a threshold that we characterize explicitly. By consistently estimating these ratios, we provide a data-driven method to determine: (i) when the pooled interpolator outperforms the target-based interpolator, and (ii) the optimal number of target samples that minimizes the generalization error. Under covariate shift, if the source sample size is small relative to the dimension, heterogeneity between between domains improves the risk, and vice versa. We establish a novel anisotropic local law to achieve these characterizations, which may be of independent interest in random matrix theory. We supplement our theoretical characterizations with comprehensive simulations that demonstrate the finite-sample efficacy of our results.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Lepton Collider as a window to Reheating
Authors:
Basabendu Barman,
Subhaditya Bhattacharya,
Sahabub Jahedi,
Dipankar Pradhan,
Abhik Sarkar
Abstract:
We propose a search strategy for MeV-scale feebly interacting massive particle (FIMP) dark matter (DM) at the $e^+e^-$ collider. We argue, detection of a mono-$γ$ signal plus missing energy can indicate to an MeV-scale reheating temperature of the Universe, after addressing observed DM abundance and other relevant constraints.
We propose a search strategy for MeV-scale feebly interacting massive particle (FIMP) dark matter (DM) at the $e^+e^-$ collider. We argue, detection of a mono-$γ$ signal plus missing energy can indicate to an MeV-scale reheating temperature of the Universe, after addressing observed DM abundance and other relevant constraints.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Synergetic effect of edge states and point defects to tune ferromagnetism in CVD-grown vertical nanostructured MoS2: A correlation between electronic structure and theoretical study
Authors:
Sharmistha Dey,
Pankaj Srivastava,
Ankita Phutela,
Saswata Bhattacharya,
Fouran Singh,
Santanu Ghosh
Abstract:
Room-temperature ferromagnetism (RTFM) exhibited by nanostructured two-dimensional semiconductors for spintronics applications is a fascinating area of research. The present work reports on the correlation between the electronic structure and magnetic properties of defect-engineered nano-structured MoS2 thin films. Low-energy light and heavy-mass ion irradiation have been performed to create defec…
▽ More
Room-temperature ferromagnetism (RTFM) exhibited by nanostructured two-dimensional semiconductors for spintronics applications is a fascinating area of research. The present work reports on the correlation between the electronic structure and magnetic properties of defect-engineered nano-structured MoS2 thin films. Low-energy light and heavy-mass ion irradiation have been performed to create defects and tune magnetic properties. Vertical nanosheets with edge state termination in the pristine sample have been examined by field emission scanning electron microscopy (FESEM). Deterioration of vertical nanosheets is observed in low-energy Ar+ and Xe+ irradiated samples. An appreciably high magnetization value of 1.7 emu/g was observed for edge-oriented nanostructured pristine MoS2 thin films, which decreased after ion irradiation. From X-ray photoelectron spectroscopy (XPS) data, it is evident that, due to oxygen incorporation in the sulfur vacancy sites, Mo5+ and 6+ states increase after ion irradiation. The density functional theory (DFT) calculations suggest that the edge-oriented spins of the prismatic edges of the vertical nanosheets are primarily responsible for the high magnetic moment in the pristine film, and the edge degradation and reduction in sulfur vacancies by the incorporation of oxygen upon irradiation result in a decrease in the magnetic moment.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Measuring the magnetic dipole moment and magnetospheric fluctuations of accretion-powered pulsars in the Small Magellanic Cloud with an unscented Kalman filter
Authors:
Joseph O'Leary,
Andrew Melatos,
Tom Kimpson,
Nicholas J. O'Neill,
Patrick M. Meyers,
Dimitris M. Christodoulou,
Sayantan Bhattacharya,
Silas G. T. Laycock
Abstract:
Many accretion-powered pulsars rotate in magnetocentrifugal disequilibrium, spinning up or down secularly over multi-year intervals. The magnetic dipole moment $μ$ of such systems cannot be inferred uniquely from the time-averaged aperiodic X-ray flux $\langle L(t) \rangle$ and pulse period $\langle P(t) \rangle$, because the radiative efficiency of the accretion is unknown and degenerate with the…
▽ More
Many accretion-powered pulsars rotate in magnetocentrifugal disequilibrium, spinning up or down secularly over multi-year intervals. The magnetic dipole moment $μ$ of such systems cannot be inferred uniquely from the time-averaged aperiodic X-ray flux $\langle L(t) \rangle$ and pulse period $\langle P(t) \rangle$, because the radiative efficiency of the accretion is unknown and degenerate with the mass accretion rate. Here we circumvent the degeneracy by tracking the fluctuations in the unaveraged time series $L(t)$ and $P(t)$ using an unscented Kalman filter, whereupon $μ$ can be estimated uniquely, up to the uncertainties in the mass, radius and distance of the star. The analysis is performed on Rossi X-ray Timing Explorer observations for $24$ X-ray transients in the Small Magellanic Cloud, which have been monitored regularly for $\sim 16$ years. As well as independent estimates of $μ$, the analysis yields time-resolved histories of the mass accretion rate and the Maxwell stress at the disk-magnetosphere boundary for each star, and hence auto- and cross-correlations involving the latter two state variables. The inferred fluctuation statistics convey important information about the complex accretion physics at the disk-magnetosphere boundary.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Les Houches 2023: Physics at TeV Colliders: Standard Model Working Group Report
Authors:
J. Andersen,
B. Assi,
K. Asteriadis,
P. Azzurri,
G. Barone,
A. Behring,
A. Benecke,
S. Bhattacharya,
E. Bothmann,
S. Caletti,
X. Chen,
M. Chiesa,
A. Cooper-Sarkar,
T. Cridge,
A. Cueto Gomez,
S. Datta,
P. K. Dhani,
M. Donega,
T. Engel,
S. Ferrario Ravasio,
S. Forte,
P. Francavilla,
M. V. Garzelli,
A. Ghira,
A. Ghosh
, et al. (59 additional authors not shown)
Abstract:
This report presents a short summary of the activities of the "Standard Model" working group for the "Physics at TeV Colliders" workshop (Les Houches, France, 12-30 June, 2023).
This report presents a short summary of the activities of the "Standard Model" working group for the "Physics at TeV Colliders" workshop (Les Houches, France, 12-30 June, 2023).
△ Less
Submitted 2 June, 2024;
originally announced June 2024.
-
Learning from Litigation: Graphs and LLMs for Retrieval and Reasoning in eDiscovery
Authors:
Sounak Lahiri,
Sumit Pai,
Tim Weninger,
Sanmitra Bhattacharya
Abstract:
Electronic Discovery (eDiscovery) involves identifying relevant documents from a vast collection based on legal production requests. The integration of artificial intelligence (AI) and natural language processing (NLP) has transformed this process, helping document review and enhance efficiency and cost-effectiveness. Although traditional approaches like BM25 or fine-tuned pre-trained models are c…
▽ More
Electronic Discovery (eDiscovery) involves identifying relevant documents from a vast collection based on legal production requests. The integration of artificial intelligence (AI) and natural language processing (NLP) has transformed this process, helping document review and enhance efficiency and cost-effectiveness. Although traditional approaches like BM25 or fine-tuned pre-trained models are common in eDiscovery, they face performance, computational, and interpretability challenges. In contrast, Large Language Model (LLM)-based methods prioritize interpretability but sacrifice performance and throughput. This paper introduces DISCOvery Graph (DISCOG), a hybrid approach that combines the strengths of two worlds: a heterogeneous graph-based method for accurate document relevance prediction and subsequent LLM-driven approach for reasoning. Graph representational learning generates embeddings and predicts links, ranking the corpus for a given request, and the LLMs provide reasoning for document relevance. Our approach handles datasets with balanced and imbalanced distributions, outperforming baselines in F1-score, precision, and recall by an average of 12%, 3%, and 16%, respectively. In an enterprise context, our approach drastically reduces document review costs by 99.9% compared to manual processes and by 95% compared to LLM-based classification methods
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Understanding stellar populations in thin & thick discs of edge-on galaxies with MUSE -- I. The case of the reignited S0 galaxy ESO 544-27
Authors:
Devang Somawanshi,
Souradeep Bhattacharya,
Manish Kataria,
Chiaki Kobayashi
Abstract:
Edge-on galaxies act as the best laboratories to understand the origin of thin and thick discs in galaxies. Measurement of spatially resolved stellar population properties in such galaxies, particularly age, metallicity and [$α$/Fe], are crucial to understanding the formation and evolution of disc galaxies. Such measurements are made possible from stellar population model fits to deep integral fie…
▽ More
Edge-on galaxies act as the best laboratories to understand the origin of thin and thick discs in galaxies. Measurement of spatially resolved stellar population properties in such galaxies, particularly age, metallicity and [$α$/Fe], are crucial to understanding the formation and evolution of disc galaxies. Such measurements are made possible from stellar population model fits to deep integral field spectroscopic (IFU) observations of resolved galaxies. We utilise archival MUSE IFU observations of the edge-on galaxy ESO 544-27 to uncover the formation history of its thin and thick discs through its stellar populations. We find the thin disc of the galaxy is dominated by an old ($>9$ Gyr) low [$α$/Fe] metal-rich stellar population. Its outer thick disc is dominated by an old ($>9$ Gyr) high [$α$/Fe] metal-rich component that should have formed with higher star-formation efficiency than the Milky Way thick disc. We thus find [$α$/Fe] dichotomy in ESO 544-27 with its thin and thick discs dominated by low and high [$α$/Fe] stellar populations respectively. However, we also find a metal-rich younger ($<2$ Gyr old) stellar population in ESO 544-27. The galaxy was nearly quenched until its star-formation was reignited recently first in the outer and inner thick disc ($\sim$1 Gyr ago) and then in the thin disc ($\sim$600 Myr ago). We thus find that both the low [$α$/Fe] thin and high [$α$/Fe] thick discs of ESO 544-27 are inhabited primarily by similarly old metal-rich stellar populations, a contrast to that of other galaxies with known thin and thick disc stellar population properties.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Cosmological constraints on curved quintessence
Authors:
Sukannya Bhattacharya,
Giulia Borghetto,
Ameek Malhotra,
Susha Parameswaran,
Gianmassimo Tasinato,
Ivonne Zavala
Abstract:
Dynamical dark energy has gained renewed interest due to recent theoretical and observational developments. In the present paper, we focus on a string-motivated dark energy set-up, and perform a detailed cosmological analysis of exponential quintessence with potential $V=V_0 e^{-λφ}$, allowing for non-zero spatial curvature. We first gain some physical intuition into the full evolution of such a s…
▽ More
Dynamical dark energy has gained renewed interest due to recent theoretical and observational developments. In the present paper, we focus on a string-motivated dark energy set-up, and perform a detailed cosmological analysis of exponential quintessence with potential $V=V_0 e^{-λφ}$, allowing for non-zero spatial curvature. We first gain some physical intuition into the full evolution of such a scenario by analysing the corresponding dynamical system. Then, we test the model using a combination of Planck CMB data, DESI BAO data, as well as recent supernovae datasets. For the model parameter $λ$, we obtain a preference for nonzero values: $λ= 0.48^{+0.28}_{-0.21},\; 0.68^{+0.31}_{-0.20},\; 0.77^{+0.18}_{-0.15}$ at 68% C.L. when combining CMB+DESI with Pantheon+, Union3 and DES-Y5 supernovae datasets respectively. We find no significant hint for spatial curvature. We discuss the implications of current cosmological results for the exponential quintessence model, and more generally for dark energy in string theory.
△ Less
Submitted 30 May, 2024; v1 submitted 27 May, 2024;
originally announced May 2024.
-
Faster $(Δ+ 1)$-Edge Coloring: Breaking the $m \sqrt{n}$ Time Barrier
Authors:
Sayan Bhattacharya,
Din Carmon,
Martín Costa,
Shay Solomon,
Tianyi Zhang
Abstract:
Vizing's theorem states that any $n$-vertex $m$-edge graph of maximum degree $Δ$ can be {\em edge colored} using at most $Δ+ 1$ different colors [Diskret.~Analiz, '64]. Vizing's original proof is algorithmic and shows that such an edge coloring can be found in $\tilde{O}(mn)$ time. This was subsequently improved to $\tilde O(m\sqrt{n})$, independently by Arjomandi [1982] and by Gabow et al.~[1985]…
▽ More
Vizing's theorem states that any $n$-vertex $m$-edge graph of maximum degree $Δ$ can be {\em edge colored} using at most $Δ+ 1$ different colors [Diskret.~Analiz, '64]. Vizing's original proof is algorithmic and shows that such an edge coloring can be found in $\tilde{O}(mn)$ time. This was subsequently improved to $\tilde O(m\sqrt{n})$, independently by Arjomandi [1982] and by Gabow et al.~[1985].
In this paper we present an algorithm that computes such an edge coloring in $\tilde O(mn^{1/3})$ time, giving the first polynomial improvement for this fundamental problem in over 40 years.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
Assessment of the Role and Origin of S* in Orange Carotenoid Protein Photoconversion
Authors:
James P. Pidgeon,
George A. Sutherland,
Matthew S. Proctor,
Shuangqing Wang,
Dimitri Chekulaev,
Sayantan Bhattacharya,
Rahul Jayaprakash,
Andrew Hitchcock,
Ravi Kumar Venkatraman,
Matthew P. Johnson,
C. Neil Hunter,
Jenny Clark
Abstract:
The orange carotenoid protein (OCP) is the water-soluble mediator of non-photochemical quenching in cyanobacteria, a crucial photoprotective mechanism in response to excess illumination. OCP converts from a globular, inactive state (OCPo) to an extended, active conformation (OCPr) under high-light conditions, resulting in a concomitant redshift in the absorption of the bound carotenoid. Here, OCP…
▽ More
The orange carotenoid protein (OCP) is the water-soluble mediator of non-photochemical quenching in cyanobacteria, a crucial photoprotective mechanism in response to excess illumination. OCP converts from a globular, inactive state (OCPo) to an extended, active conformation (OCPr) under high-light conditions, resulting in a concomitant redshift in the absorption of the bound carotenoid. Here, OCP was trapped in either the active or inactive state by fixing each protein conformation in trehalose-sucrose glass. Glass-encapsulated OCPo did not convert under intense illumination and OCPr did not convert in darkness, allowing the optical properties of each conformation to be determined at room temperature. We measured pump wavelength-dependent transient absorption of OCPo in glass films and found that initial OCP photoproducts are still formed, despite the glass preventing completion of the photocycle. By comparison to the pump wavelength dependence of the OCPo to OCPr photoconversion yield in buffer, we show that the long-lived carotenoid singlet-like feature (S*) is associated with ground-state heterogeneity within OCPo, rather than triggering OCP photoconversion.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
Some Classes of Interacting Two-Fluid Model of the Expanding Universe
Authors:
Subhra Bhattacharya
Abstract:
We consider interacting dark matter-dark energy models arising out of a general interaction term $Q=f(ρ_{m},ρ_{d},\dotρ_{m},\dotρ_{d}).$ Here $f$ is a functional relation connecting the energy densities $ρ_{m}$ and $ρ_{d}$ and their derivatives w.r.t. time $t.$ In our model we consider two interacting barotropic fluid with constant equation of state $ω_{m}$ and $ω_{d}.$ By considering a dynamical…
▽ More
We consider interacting dark matter-dark energy models arising out of a general interaction term $Q=f(ρ_{m},ρ_{d},\dotρ_{m},\dotρ_{d}).$ Here $f$ is a functional relation connecting the energy densities $ρ_{m}$ and $ρ_{d}$ and their derivatives w.r.t. time $t.$ In our model we consider two interacting barotropic fluid with constant equation of state $ω_{m}$ and $ω_{d}.$ By considering a dynamical interaction between them we trace out the cosmological evolution dynamics of the universe. We analytically solve the model by considering a constant ratio between the two fluids and then track the corresponding analytical results using observational data from the baryon acoustic oscillation measurements, Type Ia supernovae measurements and the local Hubble constant measurements. From this general setting we introduce three different models and nine different interaction function. Our final aim is to set up a comparative analysis of the various class of models under the different interaction function using common theoretical and numerical analysis.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Phonon Assisted Exciton Processes in Two-Dimensional Tungsten Monocarbide
Authors:
Rishabh Saraswat,
Miroslav Kolos,
Rekha Verma,
František Karlický,
Sitangshu Bhattacharya
Abstract:
n this study, we utilize a rigorous ab initio-based finite momentum Bethe-Salpeter equation to investigate the photoluminescence emission in two-dimensional hexagonal tungsten carbide (h-WC). This thermodynamically stable monolayer exhibits an indirect optical gap, resulting in phonon-assisted emission. We observe that light absorption is a direct process centered around the direct quasiparticle g…
▽ More
n this study, we utilize a rigorous ab initio-based finite momentum Bethe-Salpeter equation to investigate the photoluminescence emission in two-dimensional hexagonal tungsten carbide (h-WC). This thermodynamically stable monolayer exhibits an indirect optical gap, resulting in phonon-assisted emission. We observe that light absorption is a direct process centered around the direct quasiparticle gap, while light emission is indirect and requires modes between $Γ$-$M$ in the phonon dispersion. The emission lines feature prominent phonon replicas at cryogenic temperatures, particularly near-infrared wavelengths (1.09 and 1.17 eV), and we observe exciton thermalization with the crystal beyond 25 K. Additionally, non-radiative recombination is a remarkably fast process, occurring at order of a few femtoseconds (4.8 fs at 0 K and 2.8 fs at 300 K) compared to radiative recombination (2.3 ps at 0 K and 214 ns at 300 K). These optical characteristics of 2D h-WC may facilitate the promise of photon-emitter devices for near-infrared signal communication.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Generalized parton distributions from the pseudo-distribution approach on the lattice
Authors:
Shohini Bhattacharya,
Krzysztof Cichy,
Martha Constantinou,
Andreas Metz,
Niilo Nurminen,
Fernanda Steffens
Abstract:
Generalized parton distributions (GPDs) are key quantities for the description of a hadron's three-dimensional structure. They are the current focus of all areas of hadronic physics -- phenomenological, experimental, and theoretical, including lattice QCD. Synergies between these areas are desirable and essential to achieve precise quantification and understanding of the structure of, particularly…
▽ More
Generalized parton distributions (GPDs) are key quantities for the description of a hadron's three-dimensional structure. They are the current focus of all areas of hadronic physics -- phenomenological, experimental, and theoretical, including lattice QCD. Synergies between these areas are desirable and essential to achieve precise quantification and understanding of the structure of, particularly nucleons, as the basic ingredients of matter. In this paper, we investigate, for the first time, the numerical implementation of the pseudo-distribution approach for the extraction of zero-skewness GPDs for unpolarized quarks. Pseudo-distributions are Euclidean parton correlators computable in lattice QCD that can be perturbatively matched to the light-cone parton distributions of interest. Being closely related to the quasi-distributions and coming from the same lattice-extracted matrix elements, they are, however, subject to different systematic effects. We use the data previously utilized for quasi-GPDs and extend it with other momentum transfers and nucleon boosts, in particular a higher one ($P_3=1.67$ GeV) with eight-fold larger statistics than the largest one used for quasi-distributions ($P_3=1.25$ GeV). We renormalize the matrix elements with a ratio scheme and match the resulting Ioffe time distributions to the light cone in coordinate space. The matched distributions are then used to reconstruct the $x$-dependence with a fitting ansatz.We investigate some systematic effects related to this procedure, and we also compare the results with the ones obtained in the framework of quasi-GPDs. Our final results involve the invariant four-momentum transfer squared ($-t$) dependence of the flavor non-singlet ($u-d$) $H$ and $E$ GPDs.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
Detectability of axisymmetric magnetic fields from the core to the surface of oscillating post-main sequence stars
Authors:
Shatanik Bhattacharya,
Srijan Bharati Das,
Lisa Bugnet,
Subrata Panda,
Shravan M. Hanasoge
Abstract:
Magnetic fields in the stellar interiors are key candidates to explain observed core rotation rates inside solar-like stars along their evolution. Recently, asteroseismic estimates of radial magnetic field amplitudes near the hydrogen-burning shell (H-shell) inside about 24 red-giants (RGs) have been obtained by measuring frequency splittings from their power spectra. Using general Lorentz-stress…
▽ More
Magnetic fields in the stellar interiors are key candidates to explain observed core rotation rates inside solar-like stars along their evolution. Recently, asteroseismic estimates of radial magnetic field amplitudes near the hydrogen-burning shell (H-shell) inside about 24 red-giants (RGs) have been obtained by measuring frequency splittings from their power spectra. Using general Lorentz-stress (magnetic) kernels, we investigated the potential for detectability of near-surface magnetism in a 1.3 $M_{\odot}$ star of super-solar metallicity as it evolves from a mid sub-giant to a late sub-giant into an RG. Based on these sensitivity kernels, we decompose an RG into three zones - deep core, H-shell, and near-surface. The sub-giants instead required decomposition into an inner core, an outer core, and a near-surface layer. Additionally, we find that for a low-frequency g-dominated dipolar mode in the presence of a typical stable magnetic field, ~25% of the frequency shift comes from the H-shell and the remaining from deeper layers. The ratio of the subsurface tangential field to the radial field in H-burning shell decides if subsurface fields may be potentially detectable. For p-dominated dipole modes close to $ν_\rm{max}$, this ratio is around two orders of magnitude smaller in subgiant phases than the corresponding RG. Further, with the availability of magnetic kernels, we propose lower limits of field strengths in crucial layers in our stellar model during its evolutionary phases. The theoretical prescription outlined here provides the first formal way to devise inverse problems for stellar magnetism and can be seamlessly employed for slow rotators.
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
Theoretical Insights into Inorganic Antiperovskite Nitrides (X$_3$NA; X = Mg, Sr, Ca, Ba; A = Sb, As): An Emerging Class of Materials for Photovoltaics
Authors:
Sanchi Monga,
Manjari Jain,
Claudia Draxl,
Saswata Bhattacharya
Abstract:
Antiperovskite nitrides are potential candidates for applications harvesting solar light. With a comprehensive state-of-the-art approach combining hybrid density-functional theory, many-body perturbation theory, the Wannier-Mott model, density-functional perturbation theory, and the Feynman polaron model, we explore excitonic and polaronic effects in X$_3$NA (X: Mg, Ca, Sr, Ba, A = Sb, As). For al…
▽ More
Antiperovskite nitrides are potential candidates for applications harvesting solar light. With a comprehensive state-of-the-art approach combining hybrid density-functional theory, many-body perturbation theory, the Wannier-Mott model, density-functional perturbation theory, and the Feynman polaron model, we explore excitonic and polaronic effects in X$_3$NA (X: Mg, Ca, Sr, Ba, A = Sb, As). For all of them, we uncover a significant influence of the ionic dielectric screening on the static dielectric constant. Small exciton binding energies, weak electron-phonon coupling, and high charge-carrier mobilities facilitate enhanced charge transport in Mg$_3$NSb, Sr$_3$NSb, and Ba$_3$NSb. Our results highlight the potential of these nitrides as optimal candidates for efficient photovoltaic absorbers.
△ Less
Submitted 25 April, 2024;
originally announced April 2024.
-
The Geometry of the Set of Equivalent Linear Neural Networks
Authors:
Jonathan Richard Shewchuk,
Sagnik Bhattacharya
Abstract:
We characterize the geometry and topology of the set of all weight vectors for which a linear neural network computes the same linear transformation $W$. This set of weight vectors is called the fiber of $W$ (under the matrix multiplication map), and it is embedded in the Euclidean weight space of all possible weight vectors. The fiber is an algebraic variety that is not necessarily a manifold. We…
▽ More
We characterize the geometry and topology of the set of all weight vectors for which a linear neural network computes the same linear transformation $W$. This set of weight vectors is called the fiber of $W$ (under the matrix multiplication map), and it is embedded in the Euclidean weight space of all possible weight vectors. The fiber is an algebraic variety that is not necessarily a manifold. We describe a natural way to stratify the fiber--that is, to partition the algebraic variety into a finite set of manifolds of varying dimensions called strata. We call this set of strata the rank stratification. We derive the dimensions of these strata and the relationships by which they adjoin each other. Although the strata are disjoint, their closures are not. Our strata satisfy the frontier condition: if a stratum intersects the closure of another stratum, then the former stratum is a subset of the closure of the latter stratum. Each stratum is a manifold of class $C^\infty$ embedded in weight space, so it has a well-defined tangent space and normal space at every point (weight vector). We show how to determine the subspaces tangent to and normal to a specified stratum at a specified point on the stratum, and we construct elegant bases for those subspaces.
To help achieve these goals, we first derive what we call a Fundamental Theorem of Linear Neural Networks, analogous to what Strang calls the Fundamental Theorem of Linear Algebra. We show how to decompose each layer of a linear neural network into a set of subspaces that show how information flows through the neural network. Each stratum of the fiber represents a different pattern by which information flows (or fails to flow) through the neural network. The topology of a stratum depends solely on this decomposition. So does its geometry, up to a linear transformation in weight space.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
Understanding the role of FFNs in driving multilingual behaviour in LLMs
Authors:
Sunit Bhattacharya,
Ondřej Bojar
Abstract:
Multilingualism in Large Language Models (LLMs) is an yet under-explored area. In this paper, we conduct an in-depth analysis of the multilingual capabilities of a family of a Large Language Model, examining its architecture, activation patterns, and processing mechanisms across languages. We introduce novel metrics to probe the model's multilingual behaviour at different layers and shed light on…
▽ More
Multilingualism in Large Language Models (LLMs) is an yet under-explored area. In this paper, we conduct an in-depth analysis of the multilingual capabilities of a family of a Large Language Model, examining its architecture, activation patterns, and processing mechanisms across languages. We introduce novel metrics to probe the model's multilingual behaviour at different layers and shed light on the impact of architectural choices on multilingual processing.
Our findings reveal different patterns of multilinugal processing in the sublayers of Feed-Forward Networks of the models. Furthermore, we uncover the phenomenon of "over-layerization" in certain model configurations, where increasing layer depth without corresponding adjustments to other parameters may degrade model performance. Through comparisons within and across languages, we demonstrate the interplay between model architecture, layer depth, and multilingual processing capabilities of LLMs trained on multiple languages.
△ Less
Submitted 21 April, 2024;
originally announced April 2024.
-
Entanglement generation between two comoving Unruh-DeWitt detectors in the cosmological de Sitter spacetime
Authors:
Sourav Bhattacharya,
Shagun Kaushal
Abstract:
We investigate the entanglement generation or harvesting between two identical Unruh-DeWitt detectors in the cosmological de Sitter spacetime. We consider two comoving two-level detectors at a coincident spatial position. The detectors are assumed to be unentangled initially. The detectors are individually coupled to a scalar field, which eventually leads to coupling between the two detectors. We…
▽ More
We investigate the entanglement generation or harvesting between two identical Unruh-DeWitt detectors in the cosmological de Sitter spacetime. We consider two comoving two-level detectors at a coincident spatial position. The detectors are assumed to be unentangled initially. The detectors are individually coupled to a scalar field, which eventually leads to coupling between the two detectors. We consider two kinds of scalar fields -- conformally symmetric and massless minimally coupled, for both real and complex cases. By tracing out the degrees of freedom corresponding to the scalar field, we construct the reduced density matrix for the two detectors, whose eigenvalues characterise transitions between the energy levels of the detectors. By using the existing results for the detector response functions per unit proper time for these fields, we next compute the logarithmic negativity, quantifying the degree of entanglement generated at late times between the two detectors. The similarities and differences of these results for different kind of scalar fields have been discussed.
△ Less
Submitted 20 April, 2024; v1 submitted 18 April, 2024;
originally announced April 2024.
-
Simba: Mamba augmented U-ShiftGCN for Skeletal Action Recognition in Videos
Authors:
Soumyabrata Chaudhuri,
Saumik Bhattacharya
Abstract:
Skeleton Action Recognition (SAR) involves identifying human actions using skeletal joint coordinates and their interconnections. While plain Transformers have been attempted for this task, they still fall short compared to the current leading methods, which are rooted in Graph Convolutional Networks (GCNs) due to the absence of structural priors. Recently, a novel selective state space model, Mam…
▽ More
Skeleton Action Recognition (SAR) involves identifying human actions using skeletal joint coordinates and their interconnections. While plain Transformers have been attempted for this task, they still fall short compared to the current leading methods, which are rooted in Graph Convolutional Networks (GCNs) due to the absence of structural priors. Recently, a novel selective state space model, Mamba, has surfaced as a compelling alternative to the attention mechanism in Transformers, offering efficient modeling of long sequences. In this work, to the utmost extent of our awareness, we present the first SAR framework incorporating Mamba. Each fundamental block of our model adopts a novel U-ShiftGCN architecture with Mamba as its core component. The encoder segment of the U-ShiftGCN is devised to extract spatial features from the skeletal data using downsampling vanilla Shift S-GCN blocks. These spatial features then undergo intermediate temporal modeling facilitated by the Mamba block before progressing to the encoder section, which comprises vanilla upsampling Shift S-GCN blocks. Additionally, a Shift T-GCN (ShiftTCN) temporal modeling unit is employed before the exit of each fundamental block to refine temporal representations. This particular integration of downsampling spatial, intermediate temporal, upsampling spatial, and ultimate temporal subunits yields promising results for skeleton action recognition. We dub the resulting model \textbf{Simba}, which attains state-of-the-art performance across three well-known benchmark skeleton action recognition datasets: NTU RGB+D, NTU RGB+D 120, and Northwestern-UCLA. Interestingly, U-ShiftGCN (Simba without Intermediate Mamba Block) by itself is capable of performing reasonably well and surpasses our baseline.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
Exploring orbital angular momentum and spin-orbit correlation for gluons at the Electron-Ion Collider
Authors:
Shohini Bhattacharya,
Renaud Boussarie,
Yoshitaka Hatta
Abstract:
In our previous work [Phys. Rev. Lett. 128, 182002 (2022)], we introduced a pioneering observable aimed at experimentally detecting the orbital angular momentum (OAM) of gluons. Our focus was on the longitudinal double spin asymmetry observed in exclusive dijet production during electron-proton scattering. We demonstrated the sensitivity of the $\cos φ$ angular correlation between the scattered el…
▽ More
In our previous work [Phys. Rev. Lett. 128, 182002 (2022)], we introduced a pioneering observable aimed at experimentally detecting the orbital angular momentum (OAM) of gluons. Our focus was on the longitudinal double spin asymmetry observed in exclusive dijet production during electron-proton scattering. We demonstrated the sensitivity of the $\cos φ$ angular correlation between the scattered electron and proton as a probe for gluon OAM at small-$x$ and its intricate interplay with gluon helicity. This current work provides a comprehensive exposition, diving further into the aforementioned calculation with added elaboration and in-depth analysis. We reveal that, in addition to the gluon OAM, one also gains access to the spin-orbit correlation of gluons. We supplement our work with a detailed numerical analysis of our observables for the kinematics of the Electron-Ion Collider. In addition to dijet production, we also consider the recently proposed semi-inclusive diffractive deep inelastic scattering process which potentially offers experimental advantages over dijet measurements. Finally, we investigate quark-channel contributions to these processes and find an unexpected breakdown of collinear factorization.
△ Less
Submitted 5 April, 2024;
originally announced April 2024.
-
Spin-orbit entanglement in the Color Glass Condensate
Authors:
Shohini Bhattacharya,
Renaud Boussarie,
Yoshitaka Hatta
Abstract:
We compute the spin-orbit correlations of quarks and gluons at small-$x$ and show that the helicity and the orbital angular momentum of individual partons are strongly anti-aligned even in unpolarized or spinless hadrons and nuclei. Combined with the fact that gluons in the Color Glass Condensate are linearly polarized, our finding indicates that the helicity and the orbital angular momentum of si…
▽ More
We compute the spin-orbit correlations of quarks and gluons at small-$x$ and show that the helicity and the orbital angular momentum of individual partons are strongly anti-aligned even in unpolarized or spinless hadrons and nuclei. Combined with the fact that gluons in the Color Glass Condensate are linearly polarized, our finding indicates that the helicity and the orbital angular momentum of single gluons are maximally entangled in a quantum mechanical sense.
△ Less
Submitted 5 April, 2024;
originally announced April 2024.
-
Excitons, Optical Spectra, and Electronic Properties of Semiconducting Hf-based MXenes
Authors:
Nilesh Kumar,
Miroslav Kolos,
Sitangshu Bhattacharya,
František Karlický
Abstract:
Semiconducting MXenes are an intriguing two-dimensional (2D) material class with promising electronic and optoelectronic properties. Here, we focused on recently prepared Hf-based MXenes, namely Hf$_3$C$_2$O$_2$ and Hf$_2$CO$_2$. Using the first-principles calculation and excited state corrections, we proved its dynamical stability, reconciled its semiconducting behavior, and obtained fundamental…
▽ More
Semiconducting MXenes are an intriguing two-dimensional (2D) material class with promising electronic and optoelectronic properties. Here, we focused on recently prepared Hf-based MXenes, namely Hf$_3$C$_2$O$_2$ and Hf$_2$CO$_2$. Using the first-principles calculation and excited state corrections, we proved its dynamical stability, reconciled its semiconducting behavior, and obtained fundamental gaps by the many-body GW method (indirect 1.1 eV and 2.2 eV, respectively, direct 1.4 eV and 3.5 eV, respectively). Using the Bethe-Salpeter equation (BSE) we subsequently provided optical gaps (0.9 eV and 2.7eV, respectively), exciton binding energies, absorption spectra, and other properties of excitons in both Hf-based MXenes. The indirect character of both 2D materials further allowed a significant decrease of excitation energies by considering indirect excitons with exciton momentum along the $Γ$-M path in the Brillouin zone. The first bright excitons are strongly delocalized in real space while contributed by only a limited number of electron-hole pairs around the M point in the k-space from the valence and conduction band. A diverse range of excitonic states in Hf$_3$C$_2$O$_2$ MXene lead to a 4\% and 13\% absorptance for the first and second peaks in the infrared region of absorption spectra, respectively. In contrast, a prominent 28\% absorptance peak in the visible region appears in Hf$_2$CO$_2$ MXene. Results from radiative lifetime calculations indicate the promising potential of these materials in optoelectric devices requiring sustained and efficient exciton behavior.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
A proof of Sylvester's theorem
Authors:
Saptak Bhattacharya
Abstract:
We give a new elementary proof of existence and uniqueness of a solution to the Sylvester equation $AX-XB=Y$
We give a new elementary proof of existence and uniqueness of a solution to the Sylvester equation $AX-XB=Y$
△ Less
Submitted 29 February, 2024;
originally announced March 2024.
-
A new invariant for a cycle of an interval map
Authors:
Sourav Bhattacharya
Abstract:
We \emph{propose} a new \emph{invariant} for a \emph{cycle} of an \emph{interval map} $f:[0,1] \to [0,1]$, called its \emph{unfolding number}.
We \emph{propose} a new \emph{invariant} for a \emph{cycle} of an \emph{interval map} $f:[0,1] \to [0,1]$, called its \emph{unfolding number}.
△ Less
Submitted 2 June, 2024; v1 submitted 25 March, 2024;
originally announced March 2024.
-
Continuous Gravitational Waves: A New Window to Look for Heavy Non-annihilating Dark Matter
Authors:
Sulagna Bhattacharya,
Andrew L. Miller,
Anupam Ray
Abstract:
Sun-like stars can transmute into comparable mass black holes by steadily accumulating heavy non-annihilating dark matter particles over the course of their lives. If such stars form in binary systems, they could give rise to quasi-monochromatic, persistent gravitational waves, commonly known as continuous gravitational waves, as they inspiral towards one another. We demonstrate that next-generati…
▽ More
Sun-like stars can transmute into comparable mass black holes by steadily accumulating heavy non-annihilating dark matter particles over the course of their lives. If such stars form in binary systems, they could give rise to quasi-monochromatic, persistent gravitational waves, commonly known as continuous gravitational waves, as they inspiral towards one another. We demonstrate that next-generation space-based detectors, e.g., Laser Interferometer Space Antenna (LISA) and Big Bang Observer (BBO), can provide novel constraints on dark matter parameters (dark matter mass and its interaction cross-section with the nucleons) by probing gravitational waves from transmuted Sun-like stars that are in close binaries. Our projected constraints depend on several astrophysical uncertainties, nevertheless, are competitive with the existing constraints obtained from cosmological measurements as well as terrestrial direct searches, demonstrating a notable science-case for these space-based gravitational wave detectors as probes of particle dark matter.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
Intervalence Plasmons in Boron-Doped Diamond
Authors:
Souvik Bhattacharya,
Jonathan Boyd,
Sven Reichardt,
Amir Hossein Talebi,
Nicolò Maccaferri,
Olga Shenderova,
Ludger Wirtz,
Giuseppe Strangi,
R. Mohan Sankaran
Abstract:
Doped semiconductors are capable of exhibiting metallic-like properties ranging from superconductivity to tunable localized surface plasmon resonances. Diamond is a wide-bandgap semiconductor that is rendered electronically active by incorporating a hole dopant, boron. While the effects of boron doping on the electronic band structure of diamond are well-studied, any link between charge carriers a…
▽ More
Doped semiconductors are capable of exhibiting metallic-like properties ranging from superconductivity to tunable localized surface plasmon resonances. Diamond is a wide-bandgap semiconductor that is rendered electronically active by incorporating a hole dopant, boron. While the effects of boron doping on the electronic band structure of diamond are well-studied, any link between charge carriers and plasmons, which could facilitate optical applications, has never been shown. Here, we report intervalence plasmons in boron-doped diamond, defined as collective electronic excitations between the valence subbands, opened up by the presence of holes. Evidence for these low energy excitations is provided by scanning transmission electron microscope-valence electron energy loss spectroscopy and photoinduced force infrared spectroscopy. The measured loss and absorbance spectra are subsequently reproduced by first-principles calculations based on the contribution of intervalence band transitions to the dielectric function. Remarkably, the calculations also reveal that the real part of the dielectric function exhibits a resonance characteristic of metallicity (narrow-banded negative values of the dielectric function). The energy of the zero-crossing and the position of the loss peak are found to coincide, and both increase with the carrier density. Our results provide insight into a new mechanism for inducing plasmon-like behavior in doped semiconductors from intervalence band transitions, and the possibility of attaining such properties in diamond, a key emerging material for biomedical and quantum information technologies.
△ Less
Submitted 18 March, 2024;
originally announced March 2024.
-
CMB spectral distortions from enhanced primordial perturbations: the role of spectator axions
Authors:
Margherita Putti,
Nicola Bartolo,
Sukannya Bhattacharya,
Marco Peloso
Abstract:
Primordial tensor modes can induce Cosmic Microwave Background spectral distortions during horizon re-entry. We investigate a specific mechanism proposed for this purpose, characterized by the coupling of an SU(2) gauge field to an axion undergoing a momentary stage of rapid evolution during inflation. Examining also the scalar perturbations produced by this model, we find that spectral distortion…
▽ More
Primordial tensor modes can induce Cosmic Microwave Background spectral distortions during horizon re-entry. We investigate a specific mechanism proposed for this purpose, characterized by the coupling of an SU(2) gauge field to an axion undergoing a momentary stage of rapid evolution during inflation. Examining also the scalar perturbations produced by this model, we find that spectral distortions from the scalar modes significantly dominate those arising from the tensors. This holds true also for an earlier version of the model based on a U(1) gauge field. The scalar-induced distortions might be observed in future experiments, and the current COBE/FIRAS constraints already limit the parameter space of these models. Additionally, we find that delaying the onset of fast roll in the SU(2) scenario (to enhance the modes at the scales relevant for spectral distortions, while respecting the CMB constraints at larger scales) poses a greater challenge compared to the U(1) case. We propose a way to control the axion speed by varying the size of its coupling to the gauge fields.
△ Less
Submitted 13 March, 2024;
originally announced March 2024.
-
Proton Helicity GPDs from Lattice QCD
Authors:
Joshua Miller,
Shohini Bhattacharya,
Krzysztof Cichy,
Martha Constantinou,
Xiang Gao,
Andreas Metz,
Swagato Mukherjee,
Peter Petreczky,
Fernanda Steffens,
Yong Zhao
Abstract:
First lattice QCD calculations of $x$-dependent GPD have been performed in the (symmetric) Breit frame, where the momentum transfer is evenly divided between the initial and final hadron states. However, employing the asymmetric frame, we are able to obtain proton GPDs for multiple momentum transfers in a computationally efficient setup. In these proceedings, we focus on the helicity twist-2 GPD a…
▽ More
First lattice QCD calculations of $x$-dependent GPD have been performed in the (symmetric) Breit frame, where the momentum transfer is evenly divided between the initial and final hadron states. However, employing the asymmetric frame, we are able to obtain proton GPDs for multiple momentum transfers in a computationally efficient setup. In these proceedings, we focus on the helicity twist-2 GPD at zero skewness that gives access to the $\widetilde{H}$ GPD. We will cover the implementation of the asymmetric frame, its comparison to the Breit frame, and the dependence of the GPD on the squared four-momentum transfer, $-t$. The calculation is performed on an $N_f = 2+1+1$ ensemble of twisted mass fermions with a clover improvement. The mass of the pion for this ensemble is roughly 260 MeV.
△ Less
Submitted 8 March, 2024;
originally announced March 2024.
-
VTruST: Controllable value function based subset selection for Data-Centric Trustworthy AI
Authors:
Soumi Das,
Shubhadip Nag,
Shreyyash Sharma,
Suparna Bhattacharya,
Sourangshu Bhattacharya
Abstract:
Trustworthy AI is crucial to the widespread adoption of AI in high-stakes applications with fairness, robustness, and accuracy being some of the key trustworthiness metrics. In this work, we propose a controllable framework for data-centric trustworthy AI (DCTAI)- VTruST, that allows users to control the trade-offs between the different trustworthiness metrics of the constructed training datasets.…
▽ More
Trustworthy AI is crucial to the widespread adoption of AI in high-stakes applications with fairness, robustness, and accuracy being some of the key trustworthiness metrics. In this work, we propose a controllable framework for data-centric trustworthy AI (DCTAI)- VTruST, that allows users to control the trade-offs between the different trustworthiness metrics of the constructed training datasets. A key challenge in implementing an efficient DCTAI framework is to design an online value-function-based training data subset selection algorithm. We pose the training data valuation and subset selection problem as an online sparse approximation formulation. We propose a novel online version of the Orthogonal Matching Pursuit (OMP) algorithm for solving this problem. Experimental results show that VTruST outperforms the state-of-the-art baselines on social, image, and scientific datasets. We also show that the data values generated by VTruST can provide effective data-centric explanations for different trustworthiness metrics.
△ Less
Submitted 8 March, 2024;
originally announced March 2024.
-
Predicting the Temperature Dependence of Surfactant CMCs Using Graph Neural Networks
Authors:
Christoforos Brozos,
Jan G. Rittig,
Sandip Bhattacharya,
Elie Akanny,
Christina Kohlmann,
Alexander Mitsos
Abstract:
The critical micelle concentration (CMC) of surfactant molecules is an essential property for surfactant applications in industry. Recently, classical QSPR and Graph Neural Networks (GNNs), a deep learning technique, have been successfully applied to predict the CMC of surfactants at room temperature. However, these models have not yet considered the temperature dependency of the CMC, which is hig…
▽ More
The critical micelle concentration (CMC) of surfactant molecules is an essential property for surfactant applications in industry. Recently, classical QSPR and Graph Neural Networks (GNNs), a deep learning technique, have been successfully applied to predict the CMC of surfactants at room temperature. However, these models have not yet considered the temperature dependency of the CMC, which is highly relevant for practical applications. We herein develop a GNN model for temperature-dependent CMC prediction of surfactants. We collect about 1400 data points from public sources for all surfactant classes, i.e., ionic, nonionic, and zwitterionic, at multiple temperatures. We test the predictive quality of the model for following scenarios: i) when CMC data for surfactants are present in the training of the model in at least one different temperature, and ii) CMC data for surfactants are not present in the training, i.e., generalizing to unseen surfactants. In both test scenarios, our model exhibits a high predictive performance of R$^2 \geq $ 0.94 on test data. We also find that the model performance varies by surfactant class. Finally, we evaluate the model for sugar-based surfactants with complex molecular structures, as these represent a more sustainable alternative to synthetic surfactants and are therefore of great interest for future applications in the personal and home care industries.
△ Less
Submitted 6 March, 2024;
originally announced March 2024.
-
Higgs couplings in SMEFT via Zh production at the HL-LHC
Authors:
Subhaditya Bhattacharya,
Abhik Sarkar,
Sanjoy Biswas
Abstract:
We study the Higgs couplings present in the $Zh$ associated production mode at the Large Hadron Collider (LHC) in presence of both CP even and CP odd dimension 6 Standard Model Effective Theory (SMEFT) operators. The analysis is performed mainly in context of the HL-LHC (with $\sqrt{s}=$14 TeV and luminosity 3000 $fb^{-1}$) setup using cut based as well as machine learning techniques. The analysis…
▽ More
We study the Higgs couplings present in the $Zh$ associated production mode at the Large Hadron Collider (LHC) in presence of both CP even and CP odd dimension 6 Standard Model Effective Theory (SMEFT) operators. The analysis is performed mainly in context of the HL-LHC (with $\sqrt{s}=$14 TeV and luminosity 3000 $fb^{-1}$) setup using cut based as well as machine learning techniques. The analysis shows significant betterment in the signal significance by using the machine learning technique. We also do a $χ^2$ analysis, which reveals a significant change in the sensitivity of the coupling modifiers due to the presence of effective operators, in particular due to the four point $qqZh$ interaction. The presence of dimension six CP odd four point operators, which contributes at $\mathcal{O} (Λ^{-4})$ order due to lack of interference with the SM contributions, can only have sensitivity with smaller NP scale at the HL-LHC, after addressing the effective limit and constraints.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
G4-Attention: Deep Learning Model with Attention for predicting DNA G-Quadruplexes
Authors:
Shrimon Mukherjee,
Pulakesh Pramanik,
Partha Basuchowdhuri,
Santanu Bhattacharya
Abstract:
G-Quadruplexes are the four-stranded non-canonical nucleic acid secondary structures, formed by the stacking arrangement of the guanine tetramers. They are involved in a wide range of biological roles because of their exceptionally unique and distinct structural characteristics. After the completion of the human genome sequencing project, a lot of bioinformatic algorithms were introduced to predic…
▽ More
G-Quadruplexes are the four-stranded non-canonical nucleic acid secondary structures, formed by the stacking arrangement of the guanine tetramers. They are involved in a wide range of biological roles because of their exceptionally unique and distinct structural characteristics. After the completion of the human genome sequencing project, a lot of bioinformatic algorithms were introduced to predict the active G4s regions \textit{in vitro} based on the canonical G4 sequence elements, G-\textit{richness}, and G-\textit{skewness}, as well as the non-canonical sequence features. Recently, sequencing techniques like G4-seq and G4-ChIP-seq were developed to map the G4s \textit{in vitro}, and \textit{in vivo} respectively at a few hundred base resolution. Subsequently, several machine learning approaches were developed for predicting the G4 regions using the existing databases. However, their prediction models were simplistic, and the prediction accuracy was notably poor. In response, here, we propose a novel convolutional neural network with Bi-LSTM and attention layers, named G4-attention, to predict the G4 forming sequences with improved accuracy. G4-attention achieves high accuracy and attains state-of-the-art results in the G4 prediction task. Our model also predicts the G4 regions accurately in the highly class-imbalanced datasets. In addition, the developed model trained on the human genome dataset can be applied to any non-human genome DNA sequences to predict the G4 formation propensities.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
Sengupta Transformations and Carrollian Relativistic Theory
Authors:
Rabin Banerjee,
Soumya Bhattacharya,
Bibhas Ranjan Majhi
Abstract:
A detailed and systematic formulation of Carrollian relativity is provided. Based on the transformations, first provided by Sengupta [19], we construct a mapping between Lorentz relativistic and Carrollian relativistic vectors. Using this map the Carroll theory is built from the standard Maxwell action. We show that we get self-consistent equations of motion from the action, both in electric and m…
▽ More
A detailed and systematic formulation of Carrollian relativity is provided. Based on the transformations, first provided by Sengupta [19], we construct a mapping between Lorentz relativistic and Carrollian relativistic vectors. Using this map the Carroll theory is built from the standard Maxwell action. We show that we get self-consistent equations of motion from the action, both in electric and magnetic limits. We introduce Carroll electric and magnetic fields. A new set of maps is derived that connects Carroll electric and magnetic fields with the usual Maxwell ones and yields Carroll equations in terms of fields. Consistency of results with the potential formulation is shown. Carroll version of symmetries like duality, gauge, shift, Noether and boost are treated in details and their implications elaborated. Especially, boost symmetry provides a link to the various maps used in this paper.
△ Less
Submitted 12 June, 2024; v1 submitted 4 March, 2024;
originally announced March 2024.
-
Probing switchable valley-related Hall effects in 2D magnetic MXenes
Authors:
Ankita Phutela,
Sajjan Sheoran,
Saswata Bhattacharya
Abstract:
The search for two-dimensional materials with exotic valley-dependent properties has attracted rapid attention as they are fundamentally intriguing and practically appealing for nanoscale device applications. Here, using first-principles calculations, we report the identification of promising intrinsic valley-related switchable Hall effects in Cr2CSF. With a high out-of-plane magnetic anisotropy,…
▽ More
The search for two-dimensional materials with exotic valley-dependent properties has attracted rapid attention as they are fundamentally intriguing and practically appealing for nanoscale device applications. Here, using first-principles calculations, we report the identification of promising intrinsic valley-related switchable Hall effects in Cr2CSF. With a high out-of-plane magnetic anisotropy, Cr2CSF is a ferrovalley semiconductor with spontaneously polarized valleys having a valley polarization of 27.1 meV in the conduction band. This facilitates the observation of an intrinsic anomalous valley Hall (AVH) effect that is manipulated under an in-plane electric field. The underlying physics of spontaneous valley polarization is also discussed based on the SOC Hamiltonian model. Furthermore, on application of unidirectional compressive strain, Cr2CSF is further transitioned from the AVH phase to a long-sought quasi-half valley metal state. Here, only the electrons are valley polarized such that holes and electrons carriers are separated under the in-plane electric field. Our work enriches materials with the valley-related Hall effects and provides a platform for interplay among valleytronics and spintronics.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
Dynamic Anchor Selection and Real-Time Pose Prediction for Ultra-wideband Tagless Gate
Authors:
Junyoung Choi,
Sagnik Bhattacharya,
Joohyun Lee
Abstract:
Ultra-wideband (UWB) is emerging as a promising solution that can realize proximity services, such as UWB tagless gate (UTG), thanks to centimeter-level localization accuracy based on two different ranging methods such as downlink time-difference of arrival (DL-TDoA) and double-sided two-way ranging (DS-TWR). The UTG is a UWB-based proximity service that provides a seamless gate pass system withou…
▽ More
Ultra-wideband (UWB) is emerging as a promising solution that can realize proximity services, such as UWB tagless gate (UTG), thanks to centimeter-level localization accuracy based on two different ranging methods such as downlink time-difference of arrival (DL-TDoA) and double-sided two-way ranging (DS-TWR). The UTG is a UWB-based proximity service that provides a seamless gate pass system without requiring real-time mobile device (MD) tapping. The location of MD is calculated using DL-TDoA, and the MD communicates with the nearest UTG using DS-TWR to open the gate. Therefore, the knowledge about the exact location of MD is the main challenge of UTG, and hence we provide the solutions for both DL-TDoA and DS-TWR. In this paper, we propose dynamic anchor selection for extremely accurate DL-TDoA localization and pose prediction for DS-TWR, called DynaPose. The pose is defined as the actual location of MD on the human body, which affects the localization accuracy. DynaPose is based on line-of-sight (LOS) and non-LOS (NLOS) classification using deep learning for anchor selection and pose prediction. Deep learning models use the UWB channel impulse response and the inertial measurement unit embedded in the smartphone. DynaPose is implemented on Samsung Galaxy Note20 Ultra and Qorvo UWB board to show the feasibility and applicability. DynaPose achieves a LOS/NLOS classification accuracy of 0.984, 62% higher DL-TDoA localization accuracy, and ultimately detects four different poses with an accuracy of 0.961 in real-time.
△ Less
Submitted 22 February, 2024;
originally announced February 2024.
-
Existance of octupole correlation in 116Sn
Authors:
Prithwijita Ray,
H. Pai,
S. Chakraborty,
A. Mukherjee,
S. Rajbanshi,
Sajad Ali,
G. Gangopadhyay,
S. Bhattacharyya,
G. Mukherjee,
C. Bhattacharya,
Soumik Bhattacharya,
R. Banik,
S. Nandi,
R. Raut,
S. S. Ghugre,
S. Samanta,
S. Das,
S. Chatterjee,
A. Goswami
Abstract:
The negative parity states in 116Sn have been investigated in terms of octupole correlation. The same is probed by using the Indian National Gamma Array (INGA) facility at Variable Energy Cyclotron Centre, Kolkata using the reaction, 114Cd(α,2n) 116Sn at 34 MeV energy. Three new γ-transitions relevant to the present investigation are reported and the spin-parities of the associated levels are assi…
▽ More
The negative parity states in 116Sn have been investigated in terms of octupole correlation. The same is probed by using the Indian National Gamma Array (INGA) facility at Variable Energy Cyclotron Centre, Kolkata using the reaction, 114Cd(α,2n) 116Sn at 34 MeV energy. Three new γ-transitions relevant to the present investigation are reported and the spin-parities of the associated levels are assigned based on the DCO-ratios and polarisation measurements. The interband transitions between the positive parity ground band and the negative parity band are newly observed and the corresponding extracted ratio of transition probability, B(E1)/B(E2) indicated the existence of octupole correlation in these nuclei. The enhanced transition rates for E1 and E3 transitions between these opposite parity bands and corroboration of soft octupole deformation ultimately aid the onset of octupole excitation in this isotope.
△ Less
Submitted 24 February, 2024;
originally announced February 2024.
-
Aaronson-Ambainis Conjecture Is True For Random Restrictions
Authors:
Sreejata Kishor Bhattacharya
Abstract:
In an attempt to show that the acceptance probability of a quantum query algorithm making $q$ queries can be well-approximated almost everywhere by a classical decision tree of depth $\leq \text{poly}(q)$, Aaronson and Ambainis proposed the following conjecture: let $f: \{ \pm 1\}^n \rightarrow [0,1]$ be a degree $d$ polynomial with variance $\geq ε$. Then, there exists a coordinate of $f$ with in…
▽ More
In an attempt to show that the acceptance probability of a quantum query algorithm making $q$ queries can be well-approximated almost everywhere by a classical decision tree of depth $\leq \text{poly}(q)$, Aaronson and Ambainis proposed the following conjecture: let $f: \{ \pm 1\}^n \rightarrow [0,1]$ be a degree $d$ polynomial with variance $\geq ε$. Then, there exists a coordinate of $f$ with influence $\geq \text{poly} (ε, 1/d)$.
We show that for any polynomial $f: \{ \pm 1\}^n \rightarrow [0,1]$ of degree $d$ $(d \geq 2)$ and variance $\text{Var}[f] \geq 1/d$, if $ρ$ denotes a random restriction with survival probability $\dfrac{\log(d)}{C_1 d}$, $$ \text{Pr} \left[f_ρ \text{ has a coordinate with influence} \geq \dfrac{\text{Var}[f]^2 }{d^{C_2}} \right] \geq \dfrac{\text{Var}[f] \log(d)}{50C_1 d}$$ where $C_1, C_2>0$ are universal constants. Thus, Aaronson-Ambainis conjecture is true for a non-negligible fraction of random restrictions of the given polynomial assuming its variance is not too low.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
Measuring the magnetic dipole moment and magnetospheric fluctuations of SXP 18.3 with a Kalman filter
Authors:
J. O'Leary,
A. Melatos,
N. J. O'Neill,
P. M. Meyers,
D. M. Christodoulou,
S. Bhattacharya,
S. G. T. Laycock
Abstract:
The magnetic dipole moment $μ$ of an accretion-powered pulsar in magnetocentrifugal equilibrium cannot be inferred uniquely from time-averaged pulse period and aperiodic X-ray flux data, because the radiative efficiency $η_0$ of the accretion is unknown, as are the mass, radius, and distance of the star. The degeneracy associated with the radiative efficiency is circumvented, if fluctuations of th…
▽ More
The magnetic dipole moment $μ$ of an accretion-powered pulsar in magnetocentrifugal equilibrium cannot be inferred uniquely from time-averaged pulse period and aperiodic X-ray flux data, because the radiative efficiency $η_0$ of the accretion is unknown, as are the mass, radius, and distance of the star. The degeneracy associated with the radiative efficiency is circumvented, if fluctuations of the pulse period and aperiodic X-ray flux are tracked with a Kalman filter, whereupon $μ$ can be measured uniquely up to the uncertainties in the mass, radius, and distance. Here the Kalman filter analysis is demonstrated successfully in practice for the first time on Rossi X-ray Timing Explorer observations of the X-ray transient SXP 18.3 in the Small Magellanic Cloud, which is monitored regularly. The analysis yields $μ= 8.0^{+1.3}_{-1.2} \, \times \, 10^{30} \, {\rm G \, cm^3}$ and $η_0 = 0.04^{+0.02}_{-0.01}$, compared to $μ= 5.0^{+1.0}_{-1.0} \times 10^{30} \, {\rm G \, cm^3}$ as inferred traditionally from time-averaged data assuming $η_0=1$. The analysis also yields time-resolved estimates of two hidden state variables, the mass accretion rate and the Maxwell stress at the disk-magnetosphere boundary. The success of the demonstration confirms that the Kalman filter analysis can be applied in the future to study the magnetic moments and disk-magnetosphere physics of accretion-powered pulsar populations in the Small Magellanic Cloud and elsewhere.
△ Less
Submitted 19 February, 2024;
originally announced February 2024.
-
Power-Efficient Indoor Localization Using Adaptive Channel-aware Ultra-wideband DL-TDOA
Authors:
Sagnik Bhattacharya,
Junyoung Choi,
Joohyun Lee
Abstract:
Among the various Ultra-wideband (UWB) ranging methods, the absence of uplink communication or centralized computation makes downlink time-difference-of-arrival (DL-TDOA) localization the most suitable for large-scale industrial deployments. However, temporary or permanent obstacles in the deployment region often lead to non-line-of-sight (NLOS) channel path and signal outage effects, which result…
▽ More
Among the various Ultra-wideband (UWB) ranging methods, the absence of uplink communication or centralized computation makes downlink time-difference-of-arrival (DL-TDOA) localization the most suitable for large-scale industrial deployments. However, temporary or permanent obstacles in the deployment region often lead to non-line-of-sight (NLOS) channel path and signal outage effects, which result in localization errors. Prior research has addressed this problem by increasing the ranging frequency, which leads to a heavy increase in the user device power consumption. It also does not contribute to any increase in localization accuracy under line-of-sight (LOS) conditions. In this paper, we propose and implement a novel low-power channel-aware dynamic frequency DL-TDOA ranging algorithm. It comprises NLOS probability predictor based on a convolutional neural network (CNN), a dynamic ranging frequency control module, and an IMU sensor-based ranging filter. Based on the conducted experiments, we show that the proposed algorithm achieves 50% higher accuracy in NLOS conditions while having 46% lower power consumption in LOS conditions compared to baseline methods from prior research.
△ Less
Submitted 16 February, 2024;
originally announced February 2024.
-
Deep Learning-based Real-time Smartphone Pose Detection for Ultra-wideband Tagless Gate
Authors:
Junyoung Choi,
Sagnik Bhattacharya
Abstract:
As commercial interest in proximity services increased, the development of various wireless localization techniques was promoted. In line with this trend, Ultra-wideband (UWB) is emerging as a promising solution that can realize proximity services thanks to centimeter-level localization accuracy. In addition, since the actual location of the mobile device (MD) on the human body, called pose, affec…
▽ More
As commercial interest in proximity services increased, the development of various wireless localization techniques was promoted. In line with this trend, Ultra-wideband (UWB) is emerging as a promising solution that can realize proximity services thanks to centimeter-level localization accuracy. In addition, since the actual location of the mobile device (MD) on the human body, called pose, affects the localization accuracy, poses are also important to provide accurate proximity services, especially for the UWB tagless gate (UTG). In this paper, a real-time pose detector, termed D3, is proposed to estimate the pose of MD when users pass through UTG. D3 is based on line-of-sight (LOS) and non-LOS (NLOS) classification using UWB channel impulse response and utilizes the inertial measurement unit embedded in the smartphone to estimate the pose. D3 is implemented on Samsung Galaxy Note20 Ultra (i.e., SMN986B) and Qorvo UWB board to show the feasibility and applicability. D3 achieved an LOS/NLOS classification accuracy of 0.984, and ultimately detected four different poses of MD with an accuracy of 0.961 in real-time.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
EntGPT: Linking Generative Large Language Models with Knowledge Bases
Authors:
Yifan Ding,
Amrit Poudel,
Qingkai Zeng,
Tim Weninger,
Balaji Veeramani,
Sanmitra Bhattacharya
Abstract:
The ability of Large Language Models (LLMs) to generate factually correct output remains relatively unexplored due to the lack of fact-checking and knowledge grounding during training and inference. In this work, we aim to address this challenge through the Entity Disambiguation (ED) task. We first consider prompt engineering, and design a three-step hard-prompting method to probe LLMs' ED perform…
▽ More
The ability of Large Language Models (LLMs) to generate factually correct output remains relatively unexplored due to the lack of fact-checking and knowledge grounding during training and inference. In this work, we aim to address this challenge through the Entity Disambiguation (ED) task. We first consider prompt engineering, and design a three-step hard-prompting method to probe LLMs' ED performance without supervised fine-tuning (SFT). Overall, the prompting method improves the micro-F_1 score of the original vanilla models by a large margin, on some cases up to 36% and higher, and obtains comparable performance across 10 datasets when compared to existing methods with SFT. We further improve the knowledge grounding ability through instruction tuning (IT) with similar prompts and responses. The instruction-tuned model not only achieves higher micro-F1 score performance as compared to several baseline methods on supervised entity disambiguation tasks with an average micro-F_1 improvement of 2.1% over the existing baseline models, but also obtains higher accuracy on six Question Answering (QA) tasks in the zero-shot setting. Our methodologies apply to both open- and closed-source LLMs.
△ Less
Submitted 9 February, 2024;
originally announced February 2024.
-
Exponential Separation Between Powers of Regular and General Resolution Over Parities
Authors:
Sreejata Kishor Bhattacharya,
Arkadev Chattopadhyay,
Pavel Dvořák
Abstract:
Proving super-polynomial lower bounds on the size of proofs of unsatisfiability of Boolean formulas using resolution over parities is an outstanding problem that has received a lot of attention after its introduction by Raz and Tzamaret [Ann. Pure Appl. Log.'08]. Very recently, Efremenko, Garlík and Itsykson [ECCC'23] proved the first exponential lower bounds on the size of ResLin proofs that were…
▽ More
Proving super-polynomial lower bounds on the size of proofs of unsatisfiability of Boolean formulas using resolution over parities is an outstanding problem that has received a lot of attention after its introduction by Raz and Tzamaret [Ann. Pure Appl. Log.'08]. Very recently, Efremenko, Garlík and Itsykson [ECCC'23] proved the first exponential lower bounds on the size of ResLin proofs that were additionally restricted to be bottom-regular. We show that there are formulas for which such regular ResLin proofs of unsatisfiability continue to have exponential size even though there exists short proofs of their unsatisfiability in ordinary, non-regular resolution. This is the first super-polynomial separation between the power of general ResLin and and that of regular ResLin for any natural notion of regularity.
Our argument, while building upon the work of Efremenko et al., uses additional ideas from the literature on lifting theorems.
△ Less
Submitted 23 February, 2024; v1 submitted 6 February, 2024;
originally announced February 2024.
-
Nonsense associations in Markov random fields with pairwise dependence
Authors:
Sohom Bhattacharya,
Rajarshi Mukherjee,
Elizabeth Ogburn
Abstract:
Yule (1926) identified the issue of "nonsense correlations" in time series data, where dependence within each of two random vectors causes overdispersion -- i.e. variance inflation -- for measures of dependence between the two. During the near century since then, much has been written about nonsense correlations -- but nearly all of it confined to the time series literature. In this paper we provi…
▽ More
Yule (1926) identified the issue of "nonsense correlations" in time series data, where dependence within each of two random vectors causes overdispersion -- i.e. variance inflation -- for measures of dependence between the two. During the near century since then, much has been written about nonsense correlations -- but nearly all of it confined to the time series literature. In this paper we provide the first, to our knowledge, rigorous study of this phenomenon for more general forms of (positive) dependence, specifically for Markov random fields on lattices and graphs. We consider both binary and continuous random vectors and three different measures of association: correlation, covariance, and the ordinary least squares coefficient that results from projecting one random vector onto the other. In some settings we find variance inflation consistent with Yule's nonsense correlation. However, surprisingly, we also find variance deflation in some settings, and in others the variance is unchanged under dependence. Perhaps most notably, we find general conditions under which OLS inference that ignores dependence is valid despite positive dependence in the regression errors, contradicting the presentation of OLS in countless textbooks and courses.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Evaluating the consequences: Impact of sex-selective harvesting on fish population and identifying tipping points via life-history parameters
Authors:
Joydeb Bhattacharyya,
Arnab Chattopadhyay,
Anurag Sau,
Sabyasachi Bhattacharya
Abstract:
Fish harvesting often targets larger individuals, which can be sex-specific due to size dimorphism or differences in behaviors like migration and spawning. Sex-selective harvesting can have dire consequences in the long run, potentially pushing fish populations towards collapse much earlier due to skewed sex ratios and reduced reproduction. To investigate this pressing issue, we used a single-spec…
▽ More
Fish harvesting often targets larger individuals, which can be sex-specific due to size dimorphism or differences in behaviors like migration and spawning. Sex-selective harvesting can have dire consequences in the long run, potentially pushing fish populations towards collapse much earlier due to skewed sex ratios and reduced reproduction. To investigate this pressing issue, we used a single-species sex-structured mathematical model with a weak Allee effect on the fish population. Additionally, we incorporate a realistic harvesting mechanism resembling the Michaelis-Menten function. Our analysis illuminates the intricate interplay between life history traits, harvesting intensity, and population stability. The results demonstrate that fish life history traits, such as a higher reproductive rate, early maturation of juveniles, and increased longevity, confer advantages under intensive harvesting. To anticipate potential population collapse, we employ a novel early warning tool (EWT) based on the concept of basin stability to pinpoint tipping points before they occur. Harvesting yield at our proposed early indicator can act as a potential pathway to achieve optimal yield while keeping the population safely away from the brink of collapse, rather than relying solely on the established maximum sustainable yield (MSY), where the population dangerously approaches the point of no return. Furthermore, we show that density-dependent female stocking upon receiving an EWT signal significantly shifts the tipping point, allowing safe harvesting even at MSY levels, thus can act as a potential intervention strategy.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
DiffClone: Enhanced Behaviour Cloning in Robotics with Diffusion-Driven Policy Learning
Authors:
Sabariswaran Mani,
Sreyas Venkataraman,
Abhranil Chandra,
Adyan Rizvi,
Yash Sirvi,
Soumojit Bhattacharya,
Aritra Hazra
Abstract:
Robot learning tasks are extremely compute-intensive and hardware-specific. Thus the avenues of tackling these challenges, using a diverse dataset of offline demonstrations that can be used to train robot manipulation agents, is very appealing. The Train-Offline-Test-Online (TOTO) Benchmark provides a well-curated open-source dataset for offline training comprised mostly of expert data and also be…
▽ More
Robot learning tasks are extremely compute-intensive and hardware-specific. Thus the avenues of tackling these challenges, using a diverse dataset of offline demonstrations that can be used to train robot manipulation agents, is very appealing. The Train-Offline-Test-Online (TOTO) Benchmark provides a well-curated open-source dataset for offline training comprised mostly of expert data and also benchmark scores of the common offline-RL and behaviour cloning agents. In this paper, we introduce DiffClone, an offline algorithm of enhanced behaviour cloning agent with diffusion-based policy learning, and measured the efficacy of our method on real online physical robots at test time. This is also our official submission to the Train-Offline-Test-Online (TOTO) Benchmark Challenge organized at NeurIPS 2023. We experimented with both pre-trained visual representation and agent policies. In our experiments, we find that MOCO finetuned ResNet50 performs the best in comparison to other finetuned representations. Goal state conditioning and mapping to transitions resulted in a minute increase in the success rate and mean-reward. As for the agent policy, we developed DiffClone, a behaviour cloning agent improved using conditional diffusion.
△ Less
Submitted 23 May, 2024; v1 submitted 17 January, 2024;
originally announced January 2024.