subscribe to arXiv mailings

Approximate Degree Composition for Recursive Functions

Authors: Sourav Chakraborty, Chandrima Kayal, Rajat Mittal, Manaswi Paraashar, Nitin Saurabh

Abstract: Determining the approximate degree composition for Boolean functions remains a significant unsolved problem in Boolean function complexity. In recent decades, researchers have concentrated on proving that approximate degree composes for special types of inner and outer functions. An important and extensively studied class of functions are the recursive functions, i.e.~functions obtained by composi… ▽ More Determining the approximate degree composition for Boolean functions remains a significant unsolved problem in Boolean function complexity. In recent decades, researchers have concentrated on proving that approximate degree composes for special types of inner and outer functions. An important and extensively studied class of functions are the recursive functions, i.e.~functions obtained by composing a base function with itself a number of times. Let $h^d$ denote the standard $d$-fold composition of the base function $h$. The main result of this work is to show that the approximate degree composes if either of the following conditions holds: \begin{itemize} \item The outer function $f:\{0,1\}^n\to \{0,1\}$ is a recursive function of the form $h^d$, with $h$ being any base function and $d= Ω(\log\log n)$. \item The inner function is a recursive function of the form $h^d$, with $h$ being any constant arity base function (other than AND and OR) and $d= Ω(\log\log n)$, where $n$ is the arity of the outer function. \end{itemize} In terms of proof techniques, we first observe that the lower bound for composition can be obtained by introducing majority in between the inner and the outer functions. We then show that majority can be \emph{efficiently eliminated} if the inner or outer function is a recursive function. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08217 [pdf, ps, other]

doi 10.1142/S0217732323501092

A description of classical and quantum cosmology for a single scalar field torsion gravity

Authors: Dipankar Laya, Roshni Bhaumik, Sourav Dutta, Subenoy Chakraborty

Abstract: In the background of homogeneous and isotropic flat FLRW space-time, both classical and quantum cosmology has been studied for teleparallel dark energy (DE) model. Using Noether symmetry analysis, not only the symmetry vector but also the coupling function in the Lagrangian and the potential of the scalar field has been determined. Also symmetry analysis identifies a cyclic variable in the Lagrang… ▽ More In the background of homogeneous and isotropic flat FLRW space-time, both classical and quantum cosmology has been studied for teleparallel dark energy (DE) model. Using Noether symmetry analysis, not only the symmetry vector but also the coupling function in the Lagrangian and the potential of the scalar field has been determined. Also symmetry analysis identifies a cyclic variable in the Lagrangian along the symmetry vector and as a result the Lagrangian simplifies to a great extend so that classical solution is obtained. Subsequently, in quantum cosmology Wheeler-DeWitt(WD) equation has been constructed and the quantum version of the conserved momenta corresponding to Noether symmetry identifies the periodic part of the wave function of the universe and as a result the Wheeler-DeWitt equation becomes solvable. Finally, quantum description shows finite non-zero probability at the classical big-bang singularity. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 16 Pages, 4 figures

Journal ref: Modern Physics Letters A Vol. 38, Nos. 22 & 23 (2023) 2350109 (15 pages)

arXiv:2407.08207 [pdf, ps, other]

doi 10.1142/S0217751X23500641

Classical and Quantum Cosmology in Einstein-aether Scalar-tensor gravity: Noether Symmetry Analysis

Authors: Dipanakr Laya, Roshni Bhaumik, Sourav Dutta, Subenoy Chakraborty

Abstract: The present work deals with Einstein-aether Scalar tensor gravity in the background of homogeneous and isotropic flat FLRW space-time model. The Noether symmetry vector identifies a transformation in the augmented space so that the field equations become solvable. The cosmological solutions are analyzed from the observational point of view. Finally, for quantum cosmology, the Wheeler-DeWitt (WD) h… ▽ More The present work deals with Einstein-aether Scalar tensor gravity in the background of homogeneous and isotropic flat FLRW space-time model. The Noether symmetry vector identifies a transformation in the augmented space so that the field equations become solvable. The cosmological solutions are analyzed from the observational point of view. Finally, for quantum cosmology, the Wheeler-DeWitt (WD) has been formulated and solutions have been determined by identifying the periodic nature of the wave function using conserved (Noether) charge. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 15 pages, 4 figures

Journal ref: International Journal of Modern Physics A Vol. 38, Nos. 12 & 13 (2023) 2350064 (14 pages)

arXiv:2407.07696 [pdf, ps, other]

Layer Resolved Magnetotransport Properties in Antiferromagnetic/Paramagnetic Superlattices

Authors: Sandip Halder, Sourav Chakraborty, Kalpataru Pradhan

Abstract: We investigate the layer resolved magnetotransport properties of the antiferromagnetic/paramagnetic superlattices based on one band half-filled Hubbard model in three dimensions. In our set up the correlated layers (with on-site repulsion strength $U \ne$ 0) are intercalated between the uncorrelated (U = 0) layers. Our calculations based on the semi-classical Monte-Carlo technique show that the ma… ▽ More We investigate the layer resolved magnetotransport properties of the antiferromagnetic/paramagnetic superlattices based on one band half-filled Hubbard model in three dimensions. In our set up the correlated layers (with on-site repulsion strength $U \ne$ 0) are intercalated between the uncorrelated (U = 0) layers. Our calculations based on the semi-classical Monte-Carlo technique show that the magnetic moments are induced in the uncorrelated layers at low temperatures due to kinetic hopping of the carriers across the interface. The average induced magnetic moment in the uncorrelated layer varies nonmonotonically with the $U$ values of the correlated layer. Interestingly, the induced magnetic moments are antiferromagnetically arranged in uncorrelated layers and mediates the antiferromagnetic ordering between correlated layers. As a result the whole SL system turns out to be antiferromagnetic insulating at low temperatures. For $U \sim$ bandwidth the local moments in the correlated planes increases as a function of the distance from the interface. Expectedly our in-plane resistivity calculations show that the metal insulator transition temperature of the central plane is larger than the edge planes in the correlated layers. On the other hand, although the induced moments in uncorrelated planes decreases considerably as move from edge planes to center planes the metal insulator transition temperature remains more or less same for all planes. The induced moments in uncorrelated layers gradually dissipates with increasing the thickness of uncorrelated layer and as a result the long range antiferromagnetic ordering vanishes in the superlattices similar to the experiments. △ Less

Submitted 10 July, 2024; originally announced July 2024.

Comments: 12 Figs

arXiv:2407.06696 [pdf, ps, other]

doi 10.1142/S0218271823500013

Quantum Cosmology in Coupled Brans-Dicke Gravity: A Noether Symmetry Analysis

Authors: Dipankar Laya, Sourav Dutta, Subenoy Chakraborty

Abstract: The present work deals with a multi-field cosmological model in a spatially flat FLRW space-time geometry. The usual Brans-Dicke(BD) field and another scalar field are minimally coupled to gravity while they interact with each other through the Kinetic terms. {The main aim of the present work is to examine whether the model is compatible with cosmic observations. So cosmological solutions are obta… ▽ More The present work deals with a multi-field cosmological model in a spatially flat FLRW space-time geometry. The usual Brans-Dicke(BD) field and another scalar field are minimally coupled to gravity while they interact with each other through the Kinetic terms. {The main aim of the present work is to examine whether the model is compatible with cosmic observations. So cosmological solutions are obtained using symmetry analysis only.} By imposing Noether Symmetry to the Lagrangian of the system the potential of the scalar field as well as the coupling function has been determined. The classical solutions are determined after simplifying the Lagrangian using cyclic variables. Finally, Wheeler-DeWitt(WD) equation in quantum cosmology has been formulated and conserved momenta corresponding to Noether symmetry shows the periodic part of the wave function and it helps to have the complete integral for the wave function. △ Less

Submitted 9 July, 2024; originally announced July 2024.

Comments: 14 pages, 4 figures

arXiv:2407.05982 [pdf, other]

MTL-Split: Multi-Task Learning for Edge Devices using Split Computing

Authors: Luigi Capogrosso, Enrico Fraccaroli, Samarjit Chakraborty, Franco Fummi, Marco Cristani

Abstract: Split Computing (SC), where a Deep Neural Network (DNN) is intelligently split with a part of it deployed on an edge device and the rest on a remote server is emerging as a promising approach. It allows the power of DNNs to be leveraged for latency-sensitive applications that do not allow the entire DNN to be deployed remotely, while not having sufficient computation bandwidth available locally. I… ▽ More Split Computing (SC), where a Deep Neural Network (DNN) is intelligently split with a part of it deployed on an edge device and the rest on a remote server is emerging as a promising approach. It allows the power of DNNs to be leveraged for latency-sensitive applications that do not allow the entire DNN to be deployed remotely, while not having sufficient computation bandwidth available locally. In many such embedded systems scenarios, such as those in the automotive domain, computational resource constraints also necessitate Multi-Task Learning (MTL), where the same DNN is used for multiple inference tasks instead of having dedicated DNNs for each task, which would need more computing bandwidth. However, how to partition such a multi-tasking DNN to be deployed within a SC framework has not been sufficiently studied. This paper studies this problem, and MTL-Split, our novel proposed architecture, shows encouraging results on both synthetic and real-world data. The source code is available at https://github.com/intelligolabs/MTL-Split. △ Less

Submitted 8 July, 2024; originally announced July 2024.

Comments: Accepted at the 61st Design Automation Conference (DAC 2024)

arXiv:2407.05371 [pdf, other]

AstroSat Observations of the Dipping Low Mass X-ray Binary XB 1254-690

Authors: Nilam R. Navale, Devraj Pawar, A. R. Rao, Ranjeev Misra, Sudip Chakraborty, Sudip Bhattacharyya, Vaishali A. Bambole

Abstract: XB 1254-690 is a neutron star low-mass X-ray binary with an orbital period of 3.88 hrs, and it exhibits energy-dependent intensity dips, thermonuclear bursts, and flares. We present the results of an analysis of a long observation of this source using the AstroSat satellite. The X-ray light curve gradually changed from a high-intensity flaring state to a low-intensity one with a few dips. The hard… ▽ More XB 1254-690 is a neutron star low-mass X-ray binary with an orbital period of 3.88 hrs, and it exhibits energy-dependent intensity dips, thermonuclear bursts, and flares. We present the results of an analysis of a long observation of this source using the AstroSat satellite. The X-ray light curve gradually changed from a high-intensity flaring state to a low-intensity one with a few dips. The hardness intensity diagram showed that the source is in a high-intensity banana state with a gradually changing flux. Based on this, we divide the observation into four flux levels for a flux-resolved spectral study. The X-ray spectra can be explained by a model consisting of absorption, thermal emission from the disc and non-thermal emission from the corona. From our studies, we detect a correlation between the temperature of the thermal component and the flux and we examine the implications of our results for the accretion disc geometry of this source. △ Less

Submitted 7 July, 2024; originally announced July 2024.

Comments: Accepted for publication in MNRAS; 11 pages, 12 figures

arXiv:2407.05164 [pdf, other]

doi 10.1007/s10714-024-03265-1

A dynamical system analysis of bouncing cosmology with spatial curvature

Authors: Soumya Chakraborty, Sudip Mishra, Subenoy Chakraborty

Abstract: The present work deals with a FLRW cosmological model with spatial curvature and minimally coupled scalar field as the matter content. The curvature term behaves as a perfect fluid with the equation of state parameter w_K = -1/3 Using suitable transformation of variables, the evolution equations are reduced to an autonomous system for both power law and exponential form of the scalar potential. Th… ▽ More The present work deals with a FLRW cosmological model with spatial curvature and minimally coupled scalar field as the matter content. The curvature term behaves as a perfect fluid with the equation of state parameter w_K = -1/3 Using suitable transformation of variables, the evolution equations are reduced to an autonomous system for both power law and exponential form of the scalar potential. The critical points are analyzed with center manifold theory and stability has been discussed. Also, critical points at infinity have been studied using the notion of Poincare sphere. Finally, the cosmological implications of the critical points and cosmological bouncing scenarios are discussed. It is found that the cosmological bounce takes place near the points at infinity when the non-isolated critical points on the equator of the Poincare sphere are saddle or saddle-node in nature. △ Less

Submitted 6 July, 2024; originally announced July 2024.

arXiv:2407.02439 [pdf, other]

doi 10.1109/TMM.2022.3176942

Predicting Visual Attention in Graphic Design Documents

Authors: Souradeep Chakraborty, Zijun Wei, Conor Kelton, Seoyoung Ahn, Aruna Balasubramanian, Gregory J. Zelinsky, Dimitris Samaras

Abstract: We present a model for predicting visual attention during the free viewing of graphic design documents. While existing works on this topic have aimed at predicting static saliency of graphic designs, our work is the first attempt to predict both spatial attention and dynamic temporal order in which the document regions are fixated by gaze using a deep learning based model. We propose a two-stage m… ▽ More We present a model for predicting visual attention during the free viewing of graphic design documents. While existing works on this topic have aimed at predicting static saliency of graphic designs, our work is the first attempt to predict both spatial attention and dynamic temporal order in which the document regions are fixated by gaze using a deep learning based model. We propose a two-stage model for predicting dynamic attention on such documents, with webpages being our primary choice of document design for demonstration. In the first stage, we predict the saliency maps for each of the document components (e.g. logos, banners, texts, etc. for webpages) conditioned on the type of document layout. These component saliency maps are then jointly used to predict the overall document saliency. In the second stage, we use these layout-specific component saliency maps as the state representation for an inverse reinforcement learning model of fixation scanpath prediction during document viewing. To test our model, we collected a new dataset consisting of eye movements from 41 people freely viewing 450 webpages (the largest dataset of its kind). Experimental results show that our model outperforms existing models in both saliency and scanpath prediction for webpages, and also generalizes very well to other graphic design documents such as comics, posters, mobile UIs, etc. and natural images. △ Less

Submitted 2 July, 2024; originally announced July 2024.

Journal ref: IEEE Transactions on Multimedia 25 (2022): 4478-4493

arXiv:2407.01665 [pdf, other]

The M2-M5 Mohawk

Authors: Iosif Bena, Soumangsu Chakraborty, Dimitrios Toulikas, Nicholas P. Warner

Abstract: We show that the near-brane back-reaction of M2 branes ending on M5 branes has a rich "spike structure" that is determined by partitioning the numbers of M2 branes that are terminating on groups of M5 branes. The near-brane limit of the metric describing these branes has an AdS$_3$ factor, implying the existence of a dual CFT. Each partition of the M2 and M5 charges among spikes gives rise to a di… ▽ More We show that the near-brane back-reaction of M2 branes ending on M5 branes has a rich "spike structure" that is determined by partitioning the numbers of M2 branes that are terminating on groups of M5 branes. The near-brane limit of the metric describing these branes has an AdS$_3$ factor, implying the existence of a dual CFT. Each partition of the M2 and M5 charges among spikes gives rise to a different "mohawk" revealing a new layer of brane fractionation. We conjecture that all these mohawks are dual to ground states of near-brane-intersection CFT's. We show that the supergravity solutions describing these mohawks are part of the large families of AdS$_3$ $\times S^3 \times S^3$ solutions described in [arXiv:1312.5477]. We identify precisely which of these families are relevant to brane intersections and show that the AdS$_3$ invariance emerges from the self-similarity of the spikes. △ Less

Submitted 1 July, 2024; originally announced July 2024.

Comments: 25 pages, 6 figures

arXiv:2407.00573 [pdf, other]

A Simple Representation of Tree Covering Utilizing Balanced Parentheses and Efficient Implementation of Average-Case Optimal RMQs

Authors: Kou Hamada, Sankardeep Chakraborty, Seungbum Jo, Takuto Koriyama, Kunihiko Sadakane, Srinivasa Rao Satti

Abstract: Tree covering is a technique for decomposing a tree into smaller-sized trees with desirable properties, and has been employed in various succinct data structures. However, significant hurdles stand in the way of a practical implementation of tree covering: a lot of pointers are used to maintain the tree-covering hierarchy and many indices for tree navigational queries consume theoretically negligi… ▽ More Tree covering is a technique for decomposing a tree into smaller-sized trees with desirable properties, and has been employed in various succinct data structures. However, significant hurdles stand in the way of a practical implementation of tree covering: a lot of pointers are used to maintain the tree-covering hierarchy and many indices for tree navigational queries consume theoretically negligible yet practically vast space. To tackle these problems, we propose a simple representation of tree covering using a balanced parenthesis representation. The key to the proposal is the observation that every micro tree splits into at most two intervals on the BP representation. Utilizing the representation, we propose several data structures that represent a tree and its tree cover, which consequently allow micro tree compression with arbitrary coding and efficient tree navigational queries. We also applied our data structure to average-case optimal RMQ by Munro et al.~[ESA 2021] and implemented the RMQ data structure. Our RMQ data structures spend less than $2n$ bits and process queries in a practical time on several settings of the performance evaluation, reducing the gap between theoretical space complexity and actual space consumption. We also implement tree navigational operations while using the same amount of space as the RMQ data structures. We believe the representation can be widely utilized for designing practically memory-efficient data structures based on tree covering. △ Less

Submitted 29 June, 2024; originally announced July 2024.

Comments: To appear in ESA 2024

arXiv:2406.18700 [pdf, other]

On Fourier analysis of sparse Boolean functions over certain Abelian groups

Authors: Sourav Chakraborty, Swarnalipa Datta, Pranjal Dutta, Arijit Ghosh, Swagato Sanyal

Abstract: Given an Abelian group G, a Boolean-valued function f: G -> {-1,+1}, is said to be s-sparse, if it has at most s-many non-zero Fourier coefficients over the domain G. In a seminal paper, Gopalan et al. proved "Granularity" for Fourier coefficients of Boolean valued functions over Z_2^n, that have found many diverse applications in theoretical computer science and combinatorics. They also studied s… ▽ More Given an Abelian group G, a Boolean-valued function f: G -> {-1,+1}, is said to be s-sparse, if it has at most s-many non-zero Fourier coefficients over the domain G. In a seminal paper, Gopalan et al. proved "Granularity" for Fourier coefficients of Boolean valued functions over Z_2^n, that have found many diverse applications in theoretical computer science and combinatorics. They also studied structural results for Boolean functions over Z_2^n which are approximately Fourier-sparse. In this work, we obtain structural results for approximately Fourier-sparse Boolean valued functions over Abelian groups G of the form,G:= Z_{p_1}^{n_1} \times ... \times Z_{p_t}^{n_t}, for distinct primes p_i. We also obtain a lower bound of the form 1/(m^{2}s)^ceiling(phi(m)/2), on the absolute value of the smallest non-zero Fourier coefficient of an s-sparse function, where m=p_1 ... p_t, and phi(m)=(p_1-1) ... (p_t-1). We carefully apply probabilistic techniques from Gopalan et al., to obtain our structural results, and use some non-trivial results from algebraic number theory to get the lower bound. We construct a family of at most s-sparse Boolean functions over Z_p^n, where p > 2, for arbitrarily large enough s, where the minimum non-zero Fourier coefficient is 1/omega(n). The "Granularity" result of Gopalan et al. implies that the absolute values of non-zero Fourier coefficients of any s-sparse Boolean valued function over Z_2^n are 1/O(s). So, our result shows that one cannot expect such a lower bound for general Abelian groups. Using our new structural results on the Fourier coefficients of sparse functions, we design an efficient testing algorithm for Fourier-sparse Boolean functions, thata requires poly((ms)^phi(m),1/epsilon)-many queries. Further, we prove an Omega(sqrt{s}) lower bound on the query complexity of any adaptive sparsity testing algorithm. △ Less

Submitted 26 June, 2024; originally announced June 2024.

arXiv:2406.17390 [pdf, ps, other]

Tame sparse exponential random graphs

Authors: Suman Chakraborty, Remco van der Hofstad, Frank den Hollander

Abstract: In this paper, we obtain a precise estimate of the probability that the sparse binomial random graph contains a large number of vertices in a triangle. The estimate of log of this probability is correct up to second order, and enables us to propose an exponential random graph model based on the number of vertices in a triangle. Specifically, by tuning a single parameter, we can with high probabili… ▽ More In this paper, we obtain a precise estimate of the probability that the sparse binomial random graph contains a large number of vertices in a triangle. The estimate of log of this probability is correct up to second order, and enables us to propose an exponential random graph model based on the number of vertices in a triangle. Specifically, by tuning a single parameter, we can with high probability induce any given fraction of vertices in a triangle. Moreover, in the proposed exponential random graph model we derive the large deviation principle for the number of edges. As a byproduct, we propose a consistent estimator of the tuning parameter. △ Less

Submitted 25 June, 2024; originally announced June 2024.

Comments: Follow up of arXiv:2112.06526v2

MSC Class: 05C80; 60F10; 62F12

arXiv:2406.15567 [pdf, other]

SAIL: Self-Improving Efficient Online Alignment of Large Language Models

Authors: Mucong Ding, Souradip Chakraborty, Vibhu Agrawal, Zora Che, Alec Koppel, Mengdi Wang, Amrit Bedi, Furong Huang

Abstract: Reinforcement Learning from Human Feedback (RLHF) is a key method for aligning large language models (LLMs) with human preferences. However, current offline alignment approaches like DPO, IPO, and SLiC rely heavily on fixed preference datasets, which can lead to sub-optimal performance. On the other hand, recent literature has focused on designing online RLHF methods but still lacks a unified conc… ▽ More Reinforcement Learning from Human Feedback (RLHF) is a key method for aligning large language models (LLMs) with human preferences. However, current offline alignment approaches like DPO, IPO, and SLiC rely heavily on fixed preference datasets, which can lead to sub-optimal performance. On the other hand, recent literature has focused on designing online RLHF methods but still lacks a unified conceptual formulation and suffers from distribution shift issues. To address this, we establish that online LLM alignment is underpinned by bilevel optimization. By reducing this formulation to an efficient single-level first-order method (using the reward-policy equivalence), our approach generates new samples and iteratively refines model alignment by exploring responses and regulating preference labels. In doing so, we permit alignment methods to operate in an online and self-improving manner, as well as generalize prior online RLHF methods as special cases. Compared to state-of-the-art iterative RLHF methods, our approach significantly improves alignment performance on open-sourced datasets with minimal computational overhead. △ Less

Submitted 21 June, 2024; originally announced June 2024.

Comments: 24 pages, 6 figures, 3 tables

arXiv:2406.15113 [pdf, other]

A Dual Attention-aided DenseNet-121 for Classification of Glaucoma from Fundus Images

Authors: Soham Chakraborty, Ayush Roy, Payel Pramanik, Daria Valenkova, Ram Sarkar

Abstract: Deep learning and computer vision methods are nowadays predominantly used in the field of ophthalmology. In this paper, we present an attention-aided DenseNet-121 for classifying normal and glaucomatous eyes from fundus images. It involves the convolutional block attention module to highlight relevant spatial and channel features extracted by DenseNet-121. The channel recalibration module further… ▽ More Deep learning and computer vision methods are nowadays predominantly used in the field of ophthalmology. In this paper, we present an attention-aided DenseNet-121 for classifying normal and glaucomatous eyes from fundus images. It involves the convolutional block attention module to highlight relevant spatial and channel features extracted by DenseNet-121. The channel recalibration module further enriches the features by utilizing edge information along with the statistical features of the spatial dimension. For the experiments, two standard datasets, namely RIM-ONE and ACRIMA, have been used. Our method has shown superior results than state-of-the-art models. An ablation study has also been conducted to show the effectiveness of each of the components. The code of the proposed work is available at: https://github.com/Soham2004GitHub/DADGC. △ Less

Submitted 21 June, 2024; originally announced June 2024.

arXiv:2406.12696 [pdf]

doi 10.1021/acsenergylett.4c00693

Ultrasmall CsPbBr3 Blue Emissive Perovskite Quantum Dots using K-alloyed Cs4PbBr6 Nanocrystals as Precursors

Authors: Clara Otero Martinez, Matteo L. Zaffalon, Yurii Ivanov, Nikolaos Livakas, Luca Goldoni, Giorgio Divitini, Sankalpa Bora, Gabriele Saleh, Francesco Meinardi, Andrea Fratelli, Sudip Chakraborty, Lakshminarayana Polavarapu, Sergio Brovelli, Liberato Manna

Abstract: We report a colloidal synthesis of blue emissive, stable cube-shaped CsPbBr3 quantum dots (QDs) in the strong quantum confinement regime via a dissolution-recrystallization starting from pre-synthesized (KxCs1-x)4PbBr6 nanocrystals which are then reacted with PbBr2. This is markedly different from the known case of Cs4PbBr6 nanocrystals that react within seconds with PbBr2 and get transformed into… ▽ More We report a colloidal synthesis of blue emissive, stable cube-shaped CsPbBr3 quantum dots (QDs) in the strong quantum confinement regime via a dissolution-recrystallization starting from pre-synthesized (KxCs1-x)4PbBr6 nanocrystals which are then reacted with PbBr2. This is markedly different from the known case of Cs4PbBr6 nanocrystals that react within seconds with PbBr2 and get transformed into much larger, green emitting CsPbBr3 nanocrystals. Here, instead, the conversion of (KxCs1-x)4PbBr6 nanocrystals to CsPbBr3 QDs occurs in a time span of hours, and tuning of the QDs size is achieved by adjusting the concentration of precursors. The QDs exhibit excitonic features in optical absorption that are tunable in the 420 - 452 nm range, accompanied by blue photoluminescence with quantum yield around 60%. Detailed spectroscopic investigations in both the single and multi-exciton regime reveal the exciton fine structure and the effect of Auger recombination of these CsPbBr3 QDs, confirming theoretical predictions for this system. △ Less

Submitted 18 June, 2024; originally announced June 2024.

Journal ref: ACS Energy Letters 2024

arXiv:2406.12091 [pdf, other]

Is poisoning a real threat to LLM alignment? Maybe more so than you think

Authors: Pankayaraj Pathmanathan, Souradip Chakraborty, Xiangyu Liu, Yongyuan Liang, Furong Huang

Abstract: Recent advancements in Reinforcement Learning with Human Feedback (RLHF) have significantly impacted the alignment of Large Language Models (LLMs). The sensitivity of reinforcement learning algorithms such as Proximal Policy Optimization (PPO) has led to new line work on Direct Policy Optimization (DPO), which treats RLHF in a supervised learning framework. The increased practical use of these RLH… ▽ More Recent advancements in Reinforcement Learning with Human Feedback (RLHF) have significantly impacted the alignment of Large Language Models (LLMs). The sensitivity of reinforcement learning algorithms such as Proximal Policy Optimization (PPO) has led to new line work on Direct Policy Optimization (DPO), which treats RLHF in a supervised learning framework. The increased practical use of these RLHF methods warrants an analysis of their vulnerabilities. In this work, we investigate the vulnerabilities of DPO to poisoning attacks under different scenarios and compare the effectiveness of preference poisoning, a first of its kind. We comprehensively analyze DPO's vulnerabilities under different types of attacks, i.e., backdoor and non-backdoor attacks, and different poisoning methods across a wide array of language models, i.e., LLama 7B, Mistral 7B, and Gemma 7B. We find that unlike PPO-based methods, which, when it comes to backdoor attacks, require at least 4\% of the data to be poisoned to elicit harmful behavior, we exploit the true vulnerabilities of DPO more simply so we can poison the model with only as much as 0.5\% of the data. We further investigate the potential reasons behind the vulnerability and how well this vulnerability translates into backdoor vs non-backdoor attacks. △ Less

Submitted 19 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

Journal ref: ICML 2024 Workshop MHFAIA

arXiv:2406.10892 [pdf, other]

DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning

Authors: Utsav Singh, Souradip Chakraborty, Wesley A. Suttle, Brian M. Sadler, Vinay P Namboodiri, Amrit Singh Bedi

Abstract: Learning control policies to perform complex robotics tasks from human preference data presents significant challenges. On the one hand, the complexity of such tasks typically requires learning policies to perform a variety of subtasks, then combining them to achieve the overall goal. At the same time, comprehensive, well-engineered reward functions are typically unavailable in such problems, whil… ▽ More Learning control policies to perform complex robotics tasks from human preference data presents significant challenges. On the one hand, the complexity of such tasks typically requires learning policies to perform a variety of subtasks, then combining them to achieve the overall goal. At the same time, comprehensive, well-engineered reward functions are typically unavailable in such problems, while limited human preference data often is; making efficient use of such data to guide learning is therefore essential. Methods for learning to perform complex robotics tasks from human preference data must overcome both these challenges simultaneously. In this work, we introduce DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning, an efficient hierarchical approach that leverages direct preference optimization to learn a higher-level policy and reinforcement learning to learn a lower-level policy. DIPPER enjoys improved computational efficiency due to its use of direct preference optimization instead of standard preference-based approaches such as reinforcement learning from human feedback, while it also mitigates the well-known hierarchical reinforcement learning issues of non-stationarity and infeasible subgoal generation due to our use of primitive-informed regularization inspired by a novel bi-level optimization formulation of the hierarchical reinforcement learning problem. To validate our approach, we perform extensive experimental analysis on a variety of challenging robotics tasks, demonstrating that DIPPER outperforms hierarchical and non-hierarchical baselines, while ameliorating the non-stationarity and infeasible subgoal generation issues of hierarchical reinforcement learning. △ Less

Submitted 16 June, 2024; originally announced June 2024.

arXiv:2406.10692 [pdf, other]

doi 10.1142/S0219887824502505

Dynamical system analysis of quintessence dark energy model

Authors: Soumya Chakraborty, Sudip Mishra, Subenoy Chakraborty

Abstract: Our work deals with the dynamical system analysis of quintessence dark energy scalar field model with exponential potential. A dynamical system analysis has been applied at the background level. Using suitable transformation of variables, the evolution equations are reduced to an autonomous system for exponential form of the scalar potential. The critical points are analyzed with center manifold t… ▽ More Our work deals with the dynamical system analysis of quintessence dark energy scalar field model with exponential potential. A dynamical system analysis has been applied at the background level. Using suitable transformation of variables, the evolution equations are reduced to an autonomous system for exponential form of the scalar potential. The critical points are analyzed with center manifold theory and stability has been discussed by using Schwarzian derivative. Finally, cosmological implications of the critical points are discussed and it is found that the stability of the late-time attractor changes for quintessence dark energy model. △ Less

Submitted 15 June, 2024; originally announced June 2024.

arXiv:2406.09543 [pdf, other]

Rotating black holes experience dynamical tides

Authors: Rajendra Prasad Bhatt, Sumanta Chakraborty, Sukanta Bose

Abstract: We calculate the tidal response of a rotating black hole from the Teukolsky equation in the near-horizon and small-frequency regime. While the static tidal Love number of an arbitrarily rotating black hole is still zero, the dynamical tidal Love number is not; rather, it is proportional to the angular velocity of the black hole -- in the linear order of frequency -- and to the square of the angula… ▽ More We calculate the tidal response of a rotating black hole from the Teukolsky equation in the near-horizon and small-frequency regime. While the static tidal Love number of an arbitrarily rotating black hole is still zero, the dynamical tidal Love number is not; rather, it is proportional to the angular velocity of the black hole -- in the linear order of frequency -- and to the square of the angular velocity of the black hole -- in the zeroth order. Intriguingly, the zero-frequency limit of the dynamical tidal response function is not equal to the static tidal response function. We demonstrate that these results hold true for an extremal rotating black hole as well, with the dynamical tidal Love numbers being non-zero. This shows that Kerr black holes experience dynamical tides. △ Less

Submitted 13 June, 2024; originally announced June 2024.

Comments: 24 pages, 1 figure

Report number: LIGO-P240025

arXiv:2406.09043 [pdf, other]

Language Models are Crossword Solvers

Authors: Soumadeep Saha, Sutanoya Chakraborty, Saptarshi Saha, Utpal Garain

Abstract: Crosswords are a form of word puzzle that require a solver to demonstrate a high degree of proficiency in natural language understanding, wordplay, reasoning, and world knowledge, along with adherence to character and length constraints. In this paper we tackle the challenge of solving crosswords with Large Language Models (LLMs). We demonstrate that the current generation of state-of-the art (SoT… ▽ More Crosswords are a form of word puzzle that require a solver to demonstrate a high degree of proficiency in natural language understanding, wordplay, reasoning, and world knowledge, along with adherence to character and length constraints. In this paper we tackle the challenge of solving crosswords with Large Language Models (LLMs). We demonstrate that the current generation of state-of-the art (SoTA) language models show significant competence at deciphering cryptic crossword clues, and outperform previously reported SoTA results by a factor of 2-3 in relevant benchmarks. We also develop a search algorithm that builds off this performance to tackle the problem of solving full crossword grids with LLMs for the very first time, achieving an accuracy of 93\% on New York Times crossword puzzles. Contrary to previous work in this area which concluded that LLMs lag human expert performance significantly, our research suggests this gap is a lot narrower. △ Less

Submitted 14 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

Comments: Edited to include missing citation

ACM Class: I.2.7

arXiv:2406.05986 [pdf, other]

Neural-g: A Deep Learning Framework for Mixing Density Estimation

Authors: Shijie Wang, Saptarshi Chakraborty, Qian Qin, Ray Bai

Abstract: Mixing (or prior) density estimation is an important problem in machine learning and statistics, especially in empirical Bayes $g$-modeling where accurately estimating the prior is necessary for making good posterior inferences. In this paper, we propose neural-$g$, a new neural network-based estimator for $g$-modeling. Neural-$g$ uses a softmax output layer to ensure that the estimated prior is a… ▽ More Mixing (or prior) density estimation is an important problem in machine learning and statistics, especially in empirical Bayes $g$-modeling where accurately estimating the prior is necessary for making good posterior inferences. In this paper, we propose neural-$g$, a new neural network-based estimator for $g$-modeling. Neural-$g$ uses a softmax output layer to ensure that the estimated prior is a valid probability density. Under default hyperparameters, we show that neural-$g$ is very flexible and capable of capturing many unknown densities, including those with flat regions, heavy tails, and/or discontinuities. In contrast, existing methods struggle to capture all of these prior shapes. We provide justification for neural-$g$ by establishing a new universal approximation theorem regarding the capability of neural networks to learn arbitrary probability mass functions. To accelerate convergence of our numerical implementation, we utilize a weighted average gradient descent approach to update the network parameters. Finally, we extend neural-$g$ to multivariate prior density estimation. We illustrate the efficacy of our approach through simulations and analyses of real datasets. A software package to implement neural-$g$ is publicly available at https://github.com/shijiew97/neuralG. △ Less

Submitted 9 June, 2024; originally announced June 2024.

Comments: 40 pages, 8 figures, 5 tables

arXiv:2406.05004 [pdf, ps, other]

The Choquet-Deny Property for Groupoids

Authors: Tey Berendschot, Soham Chakraborty, Milan Donvil, Se-Jin Kim, Mario Klisse

Abstract: A countable discrete group is called Choquet-Deny if for any non-degenerate probability measure on the group, the corresponding space of bounded harmonic functions is trivial. Building on the previous work of Jaworski, a complete characterization of Choquet-Deny groups was recently achieved by Frisch, Hartman, Tamuz, and Ferdowski. In this article, we extend the study of the Choquet-Deny property… ▽ More A countable discrete group is called Choquet-Deny if for any non-degenerate probability measure on the group, the corresponding space of bounded harmonic functions is trivial. Building on the previous work of Jaworski, a complete characterization of Choquet-Deny groups was recently achieved by Frisch, Hartman, Tamuz, and Ferdowski. In this article, we extend the study of the Choquet-Deny property to the framework of discrete measured groupoids. Our primary result offers a complete characterization of this property in terms of the isotropy groups and the equivalence relation associated with the given groupoid. Additionally, we use the implications derived from our main theorem to classify the Choquet-Deny property of transformation groupoids. △ Less

Submitted 25 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

Comments: 37 pages, v2: Subsection 2.4.1 on the Choquet-Deny property and icc quotients was added

MSC Class: Primare: 20L05; 43A05; Secondary: 22A22; 45E10; 60J50; 37A30

arXiv:2406.03661 [pdf, other]

Randomness in atomic disorder and consequent squandering of spin-polarization in a ferromagnetically fragile quaternary Heusler alloy FeRuCrSi

Authors: Shuvankar Gupta, Sudip Chakraborty, Vidha Bhasin, Celine Barreteau, Jean-Claude Crivello, Jean-Marc Greneche, S. N. Jha, D. Bhattacharyya, Eric Alleno, Chandan Mazumdar

Abstract: Ru$_{2-x}$Fe$_x$CrSi ( 0 $<$ x $<$1) system is theoretically predicted to be one of the very few known examples of robust half-metallic ferromagnet with 100\% spin polarization. Since Cr is considered to be the main contributor to magnetism, the Fe/Ru substitution is not expected to disturb its magnetic properties any significantly, and hence all Fe-containing members of the series are predicted t… ▽ More Ru$_{2-x}$Fe$_x$CrSi ( 0 $<$ x $<$1) system is theoretically predicted to be one of the very few known examples of robust half-metallic ferromagnet with 100\% spin polarization. Since Cr is considered to be the main contributor to magnetism, the Fe/Ru substitution is not expected to disturb its magnetic properties any significantly, and hence all Fe-containing members of the series are predicted to follow Slater-Pauling rule with a saturation magnetic moment of 2 ${μ_B}$/f.u. However, contrarily to the theoretical expectations, some experiments rather show a linear variation of the saturation magnetization and Curie temperature with Fe (\textit{x}) substitution. The equiatomic member FeRuCrSi of this family is also considered as a technologically important material, where the band structure calculations suggest the material to be spin gapless semiconductor. Through our in-depth structural analysis of FeRuCrSi using X-ray diffraction, extended X-ray absorption fine structure and $^{57}$Fe Mössbauer spectrometry, we found a random disorder between Fe and Ru sites, while the magnetic moment in this system is actually contributed by Fe atoms, questioning the very basic foundation of the half-metallic character proposed by all theoretical calculations on Ru$_{2-x}$Fe$_x$CrSi series. Our Mössbauer result also envisions a rather rare scenario where the main physical properties are intricately correlated to the chemistry of the material in the form of random atomic disorder on a localised scale. △ Less

Submitted 5 June, 2024; originally announced June 2024.

arXiv:2406.03656 [pdf, other]

Restructuring disorder: Transformation from the antiferromagnetic order in Fe2VSi to the ferromagnetic state in FeRuVSi by substitution of a non-magnetic element

Authors: Shuvankar Gupta, Sudip Chakraborty, Celine Barreteau, Jean-Claude Crivello, Jean-Marc Greneche, Eric Alleno, Chandan Mazumdar

Abstract: The delicate nature of the half-metallic ferromagnetic (HMF) property in Heusler alloys is often compromised by inherent structural disorder within the systems. Fe2VSi is a prime example, where such disorder prevents the realization of the theoretically proposed HMF state as the anti-site disorder leads to the formation of two anti-parallel magnetic lattices resulting in antiferromagnetic order. I… ▽ More The delicate nature of the half-metallic ferromagnetic (HMF) property in Heusler alloys is often compromised by inherent structural disorder within the systems. Fe2VSi is a prime example, where such disorder prevents the realization of the theoretically proposed HMF state as the anti-site disorder leads to the formation of two anti-parallel magnetic lattices resulting in antiferromagnetic order. In this study, we propose an innovative and simple strategy to prevent this atomic disorder by replacing 50% of the magnetic element Fe by a large, isoelectronic, non-magnetic element, Ru. In this way, one of the magnetic sublattices of the antiferromagnetic lattice ceases to order while ferromagnetic order is restored, an essential criterion for exhibiting HMF properties. Through various experimental measurements and theoretical calculations, we have shown that such partial replacement of Fe by Ru prevents the cross-site substitution of V/Si sites and the system regains its ferromagnetic order. Our theoretical calculations suggest that a perfect structural arrangement in Fe and Ru would have restored the HMF property in FeRuVSi. However, the local atomic disorder of Fe and Ru was found to decrease the spin polarization value. The present work sheds light on the complex interplay between structural disorder and magnetic properties in Heusler alloys and provides insights for future design strategies in the pursuit of robust half-metallic ferromagnets. △ Less

Submitted 5 June, 2024; originally announced June 2024.

arXiv:2406.03174 [pdf, other]

The Amaterasu particle: constraining the superheavy dark matter origin of UHECRs

Authors: Prantik Sarmah, Nayan Das, Debasish Borah, Sovan Chakraborty, Poonam Mehta

Abstract: Amaterasu, the second most energetic ($244$ EeV) cosmic ray particle has been recently detected by the Telescope Array (TA) surface detector. The origin of the TA Amaterasu event is puzzling, as its arrival direction points back to a void in the local Universe, lacking conventional astrophysical ultra-high-energy (UHE) cosmic ray (CR) sources. Hence, we explore the possibility if this TA Amaterasu… ▽ More Amaterasu, the second most energetic ($244$ EeV) cosmic ray particle has been recently detected by the Telescope Array (TA) surface detector. The origin of the TA Amaterasu event is puzzling, as its arrival direction points back to a void in the local Universe, lacking conventional astrophysical ultra-high-energy (UHE) cosmic ray (CR) sources. Hence, we explore the possibility if this TA Amaterasu event could have originated from the decay of superheavy dark matter (SHDM) in the Milky Way. Such an origin also opens up multi-messenger detection channels in both UHE gamma-rays and UHE neutrinos. In this present work, using the TA Amaterasu event and the multi-messenger limits/sensitivities from various UHE telescopes, we place stringent constraints on the lifetime and mass of the SHDM. We find that the non-detection of the corresponding gamma-rays at the Pierre Auger Observatory (PAO) and the TA is in severe tension with the SHDM parameter space required to explain the TA Amaterasu event. Additionally, we extend the multi-messenger analysis to the future UHE gamma-ray and UHE neutrino telescopes such as PAO upgrade, GRAND 200k and IceCube-Gen2. We find that the bounds from the future neutrino telescopes will be able to compete with the present UHECR bounds. However, compared to the existing UHE gamma-ray bounds, the future PAO upgrade and the GRAND 200k gamma-ray detectors will improve the bounds on SHDM lifetime by at least one order of magnitude. △ Less

Submitted 5 June, 2024; originally announced June 2024.

Comments: 9 pages, 4 figures

arXiv:2406.02697 [pdf, other]

doi 10.1142/S0218271824500342

On the role of closed timelike curves and confinement structure around Kerr-Newman singularity

Authors: Ayanendu Dutta, Dhritimalya Roy, Subenoy Chakraborty

Abstract: In this study, the particle motion around the naked singularity and black hole of Kerr-Newman spacetime is investigated with a special attention on the closed timelike orbits. It is found that both in the naked singularity (NS) and in black hole (BH), the singularity is concealed by causality violating regions, and the Cauchy surface consistently resides inside the inner horizon in non-extremal bl… ▽ More In this study, the particle motion around the naked singularity and black hole of Kerr-Newman spacetime is investigated with a special attention on the closed timelike orbits. It is found that both in the naked singularity (NS) and in black hole (BH), the singularity is concealed by causality violating regions, and the Cauchy surface consistently resides inside the inner horizon in non-extremal black holes. For neutral particles and particles with an identical charge to the source, only particles with positive angular momentum are permitted to traverse the closed timelike curves. Conversely, for particles with the opposite charge to the source, the strong Coulomb attraction draws all particles inside the Cauchy surface, allowing them to be present in the closed timelike curves irrespective of their angular momentum. However, in both the NS and BH (both extremal and non-extremal), test particles are confined at a considerable distance from the singular point such that there always exists an empty region surrounding the singularity which prevents particles from interacting with it. The radius of the empty surface that depends on the source parameters and the particle characteristics, is investigated with an accurate expression. △ Less

Submitted 4 June, 2024; originally announced June 2024.

Comments: 17 pages, 9 figures, accepted in IJMPD

Journal ref: Int. J. Mod. Phys. D (2024)

arXiv:2406.00001 [pdf, other]

PhyPlan: Generalizable and Rapid Physical Task Planning with Physics Informed Skill Networks for Robot Manipulators

Authors: Mudit Chopra, Abhinav Barnawal, Harshil Vagadia, Tamajit Banerjee, Shreshth Tuli, Souvik Chakraborty, Rohan Paul

Abstract: Given the task of positioning a ball-like object to a goal region beyond direct reach, humans can often throw, slide, or rebound objects against the wall to attain the goal. However, enabling robots to reason similarly is non-trivial. Existing methods for physical reasoning are data-hungry and struggle with complexity and uncertainty inherent in the real world. This paper presents PhyPlan, a novel… ▽ More Given the task of positioning a ball-like object to a goal region beyond direct reach, humans can often throw, slide, or rebound objects against the wall to attain the goal. However, enabling robots to reason similarly is non-trivial. Existing methods for physical reasoning are data-hungry and struggle with complexity and uncertainty inherent in the real world. This paper presents PhyPlan, a novel physics-informed planning framework that combines physics-informed neural networks (PINNs) with modified Monte Carlo Tree Search (MCTS) to enable embodied agents to perform dynamic physical tasks. PhyPlan leverages PINNs to simulate and predict outcomes of actions in a fast and accurate manner and uses MCTS for planning. It dynamically determines whether to consult a PINN-based simulator (coarse but fast) or engage directly with the actual environment (fine but slow) to determine optimal policy. Given an unseen task, PhyPlan can infer the sequence of actions and learn the latent parameters, resulting in a generalizable approach that can rapidly learn to perform novel physical tasks. Evaluation with robots in simulated 3D environments demonstrates the ability of our approach to solve 3D-physical reasoning tasks involving the composition of dynamic skills. Quantitatively, PhyPlan excels in several aspects: (i) it achieves lower regret when learning novel tasks compared to the state-of-the-art, (ii) it expedites skill learning and enhances the speed of physical reasoning, (iii) it demonstrates higher data efficiency compared to a physics un-informed approach. △ Less

Submitted 22 April, 2024; originally announced June 2024.

Comments: arXiv admin note: substantial text overlap with arXiv:2402.15767

arXiv:2405.20699 [pdf, ps, other]

Equivariant Parabolic connections and stack of roots

Authors: Sujoy Chakraborty, Arjun Paul

Abstract: Let $X$ be a smooth complex projective variety equipped with an action of a linear algebraic group $G$ over $\mathbb{C}$. Let $D$ be a reduced effective divisor on $X$ that is invariant under the $G$--action on $X$. Let $s_D$ be the canonical section of $\mathcal{O}_X(D)$ vanishing along $D$. Given a positive integer $r$, consider the stack… ▽ More Let $X$ be a smooth complex projective variety equipped with an action of a linear algebraic group $G$ over $\mathbb{C}$. Let $D$ be a reduced effective divisor on $X$ that is invariant under the $G$--action on $X$. Let $s_D$ be the canonical section of $\mathcal{O}_X(D)$ vanishing along $D$. Given a positive integer $r$, consider the stack $\mathfrak{X} := \mathfrak{X}_{(\mathcal{O}_X(D),\, s_D,\, r)}$ of $r$-th roots of $(\mathcal{O}_X, s_D)$ together with the natural morphism $π: \mathfrak{X} \to X$. Under the assumption that $G$ has no non-trivial characters, we show that the $G$--action on $X$ naturally lifts to a $G$--action on $\mathfrak{X}$ such that $π$ become $G$--equivariant, and the tautological invertible sheaf $\mathscr{M}$ on $\mathfrak{X}$ admits a linearization of this $G$--action. Finally, we define the notions of $G$--equivariant logarithmic connections on $\mathfrak{X}$ and $G$--equivariant parabolic connections on $X$ with rational parabolic weights along $D$, and establish an equivalence between the category of $G$--equivariant logarithmic connections on $\mathfrak{X}$ and the category of $G$--equivariant parabolic connections on $X$ with rational parabolic weights along $D$. △ Less

Submitted 31 May, 2024; originally announced May 2024.

MSC Class: 14D23; 14H60; 53B15; 53C05; 14A21

arXiv:2405.20495 [pdf, other]

Transfer Q Star: Principled Decoding for LLM Alignment

Authors: Souradip Chakraborty, Soumya Suvra Ghosal, Ming Yin, Dinesh Manocha, Mengdi Wang, Amrit Singh Bedi, Furong Huang

Abstract: Aligning foundation models is essential for their safe and trustworthy deployment. However, traditional fine-tuning methods are computationally intensive and require updating billions of model parameters. A promising alternative, alignment via decoding, adjusts the response distribution directly without model updates to maximize a target reward $r$, thus providing a lightweight and adaptable frame… ▽ More Aligning foundation models is essential for their safe and trustworthy deployment. However, traditional fine-tuning methods are computationally intensive and require updating billions of model parameters. A promising alternative, alignment via decoding, adjusts the response distribution directly without model updates to maximize a target reward $r$, thus providing a lightweight and adaptable framework for alignment. However, principled decoding methods rely on oracle access to an optimal Q-function ($Q^*$), which is often unavailable in practice. Hence, prior SoTA methods either approximate this $Q^*$ using $Q^{π_{\texttt{sft}}}$ (derived from the reference $\texttt{SFT}$ model) or rely on short-term rewards, resulting in sub-optimal decoding performance. In this work, we propose Transfer $Q^*$, which implicitly estimates the optimal value function for a target reward $r$ through a baseline model $ρ_{\texttt{BL}}$ aligned with a baseline reward $ρ_{\texttt{BL}}$ (which can be different from the target reward $r$). Theoretical analyses of Transfer $Q^*$ provide a rigorous characterization of its optimality, deriving an upper bound on the sub-optimality gap and identifying a hyperparameter to control the deviation from the pre-trained reference $\texttt{SFT}$ model based on user needs. Our approach significantly reduces the sub-optimality gap observed in prior SoTA methods and demonstrates superior empirical performance across key metrics such as coherence, diversity, and quality in extensive tests on several synthetic and real datasets. △ Less

Submitted 30 May, 2024; originally announced May 2024.

arXiv:2405.19908 [pdf, other]

Long-Range Correlations in Elastic Moduli and Local Stresses at the Unjamming Transition

Authors: Surajit Chakraborty, Kabir Ramola

Abstract: We explore the behavior of spatially heterogeneous elastic moduli as well as the correlations between local moduli in model solids with short-range repulsive potentials. We show through numerical simulations that local elastic moduli exhibit long-range correlations, similar to correlations in the local stresses. Specifically, the correlations in local shear moduli exhibit anisotropic behavior at l… ▽ More We explore the behavior of spatially heterogeneous elastic moduli as well as the correlations between local moduli in model solids with short-range repulsive potentials. We show through numerical simulations that local elastic moduli exhibit long-range correlations, similar to correlations in the local stresses. Specifically, the correlations in local shear moduli exhibit anisotropic behavior at large lengthscales characterized by pinch-point singularities in Fourier space, displaying a structural pattern akin to shear stress correlations. Focussing on two-dimensional jammed solids approaching the unjamming transition, we show that stress correlations exhibit universal properties, characterized by a quadratic $p^2$ dependence of the correlations as the pressure $p$ approaches zero, independent of the details of the model. In contrast, the modulus correlations exhibit a power-law dependence with different exponents depending on the specific interaction potential. Furthermore, we illustrate that while affine responses lack long-range correlations, the total modulus, which encompasses non-affine behavior, exhibits long-range correlations. △ Less

Submitted 30 May, 2024; originally announced May 2024.

arXiv:2405.19274 [pdf, other]

A Hanani-Tutte Theorem for Cycles

Authors: Sutanoya Chakraborty, Arijit Ghosh

Abstract: Given a drawing $D$ of a graph $G$, we define the crossing number between any two cycles $C_{1}$ and $C_{2}$ in $D$ to be the number of crossings that involve at least one edge from each of $C_1$ and $C_2$ except the crossings between edges that are common to both cycles. We show that if the crossing number between every two cycles in $G$ is even in a drawing of $G$ on the plane, then there is a p… ▽ More Given a drawing $D$ of a graph $G$, we define the crossing number between any two cycles $C_{1}$ and $C_{2}$ in $D$ to be the number of crossings that involve at least one edge from each of $C_1$ and $C_2$ except the crossings between edges that are common to both cycles. We show that if the crossing number between every two cycles in $G$ is even in a drawing of $G$ on the plane, then there is a planar drawing of $G$. This result can be extended to arbitrary surfaces. We also establish an equivalence between our result and a fundamental result due to Cairns-Nikolayevsky and Pelsmajer-Schaefer-Štefankovič, about drawing graphs on surfaces, and derive the Loebl-Masbaum theorem from it. △ Less

Submitted 12 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

Comments: Included equivalence with an established result, and derived a previous theorem from the result

arXiv:2405.16948 [pdf, ps, other]

Harmonic Tutte polynomials of matroids III

Authors: Thomas Britz, Himadri Shekhar Chakraborty, Tsuyoshi Miezaki

Abstract: In this paper, we present the harmonic generalizations of well-known polynomials of codes over finite fields, namely the higher weight enumerators and the extended weight enumerators, and we derive the correspondences between these weight enumerators. Moreover, we present the harmonic generalization of Greene's Theorem for the higher (resp. extended) weight enumerators. As an application of this G… ▽ More In this paper, we present the harmonic generalizations of well-known polynomials of codes over finite fields, namely the higher weight enumerators and the extended weight enumerators, and we derive the correspondences between these weight enumerators. Moreover, we present the harmonic generalization of Greene's Theorem for the higher (resp. extended) weight enumerators. As an application of this Greene's-type theorem, we provide the MacWilliams-type identity for harmonic higher weight enumerators of codes over finite fields. Finally, we use this new identity to give a new proof of the Assmus-Mattson Theorem for subcode supports of linear codes over finite fields using harmonic higher weight enumerators. △ Less

Submitted 27 May, 2024; originally announced May 2024.

Comments: 21 pages

arXiv:2405.16538 [pdf, other]

Gamified AI Approch for Early Detection of Dementia

Authors: Paramita Kundu Maji, Soubhik Acharya, Priti Paul, Sanjay Chakraborty, Saikat Basu

Abstract: This paper aims to develop a new deep learning-inspired gaming approach for early detection of dementia. This research integrates a robust convolutional neural network (CNN)-based model for early dementia detection using health metrics data as well as facial image data through a cognitive assessment-based gaming application. We have collected 1000 data samples of health metrics dataset from Apollo… ▽ More This paper aims to develop a new deep learning-inspired gaming approach for early detection of dementia. This research integrates a robust convolutional neural network (CNN)-based model for early dementia detection using health metrics data as well as facial image data through a cognitive assessment-based gaming application. We have collected 1000 data samples of health metrics dataset from Apollo Diagnostic Center Kolkata that is labeled as either demented or non-demented for the training of MOD-1D-CNN for the game level 1 and another dataset of facial images containing 1800 facial data that are labeled as either demented or non-demented is collected by our research team for the training of MOD-2D-CNN model in-game level 2. In our work, the loss for the proposed MOD-1D-CNN model is 0.2692 and the highest accuracy is 70.50% for identifying the dementia traits using real-life health metrics data. Similarly, the proposed MOD-2D-CNN model loss is 0.1755 and the highest accuracy is obtained here 95.72% for recognizing the dementia status using real-life face-based image data. Therefore, a rule-based weightage method is applied to combine both the proposed methods to achieve the final decision. The MOD-1D-CNN and MOD-2D-CNN models are more lightweight and computationally efficient alternatives because they have a significantly lower number of parameters when compared to the other state-of-the-art models. We have compared their accuracies and parameters with the other state-of-the-art deep learning models. △ Less

Submitted 26 May, 2024; originally announced May 2024.

Comments: 50 Pages, 29 Figures

arXiv:2405.15049 [pdf, other]

Breaking Barriers: Investigating Gender Dynamics in Introductory Physics Lab Classes

Authors: Bilas Paul, Shantanu Chakraborty, Ganga Sharma

Abstract: The persistent underrepresentation of women and other gender minorities in physical science fields has been an ongoing concern. This study investigates gender dynamics in introductory physics laboratory courses, specifically exploring whether students of different gender identities exhibit equal inclination and confidence in conducting lab experiments, and whether they face barriers that impact th… ▽ More The persistent underrepresentation of women and other gender minorities in physical science fields has been an ongoing concern. This study investigates gender dynamics in introductory physics laboratory courses, specifically exploring whether students of different gender identities exhibit equal inclination and confidence in conducting lab experiments, and whether they face barriers that impact their participation. The study was conducted across three institutions, involving non-physics students enrolled in algebra-based and calculus-based physics courses. Our findings reveal no significant differences in participation levels across genders in various lab activities. However, a subtle yet significant trended was observed: non-male students tend to express greater preferences and comfort levels for note-taking, calculations, and graphing tasks compared to their male counterparts, who gravitated more towards hands-on equipment handling. Although no overt barriers deterring participation based solely on gender were identified, some students reported experiences or witnessed instances where gender dynamics hindered full engagement, such as assumptions about competence or difficulty asserting voices in male-dominated groups. These findings contribute insights into potential gender-based inclinations and experiences within laboratory environments. The results underscore the importance of fostering an inclusive climate that encourages equitable opportunities and engagement from all gender identities in scientific exploration and learning. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: 5 pages, 2 figures, talk presented at GEORGIA ACADEMY OF SCIENCE 101st ANNUAL MEETING

arXiv:2405.15027 [pdf, ps, other]

1991T-like Supernovae

Authors: M. M. Phillips, C. Ashall, Peter J. Brown, L. Galbany, M. A. Tucker, Christopher R. Burns, Carlos Contreras, P. Hoeflich, E. Y. Hsiao, S. Kumar, Nidia Morrell, Syed A. Uddin, E. Baron, Wendy L. Freedman, Kevin Krisciunas, S. E. Persson, Anthony L. Piro, B. J. Shappee, Maximilian Stritzinger, Nicholas B. Suntzeff, Sudeshna Chakraborty, R. P. Kirshner, J. Lu, G. H. Marion, Abigail Polin , et al. (1 additional authors not shown)

Abstract: Understanding the nature of the luminous 1991T-like supernovae is of great importance to supernova cosmology as they are likely to have been more common in the early universe. In this paper we explore the observational properties of 1991T-like supernovae to study their relationship to other luminous, slow-declining Type~Ia supernovae (SNe Ia). From the spectroscopic and photometric criteria define… ▽ More Understanding the nature of the luminous 1991T-like supernovae is of great importance to supernova cosmology as they are likely to have been more common in the early universe. In this paper we explore the observational properties of 1991T-like supernovae to study their relationship to other luminous, slow-declining Type~Ia supernovae (SNe Ia). From the spectroscopic and photometric criteria defined in Phillips et al. (1992), we identify 17 1991T-like supernovae from the literature. Combining these objects with ten 1991T-like supernovae from the Carnegie Supernova Project-II, the spectra, light curves, and colors of these events, along with their host galaxy properties, are examined in detail. We conclude that 1991T-like supernovae are closely related in essentially all of their UV, optical, and near-infrared properties -- as well as their host galaxy parameters -- to the slow-declining subset of Branch core-normal supernovae and to the intermediate 1999aa-like events, forming a continuum of luminous SNe Ia. The overriding difference between these three subgroups appears to be the extent to which $^{56}$Ni mixes into the ejecta, producing the pre-maximum spectra dominated by Fe III absorption, the broader UV light curves, and the higher luminosities that characterize the 1991T-like events. Nevertheless, the association of 1991T-like SNe with the rare Type Ia CSM supernovae would seem to run counter to this hypothesis, in which case 1991T-like events may form a separate subclass of SNe Ia, possibly arising from single-degenerate progenitor systems. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: Accepted for publication in ApJS

arXiv:2405.14902 [pdf, other]

Global urban activity changes from COVID-19 physical distancing restrictions

Authors: Srija Chakraborty, Eleanor Stokes, Olivia Alexander

Abstract: During the COVID-19 pandemic changes in human activity became widespread through official policies and organically in response to the virus's transmission, which in turn, impacted the environment and the economy. The pandemic has been described as a natural experiment that tested how social and economic disruptions impacted different components of the global Earth System. To move this beyond hypot… ▽ More During the COVID-19 pandemic changes in human activity became widespread through official policies and organically in response to the virus's transmission, which in turn, impacted the environment and the economy. The pandemic has been described as a natural experiment that tested how social and economic disruptions impacted different components of the global Earth System. To move this beyond hypotheses, locally-resolved, globally-available measures of how, where, and when human activity changed are critically needed. Here we use satellite-derived nighttime lights to quantify and map daily changes in human activity that are atypical for each urban area globally for two years after the onset of the pandemic using machine learning anomaly detectors. Metrics characterizing changes in lights from pre-COVID baseline in human settlements and quality assurance measures are reported. This dataset, TRacking Anomalous COVID-19 induced changEs in NTL (TRACE-NTL), is the first to resolve COVID-19 disruptions for all metropolitan regions globally, daily. It is suitable to support a variety of post-pandemic studies that assess how changes in human activity impact environmental systems. △ Less