-
Approximate Degree Composition for Recursive Functions
Authors:
Sourav Chakraborty,
Chandrima Kayal,
Rajat Mittal,
Manaswi Paraashar,
Nitin Saurabh
Abstract:
Determining the approximate degree composition for Boolean functions remains a significant unsolved problem in Boolean function complexity. In recent decades, researchers have concentrated on proving that approximate degree composes for special types of inner and outer functions. An important and extensively studied class of functions are the recursive functions, i.e.~functions obtained by composi…
▽ More
Determining the approximate degree composition for Boolean functions remains a significant unsolved problem in Boolean function complexity. In recent decades, researchers have concentrated on proving that approximate degree composes for special types of inner and outer functions. An important and extensively studied class of functions are the recursive functions, i.e.~functions obtained by composing a base function with itself a number of times. Let $h^d$ denote the standard $d$-fold composition of the base function $h$.
The main result of this work is to show that the approximate degree composes if either of the following conditions holds:
\begin{itemize}
\item The outer function $f:\{0,1\}^n\to \{0,1\}$ is a recursive function of the form $h^d$, with $h$ being any base function and $d= Ω(\log\log n)$.
\item The inner function is a recursive function of the form $h^d$, with $h$ being any constant arity base function (other than AND and OR) and $d= Ω(\log\log n)$, where $n$ is the arity of the outer function.
\end{itemize}
In terms of proof techniques, we first observe that the lower bound for composition can be obtained by introducing majority in between the inner and the outer functions. We then show that majority can be \emph{efficiently eliminated} if the inner or outer function is a recursive function.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
A description of classical and quantum cosmology for a single scalar field torsion gravity
Authors:
Dipankar Laya,
Roshni Bhaumik,
Sourav Dutta,
Subenoy Chakraborty
Abstract:
In the background of homogeneous and isotropic flat FLRW space-time, both classical and quantum cosmology has been studied for teleparallel dark energy (DE) model. Using Noether symmetry analysis, not only the symmetry vector but also the coupling function in the Lagrangian and the potential of the scalar field has been determined. Also symmetry analysis identifies a cyclic variable in the Lagrang…
▽ More
In the background of homogeneous and isotropic flat FLRW space-time, both classical and quantum cosmology has been studied for teleparallel dark energy (DE) model. Using Noether symmetry analysis, not only the symmetry vector but also the coupling function in the Lagrangian and the potential of the scalar field has been determined. Also symmetry analysis identifies a cyclic variable in the Lagrangian along the symmetry vector and as a result the Lagrangian simplifies to a great extend so that classical solution is obtained. Subsequently, in quantum cosmology Wheeler-DeWitt(WD) equation has been constructed and the quantum version of the conserved momenta corresponding to Noether symmetry identifies the periodic part of the wave function of the universe and as a result the Wheeler-DeWitt equation becomes solvable. Finally, quantum description shows finite non-zero probability at the classical big-bang singularity.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Classical and Quantum Cosmology in Einstein-aether Scalar-tensor gravity: Noether Symmetry Analysis
Authors:
Dipanakr Laya,
Roshni Bhaumik,
Sourav Dutta,
Subenoy Chakraborty
Abstract:
The present work deals with Einstein-aether Scalar tensor gravity in the background of homogeneous and isotropic flat FLRW space-time model. The Noether symmetry vector identifies a transformation in the augmented space so that the field equations become solvable. The cosmological solutions are analyzed from the observational point of view. Finally, for quantum cosmology, the Wheeler-DeWitt (WD) h…
▽ More
The present work deals with Einstein-aether Scalar tensor gravity in the background of homogeneous and isotropic flat FLRW space-time model. The Noether symmetry vector identifies a transformation in the augmented space so that the field equations become solvable. The cosmological solutions are analyzed from the observational point of view. Finally, for quantum cosmology, the Wheeler-DeWitt (WD) has been formulated and solutions have been determined by identifying the periodic nature of the wave function using conserved (Noether) charge.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Layer Resolved Magnetotransport Properties in Antiferromagnetic/Paramagnetic Superlattices
Authors:
Sandip Halder,
Sourav Chakraborty,
Kalpataru Pradhan
Abstract:
We investigate the layer resolved magnetotransport properties of the antiferromagnetic/paramagnetic superlattices based on one band half-filled Hubbard model in three dimensions. In our set up the correlated layers (with on-site repulsion strength $U \ne$ 0) are intercalated between the uncorrelated (U = 0) layers. Our calculations based on the semi-classical Monte-Carlo technique show that the ma…
▽ More
We investigate the layer resolved magnetotransport properties of the antiferromagnetic/paramagnetic superlattices based on one band half-filled Hubbard model in three dimensions. In our set up the correlated layers (with on-site repulsion strength $U \ne$ 0) are intercalated between the uncorrelated (U = 0) layers. Our calculations based on the semi-classical Monte-Carlo technique show that the magnetic moments are induced in the uncorrelated layers at low temperatures due to kinetic hopping of the carriers across the interface. The average induced magnetic moment in the uncorrelated layer varies nonmonotonically with the $U$ values of the correlated layer. Interestingly, the induced magnetic moments are antiferromagnetically arranged in uncorrelated layers and mediates the antiferromagnetic ordering between correlated layers. As a result the whole SL system turns out to be antiferromagnetic insulating at low temperatures. For $U \sim$ bandwidth the local moments in the correlated planes increases as a function of the distance from the interface. Expectedly our in-plane resistivity calculations show that the metal insulator transition temperature of the central plane is larger than the edge planes in the correlated layers. On the other hand, although the induced moments in uncorrelated planes decreases considerably as move from edge planes to center planes the metal insulator transition temperature remains more or less same for all planes. The induced moments in uncorrelated layers gradually dissipates with increasing the thickness of uncorrelated layer and as a result the long range antiferromagnetic ordering vanishes in the superlattices similar to the experiments.
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
Quantum Cosmology in Coupled Brans-Dicke Gravity: A Noether Symmetry Analysis
Authors:
Dipankar Laya,
Sourav Dutta,
Subenoy Chakraborty
Abstract:
The present work deals with a multi-field cosmological model in a spatially flat FLRW space-time geometry. The usual Brans-Dicke(BD) field and another scalar field are minimally coupled to gravity while they interact with each other through the Kinetic terms. {The main aim of the present work is to examine whether the model is compatible with cosmic observations. So cosmological solutions are obta…
▽ More
The present work deals with a multi-field cosmological model in a spatially flat FLRW space-time geometry. The usual Brans-Dicke(BD) field and another scalar field are minimally coupled to gravity while they interact with each other through the Kinetic terms. {The main aim of the present work is to examine whether the model is compatible with cosmic observations. So cosmological solutions are obtained using symmetry analysis only.} By imposing Noether Symmetry to the Lagrangian of the system the potential of the scalar field as well as the coupling function has been determined. The classical solutions are determined after simplifying the Lagrangian using cyclic variables. Finally, Wheeler-DeWitt(WD) equation in quantum cosmology has been formulated and conserved momenta corresponding to Noether symmetry shows the periodic part of the wave function and it helps to have the complete integral for the wave function.
△ Less
Submitted 9 July, 2024;
originally announced July 2024.
-
MTL-Split: Multi-Task Learning for Edge Devices using Split Computing
Authors:
Luigi Capogrosso,
Enrico Fraccaroli,
Samarjit Chakraborty,
Franco Fummi,
Marco Cristani
Abstract:
Split Computing (SC), where a Deep Neural Network (DNN) is intelligently split with a part of it deployed on an edge device and the rest on a remote server is emerging as a promising approach. It allows the power of DNNs to be leveraged for latency-sensitive applications that do not allow the entire DNN to be deployed remotely, while not having sufficient computation bandwidth available locally. I…
▽ More
Split Computing (SC), where a Deep Neural Network (DNN) is intelligently split with a part of it deployed on an edge device and the rest on a remote server is emerging as a promising approach. It allows the power of DNNs to be leveraged for latency-sensitive applications that do not allow the entire DNN to be deployed remotely, while not having sufficient computation bandwidth available locally. In many such embedded systems scenarios, such as those in the automotive domain, computational resource constraints also necessitate Multi-Task Learning (MTL), where the same DNN is used for multiple inference tasks instead of having dedicated DNNs for each task, which would need more computing bandwidth. However, how to partition such a multi-tasking DNN to be deployed within a SC framework has not been sufficiently studied. This paper studies this problem, and MTL-Split, our novel proposed architecture, shows encouraging results on both synthetic and real-world data. The source code is available at https://github.com/intelligolabs/MTL-Split.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
AstroSat Observations of the Dipping Low Mass X-ray Binary XB 1254-690
Authors:
Nilam R. Navale,
Devraj Pawar,
A. R. Rao,
Ranjeev Misra,
Sudip Chakraborty,
Sudip Bhattacharyya,
Vaishali A. Bambole
Abstract:
XB 1254-690 is a neutron star low-mass X-ray binary with an orbital period of 3.88 hrs, and it exhibits energy-dependent intensity dips, thermonuclear bursts, and flares. We present the results of an analysis of a long observation of this source using the AstroSat satellite. The X-ray light curve gradually changed from a high-intensity flaring state to a low-intensity one with a few dips. The hard…
▽ More
XB 1254-690 is a neutron star low-mass X-ray binary with an orbital period of 3.88 hrs, and it exhibits energy-dependent intensity dips, thermonuclear bursts, and flares. We present the results of an analysis of a long observation of this source using the AstroSat satellite. The X-ray light curve gradually changed from a high-intensity flaring state to a low-intensity one with a few dips. The hardness intensity diagram showed that the source is in a high-intensity banana state with a gradually changing flux. Based on this, we divide the observation into four flux levels for a flux-resolved spectral study. The X-ray spectra can be explained by a model consisting of absorption, thermal emission from the disc and non-thermal emission from the corona. From our studies, we detect a correlation between the temperature of the thermal component and the flux and we examine the implications of our results for the accretion disc geometry of this source.
△ Less
Submitted 7 July, 2024;
originally announced July 2024.
-
A dynamical system analysis of bouncing cosmology with spatial curvature
Authors:
Soumya Chakraborty,
Sudip Mishra,
Subenoy Chakraborty
Abstract:
The present work deals with a FLRW cosmological model with spatial curvature and minimally coupled scalar field as the matter content. The curvature term behaves as a perfect fluid with the equation of state parameter w_K = -1/3 Using suitable transformation of variables, the evolution equations are reduced to an autonomous system for both power law and exponential form of the scalar potential. Th…
▽ More
The present work deals with a FLRW cosmological model with spatial curvature and minimally coupled scalar field as the matter content. The curvature term behaves as a perfect fluid with the equation of state parameter w_K = -1/3 Using suitable transformation of variables, the evolution equations are reduced to an autonomous system for both power law and exponential form of the scalar potential. The critical points are analyzed with center manifold theory and stability has been discussed. Also, critical points at infinity have been studied using the notion of Poincare sphere. Finally, the cosmological implications of the critical points and cosmological bouncing scenarios are discussed. It is found that the cosmological bounce takes place near the points at infinity when the non-isolated critical points on the equator of the Poincare sphere are saddle or saddle-node in nature.
△ Less
Submitted 6 July, 2024;
originally announced July 2024.
-
Predicting Visual Attention in Graphic Design Documents
Authors:
Souradeep Chakraborty,
Zijun Wei,
Conor Kelton,
Seoyoung Ahn,
Aruna Balasubramanian,
Gregory J. Zelinsky,
Dimitris Samaras
Abstract:
We present a model for predicting visual attention during the free viewing of graphic design documents. While existing works on this topic have aimed at predicting static saliency of graphic designs, our work is the first attempt to predict both spatial attention and dynamic temporal order in which the document regions are fixated by gaze using a deep learning based model. We propose a two-stage m…
▽ More
We present a model for predicting visual attention during the free viewing of graphic design documents. While existing works on this topic have aimed at predicting static saliency of graphic designs, our work is the first attempt to predict both spatial attention and dynamic temporal order in which the document regions are fixated by gaze using a deep learning based model. We propose a two-stage model for predicting dynamic attention on such documents, with webpages being our primary choice of document design for demonstration. In the first stage, we predict the saliency maps for each of the document components (e.g. logos, banners, texts, etc. for webpages) conditioned on the type of document layout. These component saliency maps are then jointly used to predict the overall document saliency. In the second stage, we use these layout-specific component saliency maps as the state representation for an inverse reinforcement learning model of fixation scanpath prediction during document viewing. To test our model, we collected a new dataset consisting of eye movements from 41 people freely viewing 450 webpages (the largest dataset of its kind). Experimental results show that our model outperforms existing models in both saliency and scanpath prediction for webpages, and also generalizes very well to other graphic design documents such as comics, posters, mobile UIs, etc. and natural images.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
The M2-M5 Mohawk
Authors:
Iosif Bena,
Soumangsu Chakraborty,
Dimitrios Toulikas,
Nicholas P. Warner
Abstract:
We show that the near-brane back-reaction of M2 branes ending on M5 branes has a rich "spike structure" that is determined by partitioning the numbers of M2 branes that are terminating on groups of M5 branes. The near-brane limit of the metric describing these branes has an AdS$_3$ factor, implying the existence of a dual CFT. Each partition of the M2 and M5 charges among spikes gives rise to a di…
▽ More
We show that the near-brane back-reaction of M2 branes ending on M5 branes has a rich "spike structure" that is determined by partitioning the numbers of M2 branes that are terminating on groups of M5 branes. The near-brane limit of the metric describing these branes has an AdS$_3$ factor, implying the existence of a dual CFT. Each partition of the M2 and M5 charges among spikes gives rise to a different "mohawk" revealing a new layer of brane fractionation. We conjecture that all these mohawks are dual to ground states of near-brane-intersection CFT's. We show that the supergravity solutions describing these mohawks are part of the large families of AdS$_3$ $\times S^3 \times S^3$ solutions described in [arXiv:1312.5477]. We identify precisely which of these families are relevant to brane intersections and show that the AdS$_3$ invariance emerges from the self-similarity of the spikes.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
A Simple Representation of Tree Covering Utilizing Balanced Parentheses and Efficient Implementation of Average-Case Optimal RMQs
Authors:
Kou Hamada,
Sankardeep Chakraborty,
Seungbum Jo,
Takuto Koriyama,
Kunihiko Sadakane,
Srinivasa Rao Satti
Abstract:
Tree covering is a technique for decomposing a tree into smaller-sized trees with desirable properties, and has been employed in various succinct data structures. However, significant hurdles stand in the way of a practical implementation of tree covering: a lot of pointers are used to maintain the tree-covering hierarchy and many indices for tree navigational queries consume theoretically negligi…
▽ More
Tree covering is a technique for decomposing a tree into smaller-sized trees with desirable properties, and has been employed in various succinct data structures. However, significant hurdles stand in the way of a practical implementation of tree covering: a lot of pointers are used to maintain the tree-covering hierarchy and many indices for tree navigational queries consume theoretically negligible yet practically vast space. To tackle these problems, we propose a simple representation of tree covering using a balanced parenthesis representation. The key to the proposal is the observation that every micro tree splits into at most two intervals on the BP representation. Utilizing the representation, we propose several data structures that represent a tree and its tree cover, which consequently allow micro tree compression with arbitrary coding and efficient tree navigational queries. We also applied our data structure to average-case optimal RMQ by Munro et al.~[ESA 2021] and implemented the RMQ data structure. Our RMQ data structures spend less than $2n$ bits and process queries in a practical time on several settings of the performance evaluation, reducing the gap between theoretical space complexity and actual space consumption. We also implement tree navigational operations while using the same amount of space as the RMQ data structures. We believe the representation can be widely utilized for designing practically memory-efficient data structures based on tree covering.
△ Less
Submitted 29 June, 2024;
originally announced July 2024.
-
On Fourier analysis of sparse Boolean functions over certain Abelian groups
Authors:
Sourav Chakraborty,
Swarnalipa Datta,
Pranjal Dutta,
Arijit Ghosh,
Swagato Sanyal
Abstract:
Given an Abelian group G, a Boolean-valued function f: G -> {-1,+1}, is said to be s-sparse, if it has at most s-many non-zero Fourier coefficients over the domain G. In a seminal paper, Gopalan et al. proved "Granularity" for Fourier coefficients of Boolean valued functions over Z_2^n, that have found many diverse applications in theoretical computer science and combinatorics. They also studied s…
▽ More
Given an Abelian group G, a Boolean-valued function f: G -> {-1,+1}, is said to be s-sparse, if it has at most s-many non-zero Fourier coefficients over the domain G. In a seminal paper, Gopalan et al. proved "Granularity" for Fourier coefficients of Boolean valued functions over Z_2^n, that have found many diverse applications in theoretical computer science and combinatorics. They also studied structural results for Boolean functions over Z_2^n which are approximately Fourier-sparse. In this work, we obtain structural results for approximately Fourier-sparse Boolean valued functions over Abelian groups G of the form,G:= Z_{p_1}^{n_1} \times ... \times Z_{p_t}^{n_t}, for distinct primes p_i. We also obtain a lower bound of the form 1/(m^{2}s)^ceiling(phi(m)/2), on the absolute value of the smallest non-zero Fourier coefficient of an s-sparse function, where m=p_1 ... p_t, and phi(m)=(p_1-1) ... (p_t-1). We carefully apply probabilistic techniques from Gopalan et al., to obtain our structural results, and use some non-trivial results from algebraic number theory to get the lower bound.
We construct a family of at most s-sparse Boolean functions over Z_p^n, where p > 2, for arbitrarily large enough s, where the minimum non-zero Fourier coefficient is 1/omega(n). The "Granularity" result of Gopalan et al. implies that the absolute values of non-zero Fourier coefficients of any s-sparse Boolean valued function over Z_2^n are 1/O(s). So, our result shows that one cannot expect such a lower bound for general Abelian groups.
Using our new structural results on the Fourier coefficients of sparse functions, we design an efficient testing algorithm for Fourier-sparse Boolean functions, thata requires poly((ms)^phi(m),1/epsilon)-many queries. Further, we prove an Omega(sqrt{s}) lower bound on the query complexity of any adaptive sparsity testing algorithm.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Tame sparse exponential random graphs
Authors:
Suman Chakraborty,
Remco van der Hofstad,
Frank den Hollander
Abstract:
In this paper, we obtain a precise estimate of the probability that the sparse binomial random graph contains a large number of vertices in a triangle. The estimate of log of this probability is correct up to second order, and enables us to propose an exponential random graph model based on the number of vertices in a triangle. Specifically, by tuning a single parameter, we can with high probabili…
▽ More
In this paper, we obtain a precise estimate of the probability that the sparse binomial random graph contains a large number of vertices in a triangle. The estimate of log of this probability is correct up to second order, and enables us to propose an exponential random graph model based on the number of vertices in a triangle. Specifically, by tuning a single parameter, we can with high probability induce any given fraction of vertices in a triangle. Moreover, in the proposed exponential random graph model we derive the large deviation principle for the number of edges. As a byproduct, we propose a consistent estimator of the tuning parameter.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
SAIL: Self-Improving Efficient Online Alignment of Large Language Models
Authors:
Mucong Ding,
Souradip Chakraborty,
Vibhu Agrawal,
Zora Che,
Alec Koppel,
Mengdi Wang,
Amrit Bedi,
Furong Huang
Abstract:
Reinforcement Learning from Human Feedback (RLHF) is a key method for aligning large language models (LLMs) with human preferences. However, current offline alignment approaches like DPO, IPO, and SLiC rely heavily on fixed preference datasets, which can lead to sub-optimal performance. On the other hand, recent literature has focused on designing online RLHF methods but still lacks a unified conc…
▽ More
Reinforcement Learning from Human Feedback (RLHF) is a key method for aligning large language models (LLMs) with human preferences. However, current offline alignment approaches like DPO, IPO, and SLiC rely heavily on fixed preference datasets, which can lead to sub-optimal performance. On the other hand, recent literature has focused on designing online RLHF methods but still lacks a unified conceptual formulation and suffers from distribution shift issues. To address this, we establish that online LLM alignment is underpinned by bilevel optimization. By reducing this formulation to an efficient single-level first-order method (using the reward-policy equivalence), our approach generates new samples and iteratively refines model alignment by exploring responses and regulating preference labels. In doing so, we permit alignment methods to operate in an online and self-improving manner, as well as generalize prior online RLHF methods as special cases. Compared to state-of-the-art iterative RLHF methods, our approach significantly improves alignment performance on open-sourced datasets with minimal computational overhead.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
A Dual Attention-aided DenseNet-121 for Classification of Glaucoma from Fundus Images
Authors:
Soham Chakraborty,
Ayush Roy,
Payel Pramanik,
Daria Valenkova,
Ram Sarkar
Abstract:
Deep learning and computer vision methods are nowadays predominantly used in the field of ophthalmology. In this paper, we present an attention-aided DenseNet-121 for classifying normal and glaucomatous eyes from fundus images. It involves the convolutional block attention module to highlight relevant spatial and channel features extracted by DenseNet-121. The channel recalibration module further…
▽ More
Deep learning and computer vision methods are nowadays predominantly used in the field of ophthalmology. In this paper, we present an attention-aided DenseNet-121 for classifying normal and glaucomatous eyes from fundus images. It involves the convolutional block attention module to highlight relevant spatial and channel features extracted by DenseNet-121. The channel recalibration module further enriches the features by utilizing edge information along with the statistical features of the spatial dimension. For the experiments, two standard datasets, namely RIM-ONE and ACRIMA, have been used. Our method has shown superior results than state-of-the-art models. An ablation study has also been conducted to show the effectiveness of each of the components. The code of the proposed work is available at: https://github.com/Soham2004GitHub/DADGC.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Ultrasmall CsPbBr3 Blue Emissive Perovskite Quantum Dots using K-alloyed Cs4PbBr6 Nanocrystals as Precursors
Authors:
Clara Otero Martinez,
Matteo L. Zaffalon,
Yurii Ivanov,
Nikolaos Livakas,
Luca Goldoni,
Giorgio Divitini,
Sankalpa Bora,
Gabriele Saleh,
Francesco Meinardi,
Andrea Fratelli,
Sudip Chakraborty,
Lakshminarayana Polavarapu,
Sergio Brovelli,
Liberato Manna
Abstract:
We report a colloidal synthesis of blue emissive, stable cube-shaped CsPbBr3 quantum dots (QDs) in the strong quantum confinement regime via a dissolution-recrystallization starting from pre-synthesized (KxCs1-x)4PbBr6 nanocrystals which are then reacted with PbBr2. This is markedly different from the known case of Cs4PbBr6 nanocrystals that react within seconds with PbBr2 and get transformed into…
▽ More
We report a colloidal synthesis of blue emissive, stable cube-shaped CsPbBr3 quantum dots (QDs) in the strong quantum confinement regime via a dissolution-recrystallization starting from pre-synthesized (KxCs1-x)4PbBr6 nanocrystals which are then reacted with PbBr2. This is markedly different from the known case of Cs4PbBr6 nanocrystals that react within seconds with PbBr2 and get transformed into much larger, green emitting CsPbBr3 nanocrystals. Here, instead, the conversion of (KxCs1-x)4PbBr6 nanocrystals to CsPbBr3 QDs occurs in a time span of hours, and tuning of the QDs size is achieved by adjusting the concentration of precursors. The QDs exhibit excitonic features in optical absorption that are tunable in the 420 - 452 nm range, accompanied by blue photoluminescence with quantum yield around 60%. Detailed spectroscopic investigations in both the single and multi-exciton regime reveal the exciton fine structure and the effect of Auger recombination of these CsPbBr3 QDs, confirming theoretical predictions for this system.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Is poisoning a real threat to LLM alignment? Maybe more so than you think
Authors:
Pankayaraj Pathmanathan,
Souradip Chakraborty,
Xiangyu Liu,
Yongyuan Liang,
Furong Huang
Abstract:
Recent advancements in Reinforcement Learning with Human Feedback (RLHF) have significantly impacted the alignment of Large Language Models (LLMs). The sensitivity of reinforcement learning algorithms such as Proximal Policy Optimization (PPO) has led to new line work on Direct Policy Optimization (DPO), which treats RLHF in a supervised learning framework. The increased practical use of these RLH…
▽ More
Recent advancements in Reinforcement Learning with Human Feedback (RLHF) have significantly impacted the alignment of Large Language Models (LLMs). The sensitivity of reinforcement learning algorithms such as Proximal Policy Optimization (PPO) has led to new line work on Direct Policy Optimization (DPO), which treats RLHF in a supervised learning framework. The increased practical use of these RLHF methods warrants an analysis of their vulnerabilities. In this work, we investigate the vulnerabilities of DPO to poisoning attacks under different scenarios and compare the effectiveness of preference poisoning, a first of its kind. We comprehensively analyze DPO's vulnerabilities under different types of attacks, i.e., backdoor and non-backdoor attacks, and different poisoning methods across a wide array of language models, i.e., LLama 7B, Mistral 7B, and Gemma 7B. We find that unlike PPO-based methods, which, when it comes to backdoor attacks, require at least 4\% of the data to be poisoned to elicit harmful behavior, we exploit the true vulnerabilities of DPO more simply so we can poison the model with only as much as 0.5\% of the data. We further investigate the potential reasons behind the vulnerability and how well this vulnerability translates into backdoor vs non-backdoor attacks.
△ Less
Submitted 19 June, 2024; v1 submitted 17 June, 2024;
originally announced June 2024.
-
DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning
Authors:
Utsav Singh,
Souradip Chakraborty,
Wesley A. Suttle,
Brian M. Sadler,
Vinay P Namboodiri,
Amrit Singh Bedi
Abstract:
Learning control policies to perform complex robotics tasks from human preference data presents significant challenges. On the one hand, the complexity of such tasks typically requires learning policies to perform a variety of subtasks, then combining them to achieve the overall goal. At the same time, comprehensive, well-engineered reward functions are typically unavailable in such problems, whil…
▽ More
Learning control policies to perform complex robotics tasks from human preference data presents significant challenges. On the one hand, the complexity of such tasks typically requires learning policies to perform a variety of subtasks, then combining them to achieve the overall goal. At the same time, comprehensive, well-engineered reward functions are typically unavailable in such problems, while limited human preference data often is; making efficient use of such data to guide learning is therefore essential. Methods for learning to perform complex robotics tasks from human preference data must overcome both these challenges simultaneously. In this work, we introduce DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning, an efficient hierarchical approach that leverages direct preference optimization to learn a higher-level policy and reinforcement learning to learn a lower-level policy. DIPPER enjoys improved computational efficiency due to its use of direct preference optimization instead of standard preference-based approaches such as reinforcement learning from human feedback, while it also mitigates the well-known hierarchical reinforcement learning issues of non-stationarity and infeasible subgoal generation due to our use of primitive-informed regularization inspired by a novel bi-level optimization formulation of the hierarchical reinforcement learning problem. To validate our approach, we perform extensive experimental analysis on a variety of challenging robotics tasks, demonstrating that DIPPER outperforms hierarchical and non-hierarchical baselines, while ameliorating the non-stationarity and infeasible subgoal generation issues of hierarchical reinforcement learning.
△ Less
Submitted 16 June, 2024;
originally announced June 2024.
-
Dynamical system analysis of quintessence dark energy model
Authors:
Soumya Chakraborty,
Sudip Mishra,
Subenoy Chakraborty
Abstract:
Our work deals with the dynamical system analysis of quintessence dark energy scalar field model with exponential potential. A dynamical system analysis has been applied at the background level. Using suitable transformation of variables, the evolution equations are reduced to an autonomous system for exponential form of the scalar potential. The critical points are analyzed with center manifold t…
▽ More
Our work deals with the dynamical system analysis of quintessence dark energy scalar field model with exponential potential. A dynamical system analysis has been applied at the background level. Using suitable transformation of variables, the evolution equations are reduced to an autonomous system for exponential form of the scalar potential. The critical points are analyzed with center manifold theory and stability has been discussed by using Schwarzian derivative. Finally, cosmological implications of the critical points are discussed and it is found that the stability of the late-time attractor changes for quintessence dark energy model.
△ Less
Submitted 15 June, 2024;
originally announced June 2024.
-
Rotating black holes experience dynamical tides
Authors:
Rajendra Prasad Bhatt,
Sumanta Chakraborty,
Sukanta Bose
Abstract:
We calculate the tidal response of a rotating black hole from the Teukolsky equation in the near-horizon and small-frequency regime. While the static tidal Love number of an arbitrarily rotating black hole is still zero, the dynamical tidal Love number is not; rather, it is proportional to the angular velocity of the black hole -- in the linear order of frequency -- and to the square of the angula…
▽ More
We calculate the tidal response of a rotating black hole from the Teukolsky equation in the near-horizon and small-frequency regime. While the static tidal Love number of an arbitrarily rotating black hole is still zero, the dynamical tidal Love number is not; rather, it is proportional to the angular velocity of the black hole -- in the linear order of frequency -- and to the square of the angular velocity of the black hole -- in the zeroth order. Intriguingly, the zero-frequency limit of the dynamical tidal response function is not equal to the static tidal response function. We demonstrate that these results hold true for an extremal rotating black hole as well, with the dynamical tidal Love numbers being non-zero. This shows that Kerr black holes experience dynamical tides.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Language Models are Crossword Solvers
Authors:
Soumadeep Saha,
Sutanoya Chakraborty,
Saptarshi Saha,
Utpal Garain
Abstract:
Crosswords are a form of word puzzle that require a solver to demonstrate a high degree of proficiency in natural language understanding, wordplay, reasoning, and world knowledge, along with adherence to character and length constraints. In this paper we tackle the challenge of solving crosswords with Large Language Models (LLMs). We demonstrate that the current generation of state-of-the art (SoT…
▽ More
Crosswords are a form of word puzzle that require a solver to demonstrate a high degree of proficiency in natural language understanding, wordplay, reasoning, and world knowledge, along with adherence to character and length constraints. In this paper we tackle the challenge of solving crosswords with Large Language Models (LLMs). We demonstrate that the current generation of state-of-the art (SoTA) language models show significant competence at deciphering cryptic crossword clues, and outperform previously reported SoTA results by a factor of 2-3 in relevant benchmarks. We also develop a search algorithm that builds off this performance to tackle the problem of solving full crossword grids with LLMs for the very first time, achieving an accuracy of 93\% on New York Times crossword puzzles. Contrary to previous work in this area which concluded that LLMs lag human expert performance significantly, our research suggests this gap is a lot narrower.
△ Less
Submitted 14 June, 2024; v1 submitted 13 June, 2024;
originally announced June 2024.
-
Neural-g: A Deep Learning Framework for Mixing Density Estimation
Authors:
Shijie Wang,
Saptarshi Chakraborty,
Qian Qin,
Ray Bai
Abstract:
Mixing (or prior) density estimation is an important problem in machine learning and statistics, especially in empirical Bayes $g$-modeling where accurately estimating the prior is necessary for making good posterior inferences. In this paper, we propose neural-$g$, a new neural network-based estimator for $g$-modeling. Neural-$g$ uses a softmax output layer to ensure that the estimated prior is a…
▽ More
Mixing (or prior) density estimation is an important problem in machine learning and statistics, especially in empirical Bayes $g$-modeling where accurately estimating the prior is necessary for making good posterior inferences. In this paper, we propose neural-$g$, a new neural network-based estimator for $g$-modeling. Neural-$g$ uses a softmax output layer to ensure that the estimated prior is a valid probability density. Under default hyperparameters, we show that neural-$g$ is very flexible and capable of capturing many unknown densities, including those with flat regions, heavy tails, and/or discontinuities. In contrast, existing methods struggle to capture all of these prior shapes. We provide justification for neural-$g$ by establishing a new universal approximation theorem regarding the capability of neural networks to learn arbitrary probability mass functions. To accelerate convergence of our numerical implementation, we utilize a weighted average gradient descent approach to update the network parameters. Finally, we extend neural-$g$ to multivariate prior density estimation. We illustrate the efficacy of our approach through simulations and analyses of real datasets. A software package to implement neural-$g$ is publicly available at https://github.com/shijiew97/neuralG.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
The Choquet-Deny Property for Groupoids
Authors:
Tey Berendschot,
Soham Chakraborty,
Milan Donvil,
Se-Jin Kim,
Mario Klisse
Abstract:
A countable discrete group is called Choquet-Deny if for any non-degenerate probability measure on the group, the corresponding space of bounded harmonic functions is trivial. Building on the previous work of Jaworski, a complete characterization of Choquet-Deny groups was recently achieved by Frisch, Hartman, Tamuz, and Ferdowski. In this article, we extend the study of the Choquet-Deny property…
▽ More
A countable discrete group is called Choquet-Deny if for any non-degenerate probability measure on the group, the corresponding space of bounded harmonic functions is trivial. Building on the previous work of Jaworski, a complete characterization of Choquet-Deny groups was recently achieved by Frisch, Hartman, Tamuz, and Ferdowski. In this article, we extend the study of the Choquet-Deny property to the framework of discrete measured groupoids. Our primary result offers a complete characterization of this property in terms of the isotropy groups and the equivalence relation associated with the given groupoid. Additionally, we use the implications derived from our main theorem to classify the Choquet-Deny property of transformation groupoids.
△ Less
Submitted 25 June, 2024; v1 submitted 7 June, 2024;
originally announced June 2024.
-
Randomness in atomic disorder and consequent squandering of spin-polarization in a ferromagnetically fragile quaternary Heusler alloy FeRuCrSi
Authors:
Shuvankar Gupta,
Sudip Chakraborty,
Vidha Bhasin,
Celine Barreteau,
Jean-Claude Crivello,
Jean-Marc Greneche,
S. N. Jha,
D. Bhattacharyya,
Eric Alleno,
Chandan Mazumdar
Abstract:
Ru$_{2-x}$Fe$_x$CrSi ( 0 $<$ x $<$1) system is theoretically predicted to be one of the very few known examples of robust half-metallic ferromagnet with 100\% spin polarization. Since Cr is considered to be the main contributor to magnetism, the Fe/Ru substitution is not expected to disturb its magnetic properties any significantly, and hence all Fe-containing members of the series are predicted t…
▽ More
Ru$_{2-x}$Fe$_x$CrSi ( 0 $<$ x $<$1) system is theoretically predicted to be one of the very few known examples of robust half-metallic ferromagnet with 100\% spin polarization. Since Cr is considered to be the main contributor to magnetism, the Fe/Ru substitution is not expected to disturb its magnetic properties any significantly, and hence all Fe-containing members of the series are predicted to follow Slater-Pauling rule with a saturation magnetic moment of 2 ${μ_B}$/f.u. However, contrarily to the theoretical expectations, some experiments rather show a linear variation of the saturation magnetization and Curie temperature with Fe (\textit{x}) substitution. The equiatomic member FeRuCrSi of this family is also considered as a technologically important material, where the band structure calculations suggest the material to be spin gapless semiconductor. Through our in-depth structural analysis of FeRuCrSi using X-ray diffraction, extended X-ray absorption fine structure and $^{57}$Fe Mössbauer spectrometry, we found a random disorder between Fe and Ru sites, while the magnetic moment in this system is actually contributed by Fe atoms, questioning the very basic foundation of the half-metallic character proposed by all theoretical calculations on Ru$_{2-x}$Fe$_x$CrSi series. Our Mössbauer result also envisions a rather rare scenario where the main physical properties are intricately correlated to the chemistry of the material in the form of random atomic disorder on a localised scale.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Restructuring disorder: Transformation from the antiferromagnetic order in Fe2VSi to the ferromagnetic state in FeRuVSi by substitution of a non-magnetic element
Authors:
Shuvankar Gupta,
Sudip Chakraborty,
Celine Barreteau,
Jean-Claude Crivello,
Jean-Marc Greneche,
Eric Alleno,
Chandan Mazumdar
Abstract:
The delicate nature of the half-metallic ferromagnetic (HMF) property in Heusler alloys is often compromised by inherent structural disorder within the systems. Fe2VSi is a prime example, where such disorder prevents the realization of the theoretically proposed HMF state as the anti-site disorder leads to the formation of two anti-parallel magnetic lattices resulting in antiferromagnetic order. I…
▽ More
The delicate nature of the half-metallic ferromagnetic (HMF) property in Heusler alloys is often compromised by inherent structural disorder within the systems. Fe2VSi is a prime example, where such disorder prevents the realization of the theoretically proposed HMF state as the anti-site disorder leads to the formation of two anti-parallel magnetic lattices resulting in antiferromagnetic order. In this study, we propose an innovative and simple strategy to prevent this atomic disorder by replacing 50% of the magnetic element Fe by a large, isoelectronic, non-magnetic element, Ru. In this way, one of the magnetic sublattices of the antiferromagnetic lattice ceases to order while ferromagnetic order is restored, an essential criterion for exhibiting HMF properties. Through various experimental measurements and theoretical calculations, we have shown that such partial replacement of Fe by Ru prevents the cross-site substitution of V/Si sites and the system regains its ferromagnetic order. Our theoretical calculations suggest that a perfect structural arrangement in Fe and Ru would have restored the HMF property in FeRuVSi. However, the local atomic disorder of Fe and Ru was found to decrease the spin polarization value. The present work sheds light on the complex interplay between structural disorder and magnetic properties in Heusler alloys and provides insights for future design strategies in the pursuit of robust half-metallic ferromagnets.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
The Amaterasu particle: constraining the superheavy dark matter origin of UHECRs
Authors:
Prantik Sarmah,
Nayan Das,
Debasish Borah,
Sovan Chakraborty,
Poonam Mehta
Abstract:
Amaterasu, the second most energetic ($244$ EeV) cosmic ray particle has been recently detected by the Telescope Array (TA) surface detector. The origin of the TA Amaterasu event is puzzling, as its arrival direction points back to a void in the local Universe, lacking conventional astrophysical ultra-high-energy (UHE) cosmic ray (CR) sources. Hence, we explore the possibility if this TA Amaterasu…
▽ More
Amaterasu, the second most energetic ($244$ EeV) cosmic ray particle has been recently detected by the Telescope Array (TA) surface detector. The origin of the TA Amaterasu event is puzzling, as its arrival direction points back to a void in the local Universe, lacking conventional astrophysical ultra-high-energy (UHE) cosmic ray (CR) sources. Hence, we explore the possibility if this TA Amaterasu event could have originated from the decay of superheavy dark matter (SHDM) in the Milky Way. Such an origin also opens up multi-messenger detection channels in both UHE gamma-rays and UHE neutrinos. In this present work, using the TA Amaterasu event and the multi-messenger limits/sensitivities from various UHE telescopes, we place stringent constraints on the lifetime and mass of the SHDM. We find that the non-detection of the corresponding gamma-rays at the Pierre Auger Observatory (PAO) and the TA is in severe tension with the SHDM parameter space required to explain the TA Amaterasu event. Additionally, we extend the multi-messenger analysis to the future UHE gamma-ray and UHE neutrino telescopes such as PAO upgrade, GRAND 200k and IceCube-Gen2. We find that the bounds from the future neutrino telescopes will be able to compete with the present UHECR bounds. However, compared to the existing UHE gamma-ray bounds, the future PAO upgrade and the GRAND 200k gamma-ray detectors will improve the bounds on SHDM lifetime by at least one order of magnitude.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
On the role of closed timelike curves and confinement structure around Kerr-Newman singularity
Authors:
Ayanendu Dutta,
Dhritimalya Roy,
Subenoy Chakraborty
Abstract:
In this study, the particle motion around the naked singularity and black hole of Kerr-Newman spacetime is investigated with a special attention on the closed timelike orbits. It is found that both in the naked singularity (NS) and in black hole (BH), the singularity is concealed by causality violating regions, and the Cauchy surface consistently resides inside the inner horizon in non-extremal bl…
▽ More
In this study, the particle motion around the naked singularity and black hole of Kerr-Newman spacetime is investigated with a special attention on the closed timelike orbits. It is found that both in the naked singularity (NS) and in black hole (BH), the singularity is concealed by causality violating regions, and the Cauchy surface consistently resides inside the inner horizon in non-extremal black holes. For neutral particles and particles with an identical charge to the source, only particles with positive angular momentum are permitted to traverse the closed timelike curves. Conversely, for particles with the opposite charge to the source, the strong Coulomb attraction draws all particles inside the Cauchy surface, allowing them to be present in the closed timelike curves irrespective of their angular momentum. However, in both the NS and BH (both extremal and non-extremal), test particles are confined at a considerable distance from the singular point such that there always exists an empty region surrounding the singularity which prevents particles from interacting with it. The radius of the empty surface that depends on the source parameters and the particle characteristics, is investigated with an accurate expression.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
PhyPlan: Generalizable and Rapid Physical Task Planning with Physics Informed Skill Networks for Robot Manipulators
Authors:
Mudit Chopra,
Abhinav Barnawal,
Harshil Vagadia,
Tamajit Banerjee,
Shreshth Tuli,
Souvik Chakraborty,
Rohan Paul
Abstract:
Given the task of positioning a ball-like object to a goal region beyond direct reach, humans can often throw, slide, or rebound objects against the wall to attain the goal. However, enabling robots to reason similarly is non-trivial. Existing methods for physical reasoning are data-hungry and struggle with complexity and uncertainty inherent in the real world. This paper presents PhyPlan, a novel…
▽ More
Given the task of positioning a ball-like object to a goal region beyond direct reach, humans can often throw, slide, or rebound objects against the wall to attain the goal. However, enabling robots to reason similarly is non-trivial. Existing methods for physical reasoning are data-hungry and struggle with complexity and uncertainty inherent in the real world. This paper presents PhyPlan, a novel physics-informed planning framework that combines physics-informed neural networks (PINNs) with modified Monte Carlo Tree Search (MCTS) to enable embodied agents to perform dynamic physical tasks. PhyPlan leverages PINNs to simulate and predict outcomes of actions in a fast and accurate manner and uses MCTS for planning. It dynamically determines whether to consult a PINN-based simulator (coarse but fast) or engage directly with the actual environment (fine but slow) to determine optimal policy. Given an unseen task, PhyPlan can infer the sequence of actions and learn the latent parameters, resulting in a generalizable approach that can rapidly learn to perform novel physical tasks. Evaluation with robots in simulated 3D environments demonstrates the ability of our approach to solve 3D-physical reasoning tasks involving the composition of dynamic skills. Quantitatively, PhyPlan excels in several aspects: (i) it achieves lower regret when learning novel tasks compared to the state-of-the-art, (ii) it expedites skill learning and enhances the speed of physical reasoning, (iii) it demonstrates higher data efficiency compared to a physics un-informed approach.
△ Less
Submitted 22 April, 2024;
originally announced June 2024.
-
Equivariant Parabolic connections and stack of roots
Authors:
Sujoy Chakraborty,
Arjun Paul
Abstract:
Let $X$ be a smooth complex projective variety equipped with an action of a linear algebraic group $G$ over $\mathbb{C}$. Let $D$ be a reduced effective divisor on $X$ that is invariant under the $G$--action on $X$. Let $s_D$ be the canonical section of $\mathcal{O}_X(D)$ vanishing along $D$. Given a positive integer $r$, consider the stack…
▽ More
Let $X$ be a smooth complex projective variety equipped with an action of a linear algebraic group $G$ over $\mathbb{C}$. Let $D$ be a reduced effective divisor on $X$ that is invariant under the $G$--action on $X$. Let $s_D$ be the canonical section of $\mathcal{O}_X(D)$ vanishing along $D$. Given a positive integer $r$, consider the stack $\mathfrak{X} := \mathfrak{X}_{(\mathcal{O}_X(D),\, s_D,\, r)}$ of $r$-th roots of $(\mathcal{O}_X, s_D)$ together with the natural morphism $π: \mathfrak{X} \to X$. Under the assumption that $G$ has no non-trivial characters, we show that the $G$--action on $X$ naturally lifts to a $G$--action on $\mathfrak{X}$ such that $π$ become $G$--equivariant, and the tautological invertible sheaf $\mathscr{M}$ on $\mathfrak{X}$ admits a linearization of this $G$--action. Finally, we define the notions of $G$--equivariant logarithmic connections on $\mathfrak{X}$ and $G$--equivariant parabolic connections on $X$ with rational parabolic weights along $D$, and establish an equivalence between the category of $G$--equivariant logarithmic connections on $\mathfrak{X}$ and the category of $G$--equivariant parabolic connections on $X$ with rational parabolic weights along $D$.
△ Less
Submitted 31 May, 2024;
originally announced May 2024.
-
Transfer Q Star: Principled Decoding for LLM Alignment
Authors:
Souradip Chakraborty,
Soumya Suvra Ghosal,
Ming Yin,
Dinesh Manocha,
Mengdi Wang,
Amrit Singh Bedi,
Furong Huang
Abstract:
Aligning foundation models is essential for their safe and trustworthy deployment. However, traditional fine-tuning methods are computationally intensive and require updating billions of model parameters. A promising alternative, alignment via decoding, adjusts the response distribution directly without model updates to maximize a target reward $r$, thus providing a lightweight and adaptable frame…
▽ More
Aligning foundation models is essential for their safe and trustworthy deployment. However, traditional fine-tuning methods are computationally intensive and require updating billions of model parameters. A promising alternative, alignment via decoding, adjusts the response distribution directly without model updates to maximize a target reward $r$, thus providing a lightweight and adaptable framework for alignment. However, principled decoding methods rely on oracle access to an optimal Q-function ($Q^*$), which is often unavailable in practice. Hence, prior SoTA methods either approximate this $Q^*$ using $Q^{π_{\texttt{sft}}}$ (derived from the reference $\texttt{SFT}$ model) or rely on short-term rewards, resulting in sub-optimal decoding performance. In this work, we propose Transfer $Q^*$, which implicitly estimates the optimal value function for a target reward $r$ through a baseline model $ρ_{\texttt{BL}}$ aligned with a baseline reward $ρ_{\texttt{BL}}$ (which can be different from the target reward $r$). Theoretical analyses of Transfer $Q^*$ provide a rigorous characterization of its optimality, deriving an upper bound on the sub-optimality gap and identifying a hyperparameter to control the deviation from the pre-trained reference $\texttt{SFT}$ model based on user needs. Our approach significantly reduces the sub-optimality gap observed in prior SoTA methods and demonstrates superior empirical performance across key metrics such as coherence, diversity, and quality in extensive tests on several synthetic and real datasets.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Long-Range Correlations in Elastic Moduli and Local Stresses at the Unjamming Transition
Authors:
Surajit Chakraborty,
Kabir Ramola
Abstract:
We explore the behavior of spatially heterogeneous elastic moduli as well as the correlations between local moduli in model solids with short-range repulsive potentials. We show through numerical simulations that local elastic moduli exhibit long-range correlations, similar to correlations in the local stresses. Specifically, the correlations in local shear moduli exhibit anisotropic behavior at l…
▽ More
We explore the behavior of spatially heterogeneous elastic moduli as well as the correlations between local moduli in model solids with short-range repulsive potentials. We show through numerical simulations that local elastic moduli exhibit long-range correlations, similar to correlations in the local stresses. Specifically, the correlations in local shear moduli exhibit anisotropic behavior at large lengthscales characterized by pinch-point singularities in Fourier space, displaying a structural pattern akin to shear stress correlations. Focussing on two-dimensional jammed solids approaching the unjamming transition, we show that stress correlations exhibit universal properties, characterized by a quadratic $p^2$ dependence of the correlations as the pressure $p$ approaches zero, independent of the details of the model. In contrast, the modulus correlations exhibit a power-law dependence with different exponents depending on the specific interaction potential. Furthermore, we illustrate that while affine responses lack long-range correlations, the total modulus, which encompasses non-affine behavior, exhibits long-range correlations.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
A Hanani-Tutte Theorem for Cycles
Authors:
Sutanoya Chakraborty,
Arijit Ghosh
Abstract:
Given a drawing $D$ of a graph $G$, we define the crossing number between any two cycles $C_{1}$ and $C_{2}$ in $D$ to be the number of crossings that involve at least one edge from each of $C_1$ and $C_2$ except the crossings between edges that are common to both cycles. We show that if the crossing number between every two cycles in $G$ is even in a drawing of $G$ on the plane, then there is a p…
▽ More
Given a drawing $D$ of a graph $G$, we define the crossing number between any two cycles $C_{1}$ and $C_{2}$ in $D$ to be the number of crossings that involve at least one edge from each of $C_1$ and $C_2$ except the crossings between edges that are common to both cycles. We show that if the crossing number between every two cycles in $G$ is even in a drawing of $G$ on the plane, then there is a planar drawing of $G$. This result can be extended to arbitrary surfaces. We also establish an equivalence between our result and a fundamental result due to Cairns-Nikolayevsky and Pelsmajer-Schaefer-Štefankovič, about drawing graphs on surfaces, and derive the Loebl-Masbaum theorem from it.
△ Less
Submitted 12 June, 2024; v1 submitted 29 May, 2024;
originally announced May 2024.
-
Harmonic Tutte polynomials of matroids III
Authors:
Thomas Britz,
Himadri Shekhar Chakraborty,
Tsuyoshi Miezaki
Abstract:
In this paper, we present the harmonic generalizations of well-known polynomials of codes over finite fields, namely the higher weight enumerators and the extended weight enumerators, and we derive the correspondences between these weight enumerators. Moreover, we present the harmonic generalization of Greene's Theorem for the higher (resp. extended) weight enumerators. As an application of this G…
▽ More
In this paper, we present the harmonic generalizations of well-known polynomials of codes over finite fields, namely the higher weight enumerators and the extended weight enumerators, and we derive the correspondences between these weight enumerators. Moreover, we present the harmonic generalization of Greene's Theorem for the higher (resp. extended) weight enumerators. As an application of this Greene's-type theorem, we provide the MacWilliams-type identity for harmonic higher weight enumerators of codes over finite fields. Finally, we use this new identity to give a new proof of the Assmus-Mattson Theorem for subcode supports of linear codes over finite fields using harmonic higher weight enumerators.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Gamified AI Approch for Early Detection of Dementia
Authors:
Paramita Kundu Maji,
Soubhik Acharya,
Priti Paul,
Sanjay Chakraborty,
Saikat Basu
Abstract:
This paper aims to develop a new deep learning-inspired gaming approach for early detection of dementia. This research integrates a robust convolutional neural network (CNN)-based model for early dementia detection using health metrics data as well as facial image data through a cognitive assessment-based gaming application. We have collected 1000 data samples of health metrics dataset from Apollo…
▽ More
This paper aims to develop a new deep learning-inspired gaming approach for early detection of dementia. This research integrates a robust convolutional neural network (CNN)-based model for early dementia detection using health metrics data as well as facial image data through a cognitive assessment-based gaming application. We have collected 1000 data samples of health metrics dataset from Apollo Diagnostic Center Kolkata that is labeled as either demented or non-demented for the training of MOD-1D-CNN for the game level 1 and another dataset of facial images containing 1800 facial data that are labeled as either demented or non-demented is collected by our research team for the training of MOD-2D-CNN model in-game level 2. In our work, the loss for the proposed MOD-1D-CNN model is 0.2692 and the highest accuracy is 70.50% for identifying the dementia traits using real-life health metrics data. Similarly, the proposed MOD-2D-CNN model loss is 0.1755 and the highest accuracy is obtained here 95.72% for recognizing the dementia status using real-life face-based image data. Therefore, a rule-based weightage method is applied to combine both the proposed methods to achieve the final decision. The MOD-1D-CNN and MOD-2D-CNN models are more lightweight and computationally efficient alternatives because they have a significantly lower number of parameters when compared to the other state-of-the-art models. We have compared their accuracies and parameters with the other state-of-the-art deep learning models.
△ Less
Submitted 26 May, 2024;
originally announced May 2024.
-
Breaking Barriers: Investigating Gender Dynamics in Introductory Physics Lab Classes
Authors:
Bilas Paul,
Shantanu Chakraborty,
Ganga Sharma
Abstract:
The persistent underrepresentation of women and other gender minorities in physical science fields has been an ongoing concern. This study investigates gender dynamics in introductory physics laboratory courses, specifically exploring whether students of different gender identities exhibit equal inclination and confidence in conducting lab experiments, and whether they face barriers that impact th…
▽ More
The persistent underrepresentation of women and other gender minorities in physical science fields has been an ongoing concern. This study investigates gender dynamics in introductory physics laboratory courses, specifically exploring whether students of different gender identities exhibit equal inclination and confidence in conducting lab experiments, and whether they face barriers that impact their participation. The study was conducted across three institutions, involving non-physics students enrolled in algebra-based and calculus-based physics courses. Our findings reveal no significant differences in participation levels across genders in various lab activities. However, a subtle yet significant trended was observed: non-male students tend to express greater preferences and comfort levels for note-taking, calculations, and graphing tasks compared to their male counterparts, who gravitated more towards hands-on equipment handling. Although no overt barriers deterring participation based solely on gender were identified, some students reported experiences or witnessed instances where gender dynamics hindered full engagement, such as assumptions about competence or difficulty asserting voices in male-dominated groups. These findings contribute insights into potential gender-based inclinations and experiences within laboratory environments. The results underscore the importance of fostering an inclusive climate that encourages equitable opportunities and engagement from all gender identities in scientific exploration and learning.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
1991T-like Supernovae
Authors:
M. M. Phillips,
C. Ashall,
Peter J. Brown,
L. Galbany,
M. A. Tucker,
Christopher R. Burns,
Carlos Contreras,
P. Hoeflich,
E. Y. Hsiao,
S. Kumar,
Nidia Morrell,
Syed A. Uddin,
E. Baron,
Wendy L. Freedman,
Kevin Krisciunas,
S. E. Persson,
Anthony L. Piro,
B. J. Shappee,
Maximilian Stritzinger,
Nicholas B. Suntzeff,
Sudeshna Chakraborty,
R. P. Kirshner,
J. Lu,
G. H. Marion,
Abigail Polin
, et al. (1 additional authors not shown)
Abstract:
Understanding the nature of the luminous 1991T-like supernovae is of great importance to supernova cosmology as they are likely to have been more common in the early universe. In this paper we explore the observational properties of 1991T-like supernovae to study their relationship to other luminous, slow-declining Type~Ia supernovae (SNe Ia). From the spectroscopic and photometric criteria define…
▽ More
Understanding the nature of the luminous 1991T-like supernovae is of great importance to supernova cosmology as they are likely to have been more common in the early universe. In this paper we explore the observational properties of 1991T-like supernovae to study their relationship to other luminous, slow-declining Type~Ia supernovae (SNe Ia). From the spectroscopic and photometric criteria defined in Phillips et al. (1992), we identify 17 1991T-like supernovae from the literature. Combining these objects with ten 1991T-like supernovae from the Carnegie Supernova Project-II, the spectra, light curves, and colors of these events, along with their host galaxy properties, are examined in detail. We conclude that 1991T-like supernovae are closely related in essentially all of their UV, optical, and near-infrared properties -- as well as their host galaxy parameters -- to the slow-declining subset of Branch core-normal supernovae and to the intermediate 1999aa-like events, forming a continuum of luminous SNe Ia. The overriding difference between these three subgroups appears to be the extent to which $^{56}$Ni mixes into the ejecta, producing the pre-maximum spectra dominated by Fe III absorption, the broader UV light curves, and the higher luminosities that characterize the 1991T-like events. Nevertheless, the association of 1991T-like SNe with the rare Type Ia CSM supernovae would seem to run counter to this hypothesis, in which case 1991T-like events may form a separate subclass of SNe Ia, possibly arising from single-degenerate progenitor systems.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
Global urban activity changes from COVID-19 physical distancing restrictions
Authors:
Srija Chakraborty,
Eleanor Stokes,
Olivia Alexander
Abstract:
During the COVID-19 pandemic changes in human activity became widespread through official policies and organically in response to the virus's transmission, which in turn, impacted the environment and the economy. The pandemic has been described as a natural experiment that tested how social and economic disruptions impacted different components of the global Earth System. To move this beyond hypot…
▽ More
During the COVID-19 pandemic changes in human activity became widespread through official policies and organically in response to the virus's transmission, which in turn, impacted the environment and the economy. The pandemic has been described as a natural experiment that tested how social and economic disruptions impacted different components of the global Earth System. To move this beyond hypotheses, locally-resolved, globally-available measures of how, where, and when human activity changed are critically needed. Here we use satellite-derived nighttime lights to quantify and map daily changes in human activity that are atypical for each urban area globally for two years after the onset of the pandemic using machine learning anomaly detectors. Metrics characterizing changes in lights from pre-COVID baseline in human settlements and quality assurance measures are reported. This dataset, TRacking Anomalous COVID-19 induced changEs in NTL (TRACE-NTL), is the first to resolve COVID-19 disruptions for all metropolitan regions globally, daily. It is suitable to support a variety of post-pandemic studies that assess how changes in human activity impact environmental systems.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
FLIPHAT: Joint Differential Privacy for High Dimensional Sparse Linear Bandits
Authors:
Sunrit Chakraborty,
Saptarshi Roy,
Debabrota Basu
Abstract:
High dimensional sparse linear bandits serve as an efficient model for sequential decision-making problems (e.g. personalized medicine), where high dimensional features (e.g. genomic data) on the users are available, but only a small subset of them are relevant. Motivated by data privacy concerns in these applications, we study the joint differentially private high dimensional sparse linear bandit…
▽ More
High dimensional sparse linear bandits serve as an efficient model for sequential decision-making problems (e.g. personalized medicine), where high dimensional features (e.g. genomic data) on the users are available, but only a small subset of them are relevant. Motivated by data privacy concerns in these applications, we study the joint differentially private high dimensional sparse linear bandits, where both rewards and contexts are considered as private data. First, to quantify the cost of privacy, we derive a lower bound on the regret achievable in this setting. To further address the problem, we design a computationally efficient bandit algorithm, \textbf{F}orgetfu\textbf{L} \textbf{I}terative \textbf{P}rivate \textbf{HA}rd \textbf{T}hresholding (FLIPHAT). Along with doubling of episodes and episodic forgetting, FLIPHAT deploys a variant of Noisy Iterative Hard Thresholding (N-IHT) algorithm as a sparse linear regression oracle to ensure both privacy and regret-optimality. We show that FLIPHAT achieves optimal regret up to logarithmic factors. We analyze the regret by providing a novel refined analysis of the estimation error of N-IHT, which is of parallel interest.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Towards A Comprehensive Assessment of AI's Environmental Impact
Authors:
Srija Chakraborty
Abstract:
Artificial Intelligence, machine learning (AI/ML) has allowed exploring solutions for a variety of environmental and climate questions ranging from natural disasters, greenhouse gas emission, monitoring biodiversity, agriculture, to weather and climate modeling, enabling progress towards climate change mitigation. However, the intersection of AI/ML and environment is not always positive. The recen…
▽ More
Artificial Intelligence, machine learning (AI/ML) has allowed exploring solutions for a variety of environmental and climate questions ranging from natural disasters, greenhouse gas emission, monitoring biodiversity, agriculture, to weather and climate modeling, enabling progress towards climate change mitigation. However, the intersection of AI/ML and environment is not always positive. The recent surge of interest in ML, made possible by processing very large volumes of data, fueled by access to massive compute power, has sparked a trend towards large-scale adoption of AI/ML. This interest places tremendous pressure on natural resources, that are often overlooked and under-reported. There is a need for a framework that monitors the environmental impact and degradation from AI/ML throughout its lifecycle for informing policymakers, stakeholders to adequately implement standards and policies and track the policy outcome over time. For these policies to be effective, AI's environmental impact needs to be monitored in a spatially-disaggregated, timely manner across the globe at the key activity sites. This study proposes a methodology to track environmental variables relating to the multifaceted impact of AI around datacenters using openly available energy data and globally acquired satellite observations. We present a case study around Northern Virginia, United States that hosts a growing number of datacenters and observe changes in multiple satellite-based environmental metrics. We then discuss the steps to expand this methodology for comprehensive assessment of AI's environmental impact across the planet. We also identify data gaps and formulate recommendations for improving the understanding and monitoring AI-induced changes to the environment and climate.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Self-trapping phenomenon, multistability and chaos in open anisotropic Dicke dimer
Authors:
G. Vivek,
Debabrata Mondal,
Subhadeep Chakraborty,
S. Sinha
Abstract:
We investigate semiclassical dynamics of coupled atom-photon interacting system described by a dimer of anisotropic Dicke model in the presence of photon loss, exhibiting a rich variety of non-linear dynamics. Based on symmetries and dynamical classification, we characterize and chart out various dynamical phases in a phase diagram. A key feature of this system is the multistability of different d…
▽ More
We investigate semiclassical dynamics of coupled atom-photon interacting system described by a dimer of anisotropic Dicke model in the presence of photon loss, exhibiting a rich variety of non-linear dynamics. Based on symmetries and dynamical classification, we characterize and chart out various dynamical phases in a phase diagram. A key feature of this system is the multistability of different dynamical states, particularly the coexistence of various superradiant phases as well as limit cycles. Remarkably, this dimer system manifests self-trapping phenomena, resulting in a photon population imbalance between the cavities. Such a self-trapped state arises from saddle-node bifurcation, which can be understood from an equivalent Landau-Ginzburg description. Additionally, we identify a unique class of oscillatory dynamics self-trapped limit cycle hosting self-trapping of photons. The absence of stable dynamical phases leads to the onset of chaos, which is diagnosed using the saturation value of the decorrelator dynamics. Moreover, in a narrow region, the self-trapped states can coexist with chaotic attractor, which may have intriguing consequences in quantum dynamics. Finally, we discuss the experimental relevance of our findings, which can be tested in cavity and circuit quantum electrodynamics setups.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Quantum circuit model for Hamiltonian simulation via Trotter decomposition
Authors:
Rohit Sarma Sarkar,
Sabyasachi Chakraborty,
Bibhas Adhikari
Abstract:
We devise quantum circuit implementation of exponential of scaled $n$-qubit Pauli-strings using one-qubit rotation gates and CNOT gates. These circuits can be implemented in low-connected quantum hardware, in particular, star graph architecture for digital quantum computation. Then these circuits are employed to simulate classes of 1D Hamiltonian operators that include $2$-sparse Hamiltonian, Isin…
▽ More
We devise quantum circuit implementation of exponential of scaled $n$-qubit Pauli-strings using one-qubit rotation gates and CNOT gates. These circuits can be implemented in low-connected quantum hardware, in particular, star graph architecture for digital quantum computation. Then these circuits are employed to simulate classes of 1D Hamiltonian operators that include $2$-sparse Hamiltonian, Ising Hamiltonian, and both time-independent and time-dependent Random Field Heisenberg Hamiltonian and Transverse Magnetic Random Quantum Ising Hamiltonian by approximating its unitary evolution with first-order Suzuki-Trotter expansion. Finally, we perform noisy Hamiltonian simulation of these circuits using different noise models to investigate Hamiltonian simulation on NISQ devices.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Four-component relativistic third-order algebraic diagrammatic construction theory for electron detachment, attachment, electronic excitation problem and calculation of first order transition properties
Authors:
Sudipta Chakraborty,
Tamoghna Mukhopadhyay,
Malaya K. Nayak,
Achintya Kumar Dutta
Abstract:
An efficient third-order algebraic diagrammatic construction (ADC) theory has been implemented to calculate ionisation potential, electron attachment and excitation energy (IP/EA/EE-ADC(3)) in a four-component relativistic framework. We have used polarisation propagator formulation for third-order perturbation theory to access the excitation energies (EE), and for IP/EA, a single-particle propagat…
▽ More
An efficient third-order algebraic diagrammatic construction (ADC) theory has been implemented to calculate ionisation potential, electron attachment and excitation energy (IP/EA/EE-ADC(3)) in a four-component relativistic framework. We have used polarisation propagator formulation for third-order perturbation theory to access the excitation energies (EE), and for IP/EA, a single-particle propagator has been used based on a non-Dyson formulation. The benchmarking calculations have been performed on various types of systems to test the accuracy of the four component ADC(3) scheme for the computation of IP, EA and EE. We have applied our IP-ADC(3) to demonstrate the computation of splitting in the IP states for halogen monoxides (XO, X = Cl, Br, I ) due to spin-orbital coupling in the 2^Π ground state and compared it with experimental results. Next, we have studied the effect of relativity and the size of the basis set on the electron attachment calculations of halogen atoms (F, Cl, Br, I and At) using EA-ADC(3). As our next step, we have shown the efficiency of four component ADC(3) in computing excitation energies of triiodide ion and compared with relativistic equation of motion coupled cluster with singles and doubles (EOM-CCSD), intermediate Hamiltonian Fock space coupled cluster (IHFS-CC) and other EOM-CCSD schemes in which spin-orbit coupling is incorporated with different degrees of approximation. Finally, we have also investigated the excitation energies and transition dipole moments for the four excited states of Xe atom and compared them with our recent four-component EOM-CCSD implementation and relativistic finite field Fock space coupled cluster results, along with the experimental estimates.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Diagnosing and Predicting Autonomous Vehicle Operational Safety Using Multiple Simulation Modalities and a Virtual Environment
Authors:
Joe Beck,
Shean Huff,
Subhadeep Chakraborty
Abstract:
Even as technology and performance gains are made in the sphere of automated driving, safety concerns remain. Vehicle simulation has long been seen as a tool to overcome the cost associated with a massive amount of on-road testing for development and discovery of safety critical "edge-cases". However, purely software-based vehicle models may leave a large realism gap between their real-world count…
▽ More
Even as technology and performance gains are made in the sphere of automated driving, safety concerns remain. Vehicle simulation has long been seen as a tool to overcome the cost associated with a massive amount of on-road testing for development and discovery of safety critical "edge-cases". However, purely software-based vehicle models may leave a large realism gap between their real-world counterparts in terms of dynamic response, and highly realistic vehicle-in-the-loop (VIL) simulations that encapsulate a virtual world around a physical vehicle may still be quite expensive to produce and similarly time intensive as on-road testing. In this work, we demonstrate an AV simulation test bed that combines the realism of vehicle-in-the-loop (VIL) simulation with the ease of implementation of model-in-the-loop (MIL) simulation. The setup demonstrated in this work allows for response diagnosis for the VIL simulations. By observing causal links between virtual weather and lighting conditions that surround the virtual depiction of our vehicle, the vision-based perception model and controller of Openpilot, and the dynamic response of our physical vehicle under test, we can draw conclusions regarding how the perceived environment contributed to vehicle response. Conversely, we also demonstrate response prediction for the MIL setup, where the need for a physical vehicle is not required to draw richer conclusions around the impact of environmental conditions on AV performance than could be obtained with VIL simulation alone. These combine for a simulation setup with accurate real-world implications for edge-case discovery that is both cost effective and time efficient to implement.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Generative flow induced neural architecture search: Towards discovering optimal architecture in wavelet neural operator
Authors:
Hartej Soin,
Tapas Tripura,
Souvik Chakraborty
Abstract:
We propose a generative flow-induced neural architecture search algorithm. The proposed approach devices simple feed-forward neural networks to learn stochastic policies to generate sequences of architecture hyperparameters such that the generated states are in proportion with the reward from the terminal state. We demonstrate the efficacy of the proposed search algorithm on the wavelet neural ope…
▽ More
We propose a generative flow-induced neural architecture search algorithm. The proposed approach devices simple feed-forward neural networks to learn stochastic policies to generate sequences of architecture hyperparameters such that the generated states are in proportion with the reward from the terminal state. We demonstrate the efficacy of the proposed search algorithm on the wavelet neural operator (WNO), where we learn a policy to generate a sequence of hyperparameters like wavelet basis and activation operators for wavelet integral blocks. While the trajectory of the generated wavelet basis and activation sequence is cast as flow, the policy is learned by minimizing the flow violation between each state in the trajectory and maximizing the reward from the terminal state. In the terminal state, we train WNO simultaneously to guide the search. We propose to use the exponent of the negative of the WNO loss on the validation dataset as the reward function. While the grid search-based neural architecture generation algorithms foresee every combination, the proposed framework generates the most probable sequence based on the positive reward from the terminal state, thereby reducing exploration time. Compared to reinforcement learning schemes, where complete episodic training is required to get the reward, the proposed algorithm generates the hyperparameter trajectory sequentially. Through four fluid mechanics-oriented problems, we illustrate that the learned policies can sample the best-performing architecture of the neural operator, thereby improving the performance of the vanilla wavelet neural operator.
△ Less
Submitted 11 May, 2024;
originally announced May 2024.
-
Does Dynamical Wormhole Evolve From Emergent Scenario?
Authors:
Dhritimalya Roy,
Ayanendu Dutta,
Bikram Ghosh,
Subenoy Chakraborty
Abstract:
In the present work we analyse a dynamical wormhole solution with two fluids system (one isotropic and homogeneous and the other being inhomogeneous and anisotropic in nature) as the matter at the throat. We choose two different forms of Equation of State(EoS) and investigate two solutions of the wormhole geometry. The properties to ensure existence and traversability has been analysed. Also, the…
▽ More
In the present work we analyse a dynamical wormhole solution with two fluids system (one isotropic and homogeneous and the other being inhomogeneous and anisotropic in nature) as the matter at the throat. We choose two different forms of Equation of State(EoS) and investigate two solutions of the wormhole geometry. The properties to ensure existence and traversability has been analysed. Also, the model of the dynamic wormhole has been examined for a possibility of the Emergent Universe(EU) model in cosmological context. Finally, for the dynamical wormholes so obtained, Null Energy Condition(NEC) has been examined near the throat.
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
Third Harmonic Enhancement Harnessing Photoexcitation Unveils New Nonlinearities in Zinc Oxide
Authors:
Soham Saha,
Sudip Gurung,
Benjamin T. Diroll,
Suman Chakraborty,
Ohad Segal,
Mordechai Segev,
Vladimir M. Shalaev,
Alexander V. Kildishev,
Alexandra Boltasseva,
Richard D. Schaller
Abstract:
Nonlinear optical phenomena are at the heart of various technological domains such as high-speed data transfer, optical logic applications, and emerging fields such as non-reciprocal optics and photonic time crystal design. However, conventional nonlinear materials exhibit inherent limitations in the post-fabrication tailoring of their nonlinear optical properties. Achieving real-time control over…
▽ More
Nonlinear optical phenomena are at the heart of various technological domains such as high-speed data transfer, optical logic applications, and emerging fields such as non-reciprocal optics and photonic time crystal design. However, conventional nonlinear materials exhibit inherent limitations in the post-fabrication tailoring of their nonlinear optical properties. Achieving real-time control over optical nonlinearities remains a challenge. In this work, we demonstrate a method to switch third harmonic generation (THG), a commonly occurring nonlinear optical response. Third harmonic generation enhancements up to 50 times are demonstrated in zinc oxide films via the photoexcited state generation and tunable electric field enhancement. More interestingly, the enhanced third harmonic generation follows a quadratic scaling with incident power, as opposed to the conventional cubic scaling, which demonstrates a previously unreported mechanism of third harmonic generation. The THG can also be suppressed by modulating the optical losses in the film. This work shows that the photoexcitation of states can not only enhance nonlinearities, but can create new processes for third harmonic generation. Importantly, the proposed method enables real-time manipulation of the nonlinear response of a medium. The process is switchable and reversible, with the modulations occurring at picosecond timescale. Our study paves the way to boost or suppress the nonlinearities of solid-state media, enabling robust, switchable sources for nonlinear optical applications.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
Accreting Schwarzschild-like compact object: Plasma-photon interaction and stability
Authors:
Avijit Chowdhury,
Shauvik Biswas,
Sumanta Chakraborty
Abstract:
Accretion is a common phenomenon associated with any astrophysical compact object, which is best described by plasma, a state of matter composed of electrons and heavy ions. In this paper, we analyze the linear dynamics of electromagnetic (EM) fields propagating through the accreting plasma around a static and spherically symmetric horizon-less, exotic compact object (ECO). The general equations g…
▽ More
Accretion is a common phenomenon associated with any astrophysical compact object, which is best described by plasma, a state of matter composed of electrons and heavy ions. In this paper, we analyze the linear dynamics of electromagnetic (EM) fields propagating through the accreting plasma around a static and spherically symmetric horizon-less, exotic compact object (ECO). The general equations governing the propagation of EM waves in such a background exhibits quasi-bound states, with a characteristic oscillation around the BH values, for both the axial and the polar modes, as well as for homogeneous and inhomogeneous plasma distributions. The amplitude of these oscillations depend on the non-zero reflectivity of the surface of the compact object, while the oscillation length depends on its compactness. This results into slower decay of the quasi-bound states with time for certain parameter space of the plasma frequency, compared to BHs, making these ECOs more prone to instabilities.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
Towards Neural Synthesis for SMT-Assisted Proof-Oriented Programming
Authors:
Saikat Chakraborty,
Gabriel Ebner,
Siddharth Bhat,
Sarah Fakhoury,
Sakina Fatima,
Shuvendu Lahiri,
Nikhil Swamy
Abstract:
Proof-oriented programs mix computational content with proofs of program correctness. However, the human effort involved in programming and proving is still substantial, despite the use of Satisfiability Modulo Theories (SMT) solvers to automate proofs in languages such as F*.
Seeking to spur research on using AI to automate the construction of proof-oriented programs, we curate a dataset of 600…
▽ More
Proof-oriented programs mix computational content with proofs of program correctness. However, the human effort involved in programming and proving is still substantial, despite the use of Satisfiability Modulo Theories (SMT) solvers to automate proofs in languages such as F*.
Seeking to spur research on using AI to automate the construction of proof-oriented programs, we curate a dataset of 600K lines of open-source F* programs and proofs, including software used in production systems ranging from Windows and Linux, to Python and Firefox. Our dataset includes around 32K top-level F* definitions, each representing a type-directed program and proof synthesis problem -- producing a definition given a formal specification expressed as an F* type. We provide a program-fragment checker that queries F* to check the correctness of candidate solutions. We believe this is the largest corpus of SMT-assisted program proofs coupled with a reproducible program-fragment checker.
Grounded in this dataset, we investigate the use of AI to synthesize programs and their proofs in F*, with promising results. Our main finding in that the performance of fine-tuned smaller language models (such as Phi-2 or StarCoder) compare favorably with large language models (such as GPT-4), at a much lower computational cost. We also identify various type-based retrieval augmentation techniques and find that they boost performance significantly. With detailed error analysis and case studies, we identify potential strengths and weaknesses of models and techniques and suggest directions for future improvements.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Human Factors in Model-Driven Engineering: Future Research Goals and Initiatives for MDE
Authors:
Grischa Liebel,
Jil Klünder,
Regina Hebig,
Christopher Lazik,
Inês Nunes,
Isabella Graßl,
Jan-Philipp Steghöfer,
Joeri Exelmans,
Julian Oertel,
Kai Marquardt,
Katharina Juhnke,
Kurt Schneider,
Lucas Gren,
Lucia Happe,
Marc Herrmann,
Marvin Wyrich,
Matthias Tichy,
Miguel Goulão,
Rebekka Wohlrab,
Reyhaneh Kalantari,
Robert Heinrich,
Sandra Greiner,
Satrio Adi Rukmono,
Shalini Chakraborty,
Silvia Abrahão
, et al. (1 additional authors not shown)
Abstract:
Purpose: Software modelling and Model-Driven Engineering (MDE) is traditionally studied from a technical perspective. However, one of the core motivations behind the use of software models is inherently human-centred. Models aim to enable practitioners to communicate about software designs, make software understandable, or make software easier to write through domain-specific modelling languages.…
▽ More
Purpose: Software modelling and Model-Driven Engineering (MDE) is traditionally studied from a technical perspective. However, one of the core motivations behind the use of software models is inherently human-centred. Models aim to enable practitioners to communicate about software designs, make software understandable, or make software easier to write through domain-specific modelling languages. Several recent studies challenge the idea that these aims can always be reached and indicate that human factors play a role in the success of MDE. However, there is an under-representation of research focusing on human factors in modelling. Methods: During a GI-Dagstuhl seminar, topics related to human factors in modelling were discussed by 26 expert participants from research and industry. Results: In breakout groups, five topics were covered in depth, namely modelling human aspects, factors of modeller experience, diversity and inclusion in MDE, collaboration and MDE, and teaching human-aware MDE. Conclusion: We summarise our insights gained during the discussions on the five topics. We formulate research goals, questions, and propositions that support directing future initiatives towards an MDE community that is aware of and supportive of human factors and values.
△ Less
Submitted 29 April, 2024;
originally announced April 2024.
-
Conformalized Ordinal Classification with Marginal and Conditional Coverage
Authors:
Subhrasish Chakraborty,
Chhavi Tyagi,
Haiyan Qiao,
Wenge Guo
Abstract:
Conformal prediction is a general distribution-free approach for constructing prediction sets combined with any machine learning algorithm that achieve valid marginal or conditional coverage in finite samples. Ordinal classification is common in real applications where the target variable has natural ordering among the class labels. In this paper, we discuss constructing distribution-free predicti…
▽ More
Conformal prediction is a general distribution-free approach for constructing prediction sets combined with any machine learning algorithm that achieve valid marginal or conditional coverage in finite samples. Ordinal classification is common in real applications where the target variable has natural ordering among the class labels. In this paper, we discuss constructing distribution-free prediction sets for such ordinal classification problems by leveraging the ideas of conformal prediction and multiple testing with FWER control. Newer conformal prediction methods are developed for constructing contiguous and non-contiguous prediction sets based on marginal and conditional (class-specific) conformal $p$-values, respectively. Theoretically, we prove that the proposed methods respectively achieve satisfactory levels of marginal and class-specific conditional coverages. Through simulation study and real data analysis, these proposed methods show promising performance compared to the existing conformal method.
△ Less
Submitted 25 April, 2024;
originally announced April 2024.