-
Rotating clusters in phase-lagged Kuramoto oscillators with higher-order interactions
Authors:
Bhuwan Moyal,
Priyanka Rajwani,
Subhasanket Dutta,
Sarika Jalan
Abstract:
The effect of phase-lag parameter in pairwise interactions has been a topic of great interest for long. However, real-world systems often have interactions that are beyond pairwise and can be modeled using simplicial complexes. We investigate the effect of the inclusion of phase-lag in coupled Kuramoto oscillators with simplicial interactions and find that it shifts the critical points at which fi…
▽ More
The effect of phase-lag parameter in pairwise interactions has been a topic of great interest for long. However, real-world systems often have interactions that are beyond pairwise and can be modeled using simplicial complexes. We investigate the effect of the inclusion of phase-lag in coupled Kuramoto oscillators with simplicial interactions and find that it shifts the critical points at which first-order transition from cluster synchronized state to incoherent state occurs. In the thermodynamic limit, using the Ott-Antonsen approach we derive a reduced equation for order parameter measuring cluster synchronization. Further, we progress through the self-consistency method to achieve a closed form of the order parameter measuring global synchronization which was lacking in Ott-Antonsen approach. Moreover, considering polar coordinates framework we obtain rotation frequency of the clusters which comes out to be a function of the phase-lag parameter further indicating that phase-lag can be used as a control parameter to achieve a desired cluster frequency.
△ Less
Submitted 1 February, 2024; v1 submitted 27 July, 2023;
originally announced July 2023.
-
Giant conductance of PSS:PEDOT micro-surfaces induced by microbubble lithography
Authors:
Anand Dev Ranjan,
Rakesh Sen,
Sumeet Kumar,
Rahul Vaippully,
Soumya Dutta,
Soumyajit Roy,
Basudev Roy,
Ayan Banerjee
Abstract:
We provide direct evidence of the effects of interface engineering of various substrates by Microbubble lithography (MBL). We choose a model organic plastic (or polymer) poly(3,4-ethylenedioxythiophene) polystyrene sulfonate (PEDOT:PSS), with conductivity of 140 S/cm, as a representative organic system to showcase our technique. Thus, we fabricate permanent patterns of PEDOT:PSS on glass, followed…
▽ More
We provide direct evidence of the effects of interface engineering of various substrates by Microbubble lithography (MBL). We choose a model organic plastic (or polymer) poly(3,4-ethylenedioxythiophene) polystyrene sulfonate (PEDOT:PSS), with conductivity of 140 S/cm, as a representative organic system to showcase our technique. Thus, we fabricate permanent patterns of PEDOT:PSS on glass, followed by a flexible PDMS substrate, and observe conductivity enhancement of 5 times on the former (694 S/cm), and 20 times (2844 S/cm) on the latter, without the use of external doping agents or invasive chemical treatment. Probing the patterned interface, we observe that MBL is able to tune the conformational states of PEDOT:PSS from coils in the pristine form, to extended coils on glass, and almost linear structures in PDMS due to its more malleable liquid-like interface. This results in higher ordering and vanishing grain boundaries leading to the highest conductivity of PEDOT:PSS on PDMS substrates.
△ Less
Submitted 26 July, 2023;
originally announced July 2023.
-
Demystifying Local and Global Fairness Trade-offs in Federated Learning Using Partial Information Decomposition
Authors:
Faisal Hamman,
Sanghamitra Dutta
Abstract:
This work presents an information-theoretic perspective to group fairness trade-offs in federated learning (FL) with respect to sensitive attributes, such as gender, race, etc. Existing works often focus on either $\textit{global fairness}$ (overall disparity of the model across all clients) or $\textit{local fairness}$ (disparity of the model at each client), without always considering their trad…
▽ More
This work presents an information-theoretic perspective to group fairness trade-offs in federated learning (FL) with respect to sensitive attributes, such as gender, race, etc. Existing works often focus on either $\textit{global fairness}$ (overall disparity of the model across all clients) or $\textit{local fairness}$ (disparity of the model at each client), without always considering their trade-offs. There is a lack of understanding regarding the interplay between global and local fairness in FL, particularly under data heterogeneity, and if and when one implies the other. To address this gap, we leverage a body of work in information theory called partial information decomposition (PID), which first identifies three sources of unfairness in FL, namely, $\textit{Unique Disparity}$, $\textit{Redundant Disparity}$, and $\textit{Masked Disparity}$. We demonstrate how these three disparities contribute to global and local fairness using canonical examples. This decomposition helps us derive fundamental limits on the trade-off between global and local fairness, highlighting where they agree or disagree. We introduce the $\textit{Accuracy and Global-Local Fairness Optimality Problem (AGLFOP)}$, a convex optimization that defines the theoretical limits of accuracy and fairness trade-offs, identifying the best possible performance any FL strategy can attain given a dataset and client distribution. We also present experimental results on synthetic datasets and the ADULT dataset to support our theoretical findings.
△ Less
Submitted 4 March, 2024; v1 submitted 20 July, 2023;
originally announced July 2023.
-
Horizontal and Vertical Differentiation: Approaching Endogenous Measurement in Intra-industry Trade
Authors:
Sourish Dutta
Abstract:
Studying intra-industry trade involves theoretical explanations and empirical methods to measure the phenomenon. Indicators have been developed to measure the intensity of intra-industry trade, leading to theoretical models explaining its determinants. It is essential to distinguish between horizontal and vertical differentiation in empirical analyses. The determinants and consequences of intra-in…
▽ More
Studying intra-industry trade involves theoretical explanations and empirical methods to measure the phenomenon. Indicators have been developed to measure the intensity of intra-industry trade, leading to theoretical models explaining its determinants. It is essential to distinguish between horizontal and vertical differentiation in empirical analyses. The determinants and consequences of intra-industry trade depend on whether the traded products differ in quality. A method for distinguishing between vertical and horizontal differentiation involves comparing exports' unit value to imports for each industry's intra-industry trade. This approach has limitations, leading to the need for an alternative method.
△ Less
Submitted 30 August, 2023; v1 submitted 20 July, 2023;
originally announced July 2023.
-
Disentangling Societal Inequality from Model Biases: Gender Inequality in Divorce Court Proceedings
Authors:
Sujan Dutta,
Parth Srivastava,
Vaishnavi Solunke,
Swaprava Nath,
Ashiqur R. KhudaBukhsh
Abstract:
Divorce is the legal dissolution of a marriage by a court. Since this is usually an unpleasant outcome of a marital union, each party may have reasons to call the decision to quit which is generally documented in detail in the court proceedings. Via a substantial corpus of 17,306 court proceedings, this paper investigates gender inequality through the lens of divorce court proceedings. While emerg…
▽ More
Divorce is the legal dissolution of a marriage by a court. Since this is usually an unpleasant outcome of a marital union, each party may have reasons to call the decision to quit which is generally documented in detail in the court proceedings. Via a substantial corpus of 17,306 court proceedings, this paper investigates gender inequality through the lens of divorce court proceedings. While emerging data sources (e.g., public court records) on sensitive societal issues hold promise in aiding social science research, biases present in cutting-edge natural language processing (NLP) methods may interfere with or affect such studies. We thus require a thorough analysis of potential gaps and limitations present in extant NLP resources. In this paper, on the methodological side, we demonstrate that existing NLP resources required several non-trivial modifications to quantify societal inequalities. On the substantive side, we find that while a large number of court cases perhaps suggest changing norms in India where women are increasingly challenging patriarchy, AI-powered analyses of these court proceedings indicate striking gender inequality with women often subjected to domestic violence.
△ Less
Submitted 8 July, 2023;
originally announced July 2023.
-
Gradient Sparsification For Masked Fine-Tuning of Transformers
Authors:
James O' Neill,
Sourav Dutta
Abstract:
Fine-tuning pretrained self-supervised language models is widely adopted for transfer learning to downstream tasks. Fine-tuning can be achieved by freezing gradients of the pretrained network and only updating gradients of a newly added classification layer, or by performing gradient updates on all parameters. Gradual unfreezing makes a trade-off between the two by gradually unfreezing gradients o…
▽ More
Fine-tuning pretrained self-supervised language models is widely adopted for transfer learning to downstream tasks. Fine-tuning can be achieved by freezing gradients of the pretrained network and only updating gradients of a newly added classification layer, or by performing gradient updates on all parameters. Gradual unfreezing makes a trade-off between the two by gradually unfreezing gradients of whole layers during training. This has been an effective strategy to trade-off between storage and training speed with generalization performance. However, it is not clear whether gradually unfreezing layers throughout training is optimal, compared to sparse variants of gradual unfreezing which may improve fine-tuning performance. In this paper, we propose to stochastically mask gradients to regularize pretrained language models for improving overall fine-tuned performance. We introduce GradDrop and variants thereof, a class of gradient sparsification methods that mask gradients during the backward pass, acting as gradient noise. GradDrop is sparse and stochastic unlike gradual freezing. Extensive experiments on the multilingual XGLUE benchmark with XLMR-Large show that GradDrop is competitive against methods that use additional translated data for intermediate pretraining and outperforms standard fine-tuning and gradual unfreezing. A post-analysis shows how GradDrop improves performance with languages it was not trained on, such as under-resourced languages.
△ Less
Submitted 19 July, 2023;
originally announced July 2023.
-
Attention over pre-trained Sentence Embeddings for Long Document Classification
Authors:
Amine Abdaoui,
Sourav Dutta
Abstract:
Despite being the current de-facto models in most NLP tasks, transformers are often limited to short sequences due to their quadratic attention complexity on the number of tokens. Several attempts to address this issue were studied, either by reducing the cost of the self-attention computation or by modeling smaller sequences and combining them through a recurrence mechanism or using a new transfo…
▽ More
Despite being the current de-facto models in most NLP tasks, transformers are often limited to short sequences due to their quadratic attention complexity on the number of tokens. Several attempts to address this issue were studied, either by reducing the cost of the self-attention computation or by modeling smaller sequences and combining them through a recurrence mechanism or using a new transformer model. In this paper, we suggest to take advantage of pre-trained sentence transformers to start from semantically meaningful embeddings of the individual sentences, and then combine them through a small attention layer that scales linearly with the document length. We report the results obtained by this simple architecture on three standard document classification datasets. When compared with the current state-of-the-art models using standard fine-tuning, the studied method obtains competitive results (even if there is no clear best model in this configuration). We also showcase that the studied architecture obtains better results when freezing the underlying transformers. A configuration that is useful when we need to avoid complete fine-tuning (e.g. when the same frozen transformer is shared by different applications). Finally, two additional experiments are provided to further evaluate the relevancy of the studied architecture over simpler baselines.
△ Less
Submitted 18 July, 2023;
originally announced July 2023.
-
AI-assisted Improved Service Provisioning for Low-latency XR over 5G NR
Authors:
Moyukh Laha,
Dibbendu Roy,
Sourav Dutta,
Goutam Das
Abstract:
Extended Reality (XR) is one of the most important 5G/6G media applications that will fundamentally transform human interactions. However, ensuring low latency, high data rate, and reliability to support XR services poses significant challenges. This letter presents a novel AI-assisted service provisioning scheme that leverages predicted frames for processing rather than relying solely on actual f…
▽ More
Extended Reality (XR) is one of the most important 5G/6G media applications that will fundamentally transform human interactions. However, ensuring low latency, high data rate, and reliability to support XR services poses significant challenges. This letter presents a novel AI-assisted service provisioning scheme that leverages predicted frames for processing rather than relying solely on actual frames. This method virtually increases the network delay budget and consequently improves service provisioning, albeit at the expense of minor prediction errors. The proposed scheme is validated by extensive simulations demonstrating a multi-fold increase in supported XR users and also provides crucial network design insights.
△ Less
Submitted 18 July, 2023;
originally announced July 2023.
-
Self-Distilled Quantization: Achieving High Compression Rates in Transformer-Based Language Models
Authors:
James O' Neill,
Sourav Dutta
Abstract:
We investigate the effects of post-training quantization and quantization-aware training on the generalization of Transformer language models. We present a new method called self-distilled quantization (SDQ) that minimizes accumulative quantization errors and outperforms baselines. We apply SDQ to multilingual models XLM-R-Base and InfoXLM-Base and demonstrate that both models can be reduced from…
▽ More
We investigate the effects of post-training quantization and quantization-aware training on the generalization of Transformer language models. We present a new method called self-distilled quantization (SDQ) that minimizes accumulative quantization errors and outperforms baselines. We apply SDQ to multilingual models XLM-R-Base and InfoXLM-Base and demonstrate that both models can be reduced from 32-bit floating point weights to 8-bit integer weights while maintaining a high level of performance on the XGLUE benchmark. Our results also highlight the challenges of quantizing multilingual models, which must generalize to languages they were not fine-tuned on.
△ Less
Submitted 12 July, 2023;
originally announced July 2023.
-
For Women, Life, Freedom: A Participatory AI-Based Social Web Analysis of a Watershed Moment in Iran's Gender Struggles
Authors:
Adel Khorramrouz,
Sujan Dutta,
Ashiqur R. KhudaBukhsh
Abstract:
In this paper, we present a computational analysis of the Persian language Twitter discourse with the aim to estimate the shift in stance toward gender equality following the death of Mahsa Amini in police custody. We present an ensemble active learning pipeline to train a stance classifier. Our novelty lies in the involvement of Iranian women in an active role as annotators in building this AI sy…
▽ More
In this paper, we present a computational analysis of the Persian language Twitter discourse with the aim to estimate the shift in stance toward gender equality following the death of Mahsa Amini in police custody. We present an ensemble active learning pipeline to train a stance classifier. Our novelty lies in the involvement of Iranian women in an active role as annotators in building this AI system. Our annotators not only provide labels, but they also suggest valuable keywords for more meaningful corpus creation as well as provide short example documents for a guided sampling step. Our analyses indicate that Mahsa Amini's death triggered polarized Persian language discourse where both fractions of negative and positive tweets toward gender equality increased. The increase in positive tweets was slightly greater than the increase in negative tweets. We also observe that with respect to account creation time, between the state-aligned Twitter accounts and pro-protest Twitter accounts, pro-protest accounts are more similar to baseline Persian Twitter activity.
△ Less
Submitted 7 July, 2023;
originally announced July 2023.
-
A Study of Electronic and Magnetic Properties of Transition Metal Trihalides
Authors:
Shrestha Dutta,
Sachin Varma U,
Payel Bandyopadhyay,
Rudra Banerjee
Abstract:
We present the electronic and magnetic structure calculations of VCl3, VBr3, CrCl3 and CrBr3. The results are obtained by density functional theory with plane wave basis sets. The trihalides generally optimize either in trigonal or monoclinic structures. We have focused on the effect of symmetry on the electronic and magnetic properties of the systems. We have found that magnetic moments change co…
▽ More
We present the electronic and magnetic structure calculations of VCl3, VBr3, CrCl3 and CrBr3. The results are obtained by density functional theory with plane wave basis sets. The trihalides generally optimize either in trigonal or monoclinic structures. We have focused on the effect of symmetry on the electronic and magnetic properties of the systems. We have found that magnetic moments change considerably depending on the symmetry. Both CrX3 have shown a bandgap around 2eV while the V-based systems have shown half-metallic properties.
△ Less
Submitted 2 July, 2023;
originally announced July 2023.
-
Episodic Accretion in Protostars -- An ALMA Survey of Molecular Jets in the Orion Molecular Cloud
Authors:
Somnath Dutta,
Chin-Fei Lee,
Doug Johnstone,
Jeong-Eun Lee,
Naomi Hirano,
James Di Francesco,
Anthony Moraghan,
Tie Liu,
Dipen Sahu,
Sheng-Yuan Liu,
Kenichi Tatematsu,
Chang Won Lee,
Shanghuo Li,
David Eden,
Mika Juvela,
Leonardo Bronfman,
Shih-Ying Hsu,
Kee-Tae Kim,
Woojin Kwon,
Patricio Sanhueza,
Jesus Alejandro Lopez-Vazquez,
Qiuyi Luo,
Hee-Weon Yi
Abstract:
Protostellar outflows and jets are almost ubiquitous characteristics during the mass accretion phase, and encode the history of stellar accretion, complex-organic molecule (COM) formation, and planet formation. Episodic jets are likely connected to episodic accretion through the disk. Despite the importance, there is a lack of studies of a statistically significant sample of protostars via high-se…
▽ More
Protostellar outflows and jets are almost ubiquitous characteristics during the mass accretion phase, and encode the history of stellar accretion, complex-organic molecule (COM) formation, and planet formation. Episodic jets are likely connected to episodic accretion through the disk. Despite the importance, there is a lack of studies of a statistically significant sample of protostars via high-sensitivity and high-resolution observations. To explore episodic accretion mechanisms and the chronologies of episodic events, we investigated 42 fields containing protostars with ALMA observations of CO, SiO, and 1.3\,mm continuum emission. We detected SiO emission in 21 fields, where 19 sources are driving confirmed molecular jets with high abundances of SiO. Jet velocities, mass-loss rates, mass-accretion rates, and periods of accretion events are found to be dependent on the driving forces of the jet (e.g., bolometric luminosity, envelope mass). Next, velocities and mass-loss rates are positively correlated with the surrounding envelope mass, suggesting that the presence of high mass around protostars increases the ejection-accretion activity. We determine mean periods of ejection events of 20$-$175 years for our sample, which could be associated with perturbation zones of $\sim$ 2$-$25\,au extent around the protostars. Also, mean ejection periods are anti-correlated with the envelope mass, where high-accretion rates may trigger more frequent ejection events. The observed periods of outburst/ejection are much shorter than the freeze-out time scale of the simplest COMs like CH$_3$OH, suggesting that episodic events largely maintain the ice-gas balance inside and around the snowline.
△ Less
Submitted 27 June, 2023;
originally announced June 2023.
-
Robust Classification of High-Dimensional Data using Data-Adaptive Energy Distance
Authors:
Jyotishka Ray Choudhury,
Aytijhya Saha,
Sarbojit Roy,
Subhajit Dutta
Abstract:
Classification of high-dimensional low sample size (HDLSS) data poses a challenge in a variety of real-world situations, such as gene expression studies, cancer research, and medical imaging. This article presents the development and analysis of some classifiers that are specifically designed for HDLSS data. These classifiers are free of tuning parameters and are robust, in the sense that they are…
▽ More
Classification of high-dimensional low sample size (HDLSS) data poses a challenge in a variety of real-world situations, such as gene expression studies, cancer research, and medical imaging. This article presents the development and analysis of some classifiers that are specifically designed for HDLSS data. These classifiers are free of tuning parameters and are robust, in the sense that they are devoid of any moment conditions of the underlying data distributions. It is shown that they yield perfect classification in the HDLSS asymptotic regime, under some fairly general conditions. The comparative performance of the proposed classifiers is also investigated. Our theoretical results are supported by extensive simulation studies and real data analysis, which demonstrate promising advantages of the proposed classification techniques over several widely recognized methods.
△ Less
Submitted 24 June, 2023;
originally announced June 2023.
-
Intermolecular Coulombic decay by concerted transfer of energy from photoreceptors to a reaction center
Authors:
Saroj Barik,
Nihar Ranjan Behera,
Saurav Dutta,
Y. Sajeev,
G. Aravind
Abstract:
Molecular mechanisms that enable concerted transfer of energy from several photoacceptors to a distinct reaction center are most desirable for the utilization of light-energy. Here we show that intermolecular Coulombic decay, a channel which enables non-local disposal of energy in photoexcited molecules, offers an avenue for such a novel energy-transfer mechanism. On irradiation of pyridine-argon…
▽ More
Molecular mechanisms that enable concerted transfer of energy from several photoacceptors to a distinct reaction center are most desirable for the utilization of light-energy. Here we show that intermolecular Coulombic decay, a channel which enables non-local disposal of energy in photoexcited molecules, offers an avenue for such a novel energy-transfer mechanism. On irradiation of pyridine-argon gas mixture at 266 nm and at low laser intensities, we observed a surprisingly dominant formation of argon cations. Our measurements on the laser-power dependence of the yield of the Ar cations reveal that intermolecular Coulombic interactions concertedly localize the excitation energy of several photoexcited pyridines at the argon reaction center and ionize it. The density of the reaction center offers an efficient handle to optimize this concerted energy-transfer. This mechanism paves the way for a new $π$-molecular light-harvesting system, and can also contribute to biomolecular stability against photodamage.
△ Less
Submitted 17 June, 2023;
originally announced June 2023.
-
ALMA Survey of Orion Planck Galactic Cold Clumps (ALMASOP): A forming quadruple system with continuum `ribbons' and intricate outflows
Authors:
Qiu-yi Luo,
Tie Liu,
Aaron T. Lee,
Stella S. R. Offner,
James di Francesco,
Doug Johnstone,
Mika Juvela,
Paul F. Goldsmith,
Sheng-Li Qin,
Xiaofeng Mai,
Xun-chuan Liu,
Patricio Sanhueza,
Feng-Wei Xu,
Ken'ichi Tatematsu,
Somnath Dutta,
Huei-Ru Vivien Chen,
Shanghuo Li,
Aiyuan Yang,
Sheng-Yuan Liu,
Chin-Fei Lee,
Naomi Hirano,
Chang Won Lee,
Dipen Sahu,
Hsien Shang,
Shih-Ying Hsu
, et al. (9 additional authors not shown)
Abstract:
One of the most poorly understood aspects of low-mass star formation is how multiple-star systems are formed. Here we present the results of Atacama Large Millimeter/submillimeter Array (ALMA) Band-6 observations towards a forming quadruple protostellar system, G206.93-16.61E2, in the Orion B molecular cloud. ALMA 1.3 mm continuum emission reveals four compact objects, of which two are Class I you…
▽ More
One of the most poorly understood aspects of low-mass star formation is how multiple-star systems are formed. Here we present the results of Atacama Large Millimeter/submillimeter Array (ALMA) Band-6 observations towards a forming quadruple protostellar system, G206.93-16.61E2, in the Orion B molecular cloud. ALMA 1.3 mm continuum emission reveals four compact objects, of which two are Class I young stellar objects (YSOs), and the other two are likely in prestellar phase. The 1.3 mm continuum emission also shows three asymmetric ribbon-like structures that are connected to the four objects, with lengths ranging from $\sim$500 au to $\sim$2200 au. By comparing our data with magneto-hydrodynamic (MHD) simulations, we suggest that these ribbons trace accretion flows and also function as gas bridges connecting the member protostars. Additionally, ALMA CO J=2-1 line emission reveals a complicated molecular outflow associated with G206.93-16.61E2 with arc-like structures suggestive of an outflow cavity viewed pole-on.
△ Less
Submitted 13 June, 2023;
originally announced June 2023.
-
Multi-objective Anti-swing Trajectory Planning of Double-pendulum Tower Crane Operations using Opposition-based Evolutionary Algorithm
Authors:
Souravik Dutta,
Yiyu Cai,
Jianmin Zheng
Abstract:
Underactuated tower crane lifting requires time-energy optimal trajectories for the trolley/slew operations and reduction of the unactuated swings resulting from the trolley/jib motion. In scenarios involving non-negligible hook mass or long rig-cable, the hook-payload unit exhibits double-pendulum behaviour, making the problem highly challenging. This article introduces an offline multi-objective…
▽ More
Underactuated tower crane lifting requires time-energy optimal trajectories for the trolley/slew operations and reduction of the unactuated swings resulting from the trolley/jib motion. In scenarios involving non-negligible hook mass or long rig-cable, the hook-payload unit exhibits double-pendulum behaviour, making the problem highly challenging. This article introduces an offline multi-objective anti-swing trajectory planning module for a Computer-Aided Lift Planning (CALP) system of autonomous double-pendulum tower cranes, addressing all the transient state constraints. A set of auxiliary outputs are selected by methodically analyzing the payload swing dynamics and are used to prove the differential flatness property of the crane operations. The flat outputs are parameterized via suitable Bézier curves to formulate the multi-objective trajectory optimization problems in the flat output space. A novel multi-objective evolutionary algorithm called Collective Oppositional Generalized Differential Evolution 3 (CO-GDE3) is employed as the optimizer. To obtain faster convergence and better consistency in getting a wide range of good solutions, a new population initialization strategy is integrated into the conventional GDE3. The computationally efficient initialization method incorporates various concepts of computational opposition. Statistical comparisons based on trolley and slew operations verify the superiority of convergence and reliability of CO-GDE3 over the standard GDE3. Trolley and slew operations of a collision-free lifting path computed via the path planner of the CALP system are selected for a simulation study. The simulated trajectories demonstrate that the proposed planner can produce time-energy optimal solutions, keeping all the state variables within their respective limits and restricting the hook and payload swings.
△ Less
Submitted 30 May, 2023;
originally announced May 2023.
-
Robust Counterfactual Explanations for Neural Networks With Probabilistic Guarantees
Authors:
Faisal Hamman,
Erfaun Noorani,
Saumitra Mishra,
Daniele Magazzeni,
Sanghamitra Dutta
Abstract:
There is an emerging interest in generating robust counterfactual explanations that would remain valid if the model is updated or changed even slightly. Towards finding robust counterfactuals, existing literature often assumes that the original model $m$ and the new model $M$ are bounded in the parameter space, i.e., $\|\text{Params}(M){-}\text{Params}(m)\|{<}Δ$. However, models can often change s…
▽ More
There is an emerging interest in generating robust counterfactual explanations that would remain valid if the model is updated or changed even slightly. Towards finding robust counterfactuals, existing literature often assumes that the original model $m$ and the new model $M$ are bounded in the parameter space, i.e., $\|\text{Params}(M){-}\text{Params}(m)\|{<}Δ$. However, models can often change significantly in the parameter space with little to no change in their predictions or accuracy on the given dataset. In this work, we introduce a mathematical abstraction termed $\textit{naturally-occurring}$ model change, which allows for arbitrary changes in the parameter space such that the change in predictions on points that lie on the data manifold is limited. Next, we propose a measure -- that we call $\textit{Stability}$ -- to quantify the robustness of counterfactuals to potential model changes for differentiable models, e.g., neural networks. Our main contribution is to show that counterfactuals with sufficiently high value of $\textit{Stability}$ as defined by our measure will remain valid after potential $\textit{naturally-occurring}$ model changes with high probability (leveraging concentration bounds for Lipschitz function of independent Gaussians). Since our quantification depends on the local Lipschitz constant around a data point which is not always available, we also examine practical relaxations of our proposed measure and demonstrate experimentally how they can be incorporated to find robust counterfactuals for neural networks that are close, realistic, and remain valid after potential model changes. This work also has interesting connections with model multiplicity, also known as, the Rashomon effect.
△ Less
Submitted 16 March, 2024; v1 submitted 19 May, 2023;
originally announced May 2023.
-
Precision versus Shrinkage: A Comparative Analysis of Covariance Estimation Methods for Portfolio Allocation
Authors:
Sumanjay Dutta,
Shashi Jain
Abstract:
In this paper, we perform a comprehensive study of different covariance and precision matrix estimation methods in the context of minimum variance portfolio allocation. The set of models studied by us can be broadly categorized as: Gaussian Graphical Model (GGM) based methods, Shrinkage Methods, Thresholding and Random Matrix Theory (RMT) based methods. Among these, GGM methods estimate the precis…
▽ More
In this paper, we perform a comprehensive study of different covariance and precision matrix estimation methods in the context of minimum variance portfolio allocation. The set of models studied by us can be broadly categorized as: Gaussian Graphical Model (GGM) based methods, Shrinkage Methods, Thresholding and Random Matrix Theory (RMT) based methods. Among these, GGM methods estimate the precision matrix directly while the other approaches estimate the covariance matrix. We perform a synthetic experiment to study the network learning and sample complexity performance of GGM methods. Thereafter, we compare all the covariance and precision matrix estimation methods in terms of their predictive ability for daily, weekly and monthly horizons. We consider portfolio risk as an indicator of estimation error and employ it as a loss function for comparison of the methods under consideration. We find that GGM methods outperform shrinkage and other approaches. Our observations for the performance of GGM methods are consistent with the synthetic experiment. We also propose a new criterion for the hyperparameter tuning of GGM methods. Our tuning approach outperforms the existing methodology in the synthetic setup. We further perform an empirical experiment where we study the properties of the estimated precision matrix. The properties of the estimated precision matrices calculated using our tuning approach are in agreement with the algorithm performances observed in the synthetic experiment and the empirical experiment for predictive ability performance comparison. Apart from this, we perform another synthetic experiment which demonstrates the direct relation between estimation error of the precision matrix and portfolio risk.
△ Less
Submitted 9 May, 2023;
originally announced May 2023.
-
Multilingual LLMs are Better Cross-lingual In-context Learners with Alignment
Authors:
Eshaan Tanwar,
Subhabrata Dutta,
Manish Borthakur,
Tanmoy Chakraborty
Abstract:
In-context learning (ICL) unfolds as large language models become capable of inferring test labels conditioned on a few labeled samples without any gradient update. ICL-enabled large language models provide a promising step forward toward bypassing recurrent annotation costs in a low-resource setting. Yet, only a handful of past studies have explored ICL in a cross-lingual setting, in which the ne…
▽ More
In-context learning (ICL) unfolds as large language models become capable of inferring test labels conditioned on a few labeled samples without any gradient update. ICL-enabled large language models provide a promising step forward toward bypassing recurrent annotation costs in a low-resource setting. Yet, only a handful of past studies have explored ICL in a cross-lingual setting, in which the need for transferring label-knowledge from a high-resource language to a low-resource one is immensely crucial. To bridge the gap, we provide the first in-depth analysis of ICL for cross-lingual text classification. We find that the prevalent mode of selecting random input-label pairs to construct the prompt-context is severely limited in the case of cross-lingual ICL, primarily due to the lack of alignment in the input as well as the output spaces. To mitigate this, we propose a novel prompt construction strategy -- Cross-lingual In-context Source-Target Alignment (X-InSTA). With an injected coherence in the semantics of the input examples and a task-based alignment across the source and target languages, X-InSTA is able to outperform random prompt selection by a large margin across three different tasks using 44 different cross-lingual pairs.
△ Less
Submitted 24 June, 2023; v1 submitted 10 May, 2023;
originally announced May 2023.
-
Characterizing cool, neutral gas and ionized metals in the outskirts of low-z galaxy clusters
Authors:
Sapna Mishra,
Sowgat Muzahid,
Sayak Dutta,
Raghunathan Srianand,
Jane Charlton
Abstract:
We present the first detection of cool, neutral gas in the outskirts of low-z galaxy clusters using a statistically significant sample of 3191 z$\approx$0.2 background quasar - foreground cluster pairs by cross-matching the Hubble Spectroscopic Legacy Archive quasar catalog with optically- and SZ-selected cluster catalogs. The median cluster mass of our sample is $\approx 10^{14.2}$ M_sun, with a…
▽ More
We present the first detection of cool, neutral gas in the outskirts of low-z galaxy clusters using a statistically significant sample of 3191 z$\approx$0.2 background quasar - foreground cluster pairs by cross-matching the Hubble Spectroscopic Legacy Archive quasar catalog with optically- and SZ-selected cluster catalogs. The median cluster mass of our sample is $\approx 10^{14.2}$ M_sun, with a median impact parameter ($ρ_{cl}$) of $\approx5$ Mpc. We detect significant Lya, marginal CIV, but no OVI absorption in the signal-to-noise ratio weighted mean stacked spectra with rest-frame equivalent widths of 0.096$\pm$0.011 A, 0.032$\pm$0.015 A, and <0.009 A (3$σ$) for our sample. The Lya REW shows a declining trend with increasing $ρ_{cl}$ ($ρ_{cl}$ / $R_{500}$) which is well explained by a power-law with a slope of -0.79 (-0.70). The covering fractions (CFs) measured for Lya (21\%), CIV (10\%) and OVI (10\%) in cluster outskirts are significantly lower than in the circumgalatic medium (CGM). We also find that the CGM of galaxies that are closer to cluster centers or that are in massive clusters is considerably deficient in neutral gas. The low CF of the Lya along with the non-detection of Lya signal when the strong absorbers (N(HI) > $10^{13} cm^{-2}$) are excluded, indicate the patchy distribution of cool gas in the outskirts. We argue that the cool gas in cluster outskirts in combination arises from the circumgalactic gas stripped from cluster galaxies and to large-scale filaments feeding the clusters with cool gas.
△ Less
Submitted 6 November, 2023; v1 submitted 9 May, 2023;
originally announced May 2023.
-
Generalized Linear Models of T$_{90}$-T$_{50}$ relation to classify GRBs
Authors:
Sourav Dutta,
Sunanda,
Reetanjali Moharana,
Manish Kumar
Abstract:
Gamma-ray bursts (GRBs) can be classified with their linearly dependent parameters alongside the standard $T_{90}$ distribution. The Generalized linear mixture model(GLM) identifies the number of linear dependencies in a two-parameter space. Classically, GRBs are classified into two classes by the presence of bimodality in the histogram of T$_{90}$. However, additional classes and sub-classes of G…
▽ More
Gamma-ray bursts (GRBs) can be classified with their linearly dependent parameters alongside the standard $T_{90}$ distribution. The Generalized linear mixture model(GLM) identifies the number of linear dependencies in a two-parameter space. Classically, GRBs are classified into two classes by the presence of bimodality in the histogram of T$_{90}$. However, additional classes and sub-classes of GRBs are fascinating topics to explore. In this work, we investigate the GRBs classes in the $ T_{90} {-}T_{50}$ plane using the Generalized Linear Models(GLM) for Fermi GBM and BATSE catalogs. This study shows five linear features for the Fermi GBM catalog and four linear features for the BATSE catalog, directing towards the possibility of more than two GRB classes.
△ Less
Submitted 9 May, 2023; v1 submitted 6 May, 2023;
originally announced May 2023.
-
Impact of phase lag on synchronization in frustrated Kuramoto model with higher-order interactions
Authors:
Sangita Dutta,
Abhijit Mondal,
Prosenjit Kundu,
Pitambar Khanra,
Pinaki Pal,
Chittaranjan Hens
Abstract:
The study of first order transition (explosive synchronization) in an ensemble (network) of coupled oscillators has been the topic of paramount interest among the researchers for more than onedecade. Several frameworks have been proposed to induce explosive synchronization in a network and it has been reported that phase frustration in a network usually suppresses first order transition in the pre…
▽ More
The study of first order transition (explosive synchronization) in an ensemble (network) of coupled oscillators has been the topic of paramount interest among the researchers for more than onedecade. Several frameworks have been proposed to induce explosive synchronization in a network and it has been reported that phase frustration in a network usually suppresses first order transition in the presence of pairwise interactions among the oscillators. However, on the contrary, by considering networks of phase frustrated coupled oscillators in the presence of higher order interactions (upto 2-simplexes) we show here under certain conditions, phase frustration can promote explosive synchronization in a network. A reduced order model of the network in the thermodynamic limit is derived using the Ott-Antonsen ansatz to explain this surprising result. Analytical treatment of the reduced order model including bifurcation analysis explains the apparent counter intuitive result quite clearly.
△ Less
Submitted 2 May, 2023;
originally announced May 2023.
-
A novel distribution with upside down bathtub shape hazard rate: properties, estimation and applications
Authors:
Tuhin Subhra Mahatao,
Subhankar Dutta,
Suchandan Kayal
Abstract:
In this communication, we introduce a new statistical model and study its various mathematical properties. The expressions for hazard rate, reversed hazard rate, and odd functions are provided. We explore the asymptotic behaviors of the density and hazard functions of the newly proposed model. Further, moments, median, quantile, and mode are obtained. The cumulative distribution and density functi…
▽ More
In this communication, we introduce a new statistical model and study its various mathematical properties. The expressions for hazard rate, reversed hazard rate, and odd functions are provided. We explore the asymptotic behaviors of the density and hazard functions of the newly proposed model. Further, moments, median, quantile, and mode are obtained. The cumulative distribution and density functions of the general $k$th order statistic are provided. Sufficient conditions, under which the likelihood ratio order between two inverse generalized linear failure rate (IGLFR) distributed random variables holds, are derived. In addition to these results, we introduce several estimates for the parameters of IGLFR distribution. The maximum likelihood and maximum product spacings estimates are proposed. Bayes estimates are calculated with respect to the squared error loss function. Further, asymptotic confidence and Bayesian credible intervals are obtained. To observe the performance of the proposed estimates, we carry out a Monte Carlo simulation using $R$ software. Finally, two real-life data sets are considered for the purpose of illustration.
△ Less
Submitted 21 April, 2023;
originally announced April 2023.
-
Statistical inference for dependent competing risks data under adaptive Type-II progressive hybrid censoring
Authors:
Subhankar Dutta,
Suchandan Kayal
Abstract:
In this article, we consider statistical inference based on dependent competing risks data from Marshall-Olkin bivariate Weibull distribution. The maximum likelihood estimates of the unknown model parameters have been computed by using the Newton-Raphson method under adaptive Type II progressive hybrid censoring with partially observed failure causes. The existence and uniqueness of maximum likeli…
▽ More
In this article, we consider statistical inference based on dependent competing risks data from Marshall-Olkin bivariate Weibull distribution. The maximum likelihood estimates of the unknown model parameters have been computed by using the Newton-Raphson method under adaptive Type II progressive hybrid censoring with partially observed failure causes. The existence and uniqueness of maximum likelihood estimates are derived. Approximate confidence intervals have been constructed via the observed Fisher information matrix using the asymptotic normality property of the maximum likelihood estimates. Bayes estimates and highest posterior density credible intervals have been calculated under gamma-Dirichlet prior distribution by using the Markov chain Monte Carlo technique. Convergence of Markov chain Monte Carlo samples is tested. In addition, a Monte Carlo simulation is carried out to compare the effectiveness of the proposed methods. Further, three different optimality criteria have been taken into account to obtain the most effective censoring plans. Finally, a real-life data set has been analyzed to illustrate the operability and applicability of the proposed methods.
△ Less
Submitted 19 April, 2023;
originally announced April 2023.
-
HCAM -- Hierarchical Cross Attention Model for Multi-modal Emotion Recognition
Authors:
Soumya Dutta,
Sriram Ganapathy
Abstract:
Emotion recognition in conversations is challenging due to the multi-modal nature of the emotion expression. We propose a hierarchical cross-attention model (HCAM) approach to multi-modal emotion recognition using a combination of recurrent and co-attention neural network models. The input to the model consists of two modalities, i) audio data, processed through a learnable wav2vec approach and, i…
▽ More
Emotion recognition in conversations is challenging due to the multi-modal nature of the emotion expression. We propose a hierarchical cross-attention model (HCAM) approach to multi-modal emotion recognition using a combination of recurrent and co-attention neural network models. The input to the model consists of two modalities, i) audio data, processed through a learnable wav2vec approach and, ii) text data represented using a bidirectional encoder representations from transformers (BERT) model. The audio and text representations are processed using a set of bi-directional recurrent neural network layers with self-attention that converts each utterance in a given conversation to a fixed dimensional embedding. In order to incorporate contextual knowledge and the information across the two modalities, the audio and text embeddings are combined using a co-attention layer that attempts to weigh the utterance level embeddings relevant to the task of emotion recognition. The neural network parameters in the audio layers, text layers as well as the multi-modal co-attention layers, are hierarchically trained for the emotion classification task. We perform experiments on three established datasets namely, IEMOCAP, MELD and CMU-MOSI, where we illustrate that the proposed model improves significantly over other benchmarks and helps achieve state-of-art results on all these datasets.
△ Less
Submitted 9 January, 2024; v1 submitted 13 April, 2023;
originally announced April 2023.
-
The James Webb Space Telescope Mission
Authors:
Jonathan P. Gardner,
John C. Mather,
Randy Abbott,
James S. Abell,
Mark Abernathy,
Faith E. Abney,
John G. Abraham,
Roberto Abraham,
Yasin M. Abul-Huda,
Scott Acton,
Cynthia K. Adams,
Evan Adams,
David S. Adler,
Maarten Adriaensen,
Jonathan Albert Aguilar,
Mansoor Ahmed,
Nasif S. Ahmed,
Tanjira Ahmed,
Rüdeger Albat,
Loïc Albert,
Stacey Alberts,
David Aldridge,
Mary Marsha Allen,
Shaune S. Allen,
Martin Altenburg
, et al. (983 additional authors not shown)
Abstract:
Twenty-six years ago a small committee report, building on earlier studies, expounded a compelling and poetic vision for the future of astronomy, calling for an infrared-optimized space telescope with an aperture of at least $4m$. With the support of their governments in the US, Europe, and Canada, 20,000 people realized that vision as the $6.5m$ James Webb Space Telescope. A generation of astrono…
▽ More
Twenty-six years ago a small committee report, building on earlier studies, expounded a compelling and poetic vision for the future of astronomy, calling for an infrared-optimized space telescope with an aperture of at least $4m$. With the support of their governments in the US, Europe, and Canada, 20,000 people realized that vision as the $6.5m$ James Webb Space Telescope. A generation of astronomers will celebrate their accomplishments for the life of the mission, potentially as long as 20 years, and beyond. This report and the scientific discoveries that follow are extended thank-you notes to the 20,000 team members. The telescope is working perfectly, with much better image quality than expected. In this and accompanying papers, we give a brief history, describe the observatory, outline its objectives and current observing program, and discuss the inventions and people who made it possible. We cite detailed reports on the design and the measured performance on orbit.
△ Less
Submitted 10 April, 2023;
originally announced April 2023.
-
A Unified Approach to Optimally Solving Sensor Scheduling and Sensor Selection Problems in Kalman Filtering
Authors:
Shamak Dutta,
Nils Wilde,
Stephen L. Smith
Abstract:
We consider a general form of the sensor scheduling problem for state estimation of linear dynamical systems, which involves selecting sensors that minimize the trace of the Kalman filter error covariance (weighted by a positive semidefinite matrix) subject to polyhedral constraints on the selected sensors. This general form captures several well-studied problems including sensor placement, sensor…
▽ More
We consider a general form of the sensor scheduling problem for state estimation of linear dynamical systems, which involves selecting sensors that minimize the trace of the Kalman filter error covariance (weighted by a positive semidefinite matrix) subject to polyhedral constraints on the selected sensors. This general form captures several well-studied problems including sensor placement, sensor scheduling with budget constraints, and Linear Quadratic Gaussian (LQG) control and sensing co-design. We present a mixed integer optimization approach that is derived by exploiting the optimality of the Kalman filter. While existing work has focused on approximate methods to specific problem variants, our work provides a unified approach to computing optimal solutions to the general version of sensor scheduling. In simulation, we show this approach finds optimal solutions for systems with 30 to 50 states in seconds.
△ Less
Submitted 11 December, 2023; v1 submitted 5 April, 2023;
originally announced April 2023.
-
Self-organized intracellular twisters
Authors:
Sayantan Dutta,
Reza Farhadifar,
Wen Lu,
Gokberk kabacaoglu,
Robert Blackwell,
David B Stein,
Margot Lakonishok,
Vladimir I. Gelfand,
Stanislav Y. Shvartsman,
Michael J. Shelley
Abstract:
Life in complex systems, such as cities and organisms, comes to a standstill when global coordination of mass, energy, and information flows is disrupted. Global coordination is no less important in single cells, especially in large oocytes and newly formed embryos, which commonly use fast fluid flows for dynamic reorganization of their cytoplasm. Here, we combine theory, computing, and imaging to…
▽ More
Life in complex systems, such as cities and organisms, comes to a standstill when global coordination of mass, energy, and information flows is disrupted. Global coordination is no less important in single cells, especially in large oocytes and newly formed embryos, which commonly use fast fluid flows for dynamic reorganization of their cytoplasm. Here, we combine theory, computing, and imaging to investigate such flows in the Drosophila oocyte, where streaming has been proposed to spontaneously arise from hydrodynamic interactions among cortically anchored microtubules loaded with cargo-carrying molecular motors. We use a fast, accurate, and scalable numerical approach to investigate fluid-structure interactions of 1000s of flexible fibers and demonstrate the robust emergence and evolution of cell-spanning vortices, or twisters. Dominated by a rigid body rotation and secondary toroidal components, these flows are likely involved in rapid mixing and transport of ooplasmic components.
△ Less
Submitted 5 April, 2023; v1 submitted 4 April, 2023;
originally announced April 2023.
-
Comprehensive study of forced convection over a heated elliptical cylinder with varying angle of incidences to uniform free stream
Authors:
Raghav Singhal,
Sailen Dutta,
Jiten C. Kalita
Abstract:
In this paper we carry out a numerical investigation of forced convection heat transfer from a heated elliptical cylinder in a uniform free stream with angle of inclination $θ^{\circ}$. Numerical simulations were carried out for $10 \leq Re \leq 120$, $0^{\circ} \leq θ\leq 180^{\circ}$, and $Pr = 0.71$. Results are reported for both steady and unsteady state regime in terms of streamlines, vortici…
▽ More
In this paper we carry out a numerical investigation of forced convection heat transfer from a heated elliptical cylinder in a uniform free stream with angle of inclination $θ^{\circ}$. Numerical simulations were carried out for $10 \leq Re \leq 120$, $0^{\circ} \leq θ\leq 180^{\circ}$, and $Pr = 0.71$. Results are reported for both steady and unsteady state regime in terms of streamlines, vorticity contours, isotherms, drag and lift coefficients, Strouhal number, and Nusselt number. In the process, we also propose a novel method of computing the Nusselt number by merely gathering flow information along the normal to the ellipse boundary. The critical $Re$ at which which flow becomes unsteady, $Re_c$ is reported for all the values of $θ$ considered and found to be the same for $θ$ and $180^\circ -θ$ for $0^\circ \leq θ\leq 90^\circ$. In the steady regime, the $Re$ at which flow separation occurs progressively decreases as $θ$ increases. The surface averaged Nusselt number ($Nu_{\text{av}}$) increases with $Re$, whereas the drag force experienced by the cylinder decreases with $Re$. The transient regime is characterized by periodic vortex shedding, which is quantified by the Strouhal number ($St$). Vortex shedding frequency increases with $Re$ and decreases with $θ$ for a given $Re$. $Nu_{\text{av}}$ also exhibits a time-varying oscillatory behaviour with a time period which is half the time period of vortex shedding. The amplitude of oscillation of $Nu_{\text{av}}$ increases with $θ$.
△ Less
Submitted 30 March, 2023;
originally announced March 2023.
-
MUSEQuBES: Mapping the distribution of neutral hydrogen around low-redshift galaxies
Authors:
Sayak Dutta,
Sowgat Muzahid,
Joop Schaye,
Sapna Mishra,
Hsiao-Wen Chen,
Sean Johnson,
Lutz Wisotzki,
Sebastiano Cantalupo
Abstract:
We present a detailed study of cool, neutral gas traced by Lya around 4595 z<0.5 galaxies using stacks of background quasar spectra. The galaxies are selected from our MUSEQuBES low-z survey along with data from the literature. These galaxies, with a median stellar mass of log (M*/Msun)= 10.0, are probed by 184 background quasars giving rise to 5054 quasar-galaxy pairs. The median impact parameter…
▽ More
We present a detailed study of cool, neutral gas traced by Lya around 4595 z<0.5 galaxies using stacks of background quasar spectra. The galaxies are selected from our MUSEQuBES low-z survey along with data from the literature. These galaxies, with a median stellar mass of log (M*/Msun)= 10.0, are probed by 184 background quasars giving rise to 5054 quasar-galaxy pairs. The median impact parameter is b = 1.5 pMpc (median b/Rvir=10.4) with 204 (419) quasar-galaxy pairs probing b/Rvir < 1 (2). We find excess absorption out to at least ~ 15 Rvir transverse distance and ~ 600 km/s along the line of sight. We show that the median stacked profile for the full sample, dominated by the pairs with b > Rvir, can be explained by a galaxy-absorber two-point correlation function with r0 = 7.6 pMpc and gamma = -1.57. There are strong indications that the inner regions (< Rvir) of the rest equivalent width profile are better explained by a log-linear (or a Gaussian) relation whereas the outer regions are well described by a power-law, consistent with galaxy-absorber large-scale clustering. Using a sub-sample of 339 galaxies (442 quasar-galaxy pairs, median b/Rvir = 1.6) with star formation rate measurements, we find that the Lya absorption is significantly stronger for star-forming galaxies compared to passive galaxies, but only within the virial radius. The Lya absorption at b ~ Rvir for a redshift-controlled sample peaks at M* ~ 10^9 Msun~ (Mhalo ~ 10^11 Msun).
△ Less
Submitted 25 January, 2024; v1 submitted 29 March, 2023;
originally announced March 2023.
-
Perfect synchronization in complex networks with higher order interactions
Authors:
Sangita Dutta,
Prosenjit Kundu,
Pitambar Khanra,
Chittaranjan Hens,
Pinaki Pal
Abstract:
We propose a framework for achieving perfect synchronization in complex networks of Sakaguchi-Kuramoto oscillators in presence of higher order interactions (simplicial complexes) at a targeted point in the parameter space. It is achieved by using an analytically derived frequency set from the governing equations. The frequency set not only provides stable perfect synchronization in the network at…
▽ More
We propose a framework for achieving perfect synchronization in complex networks of Sakaguchi-Kuramoto oscillators in presence of higher order interactions (simplicial complexes) at a targeted point in the parameter space. It is achieved by using an analytically derived frequency set from the governing equations. The frequency set not only provides stable perfect synchronization in the network at a desired point, but also proves to be very effective in achieving high level of synchronization around it compared to the choice of any other frequency sets (Uniform, Normal etc.). The proposed framework has been verified using scale-free, random and small world networks. In all the cases, stable perfect synchronization is achieved at a targeted point for wide ranges of the coupling parameters and phase-frustration. Both first and second order transitions to synchronizations are observed in the system depending on the type of the network and phase frustration. The stability of perfect synchronization state is checked using the low dimensional reduction approach. The robustness of the perfect synchronization state obtained in the system using the derived frequency set is checked by introducing a Gaussian noise around it.
△ Less
Submitted 16 March, 2023;
originally announced March 2023.
-
Classical and quantum cosmology in $f(T)$-gravity theory: A Noether symmetry approach
Authors:
Roshni Bhaumik,
Sourav Dutta,
Subenoy Chakraborty
Abstract:
In the framework of $f(T)$-gravity theory, classical and quantum cosmology has been studied in the present work for FLRW space-time model. The Noether symmetry, a point-like symmetry of the Lagrangian is used to the physical system and a specific functional form of $f(T)$ is determined. A point transformation in the 2D augmented space restricts one of the variable to be cyclic so that the Lagrangi…
▽ More
In the framework of $f(T)$-gravity theory, classical and quantum cosmology has been studied in the present work for FLRW space-time model. The Noether symmetry, a point-like symmetry of the Lagrangian is used to the physical system and a specific functional form of $f(T)$ is determined. A point transformation in the 2D augmented space restricts one of the variable to be cyclic so that the Lagrangian as well as the field equations are simplified so that they are solvable. Lastly for quantum cosmology, the WD equation is constructed and possible solution has been evaluated.
△ Less
Submitted 14 March, 2023;
originally announced March 2023.
-
AdS Witten Diagrams to Carrollian Correlators
Authors:
Arjun Bagchi,
Prateksh Dhivakar,
Sudipta Dutta
Abstract:
Carrollian Conformal Field Theories (CFTs) have been proposed as co-dimension one holographic duals to asymptotically flat spacetimes as opposed to Celestial CFTs which are co-dimension two. In this paper, drawing inspiration from Celestial holography, we show by a suitable generalisation of the flat space limit of AdS that keeps track of the previously disregarded null direction, one can reproduc…
▽ More
Carrollian Conformal Field Theories (CFTs) have been proposed as co-dimension one holographic duals to asymptotically flat spacetimes as opposed to Celestial CFTs which are co-dimension two. In this paper, drawing inspiration from Celestial holography, we show by a suitable generalisation of the flat space limit of AdS that keeps track of the previously disregarded null direction, one can reproduce Carrollian CFT correlation functions from AdS Witten diagrams. In particular, considering Witten diagrams in AdS4, we reproduce two and three-point correlation functions for three dimensional Carrollian CFTs in the so-called delta-function branch. Along the way, we construct non-trivial Carrollian three-point functions in the delta-branch by considering a collinear limit. We also obtain a generalised anti-podal matching condition that now depends on the retarded time direction.
△ Less
Submitted 24 March, 2023; v1 submitted 13 March, 2023;
originally announced March 2023.
-
Creep response of athermal amorphous solids under imposed shear stress
Authors:
Suman Dutta,
Kirsten Martens,
Pinaki Chaudhuri
Abstract:
Yield stress materials fail when the imposed stress crosses a critical threshold. A well-known dynamical response to the applied stress is the phenomenon of creep where the cumulative deformation grows sublinearly with time, prior to failure or arrest. Using extensive molecular dynamics simulations, we study such response for a model amorphous system, in the athermal limit, and probe how the annea…
▽ More
Yield stress materials fail when the imposed stress crosses a critical threshold. A well-known dynamical response to the applied stress is the phenomenon of creep where the cumulative deformation grows sublinearly with time, prior to failure or arrest. Using extensive molecular dynamics simulations, we study such response for a model amorphous system, in the athermal limit, and probe how the annealing history of the initial state determines the observed behaviour to an applied shear stress. Further, we analyze the microscopic dynamics in the vicinity of the yield threshold, using large systems, and characterize the spatiotemporal signatures towards arrest or flow, at different scales.
△ Less
Submitted 8 March, 2023;
originally announced March 2023.
-
Fractional difference sequence spaces via Pascal mean
Authors:
Salila Dutta,
Diptimayee Jena
Abstract:
The main purpose of this article is to introduce Pascal difference sequence spaces of fractional order $ τ$ over the sequence space $\ell_p$ and $\ell_\infty$. Some topological properties of these spaces are considered here along with the Schauder basis, $α -,β-$ and $γ-$duals of the spaces.
The main purpose of this article is to introduce Pascal difference sequence spaces of fractional order $ τ$ over the sequence space $\ell_p$ and $\ell_\infty$. Some topological properties of these spaces are considered here along with the Schauder basis, $α -,β-$ and $γ-$duals of the spaces.
△ Less
Submitted 4 March, 2023;
originally announced March 2023.
-
arXiv:2303.01171
[pdf]
physics.optics
cond-mat.quant-gas
physics.app-ph
physics.atom-ph
physics.chem-ph
A stable 671 nm external cavity diode laser with output power exceeding 150 mW suitable for laser cooling of lithium atoms
Authors:
Sourav Dutta,
Bubai Rahaman
Abstract:
We report the design and performance of a Littrow-type 671 nm External Cavity Diode Laser (ECDL) that delivers output power greater than 150 mW and features enhanced passive stability. The main body of the ECDL is constructed using titanium to minimize temperature related frequency drifts. The laser diode is mounted in a cylindrical mount that allows vertical adjustments while maintaining thermal…
▽ More
We report the design and performance of a Littrow-type 671 nm External Cavity Diode Laser (ECDL) that delivers output power greater than 150 mW and features enhanced passive stability. The main body of the ECDL is constructed using titanium to minimize temperature related frequency drifts. The laser diode is mounted in a cylindrical mount that allows vertical adjustments while maintaining thermal contact with the temperature stabilized base plate. The wavelength tuning is achieved by horizontal displacement of the diffraction grating about an optimal pivot point. The compact design increases the robustness and passive stability of the ECDL and the stiff but light-weight diffraction grating-arm reduces the susceptibility to low-frequency mechanical vibrations. The linewidth of the ECDL is ~360 kHz. We use the 671 nm ECDL, without any additional power amplification, for laser cooling and trapping of lithium atoms in a magneto-optical trap. This simple, low-cost ECDL design using off-the-shelf laser diodes without anti-reflection coating can also be adapted to other wavelengths.
△ Less
Submitted 2 March, 2023;
originally announced March 2023.
-
Noether Symmetry analysis in Chameleon Field Cosmology
Authors:
Roshni Bhaumik,
Sourav Dutta,
Subenoy Chakraborty
Abstract:
This work deals with chameleon field cosmology (a scalar field nonminimally coupled to cold dark matter) in the background of flat Friedmann-Lemaitre-Robertson-Walker (FLRW) space-time. Both classical and quantum cosmology have been investigated using Noether symmetry analysis of the underlying physical system. The Wheeler-DeWitt (WD) equation has been constructed on the minisuperspace and solutio…
▽ More
This work deals with chameleon field cosmology (a scalar field nonminimally coupled to cold dark matter) in the background of flat Friedmann-Lemaitre-Robertson-Walker (FLRW) space-time. Both classical and quantum cosmology have been investigated using Noether symmetry analysis of the underlying physical system. The Wheeler-DeWitt (WD) equation has been constructed on the minisuperspace and solutions have been obtained using conserved charge.
△ Less
Submitted 28 February, 2023;
originally announced February 2023.
-
Characterization of Asymmetric Gap-Engineered Josephson Junctions and 3D Transmon Qubits
Authors:
Zach Steffen,
S. K. Dutta,
Haozhi Wang,
Kungang Li,
Yizhou Huang,
Yi-Hsiang Huang,
Advait Mathur,
F. C. Wellstood,
B. S. Palmer
Abstract:
We have fabricated and characterized asymmetric gap-engineered junctions and transmon devices. To create Josephson junctions with asymmetric gaps, Ti was used to proximitize and lower the superconducting gap of the Al counter-electrode. DC IV measurements of these small, proximitized Josephson junctions show a reduced gap and larger excess current for voltage biases below the superconducting gap w…
▽ More
We have fabricated and characterized asymmetric gap-engineered junctions and transmon devices. To create Josephson junctions with asymmetric gaps, Ti was used to proximitize and lower the superconducting gap of the Al counter-electrode. DC IV measurements of these small, proximitized Josephson junctions show a reduced gap and larger excess current for voltage biases below the superconducting gap when compared to standard Al/AlOx/Al junctions. The energy relaxation time constant for an Al/AlOx/Al/Ti 3D transmon was T1 = 1 μs, over two orders of magnitude shorter than the measured T1 = 134 μs of a standard Al/AlOx/Al 3D transmon. Intentionally adding disorder between the Al and Ti layers reduces the proximity effect and subgap current while increasing the relaxation time to T1 = 32 μs.
△ Less
Submitted 23 February, 2023;
originally announced February 2023.
-
Using Semantic Information for Defining and Detecting OOD Inputs
Authors:
Ramneet Kaur,
Xiayan Ji,
Souradeep Dutta,
Michele Caprio,
Yahan Yang,
Elena Bernardis,
Oleg Sokolsky,
Insup Lee
Abstract:
As machine learning models continue to achieve impressive performance across different tasks, the importance of effective anomaly detection for such models has increased as well. It is common knowledge that even well-trained models lose their ability to function effectively on out-of-distribution inputs. Thus, out-of-distribution (OOD) detection has received some attention recently. In the vast ma…
▽ More
As machine learning models continue to achieve impressive performance across different tasks, the importance of effective anomaly detection for such models has increased as well. It is common knowledge that even well-trained models lose their ability to function effectively on out-of-distribution inputs. Thus, out-of-distribution (OOD) detection has received some attention recently. In the vast majority of cases, it uses the distribution estimated by the training dataset for OOD detection. We demonstrate that the current detectors inherit the biases in the training dataset, unfortunately. This is a serious impediment, and can potentially restrict the utility of the trained model. This can render the current OOD detectors impermeable to inputs lying outside the training distribution but with the same semantic information (e.g. training class labels). To remedy this situation, we begin by defining what should ideally be treated as an OOD, by connecting inputs with their semantic information content. We perform OOD detection on semantic information extracted from the training data of MNIST and COCO datasets and show that it not only reduces false alarms but also significantly improves the detection of OOD inputs with spurious features from the training data.
△ Less
Submitted 21 February, 2023;
originally announced February 2023.
-
DC4L: Distribution Shift Recovery via Data-Driven Control for Deep Learning Models
Authors:
Vivian Lin,
Kuk Jin Jang,
Souradeep Dutta,
Michele Caprio,
Oleg Sokolsky,
Insup Lee
Abstract:
Deep neural networks have repeatedly been shown to be non-robust to the uncertainties of the real world, even to naturally occurring ones. A vast majority of current approaches have focused on data-augmentation methods to expand the range of perturbations that the classifier is exposed to while training. A relatively unexplored avenue that is equally promising involves sanitizing an image as a pre…
▽ More
Deep neural networks have repeatedly been shown to be non-robust to the uncertainties of the real world, even to naturally occurring ones. A vast majority of current approaches have focused on data-augmentation methods to expand the range of perturbations that the classifier is exposed to while training. A relatively unexplored avenue that is equally promising involves sanitizing an image as a preprocessing step, depending on the nature of perturbation. In this paper, we propose to use control for learned models to recover from distribution shifts online. Specifically, our method applies a sequence of semantic-preserving transformations to bring the shifted data closer in distribution to the training set, as measured by the Wasserstein distance. Our approach is to 1) formulate the problem of distribution shift recovery as a Markov decision process, which we solve using reinforcement learning, 2) identify a minimum condition on the data for our method to be applied, which we check online using a binary classifier, and 3) employ dimensionality reduction through orthonormal projection to aid in our estimates of the Wasserstein distance. We provide theoretical evidence that orthonormal projection preserves characteristics of the data at the distributional level. We apply our distribution shift recovery approach to the ImageNet-C benchmark for distribution shifts, demonstrating an improvement in average accuracy of up to 14.21% across a variety of state-of-the-art ImageNet classifiers. We further show that our method generalizes to composites of shifts from the ImageNet-C benchmark, achieving improvements in average accuracy of up to 9.81%. Finally, we test our method on CIFAR-100-C and report improvements of up to 8.25%.
△ Less
Submitted 15 May, 2024; v1 submitted 20 February, 2023;
originally announced February 2023.
-
Quantum routing in planar graph using perfect state transfer
Authors:
Supriyo Dutta
Abstract:
In this article, we consider a spin-spin interaction network governed by $XX + YY$ Hamiltonian. The vertices and edges of the network represent the spin objects and their interactions, respectively. We take a privilege to switch on or off any interaction, that assists us to perform multiple perfect state transfers in a graph simultaneously. We also build up a salable network allowing quantum commu…
▽ More
In this article, we consider a spin-spin interaction network governed by $XX + YY$ Hamiltonian. The vertices and edges of the network represent the spin objects and their interactions, respectively. We take a privilege to switch on or off any interaction, that assists us to perform multiple perfect state transfers in a graph simultaneously. We also build up a salable network allowing quantum communication between two arbitrary vertices. Later we utilize the combinatorial characteristics of hypercube graphs to propose a static routing schema to communicate simultaneously between a set of senders and a set of receivers in a planar network. Our construction is new and significantly powerful. We elaborate multiple examples of planar graphs supporting quantum routing where classical routing is not possible.
△ Less
Submitted 20 February, 2023;
originally announced February 2023.
-
Credal Bayesian Deep Learning
Authors:
Michele Caprio,
Souradeep Dutta,
Kuk Jin Jang,
Vivian Lin,
Radoslav Ivanov,
Oleg Sokolsky,
Insup Lee
Abstract:
Uncertainty quantification and robustness to distribution shifts are important goals in machine learning and artificial intelligence. Although Bayesian Neural Networks (BNNs) allow for uncertainty in the predictions to be assessed, different sources of uncertainty are indistinguishable. We present Credal Bayesian Deep Learning (CBDL). Heuristically, CBDL allows to train an (uncountably) infinite e…
▽ More
Uncertainty quantification and robustness to distribution shifts are important goals in machine learning and artificial intelligence. Although Bayesian Neural Networks (BNNs) allow for uncertainty in the predictions to be assessed, different sources of uncertainty are indistinguishable. We present Credal Bayesian Deep Learning (CBDL). Heuristically, CBDL allows to train an (uncountably) infinite ensemble of BNNs, using only finitely many elements. This is possible thanks to prior and likelihood finitely generated credal sets (FGCSs), a concept from the imprecise probability literature. Intuitively, convex combinations of a finite collection of prior-likelihood pairs are able to represent infinitely many such pairs. After training, CBDL outputs a set of posteriors on the parameters of the neural network. At inference time, such posterior set is used to derive a set of predictive distributions that is in turn utilized to distinguish between aleatoric and epistemic uncertainties, and to quantify them. The predictive set also produces either (i) a collection of outputs enjoying desirable probabilistic guarantees, or (ii) the single output that is deemed the best, that is, the one having the highest predictive lower probability -- another imprecise-probabilistic concept. CBDL is more robust than single BNNs to prior and likelihood misspecification, and to distribution shift. We show that CBDL is better at quantifying and disentangling different types of uncertainties than single BNNs, ensemble of BNNs, and Bayesian Model Averaging. In addition, we apply CBDL to two case studies to demonstrate its downstream tasks capabilities: one, for motion prediction in autonomous driving scenarios, and two, to model blood glucose and insulin dynamics for artificial pancreas control. We show that CBDL performs better when compared to an ensemble of BNNs baseline.
△ Less
Submitted 22 February, 2024; v1 submitted 19 February, 2023;
originally announced February 2023.
-
Higher Spin Gravity in $AdS_3$ and Folds on Fermi Surface
Authors:
Suvankar Dutta,
Debangshu Mukherjee,
Sanhita Parihar
Abstract:
In this paper, we introduce new sets of boundary conditions for higher spin gravity in $AdS_3$ where the boundary dynamics of spin two and other higher spin fields are governed by the interacting collective field theory Hamiltonian of Avan and Jevicki. We show that the time evolution of spin two and higher spin fields can be captured by the classical dynamics of folded fermi surfaces in the simila…
▽ More
In this paper, we introduce new sets of boundary conditions for higher spin gravity in $AdS_3$ where the boundary dynamics of spin two and other higher spin fields are governed by the interacting collective field theory Hamiltonian of Avan and Jevicki. We show that the time evolution of spin two and higher spin fields can be captured by the classical dynamics of folded fermi surfaces in the similar spirit of Lin, Lunin and Maldacena. We also construct infinite sequences of conserved charges showing the integrable structure of higher spin gravity (for spin 3) under the boundary conditions we considered. Further, we observe that there are two possible sequences of conserved charges depending on whether the underlying boundary fermions are non-relativistic or relativistic.
△ Less
Submitted 30 May, 2023; v1 submitted 16 February, 2023;
originally announced February 2023.
-
Multiscalar field cosmological model and possible solutions using Noether symmetry approach
Authors:
Santu Mondal,
Roshni Bhaumik,
Sourav Dutta,
Subenoy Chakraborty
Abstract:
In this work, a cosmological model is considered having two scalar fields minimally coupled to gravity with a mixed kinetic term. The model is characterized by the coupling function and the potential function which are assumed to depend on one of the scalar fields. Instead of choosing these functions phenomenologically here, they are evaluated assuming the existence of Noether symmetry. By appropr…
▽ More
In this work, a cosmological model is considered having two scalar fields minimally coupled to gravity with a mixed kinetic term. The model is characterized by the coupling function and the potential function which are assumed to depend on one of the scalar fields. Instead of choosing these functions phenomenologically here, they are evaluated assuming the existence of Noether symmetry. By appropriate choice of a point transformation in the augmented space, one of the variables in the Lagrangian becomes cyclic and the evolution equations become much simpler to have solutions. Finally, the solutions are analyzed from cosmological view point.
△ Less
Submitted 16 February, 2023;
originally announced February 2023.
-
Density Structure of Centrally Concentrated Prestellar Cores from Multi-scale Observations
Authors:
Dipen Sahu,
Sheng-Yuan Liu,
Doug Johnstone,
Tie Liu,
Neal J. Evans II,
Naomi Hirano,
Kenichi Tatematsu,
James Di Francesco,
Chin-Fei Lee,
Kee-Tae Kim,
Somnath Dutta,
Shih-Ying Hsu,
Shanghuo Li,
Qiu-Yi Luo,
Patricio Sanhueza,
Hsien Shang,
Alessio Traficante,
Mika Juvela,
Chang Won Lee,
David J. Eden,
Paul F. Goldsmith,
Leonardo Bronfman,
Woojin Kwon,
Jeong-Eun Lee,
Yi-Jehng Kuan
, et al. (1 additional authors not shown)
Abstract:
Starless cores represent the initial stage of evolution toward (proto)star formation, and a subset of them, known as prestellar cores, with high density (~ 10^6 cm^-3 or higher) and being centrally concentrated are expected to be embryos of (proto)stars. Determining the density profile of prestellar cores, therefore provides an important opportunity to gauge the initial conditions of star formatio…
▽ More
Starless cores represent the initial stage of evolution toward (proto)star formation, and a subset of them, known as prestellar cores, with high density (~ 10^6 cm^-3 or higher) and being centrally concentrated are expected to be embryos of (proto)stars. Determining the density profile of prestellar cores, therefore provides an important opportunity to gauge the initial conditions of star formation. In this work, we perform rigorous modeling to estimate the density profiles of three nearly spherical prestellar cores among a sample of five highly dense cores detected by our recent observations. We employed multi-scale observational data of the (sub)millimeter dust continuum emission including those obtained by SCUBA-2 on the JCMT with a resolution of ~5600 au and by multiple ALMA observations with a resolution as high as ~480 au. We are able to consistently reproduce the observed multi-scale dust continuum images of the cores with a simple prescribed density profile, which bears an inner region of flat density and a r^-2 profile toward the outer region. By utilizing the peak density and the size of the inner flat region as a proxy for the dynamical stage of the cores, we find that the three modeled cores are most likely unstable and prone to collapse. The sizes of the inner flat regions, as compact as ~500 au, signify them being the highly evolved prestellar cores rarely found to date.
△ Less
Submitted 14 February, 2023;
originally announced February 2023.
-
Interpreting the HI 21-cm cosmology maps through Largest Cluster Statistics -- I: Impact of the synthetic SKA1-Low observations
Authors:
Saswata Dasgupta,
Samit Kumar Pal,
Satadru Bag,
Sohini Dutta,
Suman Majumdar,
Abhirup Datta,
Aadarsh Pathak,
Mohd Kamran,
Rajesh Mondal,
Prakash Sarkar
Abstract:
We analyse the evolution of the largest ionized region using the topological and morphological evolution of the redshifted 21-cm signal coming from the neutral hydrogen distribution during the different stages of reionization. For this analysis, we use the "Largest Cluster Statistics" - LCS. We mainly study the impact of the array synthesized beam on the LCS analysis of the 21-cm signal considerin…
▽ More
We analyse the evolution of the largest ionized region using the topological and morphological evolution of the redshifted 21-cm signal coming from the neutral hydrogen distribution during the different stages of reionization. For this analysis, we use the "Largest Cluster Statistics" - LCS. We mainly study the impact of the array synthesized beam on the LCS analysis of the 21-cm signal considering the upcoming low-frequency Square Kilometer Array (SKA1-Low) observations using a realistic simulation for such observation based on the 21cmE2E-pipeline using OSKAR. We find that bias in LCS estimation is introduced in synthetic observations due to the array beam. This in turn shifts the apparent percolation transition point towards the later stages of reionization. The biased estimates of LCS, occurring due to the effect of the lower resolution (lack of longer baselines) and the telescope synthesized beam will lead to a biased interpretation of the reionization history. This is important to note while interpreting any future 21-cm signal images from upcoming or future telescopes like the SKA, HERA, etc. We conclude that one may need denser $uv$-coverage at longer baselines for a better deconvolution of the array synthesized beam from the 21-cm images and a relatively unbiased estimate of LCS from such images.
△ Less
Submitted 9 June, 2023; v1 submitted 6 February, 2023;
originally announced February 2023.
-
Hatemongers ride on echo chambers to escalate hate speech diffusion
Authors:
Vasu Goel,
Dhruv Sahnan,
Subhabrata Dutta,
Anil Bandhakavi,
Tanmoy Chakraborty
Abstract:
Recent years have witnessed a swelling rise of hateful and abusive content over online social networks. While detection and moderation of hate speech have been the early go-to countermeasures, the solution requires a deeper exploration of the dynamics of hate generation and propagation. We analyze more than 32 million posts from over 6.8 million users across three popular online social networks to…
▽ More
Recent years have witnessed a swelling rise of hateful and abusive content over online social networks. While detection and moderation of hate speech have been the early go-to countermeasures, the solution requires a deeper exploration of the dynamics of hate generation and propagation. We analyze more than 32 million posts from over 6.8 million users across three popular online social networks to investigate the interrelations between hateful behavior, information dissemination, and polarised organization mediated by echo chambers. We find that hatemongers play a more crucial role in governing the spread of information compared to singled-out hateful content. This observation holds for both the growth of information cascades as well as the conglomeration of hateful actors. Dissection of the core-wise distribution of these networks points towards the fact that hateful users acquire a more well-connected position in the social network and often flock together to build up information cascades. We observe that this cohesion is far from mere organized behavior; instead, in these networks, hatemongers dominate the echo chambers -- groups of users actively align themselves to specific ideological positions. The observed dominance of hateful users to inflate information cascades is primarily via user interactions amplified within these echo chambers. We conclude our study with a cautionary note that popularity-based recommendation of content is susceptible to be exploited by hatemongers given their potential to escalate content popularity via echo-chambered interactions.
△ Less
Submitted 5 February, 2023;
originally announced February 2023.
-
Hyper-parameter Tuning for Fair Classification without Sensitive Attribute Access
Authors:
Akshaj Kumar Veldanda,
Ivan Brugere,
Sanghamitra Dutta,
Alan Mishler,
Siddharth Garg
Abstract:
Fair machine learning methods seek to train models that balance model performance across demographic subgroups defined over sensitive attributes like race and gender. Although sensitive attributes are typically assumed to be known during training, they may not be available in practice due to privacy and other logistical concerns. Recent work has sought to train fair models without sensitive attrib…
▽ More
Fair machine learning methods seek to train models that balance model performance across demographic subgroups defined over sensitive attributes like race and gender. Although sensitive attributes are typically assumed to be known during training, they may not be available in practice due to privacy and other logistical concerns. Recent work has sought to train fair models without sensitive attributes on training data. However, these methods need extensive hyper-parameter tuning to achieve good results, and hence assume that sensitive attributes are known on validation data. However, this assumption too might not be practical. Here, we propose Antigone, a framework to train fair classifiers without access to sensitive attributes on either training or validation data. Instead, we generate pseudo sensitive attributes on the validation data by training a biased classifier and using the classifier's incorrectly (correctly) labeled examples as proxies for minority (majority) groups. Since fairness metrics like demographic parity, equal opportunity and subgroup accuracy can be estimated to within a proportionality constant even with noisy sensitive attribute information, we show theoretically and empirically that these proxy labels can be used to maximize fairness under average accuracy constraints. Key to our results is a principled approach to select the hyper-parameters of the biased classifier in a completely unsupervised fashion (meaning without access to ground truth sensitive attributes) that minimizes the gap between fairness estimated using noisy versus ground-truth sensitive labels.
△ Less
Submitted 21 March, 2024; v1 submitted 2 February, 2023;
originally announced February 2023.
-
Characterising Solutions of Anomalous Cancellation
Authors:
Satvik Saha,
Sohom Gupta,
Sayan Dutta,
Sourin Chatterjee
Abstract:
Anomalous cancellation of fractions is a mathematically inaccurate method where cancelling the common digits of the numerator and denominator correctly reduces it. While it appears to be accidentally successful, the property of anomalous cancellation is intricately connected to the number of digits of the denominator as well as the base in which the fraction is represented. Previous work have been…
▽ More
Anomalous cancellation of fractions is a mathematically inaccurate method where cancelling the common digits of the numerator and denominator correctly reduces it. While it appears to be accidentally successful, the property of anomalous cancellation is intricately connected to the number of digits of the denominator as well as the base in which the fraction is represented. Previous work have been mostly surrounding three digit solutions or specific properties of the same. This paper seeks to get general results regarding the structure of numbers that follow the cancellation property (denoted by $P^*_{\ell; k}$) and an estimate of the total number of solutions possible in a given base representation. In particular, interesting properties regarding the saturation of the number of solutions in general and $p^n$ bases (where $p$ is a prime) have been studied in detail.
△ Less
Submitted 31 January, 2023;
originally announced February 2023.
-
Aspects of the map from Exact RG to Holographic RG in AdS and dS
Authors:
Pavan Dharanipragada,
Semanti Dutta,
B. Sathiapalan
Abstract:
In earlier work the evolution operator for the exact RG equation was mapped to a field theory in Euclidean AdS. This gives a simple way of understanding AdS/CFT. We explore aspects of this map by studying a simple example of a Schroedinger equation for a free particle with time dependent mass. This is an analytic continuation of an ERG like equation. We show for instance that it can be mapped to a…
▽ More
In earlier work the evolution operator for the exact RG equation was mapped to a field theory in Euclidean AdS. This gives a simple way of understanding AdS/CFT. We explore aspects of this map by studying a simple example of a Schroedinger equation for a free particle with time dependent mass. This is an analytic continuation of an ERG like equation. We show for instance that it can be mapped to a harmonic oscillator. We show that the same techniques can lead to an understanding of dS/CFT too.
△ Less
Submitted 31 January, 2023;
originally announced January 2023.