subscribe to arXiv mailings

Quantum Machine Learning with Application to Progressive Supranuclear Palsy Network Classification

Abstract: Machine learning and quantum computing are being progressively explored to shed light on possible computational approaches to deal with hitherto unsolvable problems. Classical methods for machine learning are ubiquitous in pattern recognition, with support vector machines (SVMs) being a prominent technique for network classification. However, there are limitations to the successful resolution of s… ▽ More Machine learning and quantum computing are being progressively explored to shed light on possible computational approaches to deal with hitherto unsolvable problems. Classical methods for machine learning are ubiquitous in pattern recognition, with support vector machines (SVMs) being a prominent technique for network classification. However, there are limitations to the successful resolution of such classification instances when the input feature space becomes large, and the successive evaluation of so-called kernel functions becomes computationally exorbitant. The use of principal component analysis (PCA) substantially minimizes the dimensionality of feature space thereby enabling computational speed-ups of supervised learning: the creation of a classifier. Further, the application of quantum-based learning to the PCA reduced input feature space might offer an exponential speedup with fewer parameters. The present learning model is evaluated on a real clinical application: the diagnosis of Progressive Supranuclear Palsy (PSP) disorder. The results suggest that quantum machine learning has led to noticeable advancement and outperforms classical frameworks. The optimized variational quantum classifier classifies the PSP dataset with 86% accuracy as compared to conventional SVM. The other technique, a quantum kernel estimator, approximates the kernel function on the quantum machine and optimizes a classical SVM. In particular, we have demonstrated the successful application of the present model on both a quantum simulator and real chips of the IBM quantum platform. △ Less

Submitted 6 July, 2024; originally announced July 2024.

arXiv:2407.03544 [pdf, other]

Analytical Gradient and Hessian Evaluation for System Identification using State-Parameter Transition Tensors

Authors: Premjit Saha, Tarunraj Singh

Abstract: In this work, the Einstein notation is utilized to synthesize state and parameter transition matrices, by solving a set of ordinary differential equations. Additionally, for the system identification problem, it has been demonstrated that the gradient and Hessian of a cost function can be analytically constructed using the same matrix and tensor metrics. A general gradientbased optimization proble… ▽ More In this work, the Einstein notation is utilized to synthesize state and parameter transition matrices, by solving a set of ordinary differential equations. Additionally, for the system identification problem, it has been demonstrated that the gradient and Hessian of a cost function can be analytically constructed using the same matrix and tensor metrics. A general gradientbased optimization problem is then posed to identify unknown system parameters and unknown initial conditions. Here, the analytical gradient and Hessian of the cost function are derived using these state and parameter transition matrices. The more robust performance of the proposed method for identifying unknown system parameters and unknown initial conditions over an existing conventional quasi-Newton method-based system identification toolbox (available in MATLAB) is demonstrated by using two widely used benchmark datasets from real dynamic systems. In the existing toolbox, gradient and Hessian information, which are derived using a finite difference method, are more susceptible to numerical errors compared to the analytical approach presented. Keywords: Gradient-based Optimization, Transition matrix and tensors, Gradient and Hessian, System identification. △ Less

Submitted 3 July, 2024; originally announced July 2024.

Comments: 6 pages, Accepted in 2024 Modeling, Estimation and Control Conference

arXiv:2407.01870 [pdf, ps, other]

A Cosmological Holographic Reconstruction of f(Q) Theory

Authors: Pameli Saha, Prabir Rudra

Abstract: This paper explores a cosmological reconstruction scheme in the background of f(Q) gravity theory from a Holographic perspective. The basic motivation for this work is that the reconstruction is performed from a holographic origin, which has its roots in the black hole thermodynamics and quantum gravity. Dark energy models inspired by holographic prescription are used to reconstruct the f(Q) gravi… ▽ More This paper explores a cosmological reconstruction scheme in the background of f(Q) gravity theory from a Holographic perspective. The basic motivation for this work is that the reconstruction is performed from a holographic origin, which has its roots in the black hole thermodynamics and quantum gravity. Dark energy models inspired by holographic prescription are used to reconstruct the f(Q) gravity models. Two such models, namely the Granda-Oliveros holographic dark energy model and its generalization, the Chen- Jing model are considered for the study. Different scale factors are used and a thorough reconstruction scheme is set up using the dark energy models. The observationally constrained values of the free model parameters have been used to form the reconstructed models. Finally, a thorough investigation of the energy conditions has been performed to check the cosmological viability of the reconstructed f(Q) models. As an outcome, we get some very promising and cosmologically viable f(Q) models that present some interesting properties and demand further investigation. △ Less

Submitted 1 July, 2024; originally announced July 2024.

Comments: 34 pages, 60 figures

arXiv:2406.19543 [pdf, other]

Demarked: A Strategy for Enhanced Abusive Speech Moderation through Counterspeech, Detoxification, and Message Management

Authors: Seid Muhie Yimam, Daryna Dementieva, Tim Fischer, Daniil Moskovskiy, Naquee Rizwan, Punyajoy Saha, Sarthak Roy, Martin Semmann, Alexander Panchenko, Chris Biemann, Animesh Mukherjee

Abstract: Despite regulations imposed by nations and social media platforms, such as recent EU regulations targeting digital violence, abusive content persists as a significant challenge. Existing approaches primarily rely on binary solutions, such as outright blocking or banning, yet fail to address the complex nature of abusive speech. In this work, we propose a more comprehensive approach called Demarcat… ▽ More Despite regulations imposed by nations and social media platforms, such as recent EU regulations targeting digital violence, abusive content persists as a significant challenge. Existing approaches primarily rely on binary solutions, such as outright blocking or banning, yet fail to address the complex nature of abusive speech. In this work, we propose a more comprehensive approach called Demarcation scoring abusive speech based on four aspect -- (i) severity scale; (ii) presence of a target; (iii) context scale; (iv) legal scale -- and suggesting more options of actions like detoxification, counter speech generation, blocking, or, as a final measure, human intervention. Through a thorough analysis of abusive speech regulations across diverse jurisdictions, platforms, and research papers we highlight the gap in preventing measures and advocate for tailored proactive steps to combat its multifaceted manifestations. Our work aims to inform future strategies for effectively addressing abusive speech online. △ Less

Submitted 27 June, 2024; originally announced June 2024.

arXiv:2406.14663 [pdf, other]

MagMar III -- Resisting the Pressure, Is the Magnetic Field Overwhelmed in NGC6334I?

Authors: Paulo C. Cortes, Josep M. Girart, Patricio Sanhueza, Junhao Liu, Sergio Martin, Ian W. Stephens, Henrik Beuther, Patrick M. Koch, M. Fernandez-Lopez, Alvaro Sanchez-Monge, Jia-Wei Wang, Kaho Morii, Shanghuo Li, Piyali Saha, Qizhou Zhang, David Rebolledo, Luis A. Zapata, Ji-hyun Kang, Wenyu Jiao, Jongsoo Kim, Yu Cheng, Jihye Hwang, Eun Jung Chung, Spandan Choudhury, A-Ran Lyo , et al. (1 additional authors not shown)

Abstract: We report on ALMA observations of polarized dust emission at 1.2 mm from NGC6334I, a source known for its significant flux outbursts. Between five months, our data show no substantial change in total intensity and a modest 8\% variation in linear polarization, suggesting a phase of stability or the conclusion of the outburst. The magnetic field, inferred from this polarized emission, displays a pr… ▽ More We report on ALMA observations of polarized dust emission at 1.2 mm from NGC6334I, a source known for its significant flux outbursts. Between five months, our data show no substantial change in total intensity and a modest 8\% variation in linear polarization, suggesting a phase of stability or the conclusion of the outburst. The magnetic field, inferred from this polarized emission, displays a predominantly radial pattern from North-West to South-East with intricate disturbances across major cores, hinting at spiral structures. Energy analysis of CS$(J=5 \rightarrow 4)$ emission yields an outflow energy of approximately $3.5\times10^{45}$ ergs, aligning with previous interferometric studies. Utilizing the Davis-Chandrasekhar-Fermi method, we determined magnetic field strengths ranging from 1 to 11 mG, averaging at 1.9 mG. This average increases to 4 $\pm 1$ mG when incorporating Zeeman measurements. Comparative analyses using gravitational, thermal, and kinetic energy maps reveal that magnetic energy is significantly weaker, possibly explaining the observed field morphology. We also find that the energy in the outflows and the expanding cometary {\HII} region is also larger than the magnetic energy, suggesting that protostellar feedback maybe the dominant driver behind the injection of turbulence in NGC6334I at the scales sampled by our data. The gas in NGC6334I predominantly exhibits supersonic and trans-Alfvenic conditions, transitioning towards a super-Alfvenic regime, underscoring a diminished influence of the magnetic field with increasing gas density. These observations are in agreement with prior polarization studies at 220 GHz, enriching our understanding of the dynamic processes in high-mass star-forming regions. △ Less

Submitted 20 June, 2024; originally announced June 2024.

Comments: Accepted for Publication at the Astrophysical Journal

arXiv:2406.12911 [pdf, other]

The Promise of Analog Deep Learning: Recent Advances, Challenges and Opportunities

Authors: Aditya Datar, Pramit Saha

Abstract: Much of the present-day Artificial Intelligence (AI) utilizes artificial neural networks, which are sophisticated computational models designed to recognize patterns and solve complex problems by learning from data. However, a major bottleneck occurs during a device's calculation of weighted sums for forward propagation and optimization procedure for backpropagation, especially for deep neural net… ▽ More Much of the present-day Artificial Intelligence (AI) utilizes artificial neural networks, which are sophisticated computational models designed to recognize patterns and solve complex problems by learning from data. However, a major bottleneck occurs during a device's calculation of weighted sums for forward propagation and optimization procedure for backpropagation, especially for deep neural networks, or networks with numerous layers. Exploration into different methods of implementing neural networks is necessary for further advancement of the area. While a great deal of research into AI hardware in both directions, analog and digital implementation widely exists, much of the existing survey works lacks discussion on the progress of analog deep learning. To this end, we attempt to evaluate and specify the advantages and disadvantages, along with the current progress with regards to deep learning, for analog implementations. In this paper, our focus lies on the comprehensive examination of eight distinct analog deep learning methodologies across multiple key parameters. These parameters include attained accuracy levels, application domains, algorithmic advancements, computational speed, and considerations of energy efficiency and power consumption. We also identify the neural network-based experiments implemented using these hardware devices and discuss comparative performance achieved by the different analog deep learning methods along with an analysis of their current limitations. Overall, we find that Analog Deep Learning has great potential for future consumer-level applications, but there is still a long road ahead in terms of scalability. Most of the current implementations are more proof of concept and are not yet practically deployable for large-scale models. △ Less

Submitted 13 June, 2024; originally announced June 2024.

arXiv:2406.11636 [pdf, other]

Feasibility of Federated Learning from Client Databases with Different Brain Diseases and MRI Modalities

Authors: Felix Wagner, Wentian Xu, Pramit Saha, Ziyun Liang, Daniel Whitehouse, David Menon, Natalie Voets, J. Alison Noble, Konstantinos Kamnitsas

Abstract: Segmentation models for brain lesions in MRI are commonly developed for a specific disease and trained on data with a predefined set of MRI modalities. Each such model cannot segment the disease using data with a different set of MRI modalities, nor can it segment any other type of disease. Moreover, this training paradigm does not allow a model to benefit from learning from heterogeneous database… ▽ More Segmentation models for brain lesions in MRI are commonly developed for a specific disease and trained on data with a predefined set of MRI modalities. Each such model cannot segment the disease using data with a different set of MRI modalities, nor can it segment any other type of disease. Moreover, this training paradigm does not allow a model to benefit from learning from heterogeneous databases that may contain scans and segmentation labels for different types of brain pathologies and diverse sets of MRI modalities. Is it feasible to use Federated Learning (FL) for training a single model on client databases that contain scans and labels of different brain pathologies and diverse sets of MRI modalities? We demonstrate promising results by combining appropriate, simple, and practical modifications to the model and training strategy: Designing a model with input channels that cover the whole set of modalities available across clients, training with random modality drop, and exploring the effects of feature normalization methods. Evaluation on 7 brain MRI databases with 5 different diseases shows that such FL framework can train a single model that is shown to be very promising in segmenting all disease types seen during training. Importantly, it is able to segment these diseases in new databases that contain sets of modalities different from those in training clients. These results demonstrate, for the first time, feasibility and effectiveness of using FL to train a single segmentation model on decentralised data with diverse brain diseases and MRI modalities, a necessary step towards leveraging heterogeneous real-world databases. Code will be made available at: https://github.com/FelixWag/FL-MultiDisease-MRI △ Less

Submitted 17 June, 2024; originally announced June 2024.

ACM Class: I.4.9; I.4.6; I.2.11; I.4.0

arXiv:2406.06703 [pdf, other]

Video-based Exercise Classification and Activated Muscle Group Prediction with Hybrid X3D-SlowFast Network

Authors: Manvik Pasula, Pramit Saha

Abstract: This paper introduces a simple yet effective strategy for exercise classification and muscle group activation prediction (MGAP). These tasks have significant implications for personal fitness, facilitating more affordable, accessible, safer, and simpler exercise routines. This is particularly relevant for novices and individuals with disabilities. Previous research in the field is mostly dominated… ▽ More This paper introduces a simple yet effective strategy for exercise classification and muscle group activation prediction (MGAP). These tasks have significant implications for personal fitness, facilitating more affordable, accessible, safer, and simpler exercise routines. This is particularly relevant for novices and individuals with disabilities. Previous research in the field is mostly dominated by the reliance on mounted sensors and a limited scope of exercises, reducing practicality for everyday use. Furthermore, existing MGAP methodologies suffer from a similar dependency on sensors and a restricted range of muscle groups, often excluding strength training exercises, which are pivotal for a comprehensive fitness regimen. Addressing these limitations, our research employs a video-based deep learning framework that encompasses a broad spectrum of exercises and muscle groups, including those vital for strength training. Utilizing the "Workout/Exercises Video" dataset, our approach integrates the X3D and SlowFast video activity recognition models in an effective way to enhance exercise classification and MGAP performance. Our findings demonstrate that this hybrid method obtained via weighted ensemble outperforms existing baseline models in accuracy. Pretrained models play a crucial role in enhancing overall performance, with optimal channel reduction values for the SlowFast model identified near 10. Through an ablation study that explores fine-tuning, we further elucidate the interrelation between the two tasks. Our composite model, a weighted-average ensemble of X3D and SlowFast, sets a new benchmark in both exercise classification and MGAP across all evaluated categories, offering a robust solution to the limitations of previous approaches. △ Less

Submitted 10 June, 2024; originally announced June 2024.

Comments: 16 pages, 7 figures, submitted to IEEE Open Journal of the Computer Society

ACM Class: I.2.10; I.4.8

arXiv:2406.02306 [pdf, other]

Bridging the micro-Hz gravitational wave gap via Doppler tracking with the Uranus Orbiter and Probe Mission: Massive black hole binaries, early universe signals and ultra-light dark matter

Authors: Lorenz Zwick, Deniz Soyuer, Daniel J. D'Orazio, David O'Neill, Andrea Derdzinski, Prasenjit Saha, Diego Blas, Alexander C. Jenkins, Luke Zoltan Kelley

Abstract: With the recent announcement by NASA's Planetary Science and Astrobiology Decadal Survey 2023-2032, a priority flagship mission to the planet Uranus is anticipated. Here, we explore the prospects of using the mission's radio Doppler tracking equipment to detect gravitational waves (GWs) and other analogous signals related to dark matter (DM) over the duration of its interplanetary cruise. By emplo… ▽ More With the recent announcement by NASA's Planetary Science and Astrobiology Decadal Survey 2023-2032, a priority flagship mission to the planet Uranus is anticipated. Here, we explore the prospects of using the mission's radio Doppler tracking equipment to detect gravitational waves (GWs) and other analogous signals related to dark matter (DM) over the duration of its interplanetary cruise. By employing a methodology to stack tracking data in combination with Monte-Carlo Markov-Chain parameter recovery tests, we show that the mission will be sensitive to GWs over the wide frequency range of $3\times 10^{-9}$ Hz to $10^{-1}$ Hz, provided that tracking data is taken consistently over a large fraction of the cruise duration. Thus, the mission has the potential to fill the gap between pulsar timing and space-based-interferometry GW observatories. Within this assumption, we forecast the detection of $\mathcal{\mathcal{O}}(1 - 100)$ individual massive black hole binaries using two independent population models. Additionally, we determine the mission's sensitivity to both astrophysical and primordial stochastic gravitational wave backgrounds, as well as its capacity to test, or even confirm via detection, ultralight DM models. In all these cases, the tracking of the spacecraft over its interplanetary cruise would enable coverage of unexplored regions of parameter space, where signals from new phenomena in our Universe may be lurking. △ Less

Submitted 4 June, 2024; originally announced June 2024.

Comments: Submitted to Pr.D. Comments welcome!

arXiv:2406.00803 [pdf, other]

Impact of Vector like quarks on $(g-2)_μ$ with X-II-2HDM scenario and its phenomenological implications

Authors: Md. Raju, Abhi Mukherjee, Jyoti Prasad Saha

Abstract: The recent observation of the muon $(g-2)_μ$ anomaly continues to challenge the explanations provided by the Standard Model. However, this anomaly can potentially find reconciliation within the framework of two-Higgs doublet models, provided that the pseudoscalar mass remains low. The introduction of additional fermionic components, such as a generation of vector-like quarks, not only broadens the… ▽ More The recent observation of the muon $(g-2)_μ$ anomaly continues to challenge the explanations provided by the Standard Model. However, this anomaly can potentially find reconciliation within the framework of two-Higgs doublet models, provided that the pseudoscalar mass remains low. The introduction of additional fermionic components, such as a generation of vector-like quarks, not only broadens the acceptable parameter range for elucidating the anomaly but also presents an opportunity to circumvent conflicts with constraints from B-decays and heavy Higgs searches. We demonstrate the efficacy of fitting the anomaly in the muon magnetic moment within these models, assuming that vector-like quarks do not undergo mixing with Standard Model quarks. With interactions following a type-X pattern for standard model quarks and a type-II pattern for vector-like quarks, results in models designated as type-XII2HDMVLQ. Additionally, we have explored double Higgs production within this model and observed when both the heavy Higgs and VLQ contribute the double Higgs production cross section significantly enhanced. △ Less

Submitted 2 June, 2024; originally announced June 2024.

Comments: 21 pages, multiple figures, comments are welcome

arXiv:2405.07498 [pdf, other]

Wafer-Scale Integration of Freestanding Photonic Devices with Color Centers in Silicon Carbide

Authors: Sridhar Majety, Victoria A. Norman, Pranta Saha, Alex H. Rubin, Scott Dhuey, Marina Radulaski

Abstract: Color center platforms have been at the forefront of quantum nanophotonics for applications in quantum networking, computing, and sensing. However, large-scale deployment of this technology has been stifled by a lack of ability to integrate photonic devices at scale while maintaining the properties of quantum emitters. We address this challenge in silicon carbide which has both commercially availa… ▽ More Color center platforms have been at the forefront of quantum nanophotonics for applications in quantum networking, computing, and sensing. However, large-scale deployment of this technology has been stifled by a lack of ability to integrate photonic devices at scale while maintaining the properties of quantum emitters. We address this challenge in silicon carbide which has both commercially available wafer-scale substrates and is a host to color centers with desirable optical and spin properties. Using ion beam etching at an angle, we develop a 5-inch wafer process for the fabrication of triangular cross-section photonic devices in bulk 4H-SiC. The developed process has a variability in etch rate and etch angle of 5.4% and 2.9%, respectively. Furthermore, the integrated color centers maintain their optical properties after the etch, thus achieving the nanofabrication goal of wafer-scale nanofabrication in quantum-grade silicon carbide. △ Less

Submitted 13 May, 2024; originally announced May 2024.

arXiv:2405.04023 [pdf, other]

Lumbar Spine Tumor Segmentation and Localization in T2 MRI Images Using AI

Authors: Rikathi Pal, Sudeshna Mondal, Aditi Gupta, Priya Saha, Somoballi Ghoshal, Amlan Chakrabarti, Susmita Sur-Kolay

Abstract: In medical imaging, segmentation and localization of spinal tumors in three-dimensional (3D) space pose significant computational challenges, primarily stemming from limited data availability. In response, this study introduces a novel data augmentation technique, aimed at automating spine tumor segmentation and localization through AI approaches. Leveraging a fusion of fuzzy c-means clustering an… ▽ More In medical imaging, segmentation and localization of spinal tumors in three-dimensional (3D) space pose significant computational challenges, primarily stemming from limited data availability. In response, this study introduces a novel data augmentation technique, aimed at automating spine tumor segmentation and localization through AI approaches. Leveraging a fusion of fuzzy c-means clustering and Random Forest algorithms, the proposed method achieves successful spine tumor segmentation based on predefined masks initially delineated by domain experts in medical imaging. Subsequently, a Convolutional Neural Network (CNN) architecture is employed for tumor classification. Moreover, 3D vertebral segmentation and labeling techniques are used to help pinpoint the exact location of the tumors in the lumbar spine. Results indicate a remarkable performance, with 99% accuracy for tumor segmentation, 98% accuracy for tumor classification, and 99% accuracy for tumor localization achieved with the proposed approach. These metrics surpass the efficacy of existing state-of-the-art techniques, as evidenced by superior Dice Score, Class Accuracy, and Intersection over Union (IOU) on class accuracy metrics. This innovative methodology holds promise for enhancing the diagnostic capabilities in detecting and characterizing spinal tumors, thereby facilitating more effective clinical decision-making. △ Less

Submitted 7 May, 2024; originally announced May 2024.

Comments: 9 pages, 12 figures

arXiv:2404.18606 [pdf, other]

Interference with (Pseudo) Thermal Light; The Hanbury Brown and Twiss Effect

Authors: Km Nitu Rai, Soumen Basak, Subrata Sarangi, Prasenjit Saha

Abstract: The correlation of light from two sources leads to an interference pattern if they belong to a specific time interval known as the coherence time, denoted as $Δτ$. The relationship governing this phenomenon is $ΔτΔν\approx 1$, where $Δν$ represents the bandwidth of the light. This requirement is not satisfied, and hence, interference fringes are not observable in the case of ordinary (thermal) lig… ▽ More The correlation of light from two sources leads to an interference pattern if they belong to a specific time interval known as the coherence time, denoted as $Δτ$. The relationship governing this phenomenon is $ΔτΔν\approx 1$, where $Δν$ represents the bandwidth of the light. This requirement is not satisfied, and hence, interference fringes are not observable in the case of ordinary (thermal) light. In the 1950s, Robert Hanbury Brown and Richard Q. Twiss explored interference phenomena using a narrow bandwidth of thermal light. This investigation led to the discovery of the Hanbury-Brown and Twiss effect (or the HBT effect in short), which has since found applications in various fields, particularly stellar observations and quantum optics. This article briefly traces the history of the HBT effect and its applications in various fields, including stellar observations. More importantly, it outlines the basic theoretical framework of this effect, followed by the design and results of the correlation in intensity fluctuation of a pseudo-thermal light in a college laboratory setting (Michelson interferometer). △ Less

Submitted 29 April, 2024; originally announced April 2024.

Comments: Submitted to Resonance

arXiv:2404.18291 [pdf, other]

Panoptic Segmentation and Labelling of Lumbar Spine Vertebrae using Modified Attention Unet

Authors: Rikathi Pal, Priya Saha, Somoballi Ghoshal, Amlan Chakrabarti, Susmita Sur-Kolay

Abstract: Segmentation and labeling of vertebrae in MRI images of the spine are critical for the diagnosis of illnesses and abnormalities. These steps are indispensable as MRI technology provides detailed information about the tissue structure of the spine. Both supervised and unsupervised segmentation methods exist, yet acquiring sufficient data remains challenging for achieving high accuracy. In this stud… ▽ More Segmentation and labeling of vertebrae in MRI images of the spine are critical for the diagnosis of illnesses and abnormalities. These steps are indispensable as MRI technology provides detailed information about the tissue structure of the spine. Both supervised and unsupervised segmentation methods exist, yet acquiring sufficient data remains challenging for achieving high accuracy. In this study, we propose an enhancing approach based on modified attention U-Net architecture for panoptic segmentation of 3D sliced MRI data of the lumbar spine. Our method achieves an impressive accuracy of 99.5\% by incorporating novel masking logic, thus significantly advancing the state-of-the-art in vertebral segmentation and labeling. This contributes to more precise and reliable diagnosis and treatment planning. △ Less

Submitted 28 April, 2024; originally announced April 2024.

Comments: 9 pages, 10 figures

arXiv:2404.13332 [pdf, other]

The initial value problem in cosmology: An alternative derivation for beginners

Authors: Kaushik Bhattacharya, Dipanjan Dey, Priyanka Saha

Abstract: In this article we address the puzzle related to the existence of a proper cosmological solution, where the number of equations exceeds the number of unknown functions. Although only two of the various dynamical equations are independent, the solution of those two equations must have to satisfy all the dynamical equations simultaneously at all instants. We show that this last requirement demands a… ▽ More In this article we address the puzzle related to the existence of a proper cosmological solution, where the number of equations exceeds the number of unknown functions. Although only two of the various dynamical equations are independent, the solution of those two equations must have to satisfy all the dynamical equations simultaneously at all instants. We show that this last requirement demands a constraint on the initial condition. When the initial constraint is satisfied, then and only then, one can have proper cosmological dynamics. The whole exercise turns out to be an alternative derivation of the initial value problem in cosmology. △ Less

Submitted 20 April, 2024; originally announced April 2024.

Comments: 18 pages

arXiv:2404.08094 [pdf, other]

Microlensing near macro-caustics

Authors: Luke Weisenbach, Timo Anguita, Jordi Miralda-Escudé, Masamune Oguri, Prasenjit Saha, Paul L. Schechter

Abstract: Microlensing near macro-caustics is a complex phenomenon in which swarms of micro-images produced by micro-caustics form on both sides of a macro-critical curve. Recent discoveries of highly magnified images of individual stars in massive galaxy cluster lenses, predicted to be formed by these micro-image swarms, have stimulated studies on this topic. In this Chapter, we explore microlensing near m… ▽ More Microlensing near macro-caustics is a complex phenomenon in which swarms of micro-images produced by micro-caustics form on both sides of a macro-critical curve. Recent discoveries of highly magnified images of individual stars in massive galaxy cluster lenses, predicted to be formed by these micro-image swarms, have stimulated studies on this topic. In this Chapter, we explore microlensing near macro-caustics using both simulations and analytic calculations. We show that the mean total magnification of the micro-image swarms follows that of an extended source in the absence of microlensing. Micro-caustics join into a connected network in a region around the macro-critical line of a width proportional to the surface density of microlenses; within this region, the increase of the mean magnification toward the macro-caustic is driven by the increase of the number of micro-images rather than individual magnifications of micro-images. The maximum achievable magnification in micro-caustic crossings decreases with the mass fraction in microlenses. We conclude with a review of applications of this microlensing phenomenon, including limits to the fraction of dark matter in compact objects, and searches of Population III stars and dark matter subhalos. We argue that the discovered highly magnified stars at cosmological distances already imply that less than $\sim$ 10\% of the dark matter may be in the form of compact objects with mass above $\sim 10^{-6}\, M_{\odot}$. △ Less

Submitted 11 April, 2024; originally announced April 2024.

Comments: 9 pages, 6 figures; to be submitted to Space Science Reviews, Topical Collection "Strong Gravitational Lensing", eds. J. Wambsganss et al

arXiv:2404.04283 [pdf, other]

Translation-based Video-to-Video Synthesis

Authors: Pratim Saha, Chengcui Zhang

Abstract: Translation-based Video Synthesis (TVS) has emerged as a vital research area in computer vision, aiming to facilitate the transformation of videos between distinct domains while preserving both temporal continuity and underlying content features. This technique has found wide-ranging applications, encompassing video super-resolution, colorization, segmentation, and more, by extending the capabilit… ▽ More Translation-based Video Synthesis (TVS) has emerged as a vital research area in computer vision, aiming to facilitate the transformation of videos between distinct domains while preserving both temporal continuity and underlying content features. This technique has found wide-ranging applications, encompassing video super-resolution, colorization, segmentation, and more, by extending the capabilities of traditional image-to-image translation to the temporal domain. One of the principal challenges faced in TVS is the inherent risk of introducing flickering artifacts and inconsistencies between frames during the synthesis process. This is particularly challenging due to the necessity of ensuring smooth and coherent transitions between video frames. Efforts to tackle this challenge have induced the creation of diverse strategies and algorithms aimed at mitigating these unwanted consequences. This comprehensive review extensively examines the latest progress in the realm of TVS. It thoroughly investigates emerging methodologies, shedding light on the fundamental concepts and mechanisms utilized for proficient video synthesis. This survey also illuminates their inherent strengths, limitations, appropriate applications, and potential avenues for future development. △ Less

Submitted 3 April, 2024; originally announced April 2024.

Comments: 25 pages, 9 figures

arXiv:2403.14938 [pdf, ps, other]

On Zero-Shot Counterspeech Generation by LLMs

Authors: Punyajoy Saha, Aalok Agrawal, Abhik Jana, Chris Biemann, Animesh Mukherjee

Abstract: With the emergence of numerous Large Language Models (LLM), the usage of such models in various Natural Language Processing (NLP) applications is increasing extensively. Counterspeech generation is one such key task where efforts are made to develop generative models by fine-tuning LLMs with hatespeech - counterspeech pairs, but none of these attempts explores the intrinsic properties of large lan… ▽ More With the emergence of numerous Large Language Models (LLM), the usage of such models in various Natural Language Processing (NLP) applications is increasing extensively. Counterspeech generation is one such key task where efforts are made to develop generative models by fine-tuning LLMs with hatespeech - counterspeech pairs, but none of these attempts explores the intrinsic properties of large language models in zero-shot settings. In this work, we present a comprehensive analysis of the performances of four LLMs namely GPT-2, DialoGPT, ChatGPT and FlanT5 in zero-shot settings for counterspeech generation, which is the first of its kind. For GPT-2 and DialoGPT, we further investigate the deviation in performance with respect to the sizes (small, medium, large) of the models. On the other hand, we propose three different prompting strategies for generating different types of counterspeech and analyse the impact of such strategies on the performance of the models. Our analysis shows that there is an improvement in generation quality for two datasets (17%), however the toxicity increase (25%) with increase in model size. Considering type of model, GPT-2 and FlanT5 models are significantly better in terms of counterspeech quality but also have high toxicity as compared to DialoGPT. ChatGPT are much better at generating counter speech than other models across all metrics. In terms of prompting, we find that our proposed strategies help in improving counter speech generation across all the models. △ Less

Submitted 22 March, 2024; originally announced March 2024.

Comments: 12 pages, 7 tables, accepted at LREC-COLING 2024

arXiv:2403.12161 [pdf]

Effect of Leaders Voice on Financial Market: An Empirical Deep Learning Expedition on NASDAQ, NSE, and Beyond

Authors: Arijit Das, Tanmoy Nandi, Prasanta Saha, Suman Das, Saronyo Mukherjee, Sudip Kumar Naskar, Diganta Saha

Abstract: Financial market like the price of stock, share, gold, oil, mutual funds are affected by the news and posts on social media. In this work deep learning based models are proposed to predict the trend of financial market based on NLP analysis of the twitter handles of leaders of different fields. There are many models available to predict financial market based on only the historical data of the fin… ▽ More Financial market like the price of stock, share, gold, oil, mutual funds are affected by the news and posts on social media. In this work deep learning based models are proposed to predict the trend of financial market based on NLP analysis of the twitter handles of leaders of different fields. There are many models available to predict financial market based on only the historical data of the financial component but combining historical data with news and posts of the social media like Twitter is the main objective of the present work. Substantial improvement is shown in the result. The main features of the present work are: a) proposing completely generalized algorithm which is able to generate models for any twitter handle and any financial component, b) predicting the time window for a tweets effect on a stock price c) analyzing the effect of multiple twitter handles for predicting the trend. A detailed survey is done to find out the latest work in recent years in the similar field, find the research gap, and collect the required data for analysis and prediction. State-of-the-art algorithm is proposed and complete implementation with environment is given. An insightful trend of the result improvement considering the NLP analysis of twitter data on financial market components is shown. The Indian and USA financial markets are explored in the present work where as other markets can be taken in future. The socio-economic impact of the present work is discussed in conclusion. △ Less

Submitted 18 March, 2024; originally announced March 2024.

Comments: 20 pages original research

arXiv:2402.14702 [pdf, other]

InfFeed: Influence Functions as a Feedback to Improve the Performance of Subjective Tasks

Authors: Somnath Banerjee, Maulindu Sarkar, Punyajoy Saha, Binny Mathew, Animesh Mukherjee

Abstract: Recently, influence functions present an apparatus for achieving explainability for deep neural models by quantifying the perturbation of individual train instances that might impact a test prediction. Our objectives in this paper are twofold. First we incorporate influence functions as a feedback into the model to improve its performance. Second, in a dataset extension exercise, using influence f… ▽ More Recently, influence functions present an apparatus for achieving explainability for deep neural models by quantifying the perturbation of individual train instances that might impact a test prediction. Our objectives in this paper are twofold. First we incorporate influence functions as a feedback into the model to improve its performance. Second, in a dataset extension exercise, using influence functions to automatically identify data points that have been initially `silver' annotated by some existing method and need to be cross-checked (and corrected) by annotators to improve the model performance. To meet these objectives, in this paper, we introduce InfFeed, which uses influence functions to compute the influential instances for a target instance. Toward the first objective, we adjust the label of the target instance based on its influencer(s) label. In doing this, InfFeed outperforms the state-of-the-art baselines (including LLMs) by a maximum macro F1-score margin of almost 4% for hate speech classification, 3.5% for stance classification, and 3% for irony and 2% for sarcasm detection. Toward the second objective we show that manually re-annotating only those silver annotated data points in the extension set that have a negative influence can immensely improve the model performance bringing it very close to the scenario where all the data points in the extension set have gold labels. This allows for huge reduction of the number of data points that need to be manually annotated since out of the silver annotated extension dataset, the influence function scheme picks up ~1/1000 points that need manual correction. △ Less

Submitted 9 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

Comments: Accepted at LREC-COLING 2024 (Long Paper)

arXiv:2402.12198 [pdf, other]

Zero shot VLMs for hate meme detection: Are we there yet?

Authors: Naquee Rizwan, Paramananda Bhaskar, Mithun Das, Swadhin Satyaprakash Majhi, Punyajoy Saha, Animesh Mukherjee

Abstract: Multimedia content on social media is rapidly evolving, with memes gaining prominence as a distinctive form. Unfortunately, some malicious users exploit memes to target individuals or vulnerable communities, making it imperative to identify and address such instances of hateful memes. Extensive research has been conducted to address this issue by developing hate meme detection models. However, a n… ▽ More Multimedia content on social media is rapidly evolving, with memes gaining prominence as a distinctive form. Unfortunately, some malicious users exploit memes to target individuals or vulnerable communities, making it imperative to identify and address such instances of hateful memes. Extensive research has been conducted to address this issue by developing hate meme detection models. However, a notable limitation of traditional machine/deep learning models is the requirement for labeled datasets for accurate classification. Recently, the research community has witnessed the emergence of several visual language models that have exhibited outstanding performance across various tasks. In this study, we aim to investigate the efficacy of these visual language models in handling intricate tasks such as hate meme detection. We use various prompt settings to focus on zero-shot classification of hateful/harmful memes. Through our analysis, we observe that large VLMs are still vulnerable for zero-shot hate meme detection. △ Less

Submitted 19 February, 2024; originally announced February 2024.

arXiv:2402.10096 [pdf]

Quantum Linear Magnetoresistance and Fermi Liquid Behavior in Kagome Metal Ni3In2S2

Authors: P. Das, P. Saha, M. Singh, P. Kumar, S. Patnaik

Abstract: Kagome metals gain attention as they manifest a spectrum of quantum phenomena, including superconductivity, charge order, frustrated magnetism, and intertwined correlated states of condensed matter. With regard to electronic band structure, several of the them exhibit non-trivial topological characteristics. Here, we present a thorough investigation on the growth and the physical properties of sin… ▽ More Kagome metals gain attention as they manifest a spectrum of quantum phenomena, including superconductivity, charge order, frustrated magnetism, and intertwined correlated states of condensed matter. With regard to electronic band structure, several of the them exhibit non-trivial topological characteristics. Here, we present a thorough investigation on the growth and the physical properties of single crystals of Ni3In2S2 which is established to be a Dirac nodal line Kagome metal. Extensive characterization is attained through temperature and field-dependent resistivity, angle-dependent magnetoresistance and specific heat measurements. In most metals, the Fermi liquid behaviour is mostly restricted to a narrow range of temperature. In Ni3In2S2, this characteristic feature has been observed for an extensive temperature range of 82 K. This is attributed to the strong electron-electron correlation in the material. Specific heat measurements reveal a high Kadowaki-Woods ratio which is in good agreement with strongly correlated systems. Almost linear positive magnetoresistance follows the conventional Kohler scaling which depicts the applicability of semi-classical theories. The angle-dependent magneto-resistance been explained using the Voigt-Thomson formula. Furthermore, de-Haas van Alphen oscillations are observed in magnetization vs. magnetic field measurement which shed light on the topological features in the Shandite Ni3In2S2. △ Less

Submitted 15 February, 2024; originally announced February 2024.

arXiv:2402.07262 [pdf, other]

Low-Resource Counterspeech Generation for Indic Languages: The Case of Bengali and Hindi

Authors: Mithun Das, Saurabh Kumar Pandey, Shivansh Sethi, Punyajoy Saha, Animesh Mukherjee

Abstract: With the rise of online abuse, the NLP community has begun investigating the use of neural architectures to generate counterspeech that can "counter" the vicious tone of such abusive speech and dilute/ameliorate their rippling effect over the social network. However, most of the efforts so far have been primarily focused on English. To bridge the gap for low-resource languages such as Bengali and… ▽ More With the rise of online abuse, the NLP community has begun investigating the use of neural architectures to generate counterspeech that can "counter" the vicious tone of such abusive speech and dilute/ameliorate their rippling effect over the social network. However, most of the efforts so far have been primarily focused on English. To bridge the gap for low-resource languages such as Bengali and Hindi, we create a benchmark dataset of 5,062 abusive speech/counterspeech pairs, of which 2,460 pairs are in Bengali and 2,602 pairs are in Hindi. We implement several baseline models considering various interlingual transfer mechanisms with different configurations to generate suitable counterspeech to set up an effective benchmark. We observe that the monolingual setup yields the best performance. Further, using synthetic transfer, language models can generate counterspeech to some extent; specifically, we notice that transferability is better when languages belong to the same language family. △ Less

Submitted 11 February, 2024; originally announced February 2024.

Comments: Accepted to the Findings of the ACL: EACL 2024

arXiv:2402.05294 [pdf, other]

Examining Modality Incongruity in Multimodal Federated Learning for Medical Vision and Language-based Disease Detection

Authors: Pramit Saha, Divyanshu Mishra, Felix Wagner, Konstantinos Kamnitsas, J. Alison Noble

Abstract: Multimodal Federated Learning (MMFL) utilizes multiple modalities in each client to build a more powerful Federated Learning (FL) model than its unimodal counterpart. However, the impact of missing modality in different clients, also called modality incongruity, has been greatly overlooked. This paper, for the first time, analyses the impact of modality incongruity and reveals its connection with… ▽ More Multimodal Federated Learning (MMFL) utilizes multiple modalities in each client to build a more powerful Federated Learning (FL) model than its unimodal counterpart. However, the impact of missing modality in different clients, also called modality incongruity, has been greatly overlooked. This paper, for the first time, analyses the impact of modality incongruity and reveals its connection with data heterogeneity across participating clients. We particularly inspect whether incongruent MMFL with unimodal and multimodal clients is more beneficial than unimodal FL. Furthermore, we examine three potential routes of addressing this issue. Firstly, we study the effectiveness of various self-attention mechanisms towards incongruity-agnostic information fusion in MMFL. Secondly, we introduce a modality imputation network (MIN) pre-trained in a multimodal client for modality translation in unimodal clients and investigate its potential towards mitigating the missing modality problem. Thirdly, we assess the capability of client-level and server-level regularization techniques towards mitigating modality incongruity effects. Experiments are conducted under several MMFL settings on two publicly available real-world datasets, MIMIC-CXR and Open-I, with Chest X-Ray and radiology reports. △ Less

Submitted 7 February, 2024; originally announced February 2024.

Comments: 42 pages

arXiv:2402.04755 [pdf, other]

doi 10.1093/mnras/stae697

Performance and first measurements of the MAGIC Stellar Intensity Interferometer

Authors: MAGIC Collaboration, S. Abe, J. Abhir, V. A. Acciari, A. Aguasca-Cabot, I. Agudo, T. Aniello, S. Ansoldi, L. A. Antonelli, A. Arbet Engels, C. Arcaro, M. Artero, K. Asano, A. Babić, A. Baquero, U. Barres de Almeida, J. A. Barrio, I. Batković, A. Bautista, J. Baxter, J. Becerra González, E. Bernardini, M. Bernardos, J. Bernete, A. Berti , et al. (195 additional authors not shown)

Abstract: In recent years, a new generation of optical intensity interferometers has emerged, leveraging the existing infrastructure of Imaging Atmospheric Cherenkov Telescopes (IACTs). The MAGIC telescopes host the MAGIC-SII system (Stellar Intensity Interferometer), implemented to investigate the feasibility and potential of this technique on IACTs. After the first successful measurements in 2019, the sys… ▽ More In recent years, a new generation of optical intensity interferometers has emerged, leveraging the existing infrastructure of Imaging Atmospheric Cherenkov Telescopes (IACTs). The MAGIC telescopes host the MAGIC-SII system (Stellar Intensity Interferometer), implemented to investigate the feasibility and potential of this technique on IACTs. After the first successful measurements in 2019, the system was upgraded and now features a real-time, dead-time-free, 4-channel, GPU-based correlator. These hardware modifications allow seamless transitions between MAGIC's standard very-high-energy gamma-ray observations and optical interferometry measurements within seconds. We establish the feasibility and potential of employing IACTs as competitive optical Intensity Interferometers with minimal hardware adjustments. The measurement of a total of 22 stellar diameters are reported, 9 corresponding to reference stars with previous comparable measurements, and 13 with no prior measurements. A prospective implementation involving telescopes from the forthcoming Cherenkov Telescope Array Observatory's northern hemisphere array, such as the first prototype of its Large-Sized Telescopes, LST-1, is technically viable. This integration would significantly enhance the sensitivity of the current system and broaden the UV-plane coverage. This advancement would enable the system to achieve competitive sensitivity with the current generation of long-baseline optical interferometers over blue wavelengths. △ Less

Submitted 7 February, 2024; originally announced February 2024.

Comments: 18 pages, 13 figures, submitted to MNRAS

arXiv:2402.03344 [pdf, other]

First (calibration) experiment using proton beam from FRENA at SINP

Authors: C. Basu, K. Banerjee, T. K. Ghosh, G. Mukherjee, C. Bhattacharya, Shraddha S Desai, R. Shil, A. K. Saha, J. K. Meena, T. Bar, D. Basak, L. K. Sahoo, S. Saha, C. Marick, D. Das, D. Das, D. Das, M. Kujur, S. Roy, S. S. Basu, U. Gond, A. Saha, A. Das, M. Samanta, P. Saha , et al. (1 additional authors not shown)

Abstract: This work presents the first calibration experiment of a 3 MV Tandetron accelerator, FRENA, performed in May 2022. The $^7$Li(p,n) reaction threshold was measured to calibrate the terminal voltage measuring device. A LiF target of thickness 175 $μ$g/cm$^2$ was used in the experiment. The measured threshold was 1872$\pm$2.7 keV, indicating 6$-$10 keV energy shift. This work presents the first calibration experiment of a 3 MV Tandetron accelerator, FRENA, performed in May 2022. The $^7$Li(p,n) reaction threshold was measured to calibrate the terminal voltage measuring device. A LiF target of thickness 175 $μ$g/cm$^2$ was used in the experiment. The measured threshold was 1872$\pm$2.7 keV, indicating 6$-$10 keV energy shift. △ Less

Submitted 24 January, 2024; originally announced February 2024.

arXiv:2401.11957 [pdf, other]

Gravitational collapse of matter in the presence of non-minimally coupled Quintessence and Phantom-like scalar fields

Authors: Priyanka Saha, Dipanjan Dey, Kaushik Bhattacharya

Abstract: This paper explores the evolution of the over-dense region of dark matter in the presence of a non-minimally coupled scalar field which is used to model quintessence and phantom-like dark energy. We focus on algebraic coupling, where the interaction Lagrangian is independent of the derivatives of the scalar field. To make our model more relativistic, like the minimal coupling scenario we studied e… ▽ More This paper explores the evolution of the over-dense region of dark matter in the presence of a non-minimally coupled scalar field which is used to model quintessence and phantom-like dark energy. We focus on algebraic coupling, where the interaction Lagrangian is independent of the derivatives of the scalar field. To make our model more relativistic, like the minimal coupling scenario we studied earlier, we consider a spacetime structure that is internally closed Friedmann-Lemaitre-Robertson-Walker (FLRW) spacetime and externally the generalized Vaidya spacetime. This structure allows non-zero matter flux at the boundary of the over-dense region. Our investigation reveals that an increment of the coupling strength causes dark energy to cluster with dark matter at a certain cosmological scale where the influence of dark energy cannot be ignored. This phenomenon arises from the specific nature of the non-minimal coupling considered in this paper. While the evolution of matter's energy density remains unchanged, the scalar field's Klein-Gordon equation is modified, causing dark energy to deviate from its homogeneous state and cluster with dark matter. Similar to minimal coupling scenarios, closed spherical regions do not collapse within certain parameter ranges, exhibiting eternal expansion within the spatially flat FLRW spacetime acting as voids with decreasing matter density. The study extends our understanding of the cosmological scenarios where the virialization of the over-dense regions of dark matter is influenced by the non-minimally coupled dark energy. △ Less

Submitted 22 January, 2024; originally announced January 2024.

Comments: 18 pages, 8 figures

arXiv:2401.10509 [pdf, other]

ICECAP: a 3-in-1 integrated cryogenic system for emission, collection and photon-detection from near infrared quantum nanophotonic devices

Authors: Victoria A. Norman, Sridhar Majety, Alex H. Rubin, Pranta Saha, Jeanette Simo, Bradi Palomarez, Liang Li, Pietra B. Curro, Scott Dhuey, Selven Virasawmy, Marina Radulaski

Abstract: Deployment of quantum telecommunication technologies requires single-photon light emission, collection and detection capability at each network node in cryogenic environments. We combine recent technological advancements in single-photon detectors and cryogenics to demonstrate a 3-in-1 system that incorporates superconducting nanowire single-photon detectors into an optical cryostat operating at t… ▽ More Deployment of quantum telecommunication technologies requires single-photon light emission, collection and detection capability at each network node in cryogenic environments. We combine recent technological advancements in single-photon detectors and cryogenics to demonstrate a 3-in-1 system that incorporates superconducting nanowire single-photon detectors into an optical cryostat operating at temperatures below 2 K. Dubbed the ICECAP system, this cryostation cools samples, collects emission, and detects single photons in one efficient environment suitable for a variety of near infrared quantum emitters. We utilize this system to characterize emission from silicon carbide color centers in photoluminescence and time-resolved measurements. Moreover, we demonstrate the first optical characterization of nitrogen-vacancy centers integrated in 4H-SiC nanopillars. △ Less

Submitted 19 January, 2024; originally announced January 2024.

arXiv:2401.05733 [pdf, other]

doi 10.1103/PhysRevLett.132.221601

Field theory expansions of string theory amplitudes

Authors: Arnab Priya Saha, Aninda Sinha

Abstract: Motivated by quantum field theory (QFT) considerations, we present new representations of the Euler-Beta function and tree-level string theory amplitudes using a new two-channel, local, crossing symmetric dispersion relation. Unlike standard series representations, the new ones are analytic everywhere except at the poles, sum over poles in all channels and include contact interactions, in the spir… ▽ More Motivated by quantum field theory (QFT) considerations, we present new representations of the Euler-Beta function and tree-level string theory amplitudes using a new two-channel, local, crossing symmetric dispersion relation. Unlike standard series representations, the new ones are analytic everywhere except at the poles, sum over poles in all channels and include contact interactions, in the spirit of QFT. This enables us to consider mass-level truncation, which preserves all the features of the original amplitudes. By starting with such expansions for generalized Euler-Beta functions and demanding QFT like features, we single out the open superstring amplitude. We demonstrate the difficulty in deforming away from the string amplitude and show that a class of such deformations can be potentially interesting when there is level truncation. Our considerations also lead to new QFT-inspired, parametric representations of the Zeta function and $π$, which show fast convergence. △ Less

Submitted 29 April, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

Comments: v5:12 pages, 7 figures, to appear in Phys. Rev. Lett

Journal ref: Phys. Rev. Lett. 132, 221601 (2024)

arXiv:2401.04165 [pdf, other]

Essentials of strong gravitational lensing

Authors: Prasenjit Saha, Dominique Sluse, Jenny Wagner, Liliya L. R. Williams

Abstract: Of order one in 10^3 quasars and high-redshift galaxies appears in the sky as multiple images as a result of gravitational lensing by unrelated galaxies and clusters that happen to be in the foreground. While the basic phenomenon is a straightforward consequence of general relativity, there are many non-obvious consequences that make multiple-image lensing systems (aka strong gravitational lenses)… ▽ More Of order one in 10^3 quasars and high-redshift galaxies appears in the sky as multiple images as a result of gravitational lensing by unrelated galaxies and clusters that happen to be in the foreground. While the basic phenomenon is a straightforward consequence of general relativity, there are many non-obvious consequences that make multiple-image lensing systems (aka strong gravitational lenses) remarkable astrophysical probes in several different ways. This article is an introduction to the essential concepts and terminology in this area, emphasizing physical insight. The key construct is the Fermat potential or arrival-time surface: from it the standard lens equation, and the notions of image parities, magnification, critical curves, caustics, and degeneracies all follow. The advantages and limitations of the usual simplifying assumptions (geometrical optics, small angles, weak fields, thin lenses) are noted, and to the extent possible briefly, it is explained how to go beyond these. Some less well-known ideas are discussed at length: arguments using wavefronts show that much of the theory carries over unchanged to the regime of strong gravitational fields; saddle-point contours explain how even the most complicated image configurations are made up of just two ingredients. Orders of magnitude, and the question of why strong lensing is most common for objects at cosmological distance, are also discussed. The challenges of lens modeling, and diverse strategies developed to overcome them, are discussed in general terms, without many technical details. △ Less

Submitted 8 January, 2024; originally announced January 2024.

Comments: To appear in Space Science Reviews, Topical Collection "Strong Gravitational Lensing", eds. J. Wambsganss et al

arXiv:2312.07571 [pdf, other]

Investigating YOLO Models Towards Outdoor Obstacle Detection For Visually Impaired People

Authors: Chenhao He, Pramit Saha

Abstract: The utilization of deep learning-based object detection is an effective approach to assist visually impaired individuals in avoiding obstacles. In this paper, we implemented seven different YOLO object detection models \textit{viz}., YOLO-NAS (small, medium, large), YOLOv8, YOLOv7, YOLOv6, and YOLOv5 and performed comprehensive evaluation with carefully tuned hyperparameters, to analyze how these… ▽ More The utilization of deep learning-based object detection is an effective approach to assist visually impaired individuals in avoiding obstacles. In this paper, we implemented seven different YOLO object detection models \textit{viz}., YOLO-NAS (small, medium, large), YOLOv8, YOLOv7, YOLOv6, and YOLOv5 and performed comprehensive evaluation with carefully tuned hyperparameters, to analyze how these models performed on images containing common daily-life objects presented on roads and sidewalks. After a systematic investigation, YOLOv8 was found to be the best model, which reached a precision of $80\%$ and a recall of $68.2\%$ on a well-known Obstacle Dataset which includes images from VOC dataset, COCO dataset, and TT100K dataset along with images collected by the researchers in the field. Despite being the latest model and demonstrating better performance in many other applications, YOLO-NAS was found to be suboptimal for the obstacle detection task. △ Less

Submitted 10 December, 2023; originally announced December 2023.

arXiv:2312.06604 [pdf, ps, other]

On the spectral gap of Cayley graphs

Authors: Jyoti Prakash Saha

Abstract: Let $Γ$ be a Cayley graph, or a Cayley sum graph, or a twisted Cayley graph, or a twisted Cayley sum graph, or a vertex-transitive graph. Suppose $Γ$ is undirected and non-bipartite. Let $μ$ (resp. $μ_2$) denote the smallest (resp. the second largest) eigenvalue of the normalized adjacency operator of $Γ$, and $d$ denote the degree of $Γ$. We show that $1+ μ= Ω((1-μ_2)/d)$ holds. Let $Γ$ be a Cayley graph, or a Cayley sum graph, or a twisted Cayley graph, or a twisted Cayley sum graph, or a vertex-transitive graph. Suppose $Γ$ is undirected and non-bipartite. Let $μ$ (resp. $μ_2$) denote the smallest (resp. the second largest) eigenvalue of the normalized adjacency operator of $Γ$, and $d$ denote the degree of $Γ$. We show that $1+ μ= Ω((1-μ_2)/d)$ holds. △ Less

Submitted 11 December, 2023; originally announced December 2023.

arXiv:2312.05717 [pdf, other]

Forecasting Lithium-Ion Battery Longevity with Limited Data Availability: Benchmarking Different Machine Learning Algorithms

Authors: Hudson Hilal, Pramit Saha

Abstract: As the use of Lithium-ion batteries continues to grow, it becomes increasingly important to be able to predict their remaining useful life. This work aims to compare the relative performance of different machine learning algorithms, both traditional machine learning and deep learning, in order to determine the best-performing algorithms for battery cycle life prediction based on minimal data. We i… ▽ More As the use of Lithium-ion batteries continues to grow, it becomes increasingly important to be able to predict their remaining useful life. This work aims to compare the relative performance of different machine learning algorithms, both traditional machine learning and deep learning, in order to determine the best-performing algorithms for battery cycle life prediction based on minimal data. We investigated 14 different machine learning models that were fed handcrafted features based on statistical data and split into 3 feature groups for testing. For deep learning models, we tested a variety of neural network models including different configurations of standard Recurrent Neural Networks, Gated Recurrent Units, and Long Short Term Memory with and without attention mechanism. Deep learning models were fed multivariate time series signals based on the raw data for each battery across the first 100 cycles. Our experiments revealed that the machine learning algorithms on handcrafted features performed particularly well, resulting in 10-20% average mean absolute percentage error. The best-performing algorithm was the Random Forest Regressor, which gave a minimum 9.8% mean absolute percentage error. Traditional machine learning models excelled due to their capability to comprehend general data set trends. In comparison, deep learning models were observed to perform particularly poorly on raw, limited data. Algorithms like GRU and RNNs that focused on capturing medium-range data dependencies were less adept at recognizing the gradual, slow trends critical for this task. Our investigation reveals that implementing machine learning models with hand-crafted features proves to be more effective than advanced deep learning models for predicting the remaining useful Lithium-ion battery life with limited data availability. △ Less

Submitted 9 December, 2023; originally announced December 2023.

arXiv:2312.00931 [pdf, other]

Microlensing of strongly lensed quasars

Authors: G. Vernardos, D. Sluse, D. Pooley, R. W. Schmidt, M. Millon, L. Weisenbach, V. Motta, T. Anguita, P. Saha, M. O'Dowd, A. Peel, P. L. Schechter

Abstract: Strong gravitational lensing of quasars has the potential to unlock the poorly understood physics of these fascinating objects, as well as serve as a probe of the lensing mass distribution and of cosmological parameters. In particular, gravitational microlensing by compact bodies in the lensing galaxy can enable mapping of quasar structure to $\lt 10^{-6}$ arcsec scales. Some of this potential has… ▽ More Strong gravitational lensing of quasars has the potential to unlock the poorly understood physics of these fascinating objects, as well as serve as a probe of the lensing mass distribution and of cosmological parameters. In particular, gravitational microlensing by compact bodies in the lensing galaxy can enable mapping of quasar structure to $\lt 10^{-6}$ arcsec scales. Some of this potential has been realized over the past few decades, however the upcoming era of large sky surveys promises to bring this to full fruition. Here we review the theoretical framework of this field, describe the prominent current methods for parameter inference from quasar microlensing data across different observing modalities, and discuss the constraints so far derived on the geometry and physics of quasar inner structure. We also review the application of strong lensing and microlensing to constraining the granularity of the lens potential, i.e. the contribution of the baryonic and dark matter components, and the local mass distribution in the lens, i.e. the stellar mass function. Finally, we discuss the future of the field, including the new possibilities that will be opened by the next generation of large surveys and by new analysis methods now being developed. △ Less

Submitted 1 December, 2023; originally announced December 2023.

Comments: To be submitted to Space Science Reviews, Topical Collection "Strong Gravitational Lensing", eds. J. Wambsganss et al

arXiv:2311.17763 [pdf, other]

On-shell functions on the Coulomb branch of $\mathcal{N}=4$ SYM

Authors: Md. Abhishek, Subramanya Hegde, Dileep P. Jatkar, Arnab Priya Saha, Amit Suthar

Abstract: We study on-shell functions in the kinematic space for the Coulomb branch of $\mathcal{N}=4$ SYM. We construct BCFW bridges that help us build bigger on-shell functions. As a consequence, we provide on-shell diagram formulations for BCFW shifts that correspond to various mass configurations. We will use this to calculate the quadruple cut for the one-loop amplitude on the Coulomb branch and maxima… ▽ More We study on-shell functions in the kinematic space for the Coulomb branch of $\mathcal{N}=4$ SYM. We construct BCFW bridges that help us build bigger on-shell functions. As a consequence, we provide on-shell diagram formulations for BCFW shifts that correspond to various mass configurations. We will use this to calculate the quadruple cut for the one-loop amplitude on the Coulomb branch and maximal cuts for higher-loops. We make preliminary comments on finding the inequivalent set of on-shell functions for the Coulomb branch. △ Less

Submitted 30 April, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

Comments: 59 pages, 10 figures, Minor changes: improved presentation, Version to appear in JHEP

arXiv:2311.10264 [pdf, other]

Dynamic Instability of Follower Forced Euler Bernoulli Cantilever Beam With Tip Mass

Authors: Premjit Saha, Tarunraj Singh

Abstract: This work focuses on the stability analysis of an Euler Bernoulli cantilever beam with a tip mass at the free end, subject to a follower force. This can serve as a viable model for analysis of elastic instability occurring due to fluid-structure interaction of structural components submerged in fluids and gases. A linear model with appropriate boundary conditions is developed using the energy form… ▽ More This work focuses on the stability analysis of an Euler Bernoulli cantilever beam with a tip mass at the free end, subject to a follower force. This can serve as a viable model for analysis of elastic instability occurring due to fluid-structure interaction of structural components submerged in fluids and gases. A linear model with appropriate boundary conditions is developed using the energy formulation. The characteristic equation of the linear model establishes the relationship between the pulsation of the beam and the magnitude of applied follower force. The evolution of temporal eigenvalues with respect to the magnitude of the follower force helps in evaluation of the critical follower forces responsible for different modes of instability. The presented model demonstrates the existence of only dynamic instability in the system. Furthermore, the model predicts that both types of the dynamic instability i.e., flutter and divergence, are possible in the system. △ Less

Submitted 16 November, 2023; originally announced November 2023.

arXiv:2311.07845 [pdf, other]

Triangular Cross-Section Beam Splitters in Silicon Carbide for Quantum Information Processing

Authors: Sridhar Majety, Pranta Saha, Zbynka Kekula, Scott Dhuey, Marina Radulaski

Abstract: Triangular cross-section color center photonics in silicon carbide is a leading candidate for scalable implementation of quantum hardware. Within this geometry, we model low-loss beam splitters for applications in key quantum optical operations such as entanglement and single-photon interferometry. We consider triangular cross-section single-mode waveguides for the design of a directional coupler.… ▽ More Triangular cross-section color center photonics in silicon carbide is a leading candidate for scalable implementation of quantum hardware. Within this geometry, we model low-loss beam splitters for applications in key quantum optical operations such as entanglement and single-photon interferometry. We consider triangular cross-section single-mode waveguides for the design of a directional coupler. We optimize parameters for a 50:50 beam splitter. Finally, we test the experimental feasibility of the designs by fabricating triangular waveguides in an ion beam etching process and identify suitable designs for short-term implementation. △ Less

Submitted 13 April, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

arXiv:2311.00514 [pdf, other]

How Hard Is Squash? -- Towards Information Theoretic Analysis of Motor Behavior in Squash

Authors: Kavya Anand, Pramit Saha

Abstract: Fitts' law has been widely employed as a research method for analyzing tasks within the domain of Human-Computer Interaction (HCI). However, its application to non-computer tasks has remained limited. This study aims to extend the application of Fitts' law to the realm of sports, specifically focusing on squash. Squash is a high-intensity sport that requires quick movements and precise shots. Our… ▽ More Fitts' law has been widely employed as a research method for analyzing tasks within the domain of Human-Computer Interaction (HCI). However, its application to non-computer tasks has remained limited. This study aims to extend the application of Fitts' law to the realm of sports, specifically focusing on squash. Squash is a high-intensity sport that requires quick movements and precise shots. Our research investigates the effectiveness of utilizing Fitts' law to evaluate the task difficulty and effort level associated with executing and responding to various squash shots. By understanding the effort/information rate required for each shot, we can determine which shots are more effective in making the opponent work harder. Additionally, this knowledge can be valuable for coaches in designing training programs. However, since Fitts' law was primarily developed for human-computer interaction, we adapted it to fit the squash scenario. This paper provides an overview of Fitts' law and its relevance to sports, elucidates the motivation driving this investigation, outlines the methodology employed to explore this novel avenue, and presents the obtained results, concluding with key insights. We conducted experiments with different shots and players, collecting data on shot speed, player movement time, and distance traveled. Using this data, we formulated a modified version of Fitts' law specifically for squash. The results provide insights into the difficulty and effectiveness of various shots, offering valuable information for both players and coaches in the sport of squash. △ Less

Submitted 1 November, 2023; originally announced November 2023.

arXiv:2311.00469 [pdf, other]

Dual Conditioned Diffusion Models for Out-Of-Distribution Detection: Application to Fetal Ultrasound Videos

Authors: Divyanshu Mishra, He Zhao, Pramit Saha, Aris T. Papageorghiou, J. Alison Noble

Abstract: Out-of-distribution (OOD) detection is essential to improve the reliability of machine learning models by detecting samples that do not belong to the training distribution. Detecting OOD samples effectively in certain tasks can pose a challenge because of the substantial heterogeneity within the in-distribution (ID), and the high structural similarity between ID and OOD classes. For instance, when… ▽ More Out-of-distribution (OOD) detection is essential to improve the reliability of machine learning models by detecting samples that do not belong to the training distribution. Detecting OOD samples effectively in certain tasks can pose a challenge because of the substantial heterogeneity within the in-distribution (ID), and the high structural similarity between ID and OOD classes. For instance, when detecting heart views in fetal ultrasound videos there is a high structural similarity between the heart and other anatomies such as the abdomen, and large in-distribution variance as a heart has 5 distinct views and structural variations within each view. To detect OOD samples in this context, the resulting model should generalise to the intra-anatomy variations while rejecting similar OOD samples. In this paper, we introduce dual-conditioned diffusion models (DCDM) where we condition the model on in-distribution class information and latent features of the input image for reconstruction-based OOD detection. This constrains the generative manifold of the model to generate images structurally and semantically similar to those within the in-distribution. The proposed model outperforms reference methods with a 12% improvement in accuracy, 22% higher precision, and an 8% better F1 score. △ Less

Submitted 1 November, 2023; originally announced November 2023.

Comments: Published in MICCAI 2023

arXiv:2310.19884 [pdf, other]

Free-Form and Hybrid Lens Models for SDSS J1004+4112: Substructure and Central Image Time Delay Constraints

Authors: Derek Perera, Liliya L. R. Williams, Jori Liesenborgs, Agniva Ghosh, Prasenjit Saha

Abstract: SDSS J1004+4112 is a well studied gravitational lens with a recently measured time delay between its first and fourth arriving quasar images. Using this new constraint, we present updated free-form lens reconstructions using the lens inversion method {\tt GRALE}, which only uses multiple image and time delay data as inputs. In addition, we obtain hybrid lens reconstructions by including a model of… ▽ More SDSS J1004+4112 is a well studied gravitational lens with a recently measured time delay between its first and fourth arriving quasar images. Using this new constraint, we present updated free-form lens reconstructions using the lens inversion method {\tt GRALE}, which only uses multiple image and time delay data as inputs. In addition, we obtain hybrid lens reconstructions by including a model of the brightest cluster galaxy (BCG) as a Sersic lens. For both reconstructions, we use two sets of images as input: one with all identified images, and the other a revised set leaving out images that have been potentially misidentified. We also develop a source position optimization MCMC routine, performed on completed {\tt GRALE} runs, that allows each model to better match observed image positions and time delays. All the reconstructions produce similar mass distributions, with the hybrid models finding a steeper profile in the center. Similarly, all the mass distributions are fit by the Navarro-Frenk-White (NFW) profile, finding results consistent with previous parametric reconstructions and those derived from Chandra X-ray observations. We identify a $\sim 5 \times 10^{11} M_{\odot}$ substructure apparently unaffiliated with any cluster member galaxy and present in all our models, and study its reality. Using our free-form and hybrid models we predict a central quasar image time delay of $\sim 2980 \pm 270$ and $\sim 3280 \pm 215$ days, respectively. A potential future measurement of this time delay will, while being an observational challenge, further constrain the steepness of the central density profile. △ Less

Submitted 30 October, 2023; originally announced October 2023.

Comments: 13 pages, 5 figures, MNRAS Accepted

arXiv:2310.18815 [pdf, other]

Rethinking Semi-Supervised Federated Learning: How to co-train fully-labeled and fully-unlabeled client imaging data

Authors: Pramit Saha, Divyanshu Mishra, J. Alison Noble

Abstract: The most challenging, yet practical, setting of semi-supervised federated learning (SSFL) is where a few clients have fully labeled data whereas the other clients have fully unlabeled data. This is particularly common in healthcare settings where collaborating partners (typically hospitals) may have images but not annotations. The bottleneck in this setting is the joint training of labeled and unl… ▽ More The most challenging, yet practical, setting of semi-supervised federated learning (SSFL) is where a few clients have fully labeled data whereas the other clients have fully unlabeled data. This is particularly common in healthcare settings where collaborating partners (typically hospitals) may have images but not annotations. The bottleneck in this setting is the joint training of labeled and unlabeled clients as the objective function for each client varies based on the availability of labels. This paper investigates an alternative way for effective training with labeled and unlabeled clients in a federated setting. We propose a novel learning scheme specifically designed for SSFL which we call Isolated Federated Learning (IsoFed) that circumvents the problem by avoiding simple averaging of supervised and semi-supervised models together. In particular, our training approach consists of two parts - (a) isolated aggregation of labeled and unlabeled client models, and (b) local self-supervised pretraining of isolated global models in all clients. We evaluate our model performance on medical image datasets of four different modalities publicly available within the biomedical image classification benchmark MedMNIST. We further vary the proportion of labeled clients and the degree of heterogeneity to demonstrate the effectiveness of the proposed method under varied experimental settings. △ Less

Submitted 28 October, 2023; originally announced October 2023.

Comments: Published in MICCAI 2023 with early acceptance and selected as 1 of the top 20 poster highlights under the category: Which work has the potential to impact other applications of AI and CV

arXiv:2310.13060 [pdf, other]

Gravitational Wave Symphony from Oscillating Spectator Scalar Fields

Authors: Yanou Cui, Pankaj Saha, Evangelos I. Sfakianakis

Abstract: We investigate a generic source of stochastic gravitational wave background due to the parametric resonance of oscillating scalar fields in the early Universe. By systematically analyzing benchmark models through lattice simulations and considering a wide range of parameters, we demonstrate that such a scenario can lead to detectable signals in gravitational wave detectors over a broad frequency r… ▽ More We investigate a generic source of stochastic gravitational wave background due to the parametric resonance of oscillating scalar fields in the early Universe. By systematically analyzing benchmark models through lattice simulations and considering a wide range of parameters, we demonstrate that such a scenario can lead to detectable signals in gravitational wave detectors over a broad frequency range and potentially address the recent findings by pulsar timing array experiments. Furthermore, these models naturally yield ultralight dark matter candidates or dark radiation detectable by cosmic microwave background observatories. △ Less

Submitted 27 June, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

Comments: 15 pages, 3 figs including supplemental material, minor revisions, version accepted for publication in Physical Review Letters

Report number: KEK-TH-2634, KEK-Cosmo-0349

arXiv:2310.12860 [pdf, other]

Probing LLMs for hate speech detection: strengths and vulnerabilities

Authors: Sarthak Roy, Ashish Harshavardhan, Animesh Mukherjee, Punyajoy Saha

Abstract: Recently efforts have been made by social media platforms as well as researchers to detect hateful or toxic language using large language models. However, none of these works aim to use explanation, additional context and victim community information in the detection process. We utilise different prompt variation, input information and evaluate large language models in zero shot setting (without a… ▽ More Recently efforts have been made by social media platforms as well as researchers to detect hateful or toxic language using large language models. However, none of these works aim to use explanation, additional context and victim community information in the detection process. We utilise different prompt variation, input information and evaluate large language models in zero shot setting (without adding any in-context examples). We select three large language models (GPT-3.5, text-davinci and Flan-T5) and three datasets - HateXplain, implicit hate and ToxicSpans. We find that on average including the target information in the pipeline improves the model performance substantially (~20-30%) over the baseline across the datasets. There is also a considerable effect of adding the rationales/explanations into the pipeline (~10-20%) over the baseline across the datasets. In addition, we further provide a typology of the error cases where these large language models fail to (i) classify and (ii) explain the reason for the decisions they take. Such vulnerable points automatically constitute 'jailbreak' prompts for these models and industry scale safeguard techniques need to be developed to make the models robust against such prompts. △ Less

Submitted 28 October, 2023; v1 submitted 19 October, 2023; originally announced October 2023.

Comments: 13 pages, 9 figures, 7 tables, accepted to findings of EMNLP 2023

arXiv:2310.11051 [pdf]

Electromagnetic properties of copper doped lead apatite Pb9Cu(PO4)6O

Authors: M. Singh, P. Saha, K. Kumar, D. Takhar, B. Birajdar, V. P. S. Awana, S. Patnaik

Abstract: We report on the structural, electrical and magnetic measurements in as-grown polycrystalline samples of Pb10-xCux(PO4)6O. This compound has been recently reported to be a room temperature superconductor. Our as-grown specimen has excellent XRD matching with the original submission of Lee et al. This sample has 1.5% of Cu2S as an impurity phase. A resistive transition around 380 K, possibly corres… ▽ More We report on the structural, electrical and magnetic measurements in as-grown polycrystalline samples of Pb10-xCux(PO4)6O. This compound has been recently reported to be a room temperature superconductor. Our as-grown specimen has excellent XRD matching with the original submission of Lee et al. This sample has 1.5% of Cu2S as an impurity phase. A resistive transition around 380 K, possibly corresponding to structural transitions of Cu2S, is observed. No evidence of superconducting to normal state transitions in I-V characteristics at room temperature is obtained. Magnetization measurements show linear diamagnetic behavior that cannot be associated to the superconducting state. Hall measurements provide evidence of hole doping through Cu substitution. In summary, we find no evidence for room temperature ambient pressure superconductivity in Cu doped lead apatite Pb9Cu(PO4)6O. △ Less

Submitted 17 October, 2023; originally announced October 2023.

arXiv:2310.09901 [pdf, ps, other]

Toric Richardson Varieties

Authors: Mahir Bilen Can, Pinakinath Saha

Abstract: In this article, we provide characterizations of toric Richardson varieties across all types through three distinct approaches: 1) poset theory, 2) root theory, and 3) geometry. In this article, we provide characterizations of toric Richardson varieties across all types through three distinct approaches: 1) poset theory, 2) root theory, and 3) geometry. △ Less

Submitted 15 October, 2023; originally announced October 2023.

arXiv:2310.00739 [pdf, other]

Effect of Spin Fluctuations on Magnetoresistance and Anomalous Hall Effect in the Chiral Magnet Co8Zn8Mn4

Authors: P. Saha, P. Das, M. Singh, R. Rai, S. Patnaik

Abstract: The beta Mn type Co-Zn-Mn alloys have seized significant attention due to their ability to host skyrmions at room temperature. Here we analyse the unconventional magneto-transport properties of Co8Zn8Mn4 single crystals with a Curie temperature of 275 K. A negative magnetoresistance is obtained over a wide temperature range of 50K to 300K. The deviation of the isothermal magnetoresistance (MR) cur… ▽ More The beta Mn type Co-Zn-Mn alloys have seized significant attention due to their ability to host skyrmions at room temperature. Here we analyse the unconventional magneto-transport properties of Co8Zn8Mn4 single crystals with a Curie temperature of 275 K. A negative magnetoresistance is obtained over a wide temperature range of 50K to 300K. The deviation of the isothermal magnetoresistance (MR) curves from linearity to non-linearity as one approaches higher temperatures points towards the transition from the dominance of magnons to spin fluctuations. In the paramagnetic phase, the change in the shape of the MR curve has been explained using the Khosla and Fischer model. The relationship between the anomalous Hall effect (AHE) and longitudinal resistivity reveals the dominance of the skew-scattering mechanism, which is inexplicable based on the theories of semi-classical magneto-transport. We experimentally determine that the spin fluctuation is the source of the skew-scattering mechanism in Co8Zn8Mn4. In general skew-scattering mechanisms predominate in compounds with high conductivity, but our findings demonstrate that this is not always the case and that other aspects also require equal consideration. Our work throws new light on the predominant scattering mechanism in chiral magnets with skyrmionics phase at low conductivity. △ Less

Submitted 1 October, 2023; originally announced October 2023.

arXiv:2309.12583 [pdf]

doi 10.1145/3571884.3603755

Using ChatGPT in HCI Research -- A Trioethnography

Authors: Smit Desai, Tanusree Sharma, Pratyasha Saha

Abstract: This paper explores the lived experience of using ChatGPT in HCI research through a month-long trioethnography. Our approach combines the expertise of three HCI researchers with diverse research interests to reflect on our daily experience of living and working with ChatGPT. Our findings are presented as three provocations grounded in our collective experiences and HCI theories. Specifically, we e… ▽ More This paper explores the lived experience of using ChatGPT in HCI research through a month-long trioethnography. Our approach combines the expertise of three HCI researchers with diverse research interests to reflect on our daily experience of living and working with ChatGPT. Our findings are presented as three provocations grounded in our collective experiences and HCI theories. Specifically, we examine (1) the emotional impact of using ChatGPT, with a focus on frustration and embarrassment, (2) the absence of accountability and consideration of future implications in design, and raise (3) questions around bias from a Global South perspective. Our work aims to inspire critical discussions about utilizing ChatGPT in HCI research and advance equitable and inclusive technological development. △ Less

Submitted 21 September, 2023; originally announced September 2023.

arXiv:2309.11646 [pdf, other]

An Evaluation of Machine Learning Approaches for Early Diagnosis of Autism Spectrum Disorder

Authors: Rownak Ara Rasul, Promy Saha, Diponkor Bala, S M Rakib Ul Karim, Md. Ibrahim Abdullah, Bishwajit Saha

Abstract: Autistic Spectrum Disorder (ASD) is a neurological disease characterized by difficulties with social interaction, communication, and repetitive activities. While its primary origin lies in genetics, early detection is crucial, and leveraging machine learning offers a promising avenue for a faster and more cost-effective diagnosis. This study employs diverse machine learning methods to identify cru… ▽ More Autistic Spectrum Disorder (ASD) is a neurological disease characterized by difficulties with social interaction, communication, and repetitive activities. While its primary origin lies in genetics, early detection is crucial, and leveraging machine learning offers a promising avenue for a faster and more cost-effective diagnosis. This study employs diverse machine learning methods to identify crucial ASD traits, aiming to enhance and automate the diagnostic process. We study eight state-of-the-art classification models to determine their effectiveness in ASD detection. We evaluate the models using accuracy, precision, recall, specificity, F1-score, area under the curve (AUC), kappa, and log loss metrics to find the best classifier for these binary datasets. Among all the classification models, for the children dataset, the SVM and LR models achieve the highest accuracy of 100% and for the adult dataset, the LR model produces the highest accuracy of 97.14%. Our proposed ANN model provides the highest accuracy of 94.24% for the new combined dataset when hyperparameters are precisely tuned for each model. As almost all classification models achieve high accuracy which utilize true labels, we become interested in delving into five popular clustering algorithms to understand model behavior in scenarios without true labels. We calculate Normalized Mutual Information (NMI), Adjusted Rand Index (ARI), and Silhouette Coefficient (SC) metrics to select the best clustering models. Our evaluation finds that spectral clustering outperforms all other benchmarking clustering models in terms of NMI and ARI metrics while demonstrating comparability to the optimal SC achieved by k-means. The implemented code is available at GitHub. △ Less

Submitted 28 December, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

Comments: 20 pages, 12 figures, 8 tables

arXiv:2309.05696 [pdf, other]

doi 10.21105/astro.2309.05696

What are the parities of photon-ring images near a black hole?

Authors: Ashish Kumar Meena, Prasenjit Saha

Abstract: Light that grazes a black-hole event horizon can loop around one or more times before escaping again, resulting for distance observers in an infinite sequence of ever fainter and more delayed images near the black hole shadow. In the case of the M87 and Sgr A$^*$ back holes, the first of these so-called photon-ring images have now been observed. A question then arises: are such images minima, maxi… ▽ More Light that grazes a black-hole event horizon can loop around one or more times before escaping again, resulting for distance observers in an infinite sequence of ever fainter and more delayed images near the black hole shadow. In the case of the M87 and Sgr A$^*$ back holes, the first of these so-called photon-ring images have now been observed. A question then arises: are such images minima, maxima, or saddle-points in the sense of Fermat's principle in gravitational lensing? or more briefly, the title question above. In the theory of lensing by weak gravitational fields, image parities are readily found by considering the time-delay surface (also called the Fermat potential or the arrival-time surface). In this work, we extend the notion of the time delay surface to strong gravitational fields and compute the surface for a Schwarzschild black hole. The time-delay surface is the difference of two wavefronts, one travelling forward from the source and one travelling backwards from the observer. Image parities are read off from the topography of the surface, exactly as in the weak-field regime, but the surface itself is more complicated. Of the images, furthest from the black hole and similar to the weak-field limit, are a minimum and a saddle point. The strong field repeats the pattern, corresponding to light taking one or more loops around the back hole. In between, there are steeply-rising walls in the time-delay surface, which can be interpreted as maxima and saddle points that are infinitely delayed and not observable -- these correspond to light rays taking a U-turn around the black hole. △ Less

Submitted 21 December, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

Comments: 11 pages. 7 figures. 3 appendix. Accepted in OJA

arXiv:2308.16735 [pdf, other]

Post-Deployment Adaptation with Access to Source Data via Federated Learning and Source-Target Remote Gradient Alignment

Authors: Felix Wagner, Zeju Li, Pramit Saha, Konstantinos Kamnitsas

Abstract: Deployment of Deep Neural Networks in medical imaging is hindered by distribution shift between training data and data processed after deployment, causing performance degradation. Post-Deployment Adaptation (PDA) addresses this by tailoring a pre-trained, deployed model to the target data distribution using limited labelled or entirely unlabelled target data, while assuming no access to source tra… ▽ More Deployment of Deep Neural Networks in medical imaging is hindered by distribution shift between training data and data processed after deployment, causing performance degradation. Post-Deployment Adaptation (PDA) addresses this by tailoring a pre-trained, deployed model to the target data distribution using limited labelled or entirely unlabelled target data, while assuming no access to source training data as they cannot be deployed with the model due to privacy concerns and their large size. This makes reliable adaptation challenging due to limited learning signal. This paper challenges this assumption and introduces FedPDA, a novel adaptation framework that brings the utility of learning from remote data from Federated Learning into PDA. FedPDA enables a deployed model to obtain information from source data via remote gradient exchange, while aiming to optimize the model specifically for the target domain. Tailored for FedPDA, we introduce a novel optimization method StarAlign (Source-Target Remote Gradient Alignment) that aligns gradients between source-target domain pairs by maximizing their inner product, to facilitate learning a target-specific model. We demonstrate the method's effectiveness using multi-center databases for the tasks of cancer metastases detection and skin lesion classification, where our method compares favourably to previous work. Code is available at: https://github.com/FelixWag/StarAlign △ Less

Submitted 31 August, 2023; originally announced August 2023.

Comments: This version was accepted for the Machine Learning in Medical Imaging (MLMI 2023) workshop at MICCAI 2023

Showing 1–50 of 379 results for author: Saha, P