-
Transport signatures of single and multiple Floquet Majorana modes in one-dimensional Rashba nanowire and Shiba chain
Authors:
Debashish Mondal,
Rekha Kumari,
Tanay Nag,
Arijit Saha
Abstract:
We theoretically investigate the transport signature of single and multiple Floquet Majorana end modes~(FMEMs), appearing in an experimentally feasible setup with Rashba nanowire~(NW) placed in closed proximity to a conventional $s$-wave superconductor, in the presence of an external Zeeman field. Periodic drive causes the anomalous $π$-modes to emerge in addition to the regular $0$-modes in the d…
▽ More
We theoretically investigate the transport signature of single and multiple Floquet Majorana end modes~(FMEMs), appearing in an experimentally feasible setup with Rashba nanowire~(NW) placed in closed proximity to a conventional $s$-wave superconductor, in the presence of an external Zeeman field. Periodic drive causes the anomalous $π$-modes to emerge in addition to the regular $0$-modes in the driven system where the former does not exhibit any static analog. For single $0$- and/or $π$-FMEM, differential conductance exhibits a quantized value of $2e^{2}/h$ while we consider the sum over all the photon sectors, supporting Floquet sum rule. We examine the stability of this summed conductance against random onsite disorder. We further investigate the summed conductance in several cases hosting multiple~(more than one) $0$- or $π$-modes at the end of the NW. In these cases, we obtain quantized values of $n\times 2e^{2}/h$ of summed conductance with $n$ being the number of modes~($0$ / $π$) located at one end of NW. We repeat our analysis for another experimentally realizable model system known as helical Shiba chain. Moreover, we corroborate our results by computing the differential conductance for FMEMs using non-equilibrium Green's function method. Our work opens up the possibility of studying the transport signatures of FMEMs in these realistic models.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Direct observational evidence of multi-epoch massive star formation in G24.47+0.49
Authors:
Anindya Saha,
Anandmayee Tej,
Hong-Li Liu,
Tie Liu,
Guido Garay,
Paul F. Goldsmith,
Chang Won Lee,
Jinhua He,
Mika Juvela,
Leonardo Bronfman,
Tapas Baug,
Enrique Vazquez-Semadeni,
Patricio Sanhueza,
Shanghuo Li,
James O. Chibueze,
N. K. Bhadari,
Lokesh K. Dewangan,
Swagat Ranjan Das,
Feng-Wei Xu,
Namitha Issac,
Jihye Hwang,
L. Viktor Toth
Abstract:
Using new continuum and molecular line data from the ALMA Three-millimeter Observations of Massive Star-forming Regions (ATOMS) survey and archival VLA, 4.86 GHz data, we present direct observational evidence of hierarchical triggering relating three epochs of massive star formation in a ring-like H II region, G24.47+0.49. We find from radio flux analysis that it is excited by a massive star(s) of…
▽ More
Using new continuum and molecular line data from the ALMA Three-millimeter Observations of Massive Star-forming Regions (ATOMS) survey and archival VLA, 4.86 GHz data, we present direct observational evidence of hierarchical triggering relating three epochs of massive star formation in a ring-like H II region, G24.47+0.49. We find from radio flux analysis that it is excited by a massive star(s) of spectral type O8.5V-O8V from the first epoch of star formation. The swept-up ionized ring structure shows evidence of secondary collapse, and within this ring a burst of massive star formation is observed in different evolutionary phases, which constitutes the second epoch. ATOMS spectral line (e.g., HCO$^+$(1-0)) observations reveal an outer concentric molecular gas ring expanding at a velocity of $\sim$ 9 $\rm km\,s^{-1}$, constituting the direct and unambiguous detection of an expanding molecular ring. It harbors twelve dense molecular cores with surface mass density greater than 0.05 $\rm g\,cm^{-2}$, a threshold typical of massive star formation. Half of them are found to be subvirial, and thus in gravitational collapse, making them third epoch of potential massive star-forming sites.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Automatic Speech Recognition for Hindi
Authors:
Anish Saha,
A. G. Ramakrishnan
Abstract:
Automatic speech recognition (ASR) is a key area in computational linguistics, focusing on developing technologies that enable computers to convert spoken language into text. This field combines linguistics and machine learning. ASR models, which map speech audio to transcripts through supervised learning, require handling real and unrestricted text. Text-to-speech systems directly work with real…
▽ More
Automatic speech recognition (ASR) is a key area in computational linguistics, focusing on developing technologies that enable computers to convert spoken language into text. This field combines linguistics and machine learning. ASR models, which map speech audio to transcripts through supervised learning, require handling real and unrestricted text. Text-to-speech systems directly work with real text, while ASR systems rely on language models trained on large text corpora. High-quality transcribed data is essential for training predictive models. The research involved two main components: developing a web application and designing a web interface for speech recognition. The web application, created with JavaScript and Node.js, manages large volumes of audio files and their transcriptions, facilitating collaborative human correction of ASR transcripts. It operates in real-time using a client-server architecture. The web interface for speech recognition records 16 kHz mono audio from any device running the web app, performs voice activity detection (VAD), and sends the audio to the recognition engine. VAD detects human speech presence, aiding efficient speech processing and reducing unnecessary processing during non-speech intervals, thus saving computation and network bandwidth in VoIP applications. The final phase of the research tested a neural network for accurately aligning the speech signal to hidden Markov model (HMM) states. This included implementing a novel backpropagation method that utilizes prior statistics of node co-activations.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Gapless dynamic magnetic ground state in the charge-gapped trimer iridate Ba$_4$NbIr$_3$O$_{12}$
Authors:
Abhisek Bandyopadhyay,
S. Lee,
D. T. Adroja,
M. R. Lees,
G. B. G. Stenning,
P. Aich,
Luca Tortora,
C. Meneghini,
G. Cibin,
Adam Berlie,
R. A. Saha,
D. Takegami,
A. Melendez-Sans,
G. Poelchen,
M. Yoshimura,
K. D. Tsuei,
Z. Hu,
Ting-Shan Chan,
S. Chattopadhyay,
G. S. Thakur,
Kwang-Yong Choi
Abstract:
We present an experimental investigation of the magnetic ground state in Ba$_4$NbIr$_3$O$_{12}$, a fractional valent trimer iridate. X-ray absorption and photoemission spectroscopy show that the Ir valence lies between 3+ and 4+ while Nb is pentavalent. Combined dc/ac magnetization, specific heat, and muon spin rotation/relaxation ($μ$SR) measurements reveal no magnetic phase transition down to 0.…
▽ More
We present an experimental investigation of the magnetic ground state in Ba$_4$NbIr$_3$O$_{12}$, a fractional valent trimer iridate. X-ray absorption and photoemission spectroscopy show that the Ir valence lies between 3+ and 4+ while Nb is pentavalent. Combined dc/ac magnetization, specific heat, and muon spin rotation/relaxation ($μ$SR) measurements reveal no magnetic phase transition down to 0.05~K. Despite a significant Weiss temperature ($Θ_{\mathrm{W}} \sim -15$ to $-25$~K) indicating antiferromagnetic correlations, a quantum spin-liquid (QSL) phase emerges and persists down to 0.1~K. This state likely arises from geometric frustration in the edge-sharing equilateral triangle Ir network. Our $μ$SR analysis reveals a two-component depolarization, arising from the coexistence of rapidly (90\%) and slowly (10\%) fluctuating Ir moments. Powder x-ray diffraction and Ir-L$_3$edge x-ray absorption fine structure spectroscopy identify ~8-10\% Nb/Ir site-exchange, reducing frustration within part of the Ir network, and likely leading to the faster muon spin relaxation, while the structurally ordered Ir ions remain highly geometrically frustrated, giving rise to the rapidly spin-fluctuating QSL ground state. At low temperatures, the magnetic specific heat varies as $γT + αT^2$, indicating gapless spinon excitations, and possible Dirac QSL features with linear spinon dispersion, respectively.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Study of Wolf-Rayet stars using uGMRT
Authors:
Anindya Saha,
Anandmayee Tej,
Santiago del Palacio,
Michaël De Becker,
Paula Benaglia,
Ishwara Chandra CH,
Prachi Prajapati
Abstract:
In recent years, systems involving massive stars with large wind kinetic power have been considered as promising sites for investigating relativistic particle acceleration in low radio frequencies. With this aim, we observed two Wolf-Rayet systems, WR 114 and WR 142, using upgraded Giant Meterwave Radio Telescope observations in Band 4 (550-950 MHz) and Band 5 (1050-1450 MHz). None of the targets…
▽ More
In recent years, systems involving massive stars with large wind kinetic power have been considered as promising sites for investigating relativistic particle acceleration in low radio frequencies. With this aim, we observed two Wolf-Rayet systems, WR 114 and WR 142, using upgraded Giant Meterwave Radio Telescope observations in Band 4 (550-950 MHz) and Band 5 (1050-1450 MHz). None of the targets was detected at these frequencies. Based on the non-detection, we report 3$σ$ upper limits to the radio flux densities at 735 and 1260 MHz (123 and 66 $μ$Jy for WR 114, and 111 and 96 $μ$Jy for WR 142, respectively). The plausible scenarios to interpret this non-detection are presented.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Light curve's recovery with Rubin-LSST: II. UnVEiling the darknesS of The gAlactic buLgE (VESTALE) with RR Lyrae
Authors:
M. Di Criscienzo,
S. Leccia,
V. Braga,
I. Musella,
G. Bono,
M. Dall'Ora,
G. Fiorentino,
M. Marconi,
R. Molinaro,
V. Ripepi,
L. Girardi,
A. Mazzi,
G. Pastorelli,
M. Trabucchi,
N. Matsunaga,
M. Monelli,
A. Saha,
K. Vivas,
R. Zanmar Sanchez
Abstract:
This work is part of VESTALE, a project initiated within the Rubin-LSST Cadence Strategy Optimization Process . Its goal is to explore the potential of Rubin-LSST observations aimed at the Galaxy's bulge (Bulge) for studying RR Lyrae stars (RRL). Observation and analysis of RR Lyrae stars in the Bulge are crucial for tracing the old population of the central part of our galaxy and reconstructing t…
▽ More
This work is part of VESTALE, a project initiated within the Rubin-LSST Cadence Strategy Optimization Process . Its goal is to explore the potential of Rubin-LSST observations aimed at the Galaxy's bulge (Bulge) for studying RR Lyrae stars (RRL). Observation and analysis of RR Lyrae stars in the Bulge are crucial for tracing the old population of the central part of our galaxy and reconstructing the history of Bulge formation. Based on observations conducted with CTIO/DECam by Saha et al. 2019 towards the Baade Window, our simulations demonstrate that early Rubin-LSST observations will enable the recovery of RR Lyrae light curves at Galactic center distances with sufficient precision. This will allow us to utilize theoretical relations from Marconi et al. 2022 to determine their distances and/or metallicity, following the REDIME algorithm introduced in Bono et al. 2019. We show how reddening and crowding affect our simulations and highlight the importance of considering these effects when deriving pulsation parameters (luminosity amplitudes, mean magnitudes) based on the light curves especially if the goal is to explore the opposite side of the Bulge through the observation of its RRL. The simulations discussed in this investigation were conducted to support the SCOC's decision to observe this important sky region since it has only recently been decided to include part of the Bulge as a target within the LSST main survey.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
A Generalized Apprenticeship Learning Framework for Modeling Heterogeneous Student Pedagogical Strategies
Authors:
Md Mirajul Islam,
Xi Yang,
John Hostetter,
Adittya Soukarjya Saha,
Min Chi
Abstract:
A key challenge in e-learning environments like Intelligent Tutoring Systems (ITSs) is to induce effective pedagogical policies efficiently. While Deep Reinforcement Learning (DRL) often suffers from sample inefficiency and reward function design difficulty, Apprenticeship Learning(AL) algorithms can overcome them. However, most AL algorithms can not handle heterogeneity as they assume all demonst…
▽ More
A key challenge in e-learning environments like Intelligent Tutoring Systems (ITSs) is to induce effective pedagogical policies efficiently. While Deep Reinforcement Learning (DRL) often suffers from sample inefficiency and reward function design difficulty, Apprenticeship Learning(AL) algorithms can overcome them. However, most AL algorithms can not handle heterogeneity as they assume all demonstrations are generated with a homogeneous policy driven by a single reward function. Still, some AL algorithms which consider heterogeneity, often can not generalize to large continuous state space and only work with discrete states. In this paper, we propose an expectation-maximization(EM)-EDM, a general AL framework to induce effective pedagogical policies from given optimal or near-optimal demonstrations, which are assumed to be driven by heterogeneous reward functions. We compare the effectiveness of the policies induced by our proposed EM-EDM against four AL-based baselines and two policies induced by DRL on two different but related tasks that involve pedagogical action prediction. Our overall results showed that, for both tasks, EM-EDM outperforms the four AL baselines across all performance metrics and the two DRL baselines. This suggests that EM-EDM can effectively model complex student pedagogical decision-making processes through the ability to manage a large, continuous state space and adapt to handle diverse and heterogeneous reward functions with very few given demonstrations.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Strategic Linear Contextual Bandits
Authors:
Thomas Kleine Buening,
Aadirupa Saha,
Christos Dimitrakakis,
Haifeng Xu
Abstract:
Motivated by the phenomenon of strategic agents gaming a recommender system to maximize the number of times they are recommended to users, we study a strategic variant of the linear contextual bandit problem, where the arms can strategically misreport their privately observed contexts to the learner. We treat the algorithm design problem as one of mechanism design under uncertainty and propose the…
▽ More
Motivated by the phenomenon of strategic agents gaming a recommender system to maximize the number of times they are recommended to users, we study a strategic variant of the linear contextual bandit problem, where the arms can strategically misreport their privately observed contexts to the learner. We treat the algorithm design problem as one of mechanism design under uncertainty and propose the Optimistic Grim Trigger Mechanism (OptGTM) that incentivizes the agents (i.e., arms) to report their contexts truthfully while simultaneously minimizing regret. We also show that failing to account for the strategic nature of the agents results in linear regret. However, a trade-off between mechanism design and regret minimization appears to be unavoidable. More broadly, this work aims to provide insight into the intersection of online learning and mechanism design.
△ Less
Submitted 1 June, 2024;
originally announced June 2024.
-
QUBIQ: Uncertainty Quantification for Biomedical Image Segmentation Challenge
Authors:
Hongwei Bran Li,
Fernando Navarro,
Ivan Ezhov,
Amirhossein Bayat,
Dhritiman Das,
Florian Kofler,
Suprosanna Shit,
Diana Waldmannstetter,
Johannes C. Paetzold,
Xiaobin Hu,
Benedikt Wiestler,
Lucas Zimmer,
Tamaz Amiranashvili,
Chinmay Prabhakar,
Christoph Berger,
Jonas Weidner,
Michelle Alonso-Basant,
Arif Rashid,
Ujjwal Baid,
Wesam Adel,
Deniz Ali,
Bhakti Baheti,
Yingbin Bai,
Ishaan Bhatt,
Sabri Can Cetindag
, et al. (55 additional authors not shown)
Abstract:
Uncertainty in medical image segmentation tasks, especially inter-rater variability, arising from differences in interpretations and annotations by various experts, presents a significant challenge in achieving consistent and reliable image segmentation. This variability not only reflects the inherent complexity and subjective nature of medical image interpretation but also directly impacts the de…
▽ More
Uncertainty in medical image segmentation tasks, especially inter-rater variability, arising from differences in interpretations and annotations by various experts, presents a significant challenge in achieving consistent and reliable image segmentation. This variability not only reflects the inherent complexity and subjective nature of medical image interpretation but also directly impacts the development and evaluation of automated segmentation algorithms. Accurately modeling and quantifying this variability is essential for enhancing the robustness and clinical applicability of these algorithms. We report the set-up and summarize the benchmark results of the Quantification of Uncertainties in Biomedical Image Quantification Challenge (QUBIQ), which was organized in conjunction with International Conferences on Medical Image Computing and Computer-Assisted Intervention (MICCAI) 2020 and 2021. The challenge focuses on the uncertainty quantification of medical image segmentation which considers the omnipresence of inter-rater variability in imaging datasets. The large collection of images with multi-rater annotations features various modalities such as MRI and CT; various organs such as the brain, prostate, kidney, and pancreas; and different image dimensions 2D-vs-3D. A total of 24 teams submitted different solutions to the problem, combining various baseline models, Bayesian neural networks, and ensemble model techniques. The obtained results indicate the importance of the ensemble models, as well as the need for further research to develop efficient 3D methods for uncertainty quantification methods in 3D segmentation tasks.
△ Less
Submitted 24 June, 2024; v1 submitted 19 March, 2024;
originally announced May 2024.
-
Geodesic motion of particles in the vicinity of the $κ$-deformed Schwarzchild Black Hole
Authors:
Dilip Kumar,
Suman Kumar Panja,
Abhisek Saha,
Soma Sanyal
Abstract:
In this study, we investigate the geodesic motion of a test particle around the Schwarzchild black hole in a $κ$-deformed space-time. We compute a modified Lagrangian to obtain the $κ$-deformed effective potential and find the particle trajectories based on the constants of motion. For the same value of angular momentum, we obtain a significant deformation in the orbits of the particles due to the…
▽ More
In this study, we investigate the geodesic motion of a test particle around the Schwarzchild black hole in a $κ$-deformed space-time. We compute a modified Lagrangian to obtain the $κ$-deformed effective potential and find the particle trajectories based on the constants of motion. For the same value of angular momentum, we obtain a significant deformation in the orbits of the particles due to the non-commutativity of the $κ$-deformed space-time. The deformation parameter becomes more significant for higher values of the angular momentum. The radius of the individual trajectories become smaller and their velocities decrease compared to the commutative case. The radius of the innermost stable circular orbit ($r_{ISCO}$) is also found using the modified effective potential. Though the equations get modified due to the non-commutativity of the $κ$-deformed space-time, the $r_{ISCO}$ remains the same. We then study a large number of freely streaming particles moving in this $κ$-deformed space-time and analyze the movement of these particles around the black hole due to the non-commutativity of the space-time. We concentrate on particles with different angular momentum moving around the black hole. We find that the motion of the particles are modified due to the non-commutativity of the space-time. The particles move slower along their respective trajectories in the deformed space-time. So, they remain closer to the black hole for a longer period of time, indicating that the accretion of freely streaming particles around the black hole would be modified by the non-commutativity of the space-time.
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
Multiple Topological Phase Transitions Unveiling Gapless Topological Superconductivity in Magnet/Unconventional Superconductor Hybrid Platform
Authors:
Minakshi Subhadarshini,
Amartya Pal,
Pritam Chatterjee,
Arijit Saha
Abstract:
We propose a theoretical framework for generating gapless topological superconductivity (GTSC) hosting Majorana flat edge modes (MFEMs) in the presence of a two-dimensional (2D) array of magnetic adatoms with noncollinear spin texture deposited on top of a unconventional superconductor. Our observations reveal two distinct topological phase transitions within the emergent Shiba band depending on t…
▽ More
We propose a theoretical framework for generating gapless topological superconductivity (GTSC) hosting Majorana flat edge modes (MFEMs) in the presence of a two-dimensional (2D) array of magnetic adatoms with noncollinear spin texture deposited on top of a unconventional superconductor. Our observations reveal two distinct topological phase transitions within the emergent Shiba band depending on the exchange coupling strength ($J$) between magnetic adatom spins and superconducting electrons: the first one designates transition from gapless non-topological to gapless topological phase at lower $J$, while the second one denotes transition from gapless topological to a trivial gapped superconducting phase at higher $J$. The gapless topological superconducting phase survives at intermediate values of $J$, hosting MFEMs. Further, we investigate the nature of the bulk effective pairings which indicate that GTSC appears due to the interplay between pseudo "$s$-wave" and pseudo "$p_{x}+p_y$" types of pairing. Consequently, our study opens a promising avenue for the experimental realization of GTSC in 2D Shiba lattice based on $d$-wave superconductors as a high-temperature platform.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
Mapping electron beam-induced radiolytic damage in molecular crystals
Authors:
Ambarneil Saha,
Matthew Mecklenburg,
Alexander J. Pattison,
Aaron S. Brewster,
Jose A. Rodriguez,
Peter Ercius
Abstract:
Every electron crystallography experiment is fundamentally limited by radiation damage. Nevertheless, little is known about the onset and progression of radiolysis in beam-sensitive molecular crystals. Here we apply ambient-temperature scanning nanobeam electron diffraction to record simultaneous dual-space snapshots of organic and organometallic nanocrystals at sequential stages of beam-induced r…
▽ More
Every electron crystallography experiment is fundamentally limited by radiation damage. Nevertheless, little is known about the onset and progression of radiolysis in beam-sensitive molecular crystals. Here we apply ambient-temperature scanning nanobeam electron diffraction to record simultaneous dual-space snapshots of organic and organometallic nanocrystals at sequential stages of beam-induced radiolytic decay. We show that the underlying mosaic of coherently diffracting zones (CDZs) continuously undergoes spatial reorientation as a function of accumulating electron exposure, causing the intensities of many Bragg reflections to fade nonmonotonically. Furthermore, we demonstrate that repeated irradiation at a single probe position leads to the concentric propagation of delocalized radiolytic damage well beyond the initial point of impact. These results sharpen our understanding of molecular crystals as conglomerates of CDZs whose complex lattice structure deteriorates through a series of dynamic internal changes during exposure to ionizing radiation.
△ Less
Submitted 27 April, 2024;
originally announced April 2024.
-
NTIRE 2024 Quality Assessment of AI-Generated Content Challenge
Authors:
Xiaohong Liu,
Xiongkuo Min,
Guangtao Zhai,
Chunyi Li,
Tengchuan Kou,
Wei Sun,
Haoning Wu,
Yixuan Gao,
Yuqin Cao,
Zicheng Zhang,
Xiele Wu,
Radu Timofte,
Fei Peng,
Huiyuan Fu,
Anlong Ming,
Chuanming Wang,
Huadong Ma,
Shuai He,
Zifei Dou,
Shu Chen,
Huacong Zhang,
Haiyi Xie,
Chengwei Wang,
Baoying Chen,
Jishen Zeng
, et al. (89 additional authors not shown)
Abstract:
This paper reports on the NTIRE 2024 Quality Assessment of AI-Generated Content Challenge, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2024. This challenge is to address a major challenge in the field of image and video processing, namely, Image Quality Assessment (IQA) and Video Quality Assessment (VQA) for AI-Generated Conte…
▽ More
This paper reports on the NTIRE 2024 Quality Assessment of AI-Generated Content Challenge, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2024. This challenge is to address a major challenge in the field of image and video processing, namely, Image Quality Assessment (IQA) and Video Quality Assessment (VQA) for AI-Generated Content (AIGC). The challenge is divided into the image track and the video track. The image track uses the AIGIQA-20K, which contains 20,000 AI-Generated Images (AIGIs) generated by 15 popular generative models. The image track has a total of 318 registered participants. A total of 1,646 submissions are received in the development phase, and 221 submissions are received in the test phase. Finally, 16 participating teams submitted their models and fact sheets. The video track uses the T2VQA-DB, which contains 10,000 AI-Generated Videos (AIGVs) generated by 9 popular Text-to-Video (T2V) models. A total of 196 participants have registered in the video track. A total of 991 submissions are received in the development phase, and 185 submissions are received in the test phase. Finally, 12 participating teams submitted their models and fact sheets. Some methods have achieved better results than baseline methods, and the winning methods in both tracks have demonstrated superior prediction performance on AIGC.
△ Less
Submitted 7 May, 2024; v1 submitted 25 April, 2024;
originally announced April 2024.
-
Stark localization near Aubry-André criticality
Authors:
Ayan Sahoo,
Aitijhya Saha,
Debraj Rakshit
Abstract:
In this work we investigate the Stark localization near the Aubry-André (AA) critical point. We study system-dependent parameters, such as localization length, inverse participation ratio (IPR), and energy gap between the ground and first excited state, for characterizing the localization-delocalization transition. We show that the scaling exponents possessed by these key descriptors of localizati…
▽ More
In this work we investigate the Stark localization near the Aubry-André (AA) critical point. We study system-dependent parameters, such as localization length, inverse participation ratio (IPR), and energy gap between the ground and first excited state, for characterizing the localization-delocalization transition. We show that the scaling exponents possessed by these key descriptors of localization are quite different from that of a pure AA model or Stark model. Near the critical point of the AA model, inducing Stark field of strength $h$, the localization length $ζ$ scales as $ζ\propto h^{-ν}$ with $ν\approx0.29$ which is different than both the pure AA model ($ν=1$) and Stark model ($ν\approx0.33$). The IPR scales as IPR $\propto h^{s}$ with $s\approx0.096$ which is again significantly different than both the pure AA model ($s\approx0.33$) and Stark model ($s\approx0.33$). The energy gap, $Δ$, scales as $E\propto h^{νz}$, where $z\approx2.37$ which is however same as the pure AA model.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
Deformable MRI Sequence Registration for AI-based Prostate Cancer Diagnosis
Authors:
Alessa Hering,
Sarah de Boer,
Anindo Saha,
Jasper J. Twilt,
Mattias P. Heinrich,
Derya Yakar,
Maarten de Rooij,
Henkjan Huisman,
Joeran S. Bosma
Abstract:
The PI-CAI (Prostate Imaging: Cancer AI) challenge led to expert-level diagnostic algorithms for clinically significant prostate cancer detection. The algorithms receive biparametric MRI scans as input, which consist of T2-weighted and diffusion-weighted scans. These scans can be misaligned due to multiple factors in the scanning process. Image registration can alleviate this issue by predicting t…
▽ More
The PI-CAI (Prostate Imaging: Cancer AI) challenge led to expert-level diagnostic algorithms for clinically significant prostate cancer detection. The algorithms receive biparametric MRI scans as input, which consist of T2-weighted and diffusion-weighted scans. These scans can be misaligned due to multiple factors in the scanning process. Image registration can alleviate this issue by predicting the deformation between the sequences. We investigate the effect of image registration on the diagnostic performance of AI-based prostate cancer diagnosis. First, the image registration algorithm, developed in MeVisLab, is analyzed using a dataset with paired lesion annotations. Second, the effect on diagnosis is evaluated by comparing case-level cancer diagnosis performance between using the original dataset, rigidly aligned diffusion-weighted scans, or deformably aligned diffusion-weighted scans. Rigid registration showed no improvement. Deformable registration demonstrated a substantial improvement in lesion overlap (+10% median Dice score) and a positive yet non-significant improvement in diagnostic performance (+0.3% AUROC, p=0.18). Our investigation shows that a substantial improvement in lesion alignment does not directly lead to a significant improvement in diagnostic performance. Qualitative analysis indicated that jointly developing image registration methods and diagnostic AI algorithms could enhance diagnostic accuracy and patient outcomes.
△ Less
Submitted 28 June, 2024; v1 submitted 15 April, 2024;
originally announced April 2024.
-
Exploring Explainability in Video Action Recognition
Authors:
Avinab Saha,
Shashank Gupta,
Sravan Kumar Ankireddy,
Karl Chahine,
Joydeep Ghosh
Abstract:
Image Classification and Video Action Recognition are perhaps the two most foundational tasks in computer vision. Consequently, explaining the inner workings of trained deep neural networks is of prime importance. While numerous efforts focus on explaining the decisions of trained deep neural networks in image classification, exploration in the domain of its temporal version, video action recognit…
▽ More
Image Classification and Video Action Recognition are perhaps the two most foundational tasks in computer vision. Consequently, explaining the inner workings of trained deep neural networks is of prime importance. While numerous efforts focus on explaining the decisions of trained deep neural networks in image classification, exploration in the domain of its temporal version, video action recognition, has been scant. In this work, we take a deeper look at this problem. We begin by revisiting Grad-CAM, one of the popular feature attribution methods for Image Classification, and its extension to Video Action Recognition tasks and examine the method's limitations. To address these, we introduce Video-TCAV, by building on TCAV for Image Classification tasks, which aims to quantify the importance of specific concepts in the decision-making process of Video Action Recognition models. As the scalable generation of concepts is still an open problem, we propose a machine-assisted approach to generate spatial and spatiotemporal concepts relevant to Video Action Recognition for testing Video-TCAV. We then establish the importance of temporally-varying concepts by demonstrating the superiority of dynamic spatiotemporal concepts over trivial spatial concepts. In conclusion, we introduce a framework for investigating hypotheses in action recognition and quantitatively testing them, thus advancing research in the explainability of deep neural networks used in video action recognition.
△ Less
Submitted 13 April, 2024;
originally announced April 2024.
-
Flow Of Information In a Mechanically Quenched Confined Flock
Authors:
Md. Samsuzzaman,
Mohammad Hasanuzzaman,
Ahmed Sayeed,
Arnab Saha
Abstract:
Living entities in a group communicate and transfer information to one another for a variety of reasons. It might be for foraging food, migration, or escaping threats and obstacles, etc. They do so by interacting with each other and also with the environment. The tools from statistical mechanics and information theory can be useful to analyze the flow of information among the living entities model…
▽ More
Living entities in a group communicate and transfer information to one another for a variety of reasons. It might be for foraging food, migration, or escaping threats and obstacles, etc. They do so by interacting with each other and also with the environment. The tools from statistical mechanics and information theory can be useful to analyze the flow of information among the living entities modelled as active (i.e. self-propelling) particles. Here we consider the active particles confined in a circular trap. The self-organisation of the particles crucially depends on whether the trap boundary is soft or hard. We quench the trap boundary from soft to hard instantaneously. After the mechanical quench, the particles suddenly find themselves in a hard potential. The self-organised cluster of the active particles, which was stable when the boundary was soft, becomes unstable. The cluster undergoes extreme deformation after the quench to find another stable configuration suitable for the hard potential. Together with the structural relaxation, information regarding the quench also flows throughout the deforming cluster. Here, we quantify the flow of information by computing local transfer entropy. We find that the flow spans the whole cluster, propagating ballistically.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
Corner modes in Non-Hermitian long-range model
Authors:
Arnob Kumar Ghosh,
Arijit Saha,
Tanay Nag
Abstract:
We consider non-Hermitian (NH) analog of a second-order topological insulator, protected by chiral symmetry, in the presence of second-nearest neighbor hopping elements to theoretically investigate the interplay between long-range and topological order away from Hermiticity. In addition to the four zero-energy corner modes present in the first nearest neighbor hopping model, we uncover that the se…
▽ More
We consider non-Hermitian (NH) analog of a second-order topological insulator, protected by chiral symmetry, in the presence of second-nearest neighbor hopping elements to theoretically investigate the interplay between long-range and topological order away from Hermiticity. In addition to the four zero-energy corner modes present in the first nearest neighbor hopping model, we uncover that the second nearest neighbor hopping introduces another topological phase with sixteen zero-energy corner modes. Importantly, the NH effects are manifested in altering the Hermitian phase boundaries for both the models. While comparing the complex energy spectrum under open boundary conditions, and bi-orthogonalized quadrupolar winding number (QWN) in real space, we resolve the apparent anomaly in the bulk boundary correspondence of the NH system as compared to the Hermitian counterpart by incorporating the effect of non-Bloch form of momentum into the mass term. The above invariant is also capable of capturing the phase boundaries between the two different topological phases where the degeneracy of the corner modes is evident, as exclusively observed for the second nearest neighbor model.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion
Authors:
Hossein Souri,
Arpit Bansal,
Hamid Kazemi,
Liam Fowl,
Aniruddha Saha,
Jonas Geiping,
Andrew Gordon Wilson,
Rama Chellappa,
Tom Goldstein,
Micah Goldblum
Abstract:
Modern neural networks are often trained on massive datasets that are web scraped with minimal human inspection. As a result of this insecure curation pipeline, an adversary can poison or backdoor the resulting model by uploading malicious data to the internet and waiting for a victim to scrape and train on it. Existing approaches for creating poisons and backdoors start with randomly sampled clea…
▽ More
Modern neural networks are often trained on massive datasets that are web scraped with minimal human inspection. As a result of this insecure curation pipeline, an adversary can poison or backdoor the resulting model by uploading malicious data to the internet and waiting for a victim to scrape and train on it. Existing approaches for creating poisons and backdoors start with randomly sampled clean data, called base samples, and then modify those samples to craft poisons. However, some base samples may be significantly more amenable to poisoning than others. As a result, we may be able to craft more potent poisons by carefully choosing the base samples. In this work, we use guided diffusion to synthesize base samples from scratch that lead to significantly more potent poisons and backdoors than previous state-of-the-art attacks. Our Guided Diffusion Poisoning (GDP) base samples can be combined with any downstream poisoning or backdoor attack to boost its effectiveness. Our implementation code is publicly available at: https://github.com/hsouri/GDP .
△ Less
Submitted 24 March, 2024;
originally announced March 2024.
-
DP-Dueling: Learning from Preference Feedback without Compromising User Privacy
Authors:
Aadirupa Saha,
Hilal Asi
Abstract:
We consider the well-studied dueling bandit problem, where a learner aims to identify near-optimal actions using pairwise comparisons, under the constraint of differential privacy. We consider a general class of utility-based preference matrices for large (potentially unbounded) decision spaces and give the first differentially private dueling bandit algorithm for active learning with user prefere…
▽ More
We consider the well-studied dueling bandit problem, where a learner aims to identify near-optimal actions using pairwise comparisons, under the constraint of differential privacy. We consider a general class of utility-based preference matrices for large (potentially unbounded) decision spaces and give the first differentially private dueling bandit algorithm for active learning with user preferences. Our proposed algorithms are computationally efficient with near-optimal performance, both in terms of the private and non-private regret bound. More precisely, we show that when the decision space is of finite size $K$, our proposed algorithm yields order optimal $O\Big(\sum_{i = 2}^K\log\frac{KT}{Δ_i} + \frac{K}ε\Big)$ regret bound for pure $ε$-DP, where $Δ_i$ denotes the suboptimality gap of the $i$-th arm. We also present a matching lower bound analysis which proves the optimality of our algorithms. Finally, we extend our results to any general decision space in $d$-dimensions with potentially infinite arms and design an $ε$-DP algorithm with regret $\tilde{O} \left( \frac{d^6}{κε} + \frac{ d\sqrt{T }}κ \right)$, providing privacy for free when $T \gg d$.
△ Less
Submitted 22 March, 2024;
originally announced March 2024.
-
Machine Learning-based Layer-wise Detection of Overheating Anomaly in LPBF using Photodiode Data
Authors:
Nazmul Hasan,
Apurba Kumar Saha,
Andrew Wessman,
Mohammed Shafae
Abstract:
Overheating anomaly detection is essential for the quality and reliability of parts produced by laser powder bed fusion (LPBF) additive manufacturing (AM). In this research, we focus on the detection of overheating anomalies using photodiode sensor data. Photodiode sensors can collect high-frequency data from the melt pool, reflecting the process dynamics and thermal history. Hence, the proposed m…
▽ More
Overheating anomaly detection is essential for the quality and reliability of parts produced by laser powder bed fusion (LPBF) additive manufacturing (AM). In this research, we focus on the detection of overheating anomalies using photodiode sensor data. Photodiode sensors can collect high-frequency data from the melt pool, reflecting the process dynamics and thermal history. Hence, the proposed method offers a machine learning (ML) framework to utilize photodiode sensor data for layer-wise detection of overheating anomalies. In doing so, three sets of features are extracted from the raw photodiode data: MSMM (mean, standard deviation, median, maximum), MSQ (mean, standard deviation, quartiles), and MSD (mean, standard deviation, deciles). These three datasets are used to train several ML classifiers. Cost-sensitive learning is used to handle the class imbalance between the "anomalous" layers (affected by overheating) and "nominal" layers in the benchmark dataset. To boost detection accuracy, our proposed ML framework involves utilizing the majority voting ensemble (MVE) approach. The proposed method is demonstrated using a case study including an open benchmark dataset of photodiode measurements from an LPBF specimen with deliberate overheating anomalies at some layers. The results from the case study demonstrate that the MSD features yield the best performance for all classifiers, and the MVE classifier (with a mean F1-score of 0.8654) surpasses the individual ML classifiers. Moreover, our machine learning methodology achieves superior results (9.66% improvement in mean F1-score) in detecting layer-wise overheating anomalies, surpassing the existing methods in the literature that use the same benchmark dataset.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
Tailoring topological band properties of twisted double bilayer graphene: effects due to spin-orbit coupling
Authors:
Kamalesh Bera,
Priyanka Mohan,
Arijit Saha
Abstract:
Our theoretical study unfolds the topological phase transitions (within bands of the Moiré super-lattice) in small angle twisted double bilayer graphene (tDBLG) under the influence of external gate voltage and intrinsic spin-orbit coupling (SOC) for both AB-AB and AB-BA stacking configurations. Utilizing a low-energy continuum model, we investigate the band structure and perform a comprehensive to…
▽ More
Our theoretical study unfolds the topological phase transitions (within bands of the Moiré super-lattice) in small angle twisted double bilayer graphene (tDBLG) under the influence of external gate voltage and intrinsic spin-orbit coupling (SOC) for both AB-AB and AB-BA stacking configurations. Utilizing a low-energy continuum model, we investigate the band structure and perform a comprehensive topological characterization of the system by analysing the direct band gap closing as well as various Chern numbers. In the absence of SOC, the tDBLG exhibits characteristics of a valley Hall insulator. However, in the presence of SOC, we observe a transition to a quantum spin Hall insulator state and band topology emerges in the parameter spaces of non-topological regime without SOC. Furthermore, we conduct a comparative analysis between untwisted double bilayer graphene and tDBLG to assess the impact of twisting on the system's properties. Our findings reveal the construction of topological phase diagrams that showcase distinct phases arising from changes in the twist angle compared to the untwisted case. These phase diagrams provide valuable insights into the diverse topological phases achievable in tDBLG with SOC. Our findings contribute to the understanding of the interplay between small twist angle, SOC, and external electric field on the topological band properties of twisted multilayer graphene systems.
△ Less
Submitted 18 March, 2024;
originally announced March 2024.
-
Gauss-Bonnet $\boldsymbol{AdS}$ planar and spherical black hole thermodynamics and holography
Authors:
Souvik Paul,
Sunandan Gangopadhyay,
Ashis Saha
Abstract:
In this work, we extend the study in \cite{Bilic:2022psx} incorporating the $AdS$/CFT duality to establish a relationship between the local temperatures of a large ($AdS$) spherical and a ($AdS$) planar Schwarzschild black hole near the $AdS$ boundary considering Gauss-Bonnet curvature correction in the gravitational action. We have shown the finite coupling corrections appear in the local tempera…
▽ More
In this work, we extend the study in \cite{Bilic:2022psx} incorporating the $AdS$/CFT duality to establish a relationship between the local temperatures of a large ($AdS$) spherical and a ($AdS$) planar Schwarzschild black hole near the $AdS$ boundary considering Gauss-Bonnet curvature correction in the gravitational action. We have shown the finite coupling corrections appear in the local temperature relationships due to the inclusion of Gauss-Bonnet term in the bulk. By transforming the metric into Fefferman-Graham form, we have calculated the energy of the conformal fluid at the boundary. Following the results of fluid/gravity duality, the energy of the conformal fluid at the boundary is then compared with the black body radiation energy which eventually leads us to establish the local temperature relationship between spherical and planar black holes in Gauss-Bonnet gravity near the $AdS$ boundary.
△ Less
Submitted 12 March, 2024;
originally announced March 2024.
-
Novel quantum spin liquid ground state in the trimer rhodate Ba$_4$NbRh$_3$O$_{12}$
Authors:
Abhisek Bandyopadhyay,
S. Lee,
D. T. Adroja,
G. B. G. Stenning,
Adam Berlie,
M. R. Lees,
R. A. Saha,
D. Takegami,
A. Melendez-Sans,
G. Poelchen,
M. Yoshimura,
K. D. Tsuei,
Z. Hu,
Cheng-Wei Kao,
Yu-Cheng Huang,
Ting-Shan Chan,
Kwang-Yong Cho
Abstract:
Frustrated magnets offer a plethora of exotic magnetic ground states, including quantum spin liquids (QSLs), in which enhanced quantum fluctuations prevent a long-range magnetic ordering of the strongly correlated spins down to lowest temperature. Here we have investigated the trimer based mixed valence hexagonal rhodate Ba$_4$NbRh$_3$O$_{12}$ using a combination of dc and ac magnetization, electr…
▽ More
Frustrated magnets offer a plethora of exotic magnetic ground states, including quantum spin liquids (QSLs), in which enhanced quantum fluctuations prevent a long-range magnetic ordering of the strongly correlated spins down to lowest temperature. Here we have investigated the trimer based mixed valence hexagonal rhodate Ba$_4$NbRh$_3$O$_{12}$ using a combination of dc and ac magnetization, electrical resistivity, specific heat, and muon spin rotation/relaxation ($μ$SR) measurements. Despite the substantial antiferromagnetic exchange interactions, as evident from the Weiss temperature ($θ_{\mathrm{W}}\sim -35$ to -45 K), among the Rh-local moments, neither long-range magnetic ordering nor spin-freezing is observed down to at least 50 mK, in ac-susceptibility, specific heat and ZF-$μ$SR measurements (down to 0.26 K). We ascribe the absence of any magnetic transition to enhanced quantum fluctuations as a result of geometrical frustration arising out of the edge-sharing equilateral Rh-triangular network in the structure. Our longitudinal-field $μ$SR result evidences persistent spin fluctuations down to 0.26~K, thus stabilizing a dynamic QSL ground state in Ba$_4$NbRh$_3$O$_{12}$. Furthermore, the magnetic specific heat ($C_{\mathrm{m}}$) data at low-$T$ reveal a significant $T$-linear contribution plus a quadratic $T$-dependence. A $T$-linear behavior is evocative of gapless spin excitations, while the $T^2$-term of $C_{\mathrm{m}}$ may indicate the Dirac QSL phenomenology of the spinon excitations with a linear dispersion.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Time-dependent droplet detachment behaviour from wettability-engineered fibers during fog harvesting
Authors:
Arijit Saha,
Arkadeep Datta,
Arani Mukhopadhyay,
Amitava Datta,
Ranjan Ganguly
Abstract:
Water collection from natural and industrial fogs has recently been viewed as a viable freshwater source. An interesting outgrowth of the relevant research as focused on arresting of the drift losses (un-evaporated and re-condensed water droplets present in the exhaust plume from industrial cooling towers. Such exploits in fog collection have implemented metal and polyester meshes as fog water col…
▽ More
Water collection from natural and industrial fogs has recently been viewed as a viable freshwater source. An interesting outgrowth of the relevant research as focused on arresting of the drift losses (un-evaporated and re-condensed water droplets present in the exhaust plume from industrial cooling towers. Such exploits in fog collection have implemented metal and polyester meshes as fog water collectors (FWC). Fog droplets impinge and deposit on mesh fibers. They coalesce with previously deposited liquid to evolve as larger drops before detaching from the fibers under their own weight, an event largely dependent on the mesh fiber wettability, diameter and its arrangement relative to the fog flow. To better estimate drainage and hence collection from these fibers, the study, focuses on droplet detachment from differently wetted horizontally positioned cylindrical fibers of various diameters, placed orthogonally in the path of an oncoming fog. Droplet detachment volume is found to increase with fiber diameter and fiber surface wettability. Interestingly, in a typical fogging condition, the detachment volume is also found to exhibit a time-dependent behaviour, altering the droplet detachment criteria otherwise predicted from emulation. Our current study sheds light on this unexplored phenomenon.
△ Less
Submitted 9 March, 2024;
originally announced March 2024.
-
ROME/REA: Three-year, Tri-color Timeseries Photometry of the Galactic Bulge
Authors:
R. A. Street,
E. Bachelet,
Y. Tsapras,
M. P. G. Hundertmark,
V. Bozza,
D. M. Bramich,
A. Cassan,
M. Dominik,
R. Figuera Jaimes,
K. Horne,
S. Mao,
A. Saha,
J. Wambsganss,
Weicheng Zang
Abstract:
The ROME/REA (Robotic Observations of Microlensing Events/Reactive Event Assessment) Survey was a Key Project at Las Cumbres Observatory (hereafter LCO) which continuously monitored 20 selected fields (3.76 sq.deg.) in the Galactic Bulge throughout their seasonal visibility window over a three-year period, between March 2017 and March 2020. Observations were made in three optical passbands (SDSS-g…
▽ More
The ROME/REA (Robotic Observations of Microlensing Events/Reactive Event Assessment) Survey was a Key Project at Las Cumbres Observatory (hereafter LCO) which continuously monitored 20 selected fields (3.76 sq.deg.) in the Galactic Bulge throughout their seasonal visibility window over a three-year period, between March 2017 and March 2020. Observations were made in three optical passbands (SDSS-g', -r', -i'), and LCO's multi-site telescope network enabled the survey to achieve a typical cadence of $\sim$10\,hrs in i' and ~15 hrs in g' and r'. In addition, intervals of higher cadence (<1 hr) data were obtained during monitoring of key microlensing events within the fields. This paper describes the Difference Image Analysis data reduction pipeline developed to process these data, and the process for combining the photometry from LCO's three observing sites in the Southern Hemisphere. The full timeseries photometry for all 8 million stars, down to a limiting magnitude of i~18 mag is provided in the data release accompanying this paper, and samples of the data are presented for exemplar microlensing events, illustrating how the tri-band data are used to derive constraints on the microlensing source star parameters, a necessary step in determining the physical properties of the lensing object. The timeseries data also enables a wealth of additional science, for example in characterizing long-timescale stellar variability, and a few examples of the data for known variables are presented.
△ Less
Submitted 9 March, 2024;
originally announced March 2024.
-
Ergonomic Design of Computer Laboratory Furniture: Mismatch Analysis Utilizing Anthropometric Data of University Students
Authors:
Anik Kumar Saha,
Md Abrar Jahin,
Md. Rafiquzzaman,
M. F. Mridha
Abstract:
Many studies have shown how ergonomically designed furniture improves productivity and well-being. As computers have become a part of students' academic lives, they will grow further in the future. We propose anthropometric-based furniture dimensions suitable for university students to improve computer laboratory ergonomics. We collected data from 380 participants and analyzed 11 anthropometric me…
▽ More
Many studies have shown how ergonomically designed furniture improves productivity and well-being. As computers have become a part of students' academic lives, they will grow further in the future. We propose anthropometric-based furniture dimensions suitable for university students to improve computer laboratory ergonomics. We collected data from 380 participants and analyzed 11 anthropometric measurements, correlating them to 11 furniture dimensions. Two types of furniture were studied: a non-adjustable chair with a non-adjustable table and an adjustable chair with a non-adjustable table. The mismatch calculation showed a significant difference between furniture dimensions and anthropometric measurements. The one-way ANOVA test with a significance level of 5% also showed a significant difference between proposed and existing furniture dimensions. The proposed dimensions were found to be more compatible and reduced mismatch percentages for both males and females compared to existing furniture. The proposed dimensions of the furniture set with adjustable seat height showed slightly improved results compared to the non-adjustable furniture set. This suggests that the proposed dimensions can improve comfort levels and reduce the risk of musculoskeletal disorders among students. Further studies on the implementation and long-term effects of these proposed dimensions in real-world computer laboratory settings are recommended.
△ Less
Submitted 15 March, 2024; v1 submitted 4 March, 2024;
originally announced March 2024.
-
Don't Blame the Data, Blame the Model: Understanding Noise and Bias When Learning from Subjective Annotations
Authors:
Abhishek Anand,
Negar Mokhberian,
Prathyusha Naresh Kumar,
Anweasha Saha,
Zihao He,
Ashwin Rao,
Fred Morstatter,
Kristina Lerman
Abstract:
Researchers have raised awareness about the harms of aggregating labels especially in subjective tasks that naturally contain disagreements among human annotators. In this work we show that models that are only provided aggregated labels show low confidence on high-disagreement data instances. While previous studies consider such instances as mislabeled, we argue that the reason the high-disagreem…
▽ More
Researchers have raised awareness about the harms of aggregating labels especially in subjective tasks that naturally contain disagreements among human annotators. In this work we show that models that are only provided aggregated labels show low confidence on high-disagreement data instances. While previous studies consider such instances as mislabeled, we argue that the reason the high-disagreement text instances have been hard-to-learn is that the conventional aggregated models underperform in extracting useful signals from subjective tasks. Inspired by recent studies demonstrating the effectiveness of learning from raw annotations, we investigate classifying using Multiple Ground Truth (Multi-GT) approaches. Our experiments show an improvement of confidence for the high-disagreement instances.
△ Less
Submitted 6 March, 2024;
originally announced March 2024.
-
qPMS Sigma -- An Efficient and Exact Parallel Algorithm for the Planted $(l, d)$ Motif Search Problem
Authors:
Saurav Dhar,
Amlan Saha,
Dhiman Goswami,
Md. Abul Kashem Mia
Abstract:
Motif finding is an important step for the detection of rare events occurring in a set of DNA or protein sequences. Extraction of information about these rare events can lead to new biological discoveries. Motifs are some important patterns that have numerous applications including the identification of transcription factors and their binding sites, composite regulatory patterns, similarity betwee…
▽ More
Motif finding is an important step for the detection of rare events occurring in a set of DNA or protein sequences. Extraction of information about these rare events can lead to new biological discoveries. Motifs are some important patterns that have numerous applications including the identification of transcription factors and their binding sites, composite regulatory patterns, similarity between families of proteins, etc. Although several flavors of motif searching algorithms have been studied in the literature, we study the version known as $ (l, d) $-motif search or Planted Motif Search (PMS). In PMS, given two integers $ l $, $ d $ and $ n $ input sequences we try to find all the patterns of length $ l $ that appear in each of the $ n $ input sequences with at most $ d $ mismatches. We also discuss the quorum version of PMS in our work that finds motifs that are not planted in all the input sequences but at least in $ q $ of the sequences. Our algorithm is mainly based on the algorithms qPMSPrune, qPMS7, TraverStringRef and PMS8. We introduce some techniques to compress the input strings and make faster comparison between strings with bitwise operations. Our algorithm performs a little better than the existing exact algorithms to solve the qPMS problem in DNA sequence. We have also proposed an idea for parallel implementation of our algorithm.
△ Less
Submitted 1 March, 2024;
originally announced March 2024.
-
Stop Relying on No-Choice and Do not Repeat the Moves: Optimal, Efficient and Practical Algorithms for Assortment Optimization
Authors:
Aadirupa Saha,
Pierre Gaillard
Abstract:
We address the problem of active online assortment optimization problem with preference feedback, which is a framework for modeling user choices and subsetwise utility maximization. The framework is useful in various real-world applications including ad placement, online retail, recommender systems, fine-tuning language models, amongst many. The problem, although has been studied in the past, lack…
▽ More
We address the problem of active online assortment optimization problem with preference feedback, which is a framework for modeling user choices and subsetwise utility maximization. The framework is useful in various real-world applications including ad placement, online retail, recommender systems, fine-tuning language models, amongst many. The problem, although has been studied in the past, lacks an intuitive and practical solution approach with simultaneously efficient algorithm and optimal regret guarantee. E.g., popularly used assortment selection algorithms often require the presence of a `strong reference' which is always included in the choice sets, further they are also designed to offer the same assortments repeatedly until the reference item gets selected -- all such requirements are quite unrealistic for practical applications. In this paper, we designed efficient algorithms for the problem of regret minimization in assortment selection with \emph{Plackett Luce} (PL) based user choices. We designed a novel concentration guarantee for estimating the score parameters of the PL model using `\emph{Pairwise Rank-Breaking}', which builds the foundation of our proposed algorithms. Moreover, our methods are practical, provably optimal, and devoid of the aforementioned limitations of the existing methods. Empirical evaluations corroborate our findings and outperform the existing baselines.
△ Less
Submitted 29 February, 2024;
originally announced February 2024.
-
ToDo: Token Downsampling for Efficient Generation of High-Resolution Images
Authors:
Ethan Smith,
Nayan Saxena,
Aninda Saha
Abstract:
Attention mechanism has been crucial for image diffusion models, however, their quadratic computational complexity limits the sizes of images we can process within reasonable time and memory constraints. This paper investigates the importance of dense attention in generative image models, which often contain redundant features, making them suitable for sparser attention mechanisms. We propose a no…
▽ More
Attention mechanism has been crucial for image diffusion models, however, their quadratic computational complexity limits the sizes of images we can process within reasonable time and memory constraints. This paper investigates the importance of dense attention in generative image models, which often contain redundant features, making them suitable for sparser attention mechanisms. We propose a novel training-free method ToDo that relies on token downsampling of key and value tokens to accelerate Stable Diffusion inference by up to 2x for common sizes and up to 4.5x or more for high resolutions like 2048x2048. We demonstrate that our approach outperforms previous methods in balancing efficient throughput and fidelity.
△ Less
Submitted 8 May, 2024; v1 submitted 21 February, 2024;
originally announced February 2024.
-
Fermi arcs mediated transport in inversion symmetry-broken Weyl semimetal nanowire and its hybrid junctions
Authors:
Amartya Pal,
Paramita Dutta,
Arijit Saha
Abstract:
The emergence of gapless surface states, known as Fermi arcs (FAs), is one of the unique properties of the novel topological Weyl semimetal (WSM). However, extracting the signatures of FAs from the bulk states has always been a challenge as both of them are gapless in nature and connected to each other. We capture the signatures of FAs via transport in an inversion symmetry (IS)-broken WSM. We stu…
▽ More
The emergence of gapless surface states, known as Fermi arcs (FAs), is one of the unique properties of the novel topological Weyl semimetal (WSM). However, extracting the signatures of FAs from the bulk states has always been a challenge as both of them are gapless in nature and connected to each other. We capture the signatures of FAs via transport in an inversion symmetry (IS)-broken WSM. We study the band structure and the properties of FAs like shape, spin polarization considering slab and nanowire (NW) geometry, and then compute the two-terminal conductance in WSM NW in terms of the scattering coefficients within the Landauer formalism. We find the FA-mediated conductance to be quantized in units of $2e^2/h$. We extend our study to the transport in WSM/Weyl superconductor (WSC) NW hybrid junction using the Blonder-Tinkham-Klapwijk (BTK) formalism. We show that due to the intricate spin textures, the signatures of the FAs can be captured via Andreev reflection process. We also show that our results of conductance are robust against delta-correlated quenched disorder and thus enhancing the experimental feasibility.
△ Less
Submitted 15 June, 2024; v1 submitted 12 February, 2024;
originally announced February 2024.
-
Cloud-cloud collision and cluster formation in the W5-NW complex
Authors:
Namitha Issac,
Anindya Saha,
Saanika Choudhary,
Aakash Chaudhary,
Anandmayee Tej,
Hong-Li Liu,
Tie Liu,
Maheswar Gopinathan
Abstract:
We present a detailed structural and gas kinematic study of the star-forming complex W5-NW. A cloud-cloud collision scenario unravels with evidences of collision induced star and cluster formation. Various signatures of cloud-cloud collision such as "complementary distribution" and "bridging-features" are explored. At the colliding region, the two clouds have complementary morphologies, where W5-N…
▽ More
We present a detailed structural and gas kinematic study of the star-forming complex W5-NW. A cloud-cloud collision scenario unravels with evidences of collision induced star and cluster formation. Various signatures of cloud-cloud collision such as "complementary distribution" and "bridging-features" are explored. At the colliding region, the two clouds have complementary morphologies, where W5-NWb has a filamentary key-like shape which fits into the U-shaped cavity in W5-NWa that behaves like a keyhole. The interaction region between the two clouds is characterised by bridging features with intermediate velocities connecting the two clouds. A skewed V-shaped bridging feature is also detected at the site of collision. A robust picture of the molecular gas distribution highlighting the bridges is seen in the position-position-velocity diagram obtained using the SCOUSEPY algorithm. Star cluster formation with an over-density of Class I and Class II young stellar objects is also seen towards this cloud complex, likely triggered by the cloud collision event.
△ Less
Submitted 11 February, 2024;
originally announced February 2024.
-
MERP: Metaverse Extended Realtiy Portal
Authors:
Anisha Ghosh,
Aditya Mitra,
Anik Saha,
Sibi Chakkaravarthy Sethuraman,
Anitha Subramanian
Abstract:
A standardized control system called Metaverse Extended Reality Portal (MERP) is presented as a solution to the issues with conventional VR eyewear. The MERP system improves user awareness of the physical world while offering an immersive 3D view of the metaverse by using a shouldermounted projector to display a Heads-Up Display (HUD) in a designated Metaverse Experience Room. To provide natural a…
▽ More
A standardized control system called Metaverse Extended Reality Portal (MERP) is presented as a solution to the issues with conventional VR eyewear. The MERP system improves user awareness of the physical world while offering an immersive 3D view of the metaverse by using a shouldermounted projector to display a Heads-Up Display (HUD) in a designated Metaverse Experience Room. To provide natural and secure interaction inside the metaverse, a compass module and gyroscope integration enable accurate mapping of real-world motions to avatar actions. Through user tests and research, the MERP system shows that it may reduce mishaps brought on by poor spatial awareness, offering an improved metaverse experience and laying the groundwork for future developments in virtual reality technology. MERP, which is compared with existing Virtual Reality (VR) glasses used to traverse the metaverse, is projected to become a seamless, novel and better alternative. Existing VR headsets and AR glasses have well-known drawbacks that making them ineffective for prolonged usage as it causes harm to the eyes.
△ Less
Submitted 8 February, 2024;
originally announced February 2024.
-
First (calibration) experiment using proton beam from FRENA at SINP
Authors:
C. Basu,
K. Banerjee,
T. K. Ghosh,
G. Mukherjee,
C. Bhattacharya,
Shraddha S Desai,
R. Shil,
A. K. Saha,
J. K. Meena,
T. Bar,
D. Basak,
L. K. Sahoo,
S. Saha,
C. Marick,
D. Das,
D. Das,
D. Das,
M. Kujur,
S. Roy,
S. S. Basu,
U. Gond,
A. Saha,
A. Das,
M. Samanta,
P. Saha
, et al. (1 additional authors not shown)
Abstract:
This work presents the first calibration experiment of a 3 MV Tandetron accelerator, FRENA, performed in May 2022. The $^7$Li(p,n) reaction threshold was measured to calibrate the terminal voltage measuring device. A LiF target of thickness 175 $μ$g/cm$^2$ was used in the experiment. The measured threshold was 1872$\pm$2.7 keV, indicating 6$-$10 keV energy shift.
This work presents the first calibration experiment of a 3 MV Tandetron accelerator, FRENA, performed in May 2022. The $^7$Li(p,n) reaction threshold was measured to calibrate the terminal voltage measuring device. A LiF target of thickness 175 $μ$g/cm$^2$ was used in the experiment. The measured threshold was 1872$\pm$2.7 keV, indicating 6$-$10 keV energy shift.
△ Less
Submitted 24 January, 2024;
originally announced February 2024.
-
Constraints on Triton atmospheric evolution from occultations: 1989-2022
Authors:
B. Sicardy,
A. Tej,
A. R. Gomes-Junior,
F. D. Romanov,
T. Bertrand,
N. M. Ashok,
E. Lellouch,
B. E. Morgado,
M. Assafin,
J. Desmars,
J. I. B. Camargo,
Y. Kilic,
J. L. Ortiz,
R. Vieira-Martins,
F. Braga-Ribas,
J. P. Ninan,
B. C. Bhatt,
S. Pramod Kumar,
V. Swain,
S. Sharma,
A. Saha,
D. K. Ojha,
G. Pawar,
S. Deshmukh,
A. Deshpande
, et al. (27 additional authors not shown)
Abstract:
Context - Around the year 2000, Triton's south pole experienced an extreme summer solstice that occurs every about 650 years, when the subsolar latitude reached about 50°. Bracketing this epoch, a few occultations probed Triton's atmosphere in 1989, 1995, 1997, 2008 and 2017. A recent ground-based stellar occultation observed on 6 October 2022 provides a new measurement of Triton's atmospheric pre…
▽ More
Context - Around the year 2000, Triton's south pole experienced an extreme summer solstice that occurs every about 650 years, when the subsolar latitude reached about 50°. Bracketing this epoch, a few occultations probed Triton's atmosphere in 1989, 1995, 1997, 2008 and 2017. A recent ground-based stellar occultation observed on 6 October 2022 provides a new measurement of Triton's atmospheric pressure which is presented here.
Aims- The goal is to constrain the Volatile Transport Models (VTMs) of Triton's atmosphere that is basically in vapor pressure equilibrium with the nitrogen ice at its surface.
Methods - Fits to the occultation light curves yield Triton's atmospheric pressure at the reference radius 1400 km, from which the surface pressure is induced.
Results - The fits provide a pressure p_1400= 1.211 +/- 0.039 microbar at radius 1400 km (47 km altitude), from which a surface pressure of p_surf= 14.54 +/- 0.47 microbar is induced (1-sigma error bars). To within error bars, this is identical to the pressure derived from the previous occultation of 5 October 2017, p_1400 = 1.18 +/- 0.03 microbar and p_surf= 14.1 +/- 0.4 microbar, respectively. Based on recent models of Triton's volatile cycles, the overall evolution over the last 30 years of the surface pressure is consistent with N2 condensation taking place in the northern hemisphere. However, models typically predict a steady decrease in surface pressure for the period 2005-2060, which is not confirmed by this observation. Complex surface-atmosphere interactions, such as ice albedo runaway and formation of local N2 frosts in the equatorial regions of Triton could explain the relatively constant pressure between 2017 and 2022.
△ Less
Submitted 4 February, 2024;
originally announced February 2024.
-
Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text
Authors:
Abhimanyu Hans,
Avi Schwarzschild,
Valeriia Cherepanova,
Hamid Kazemi,
Aniruddha Saha,
Micah Goldblum,
Jonas Geiping,
Tom Goldstein
Abstract:
Detecting text generated by modern large language models is thought to be hard, as both LLMs and humans can exhibit a wide range of complex behaviors. However, we find that a score based on contrasting two closely related language models is highly accurate at separating human-generated and machine-generated text. Based on this mechanism, we propose a novel LLM detector that only requires simple ca…
▽ More
Detecting text generated by modern large language models is thought to be hard, as both LLMs and humans can exhibit a wide range of complex behaviors. However, we find that a score based on contrasting two closely related language models is highly accurate at separating human-generated and machine-generated text. Based on this mechanism, we propose a novel LLM detector that only requires simple calculations using a pair of pre-trained LLMs. The method, called Binoculars, achieves state-of-the-art accuracy without any training data. It is capable of spotting machine text from a range of modern LLMs without any model-specific modifications. We comprehensively evaluate Binoculars on a number of text sources and in varied situations. Over a wide range of document types, Binoculars detects over 90% of generated samples from ChatGPT (and other LLMs) at a false positive rate of 0.01%, despite not being trained on any ChatGPT data.
△ Less
Submitted 1 July, 2024; v1 submitted 22 January, 2024;
originally announced January 2024.
-
AI in Supply Chain Risk Assessment: A Systematic Literature Review and Bibliometric Analysis
Authors:
Md Abrar Jahin,
Saleh Akram Naife,
Anik Kumar Saha,
M. F. Mridha
Abstract:
Supply chain risk assessment (SCRA) has witnessed a profound evolution through the integration of artificial intelligence (AI) and machine learning (ML) techniques, revolutionizing predictive capabilities and risk mitigation strategies. The significance of this evolution stems from the critical role of robust risk management strategies in ensuring operational resilience and continuity within moder…
▽ More
Supply chain risk assessment (SCRA) has witnessed a profound evolution through the integration of artificial intelligence (AI) and machine learning (ML) techniques, revolutionizing predictive capabilities and risk mitigation strategies. The significance of this evolution stems from the critical role of robust risk management strategies in ensuring operational resilience and continuity within modern supply chains. Previous reviews have outlined established methodologies but have overlooked emerging AI/ML techniques, leaving a notable research gap in understanding their practical implications within SCRA. This paper conducts a systematic literature review combined with a comprehensive bibliometric analysis. We meticulously examined 1,717 papers and derived key insights from a select group of 48 articles published between 2014 and 2023. The review fills this research gap by addressing pivotal research questions, and exploring existing AI/ML techniques, methodologies, findings, and future trajectories, thereby providing a more encompassing view of the evolving landscape of SCRA. Our study unveils the transformative impact of AI/ML models, such as Random Forest, XGBoost, and hybrids, in substantially enhancing precision within SCRA. It underscores adaptable post-COVID strategies, advocating for resilient contingency plans and aligning with evolving risk landscapes. Significantly, this review surpasses previous examinations by accentuating emerging AI/ML techniques and their practical implications within SCRA. Furthermore, it highlights the contributions through a comprehensive bibliometric analysis, revealing publication trends, influential authors, and highly cited articles.
△ Less
Submitted 25 January, 2024; v1 submitted 12 December, 2023;
originally announced January 2024.
-
Study of $^{113}$In($α,α$) elastic scattering to determine $α$-optical potential relevant for astrophysical $γ$-process
Authors:
Dipali Basak,
Tanmoy Bar,
Lalit Kumar Sahoo,
Sukhendu Saha,
T. K. Rana,
S. Manna,
C. Bhattacharya,
Samir Kundu,
J. K. Sahoo,
J. K. Meena,
A. K. Saha,
Ashok Kumar Mondal,
Chinmay Basu
Abstract:
The $α$-optical potential is one of the key input parameters used to measure the reaction rate of the ($γ,α$)-process using the Hauser-Feshbach(HF) statistical model and the principle of detailed balance. $α$-elastic scattering experiment on $^{113}$In $p$-nucleus was carried out in the energy range E$_{lab}$=26$-$32 MeV. The vacuum evaporation technique was used to prepare the $^{113}$In target~(…
▽ More
The $α$-optical potential is one of the key input parameters used to measure the reaction rate of the ($γ,α$)-process using the Hauser-Feshbach(HF) statistical model and the principle of detailed balance. $α$-elastic scattering experiment on $^{113}$In $p$-nucleus was carried out in the energy range E$_{lab}$=26$-$32 MeV. The vacuum evaporation technique was used to prepare the $^{113}$In target~($\sim$86 $μ$g/cm$^2$). An energy-dependent local optical potential parameters set was obtained by analysing the experimental elastic scattering angular distribution data. The local potential parameters are extrapolated for lower energies and are used to measure the $^{113}$In($α,γ$) reaction cross-section.
△ Less
Submitted 15 January, 2024;
originally announced January 2024.
-
Quantum chaos in the presence of non-conformality
Authors:
Ashis Saha,
Sunandan Gangopadhyay
Abstract:
The behaviour of a chaotic system and its effect on existing quantum correlation has been holographically studied in presence of non-conformality. Keeping in mind the gauge/gravity duality framework, the non-conformality in the dual field theory has been introduced by considering a Liouville type dilaton potential for the gravitational theory. The resulting black brane solution is associated with…
▽ More
The behaviour of a chaotic system and its effect on existing quantum correlation has been holographically studied in presence of non-conformality. Keeping in mind the gauge/gravity duality framework, the non-conformality in the dual field theory has been introduced by considering a Liouville type dilaton potential for the gravitational theory. The resulting black brane solution is associated with a parameter $η$ which represents the deviation from conformality. The parameters of chaos, namely, the Lyapunov exponent and butterfly velocity are computed by following the well-known shock wave analysis. The obtained results reveal that presence of non-conformality leads to suppression of the chaotic nature of a system. Further, for a particular value of the non-conformal parameter $η$, the system achieves Lyapunov stability resulting from the vanishing of both the Lyapunov exponent and as well as butterfly velocity. Interestingly, this particular value of $η$ matches with the previously given upper bound of $η$ known as Gubser bound in the literature. The effects of chaos and non-conformality on the existing correlation of a thermofield doublet state have been quantified by holographically computing the thermo mutual information in both the presence and absence of the shock wave. Furthermore, the entanglement velocity is also computed and the effect of non-conformality on it has been observed. Finally, the obtained results for the Lyapunov exponent and the butterfly velocity have also been computed from the pole-skipping analysis. The results from the two approaches agree with each other.
△ Less
Submitted 3 July, 2024; v1 submitted 11 January, 2024;
originally announced January 2024.
-
Field theory expansions of string theory amplitudes
Authors:
Arnab Priya Saha,
Aninda Sinha
Abstract:
Motivated by quantum field theory (QFT) considerations, we present new representations of the Euler-Beta function and tree-level string theory amplitudes using a new two-channel, local, crossing symmetric dispersion relation. Unlike standard series representations, the new ones are analytic everywhere except at the poles, sum over poles in all channels and include contact interactions, in the spir…
▽ More
Motivated by quantum field theory (QFT) considerations, we present new representations of the Euler-Beta function and tree-level string theory amplitudes using a new two-channel, local, crossing symmetric dispersion relation. Unlike standard series representations, the new ones are analytic everywhere except at the poles, sum over poles in all channels and include contact interactions, in the spirit of QFT. This enables us to consider mass-level truncation, which preserves all the features of the original amplitudes. By starting with such expansions for generalized Euler-Beta functions and demanding QFT like features, we single out the open superstring amplitude. We demonstrate the difficulty in deforming away from the string amplitude and show that a class of such deformations can be potentially interesting when there is level truncation. Our considerations also lead to new QFT-inspired, parametric representations of the Zeta function and $π$, which show fast convergence.
△ Less
Submitted 29 April, 2024; v1 submitted 11 January, 2024;
originally announced January 2024.
-
Think Before You Duel: Understanding Complexities of Preference Learning under Constrained Resources
Authors:
Rohan Deb,
Aadirupa Saha
Abstract:
We consider the problem of reward maximization in the dueling bandit setup along with constraints on resource consumption. As in the classic dueling bandits, at each round the learner has to choose a pair of items from a set of $K$ items and observe a relative feedback for the current pair. Additionally, for both items, the learner also observes a vector of resource consumptions. The objective of…
▽ More
We consider the problem of reward maximization in the dueling bandit setup along with constraints on resource consumption. As in the classic dueling bandits, at each round the learner has to choose a pair of items from a set of $K$ items and observe a relative feedback for the current pair. Additionally, for both items, the learner also observes a vector of resource consumptions. The objective of the learner is to maximize the cumulative reward, while ensuring that the total consumption of any resource is within the allocated budget. We show that due to the relative nature of the feedback, the problem is more difficult than its bandit counterpart and that without further assumptions the problem is not learnable from a regret minimization perspective. Thereafter, by exploiting assumptions on the available budget, we provide an EXP3 based dueling algorithm that also considers the associated consumptions and show that it achieves an $\tilde{\mathcal{O}}\left({\frac{OPT^{(b)}}{B}}K^{1/3}T^{2/3}\right)$ regret, where $OPT^{(b)}$ is the optimal value and $B$ is the available budget. Finally, we provide numerical simulations to demonstrate the efficacy of our proposed method.
△ Less
Submitted 28 December, 2023;
originally announced December 2023.
-
An EFT origin of Secluded Dark Matter
Authors:
AseshKrishna Datta,
Sourov Roy,
Abhijit Kumar Saha,
Ananya Tapadar
Abstract:
The present study aims to unveil a scenario with a non-minimal secluded dark sector (DS) in an effective field theory (EFT) framework. To explore this, we have examined a suitable extension of the type-X Two Higgs Doublet Model (2HDM) as a potential origin for the secluded DS. The DS comprises a dark matter (DM) candidate and a mediator particle `$a$' and possesses some non-minimal characteristics…
▽ More
The present study aims to unveil a scenario with a non-minimal secluded dark sector (DS) in an effective field theory (EFT) framework. To explore this, we have examined a suitable extension of the type-X Two Higgs Doublet Model (2HDM) as a potential origin for the secluded DS. The DS comprises a dark matter (DM) candidate and a mediator particle `$a$' and possesses some non-minimal characteristics. It becomes non-thermally populated through diverse dim-6 four-Fermi operators, effectively generated by integrating out the heavier Higgs particles. The analysis further focuses on the consequences of the collision processes $\textit{DM}+ a \leftrightarrow a + a$ and $\textit{DM}+ \textit{DM} \leftrightarrow a + a$ occurring within the DS. We have investigated the significance of employing an EFT approach in tracking the temperature evolution of the DS. Within the present framework, the observed relic abundance of the DM can be realized through both dark freeze-out and freeze-in mechanisms. Further, we have delineated the permissible ranges of the relevant parameters, viz., the DM mass ($m_χ\gtrsim 20 \, \text{GeV}$), the portal coupling ($C_τ\lesssim 10^{-14}\, \text{GeV}^{-2}$), and the DS coupling ($λ\lesssim 10^{-6} \,\text{GeV}^{-2}$) by taking into account the perturbativity of the involved couplings while reproducing the observed DM relic and complying with the bounds from a successful Big Bang Nucleosynthesis (BBN) and $γ$-ray searches.
△ Less
Submitted 11 July, 2024; v1 submitted 28 December, 2023;
originally announced December 2023.
-
Variation-Resilient FeFET-Based In-Memory Computing Leveraging Probabilistic Deep Learning
Authors:
Bibhas Manna,
Arnob Saha,
Zhouhang Jiang,
Kai Ni,
Abhronil Sengupta
Abstract:
Reliability issues stemming from device level non-idealities of non-volatile emerging technologies like ferroelectric field-effect transistors (FeFET), especially at scaled dimensions, cause substantial degradation in the accuracy of In-Memory crossbar-based AI systems. In this work, we present a variation-aware design technique to characterize the device level variations and to mitigate their imp…
▽ More
Reliability issues stemming from device level non-idealities of non-volatile emerging technologies like ferroelectric field-effect transistors (FeFET), especially at scaled dimensions, cause substantial degradation in the accuracy of In-Memory crossbar-based AI systems. In this work, we present a variation-aware design technique to characterize the device level variations and to mitigate their impact on hardware accuracy employing a Bayesian Neural Network (BNN) approach. An effective conductance variation model is derived from the experimental measurements of cycle-to-cycle (C2C) and device-to-device (D2D) variations performed on FeFET devices fabricated using 28 nm high-$k$ metal gate technology. The variations were found to be a function of different conductance states within the given programming range, which sharply contrasts earlier efforts where a fixed variation dispersion was considered for all conductance values. Such variation characteristics formulated for three different device sizes at different read voltages were provided as prior variation information to the BNN to yield a more exact and reliable inference. Near-ideal accuracy for shallow networks (MLP5 and LeNet models) on the MNIST dataset and limited accuracy decline by $\sim$3.8-16.1% for deeper AlexNet models on CIFAR10 dataset under a wide range of variations corresponding to different device sizes and read voltages, demonstrates the efficacy of our proposed device-algorithm co-design technique.
△ Less
Submitted 13 March, 2024; v1 submitted 24 December, 2023;
originally announced December 2023.
-
Heterogeneous Transfer Learning for Building High-Dimensional Generalized Linear Models with Disparate Datasets
Authors:
Ruzhang Zhao,
Prosenjit Kundu,
Arkajyoti Saha,
Nilanjan Chatterjee
Abstract:
Development of comprehensive prediction models are often of great interest in many disciplines of science, but datasets with information on all desired features typically have small sample sizes. In this article, we describe a transfer learning approach for building high-dimensional generalized linear models using data from a main study that has detailed information on all predictors, and from one…
▽ More
Development of comprehensive prediction models are often of great interest in many disciplines of science, but datasets with information on all desired features typically have small sample sizes. In this article, we describe a transfer learning approach for building high-dimensional generalized linear models using data from a main study that has detailed information on all predictors, and from one or more external studies that have ascertained a more limited set of predictors. We propose using the external dataset(s) to build reduced model(s) and then transfer the information on underlying parameters for the analysis of the main study through a set of calibration equations, while accounting for the study-specific effects of certain design variables. We then use a generalized method of moment (GMM) with penalization for parameter estimation and develop highly scalable algorithms for fitting models taking advantage of the popular glmnet package. We further show that the use of adaptive-Lasso penalty leads to the oracle property of underlying parameter estimates and thus leads to convenient post-selection inference procedures. We conduct extensive simulation studies to investigate both predictive performance and post-selection inference properties of the proposed method. Finally, we illustrate a timely application of the proposed method for the development of risk prediction models for five common diseases using the UK Biobank study, combining baseline information from all study participants (500K) and recently released high-throughout proteomic data (# protein = 1500) on a subset (50K) of the participants.
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
Electromagnetically-induced transparency assists the Raman gradient echo memory at moderate detuning, dependent on gradient order
Authors:
Jesse L. Everett,
Ankit Papneja,
Arindam Saha,
Cameron Trainor,
Aaron D. Tranter,
Ben C. Buchler
Abstract:
Optical quantum memories are essential for quantum communications and photonic quantum technologies. Ensemble optical memories based on 3-level interactions are a popular basis for implementing these memories. All such memories, however, suffer from loss due to scattering. In off-resonant 3-level interactions, such as the Raman gradient echo memory (GEM), scattering loss can be reduced by a large…
▽ More
Optical quantum memories are essential for quantum communications and photonic quantum technologies. Ensemble optical memories based on 3-level interactions are a popular basis for implementing these memories. All such memories, however, suffer from loss due to scattering. In off-resonant 3-level interactions, such as the Raman gradient echo memory (GEM), scattering loss can be reduced by a large detuning from the intermediate state. In this work, we show how electromagnetically induced transparency adjacent to the Raman absorption line plays a crucial role in reducing scattering loss, so that maximum efficiency is in fact achieved at a moderate detuning. Furthermore, the effectiveness of the transparency, and therefore the efficiency of GEM, depends on the order in which gradients are applied to store and recall the light. We provide a theoretical analysis and show experimentally how the efficiency depends on gradient order and detuning.
△ Less
Submitted 13 May, 2024; v1 submitted 19 December, 2023;
originally announced December 2023.
-
Faster Convergence with Multiway Preferences
Authors:
Aadirupa Saha,
Vitaly Feldman,
Tomer Koren,
Yishay Mansour
Abstract:
We address the problem of convex optimization with preference feedback, where the goal is to minimize a convex function given a weaker form of comparison queries. Each query consists of two points and the dueling feedback returns a (noisy) single-bit binary comparison of the function values of the two queried points. Here we consider the sign-function-based comparison feedback model and analyze th…
▽ More
We address the problem of convex optimization with preference feedback, where the goal is to minimize a convex function given a weaker form of comparison queries. Each query consists of two points and the dueling feedback returns a (noisy) single-bit binary comparison of the function values of the two queried points. Here we consider the sign-function-based comparison feedback model and analyze the convergence rates with batched and multiway (argmin of a set queried points) comparisons. Our main goal is to understand the improved convergence rates owing to parallelization in sign-feedback-based optimization problems. Our work is the first to study the problem of convex optimization with multiway preferences and analyze the optimal convergence rates. Our first contribution lies in designing efficient algorithms with a convergence rate of $\smash{\widetilde O}(\frac{d}{\min\{m,d\} ε})$ for $m$-batched preference feedback where the learner can query $m$-pairs in parallel. We next study a $m$-multiway comparison (`battling') feedback, where the learner can get to see the argmin feedback of $m$-subset of queried points and show a convergence rate of $\smash{\widetilde O}(\frac{d}{ \min\{\log m,d\}ε})$. We show further improved convergence rates with an additional assumption of strong convexity. Finally, we also study the convergence lower bounds for batched preferences and multiway feedback optimization showing the optimality of our convergence rates w.r.t. $m$.
△ Less
Submitted 18 December, 2023;
originally announced December 2023.
-
Performance of externally validated machine learning models based on histopathology images for the diagnosis, classification, prognosis, or treatment outcome prediction in female breast cancer: A systematic review
Authors:
Ricardo Gonzalez,
Peyman Nejat,
Ashirbani Saha,
Clinton J. V. Campbell,
Andrew P. Norgan,
Cynthia Lokker
Abstract:
Numerous machine learning (ML) models have been developed for breast cancer using various types of data. Successful external validation (EV) of ML models is important evidence of their generalizability. The aim of this systematic review was to assess the performance of externally validated ML models based on histopathology images for diagnosis, classification, prognosis, or treatment outcome predi…
▽ More
Numerous machine learning (ML) models have been developed for breast cancer using various types of data. Successful external validation (EV) of ML models is important evidence of their generalizability. The aim of this systematic review was to assess the performance of externally validated ML models based on histopathology images for diagnosis, classification, prognosis, or treatment outcome prediction in female breast cancer. A systematic search of MEDLINE, EMBASE, CINAHL, IEEE, MICCAI, and SPIE conferences was performed for studies published between January 2010 and February 2022. The Prediction Model Risk of Bias Assessment Tool (PROBAST) was employed, and the results were narratively described. Of the 2011 non-duplicated citations, 8 journal articles and 2 conference proceedings met inclusion criteria. Three studies externally validated ML models for diagnosis, 4 for classification, 2 for prognosis, and 1 for both classification and prognosis. Most studies used Convolutional Neural Networks and one used logistic regression algorithms. For diagnostic/classification models, the most common performance metrics reported in the EV were accuracy and area under the curve, which were greater than 87% and 90%, respectively, using pathologists' annotations as ground truth. The hazard ratios in the EV of prognostic ML models were between 1.7 (95% CI, 1.2-2.6) and 1.8 (95% CI, 1.3-2.7) to predict distant disease-free survival; 1.91 (95% CI, 1.11-3.29) for recurrence, and between 0.09 (95% CI, 0.01-0.70) and 0.65 (95% CI, 0.43-0.98) for overall survival, using clinical data as ground truth. Despite EV being an important step before the clinical application of a ML model, it hasn't been performed routinely. The large variability in the training/validation datasets, methods, performance metrics, and reported information limited the comparison of the models and the analysis of their results (...)
△ Less
Submitted 9 December, 2023;
originally announced December 2023.
-
Seeing the random forest through the decision trees. Supporting learning health systems from histopathology with machine learning models: Challenges and opportunities
Authors:
Ricardo Gonzalez,
Ashirbani Saha,
Clinton J. V. Campbell,
Peyman Nejat,
Cynthia Lokker,
Andrew P. Norgan
Abstract:
This paper discusses some overlooked challenges faced when working with machine learning models for histopathology and presents a novel opportunity to support "Learning Health Systems" with them. Initially, the authors elaborate on these challenges after separating them according to their mitigation strategies: those that need innovative approaches, time, or future technological capabilities and t…
▽ More
This paper discusses some overlooked challenges faced when working with machine learning models for histopathology and presents a novel opportunity to support "Learning Health Systems" with them. Initially, the authors elaborate on these challenges after separating them according to their mitigation strategies: those that need innovative approaches, time, or future technological capabilities and those that require a conceptual reappraisal from a critical perspective. Then, a novel opportunity to support "Learning Health Systems" by integrating hidden information extracted by ML models from digitalized histopathology slides with other healthcare big data is presented.
△ Less
Submitted 6 December, 2023;
originally announced December 2023.
-
On-shell functions on the Coulomb branch of $\mathcal{N}=4$ SYM
Authors:
Md. Abhishek,
Subramanya Hegde,
Dileep P. Jatkar,
Arnab Priya Saha,
Amit Suthar
Abstract:
We study on-shell functions in the kinematic space for the Coulomb branch of $\mathcal{N}=4$ SYM. We construct BCFW bridges that help us build bigger on-shell functions. As a consequence, we provide on-shell diagram formulations for BCFW shifts that correspond to various mass configurations. We will use this to calculate the quadruple cut for the one-loop amplitude on the Coulomb branch and maxima…
▽ More
We study on-shell functions in the kinematic space for the Coulomb branch of $\mathcal{N}=4$ SYM. We construct BCFW bridges that help us build bigger on-shell functions. As a consequence, we provide on-shell diagram formulations for BCFW shifts that correspond to various mass configurations. We will use this to calculate the quadruple cut for the one-loop amplitude on the Coulomb branch and maximal cuts for higher-loops. We make preliminary comments on finding the inequivalent set of on-shell functions for the Coulomb branch.
△ Less
Submitted 30 April, 2024; v1 submitted 29 November, 2023;
originally announced November 2023.