-
Enhancing Training Efficiency Using Packing with Flash Attention
Authors:
Achintya Kundu,
Rhui Dih Lee,
Laura Wynter,
Raghu Kiran Ganti
Abstract:
Padding is often used in tuning LLM models by adding special tokens to shorter training examples to match the length of the longest sequence in each batch. While this ensures uniformity for batch processing, it introduces inefficiencies by including irrelevant padding tokens in the computation and wastes GPU resources. On the other hand, the Hugging Face SFT trainer offers the option to use packin…
▽ More
Padding is often used in tuning LLM models by adding special tokens to shorter training examples to match the length of the longest sequence in each batch. While this ensures uniformity for batch processing, it introduces inefficiencies by including irrelevant padding tokens in the computation and wastes GPU resources. On the other hand, the Hugging Face SFT trainer offers the option to use packing to combine multiple training examples up to the maximum sequence length. This allows for maximal utilization of GPU resources. However, without proper masking of each packed training example, attention will not be computed correctly when using SFT trainer. We enable and then analyse packing and Flash Attention with proper attention masking of each example and show the benefits of this training paradigm.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects
Authors:
Akshay Krishnan,
Abhijit Kundu,
Kevis-Kokitsi Maninis,
James Hays,
Matthew Brown
Abstract:
We propose OmniNOCS, a large-scale monocular dataset with 3D Normalized Object Coordinate Space (NOCS) maps, object masks, and 3D bounding box annotations for indoor and outdoor scenes. OmniNOCS has 20 times more object classes and 200 times more instances than existing NOCS datasets (NOCS-Real275, Wild6D). We use OmniNOCS to train a novel, transformer-based monocular NOCS prediction model (NOCSfo…
▽ More
We propose OmniNOCS, a large-scale monocular dataset with 3D Normalized Object Coordinate Space (NOCS) maps, object masks, and 3D bounding box annotations for indoor and outdoor scenes. OmniNOCS has 20 times more object classes and 200 times more instances than existing NOCS datasets (NOCS-Real275, Wild6D). We use OmniNOCS to train a novel, transformer-based monocular NOCS prediction model (NOCSformer) that can predict accurate NOCS, instance masks and poses from 2D object detections across diverse classes. It is the first NOCS model that can generalize to a broad range of classes when prompted with 2D boxes. We evaluate our model on the task of 3D oriented bounding box prediction, where it achieves comparable results to state-of-the-art 3D detection methods such as Cube R-CNN. Unlike other 3D detection methods, our model also provides detailed and accurate 3D object shape and segmentation. We propose a novel benchmark for the task of NOCS prediction based on OmniNOCS, which we hope will serve as a useful baseline for future work in this area. Our dataset and code will be at the project website: https://omninocs.github.io.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Quantum Dynamics with Stochastic Non-Hermitian Hamiltonians
Authors:
Pablo Martinez-Azcona,
Aritra Kundu,
Avadh Saxena,
Adolfo del Campo,
Aurelia Chenu
Abstract:
We study the quantum dynamics generated by a non-Hermitian Hamiltonian subject to stochastic perturbations in its anti-Hermitian part, describing fluctuating gains and losses. The master equation governing the noise-average dynamics describes a new form of dephasing. We characterize the resulting state evolution and analyze its purity. The novel properties of such dynamics are illustrated in a sto…
▽ More
We study the quantum dynamics generated by a non-Hermitian Hamiltonian subject to stochastic perturbations in its anti-Hermitian part, describing fluctuating gains and losses. The master equation governing the noise-average dynamics describes a new form of dephasing. We characterize the resulting state evolution and analyze its purity. The novel properties of such dynamics are illustrated in a stochastic dissipative qubit. Our analytical results show that adding noise allows for a rich control of the dynamics, with a greater diversity of steady states and the possibility of state purification.
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
Code Hallucination
Authors:
Mirza Masfiqur Rahman,
Ashish Kundu
Abstract:
Generative models such as large language models are extensively used as code copilots and for whole program generation. However, the programs they generate often have questionable correctness, authenticity and reliability in terms of integration as they might not follow the user requirements, provide incorrect and/or nonsensical outputs, or even contain semantic/syntactic errors - overall known as…
▽ More
Generative models such as large language models are extensively used as code copilots and for whole program generation. However, the programs they generate often have questionable correctness, authenticity and reliability in terms of integration as they might not follow the user requirements, provide incorrect and/or nonsensical outputs, or even contain semantic/syntactic errors - overall known as LLM hallucination. In this work, we present several types of code hallucination. We have generated such hallucinated code manually using large language models. We also present a technique - HallTrigger, in order to demonstrate efficient ways of generating arbitrary code hallucination. Our method leverages 3 different dynamic attributes of LLMs to craft prompts that can successfully trigger hallucinations from models without the need to access model architecture or parameters. Results from popular blackbox models suggest that HallTrigger is indeed effective and the pervasive LLM hallucination have sheer impact on software development.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
H.E.S.S. observations of the 2021 periastron passage of PSR B1259-63/LS 2883
Authors:
H. E. S. S. Collaboration,
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
J. Borowska,
M. Bouyahiaoui,
R. Brose,
A. Brown,
F. Brun,
B. Bruno,
T. Bulik,
C. Burger-Scheidlin,
S. Caroff,
S. Casanova
, et al. (119 additional authors not shown)
Abstract:
PSR B1259-63 is a gamma-ray binary system that hosts a pulsar in an eccentric orbit, with a 3.4 year period, around an O9.5Ve star. At orbital phases close to periastron passages, the system radiates bright and variable non-thermal emission. We report on an extensive VHE observation campaign conducted with the High Energy Stereoscopic System, comprised of ~100 hours of data taken from $t_p-24$ day…
▽ More
PSR B1259-63 is a gamma-ray binary system that hosts a pulsar in an eccentric orbit, with a 3.4 year period, around an O9.5Ve star. At orbital phases close to periastron passages, the system radiates bright and variable non-thermal emission. We report on an extensive VHE observation campaign conducted with the High Energy Stereoscopic System, comprised of ~100 hours of data taken from $t_p-24$ days to $t_p+127$ days around the system's 2021 periastron passage. We also present the timing and spectral analyses of the source. The VHE light curve in 2021 is consistent with the stacked light curve of all previous observations. Within the light curve, we report a VHE maximum at times coincident with the third X-ray peak first detected in the 2021 X-ray light curve. In the light curve -- although sparsely sampled in this time period -- we see no VHE enhancement during the second disc crossing. In addition, we see no correspondence to the 2021 GeV flare in the VHE light curve. The VHE spectrum obtained from the analysis of the 2021 dataset is best described by a power law of spectral index $Γ= 2.65 \pm 0.04_{\text{stat}}$ $\pm 0.04_{\text{sys}}$, a value consistent with the previous H.E.S.S. observations of the source. We report spectral variability with a difference of $ΔΓ= 0.56 ~\pm~ 0.18_{\text{stat}}$ $~\pm~0.10_{\text{sys}}$ at 95% c.l., between sub-periods of the 2021 dataset. We also find a linear correlation between contemporaneous flux values of X-ray and TeV datasets, detected mainly after $t_p+25$ days, suggesting a change in the available energy for non-thermal radiation processes. We detect no significant correlation between GeV and TeV flux points, within the uncertainties of the measurements, from $\sim t_p-23$ days to $\sim t_p+126$ days. This suggests that the GeV and TeV emission originate from different electron populations.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
trARPES and optical transport properties of irradiated twisted bilayer graphene in steady-state
Authors:
Ashutosh Dubey,
Ritajit Kundu,
Arijit Kundu
Abstract:
We theoretically investigate the trARPES spectrum and optical Hall conductivity in periodically driven twisted bilayer graphene, considering both steady-state and "projected" occupations of the Floquet state. In periodically driven pre-thermalized systems, steady-state occupation of Floquet states is predicted to occur when coupled to a bath, while these states have projected occupation instantane…
▽ More
We theoretically investigate the trARPES spectrum and optical Hall conductivity in periodically driven twisted bilayer graphene, considering both steady-state and "projected" occupations of the Floquet state. In periodically driven pre-thermalized systems, steady-state occupation of Floquet states is predicted to occur when coupled to a bath, while these states have projected occupation instantaneously after the driving starts. We study how these two regimes can give markedly different responses in optical transport properties. In particular, our results show that steady-state occupation leads to near-quantized optical Hall conductivity for a range of driving parameters in twisted bilayer graphene, whereas projected occupation leads to non-quantized values. We discuss the experimental feasibility of probing such non-equilibrium states in twisted bilayer graphene.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
KANQAS: Kolmogorov Arnold Network for Quantum Architecture Search
Authors:
Akash Kundu,
Aritra Sarkar,
Abhishek Sadhu
Abstract:
Quantum architecture search~(QAS) is a promising direction for optimization and automated design of quantum circuits towards quantum advantage. Recent techniques in QAS focus on machine learning-based approaches from reinforcement learning, like deep Q-network. While multi-layer perceptron-based deep Q-networks have been applied for QAS, their interpretability remains challenging due to the high n…
▽ More
Quantum architecture search~(QAS) is a promising direction for optimization and automated design of quantum circuits towards quantum advantage. Recent techniques in QAS focus on machine learning-based approaches from reinforcement learning, like deep Q-network. While multi-layer perceptron-based deep Q-networks have been applied for QAS, their interpretability remains challenging due to the high number of parameters. In this work, we evaluate the practicality of KANs in quantum architecture search problems, analyzing their efficiency in terms of the probability of success, frequency of optimal solutions and their dependencies on various degrees of freedom of the network. In a noiseless scenario, the probability of success and the number of optimal quantum circuit configurations to generate the multi-qubit maximally entangled states are significantly higher than MLPs. Moreover in noisy scenarios, KAN can achieve a better fidelity in approximating maximally entangled state than MLPs, where the performance of the MLP significantly depends on the choice of activation function. Further investigation reveals that KAN requires a very small number of learnable parameters compared to MLPs, however, the average time of executing each episode for KAN is much higher.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
YAQQ: Yet Another Quantum Quantizer -- Design Space Exploration of Quantum Gate Sets using Novelty Search
Authors:
Aritra Sarkar,
Akash Kundu,
Matthew Steinberg,
Sibasish Mishra,
Sebastiaan Fauquenot,
Tamal Acharya,
Jarosław A. Miszczak,
Sebastian Feld
Abstract:
In the standard circuit model of quantum computation, the number and quality of the quantum gates composing the circuit influence the runtime and fidelity of the computation. The fidelity of the decomposition of quantum algorithms, represented as unitary matrices, to bounded depth quantum circuits depends strongly on the set of gates available for the decomposition routine. To investigate this dep…
▽ More
In the standard circuit model of quantum computation, the number and quality of the quantum gates composing the circuit influence the runtime and fidelity of the computation. The fidelity of the decomposition of quantum algorithms, represented as unitary matrices, to bounded depth quantum circuits depends strongly on the set of gates available for the decomposition routine. To investigate this dependence, we explore the design space of discrete quantum gate sets and present a software tool for comparative analysis of quantum processing units and control protocols based on their native gates. The evaluation is conditioned on a set of unitary transformations representing target use cases on the quantum processors. The cost function considers three key factors: (i) the statistical distribution of the decomposed circuits' depth, (ii) the statistical distribution of process fidelities for the approximate decomposition, and (iii) the relative novelty of a gate set compared to other gate sets in terms of the aforementioned properties. The developed software, YAQQ (Yet Another Quantum Quantizer), enables the discovery of an optimized set of quantum gates through this tunable joint cost function. To identify these gate sets, we use the novelty search algorithm, circuit decomposition techniques, and stochastic optimization to implement YAQQ within the Qiskit quantum simulator environment. YAQQ exploits reachability tradeoffs conceptually derived from quantum algorithmic information theory. Our results demonstrate the pragmatic application of identifying gate sets that are advantageous to popularly used quantum gate sets in representing quantum algorithms. Consequently, we demonstrate pragmatic use cases of YAQQ in comparing transversal logical gate sets in quantum error correction codes, designing optimal quantum instruction sets, and compiling to specific quantum processors.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Low-Energy Electronic Structure in the Unconventional Charge-Ordered State of ScV$_6$Sn$_6$
Authors:
Asish K. Kundu,
Xiong Huang,
Eric Seewald,
Ethan Ritz,
Santanu Pakhira,
Shuai Zhang,
Dihao Sun,
Simon Turkel,
Sara Shabani,
Turgut Yilmaz,
Elio Vescovo,
Cory R. Dean,
David C. Johnston,
Tonica Valla,
Turan Birol,
Dmitri N. Basov,
Rafael M. Fernandes,
Abhay N. Pasupathy
Abstract:
Kagome vanadates {\it A}V$_3$Sb$_5$ display unusual low-temperature electronic properties including charge density waves (CDW), whose microscopic origin remains unsettled. Recently, CDW order has been discovered in a new material ScV$_6$Sn$_6$, providing an opportunity to explore whether the onset of CDW leads to unusual electronic properties. Here, we study this question using angle-resolved phot…
▽ More
Kagome vanadates {\it A}V$_3$Sb$_5$ display unusual low-temperature electronic properties including charge density waves (CDW), whose microscopic origin remains unsettled. Recently, CDW order has been discovered in a new material ScV$_6$Sn$_6$, providing an opportunity to explore whether the onset of CDW leads to unusual electronic properties. Here, we study this question using angle-resolved photoemission spectroscopy (ARPES) and scanning tunneling microscopy (STM). The ARPES measurements show minimal changes to the electronic structure after the onset of CDW. However, STM quasiparticle interference (QPI) measurements show strong dispersing features related to the CDW ordering vectors. A plausible explanation is the presence of a strong momentum-dependent scattering potential peaked at the CDW wavevector, associated with the existence of competing CDW instabilities. Our STM results further indicate that the bands most affected by the CDW are near vHS, analogous to the case of {\it A}V$_3$Sb$_5$ despite very different CDW wavevectors.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Drift-diffusive resetting search process with stochastic returns: speed-up beyond optimal instantaneous return
Authors:
Arup Biswas,
Ashutosh Dubey,
Anupam Kundu,
Arnab Pal
Abstract:
Stochastic resetting has emerged as a useful strategy to reduce the completion time for a broad class of first passage processes. In the canonical setup, one intermittently resets a given system to its initial configuration only to start afresh and continue evolving in time until the target goal is met. This is, however, an instantaneous process and thus less feasible for any practical purposes. A…
▽ More
Stochastic resetting has emerged as a useful strategy to reduce the completion time for a broad class of first passage processes. In the canonical setup, one intermittently resets a given system to its initial configuration only to start afresh and continue evolving in time until the target goal is met. This is, however, an instantaneous process and thus less feasible for any practical purposes. A crucial generalization in this regard is to consider a finite-time return process which has significant ramifications to the first passage properties. Intriguingly, it has recently been shown that for diffusive search processes, returning in finite but stochastic time can gain significant speed-up over the instantaneous resetting process. Unlike diffusion which has a diverging mean completion time, in this paper, we ask whether this phenomena can also be observed for a first passage process with finite mean completion time. To this end, we explore the set-up of a classical drift-diffusive search process in one dimension with stochastic resetting and further assume that the return phase is modulated by a potential $U(x)=λ|x|$ with $λ>0$. For this process, we compute the mean first passage time exactly and underpin its characteristics with respect to the resetting rate and potential strength. We find a unified phase space that allows us to explore and identify the system parameter regions where stochastic return supersedes over both the underlying process and the process under instantaneous resetting. Furthermore and quite interestingly, we find that for a range of parameters the mean completion time under stochastic return protocol can be reduced further than the \textit{optimally restarted} instantaneous processes. We thus believe that resetting with stochastic returns can serve as a better optimization strategy owing to its dominance over classical first passage under resetting.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Single MoS2-flake as a high TCR non-cryogenic bolometer
Authors:
Saba M. Khan,
Jyoti Saini,
Anirban Kundu,
Renu Rani,
Kiran S. Hazra
Abstract:
Temperature coefficient of resistance (TCR) of a bolometer can be tuned by modifying the thermal conductance of an absorbing materials since they sense radiations via the temperature change in the absorber. However, the thermal conductance of the absorber can be reduced by engineering the appropriate thermal isolation, which can be an ultimate solution towards making a highly sensitive thermal det…
▽ More
Temperature coefficient of resistance (TCR) of a bolometer can be tuned by modifying the thermal conductance of an absorbing materials since they sense radiations via the temperature change in the absorber. However, the thermal conductance of the absorber can be reduced by engineering the appropriate thermal isolation, which can be an ultimate solution towards making a highly sensitive thermal detector. Here, we have developed an atomically thin 2D bolometer detector made up of a mechanically transferred suspended multilayer-MoS2 flake, eliminating the use of challenging thin-film fabrication process. The strength of our detector lies on the two factors: its large surface-to-volume window to absorb the radiations; the suspended configuration which prevents the heat dissipation through the substrate and therefore reduces the thermal conductance. The bolometric response of the detector is tested in both modes, via the photoresponse and the thermal response. The prototype is found to exhibit a very high TCR ~ -9.5%/K with the least achievable thermal noise-equivalent power (NEP) ~ 0.61 pWHz-1/2, in ambient conditions at 328 K.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Harmonically trapped inertial run-and-tumble particle in one dimension
Authors:
Debraj Dutta,
Anupam Kundu,
Sanjib Sabhapandit,
Urna Basu
Abstract:
We study the nonequilibrium stationary state of a one-dimensional inertial run-and-tumble particle (IRTP) trapped in a harmonic potential. We find that the presence of inertia leads to two distinct dynamical scenarios, namely, overdamped and underdamped, characterized by the relative strength of the viscous and the trap time-scales. We also find that inertial nature of the active dynamics leads to…
▽ More
We study the nonequilibrium stationary state of a one-dimensional inertial run-and-tumble particle (IRTP) trapped in a harmonic potential. We find that the presence of inertia leads to two distinct dynamical scenarios, namely, overdamped and underdamped, characterized by the relative strength of the viscous and the trap time-scales. We also find that inertial nature of the active dynamics leads to the particle being confined in specific regions of the phase plane in the overdamped and underdamped cases, which we compute analytically. Moreover, the interplay of the inertial and active time-scales gives rise to several sub-regimes, which are characterized by very different behaviour of position and velocity fluctuations of the IRTP. In particular, in the underdamped regime, both the position and velocity undergoes transitions from a novel multi-peaked structure in the strongly active limit to a single peaked Gaussian-like distribution in the passive limit. On the other hand, in the overdamped scenario, the position distribution shows a transition from a U-shape to a dome-shape, as activity is decreased. Interestingly, the velocity distribution in the overdamped scenario shows two transitions -- from a single-peaked shape with an algebraic divergence at the origin in the strongly active regime to a double peaked one in the moderately active regime to a dome-shaped one in the passive regime.
△ Less
Submitted 12 June, 2024; v1 submitted 10 June, 2024;
originally announced June 2024.
-
Moving Mirrors, OTOCs and Scrambling
Authors:
Parthajit Biswas,
Bobby Ezhuthachan,
Arnab Kundu,
Baishali Roy
Abstract:
We explore the physics of scrambling in the moving mirror models, in which a two-dimensional CFT is subjected to a time-dependent boundary condition. It is well-known that by choosing an appropriate mirror profile, one can model quantum aspects of black holes in two-dimensions, ranging from Hawking radiation in an eternal black hole (for an "escaping mirror") to the recent realization of Page curv…
▽ More
We explore the physics of scrambling in the moving mirror models, in which a two-dimensional CFT is subjected to a time-dependent boundary condition. It is well-known that by choosing an appropriate mirror profile, one can model quantum aspects of black holes in two-dimensions, ranging from Hawking radiation in an eternal black hole (for an "escaping mirror") to the recent realization of Page curve in evaporating black holes (for a "kink mirror"). We explore a class of OTOCs in the presence of such a boundary and explicitly demonstrate the following primary aspects: First, we show that the dynamical CFT data directly affect an OTOC and maximally chaotic scrambling occurs for the escaping mirror for a large-$c$ CFT with identity block dominance. We further show that the exponential growth of OTOC associated with the physics of scrambling yields a power-law growth in the model for evaporating black holes which demonstrates a unitary dynamics in terms of a Page curve. We also demonstrate that, by tuning a parameter, one can naturally interpolate between an exponential growth associated to scrambling and a power-law growth in unitary dynamics. Our work explicitly exhibits the role of higher-point functions in CFT dynamics as well as the distinction between scrambling and Page curve. We also discuss several future possibilities based on this class of models.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
RepCNN: Micro-sized, Mighty Models for Wakeword Detection
Authors:
Arnav Kundu,
Prateeth Nayak,
Hywel Richards,
Priyanka Padmanabhan,
Devang Naik
Abstract:
Always-on machine learning models require a very low memory and compute footprint. Their restricted parameter count limits the model's capacity to learn, and the effectiveness of the usual training algorithms to find the best parameters. Here we show that a small convolutional model can be better trained by first refactoring its computation into a larger redundant multi-branched architecture. Then…
▽ More
Always-on machine learning models require a very low memory and compute footprint. Their restricted parameter count limits the model's capacity to learn, and the effectiveness of the usual training algorithms to find the best parameters. Here we show that a small convolutional model can be better trained by first refactoring its computation into a larger redundant multi-branched architecture. Then, for inference, we algebraically re-parameterize the trained model into the single-branched form with fewer parameters for a lower memory footprint and compute cost. Using this technique, we show that our always-on wake-word detector model, RepCNN, provides a good trade-off between latency and accuracy during inference. RepCNN re-parameterized models are 43% more accurate than a uni-branch convolutional model while having the same runtime. RepCNN also meets the accuracy of complex architectures like BC-ResNet, while having 2x lesser peak memory usage and 10x faster runtime.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Superconducting magic-angle twisted trilayer graphene hosts competing magnetic order and moiré inhomogeneities
Authors:
Ayshi Mukherjee,
Surat Layek,
Subhajit Sinha,
Ritajit Kundu,
Alisha H. Marchawala,
Mahesh Hingankar,
Joydip Sarkar,
L. D. Varma Sangani,
Heena Agarwal,
Sanat Ghosh,
Aya Batoul Tazi,
Kenji Watanabe,
Takashi Taniguchi,
Abhay N. Pasupathy,
Arijit Kundu,
Mandar M. Deshmukh
Abstract:
The microscopic mechanism of superconductivity in the magic-angle twisted graphene family, including magic-angle twisted trilayer graphene (MATTG), is poorly understood. Properties of MATTG, like Pauli limit violation, suggest unconventional superconductivity. Theoretical studies propose proximal magnetic states in the phase diagram, but direct experimental evidence is lacking. We show direct evid…
▽ More
The microscopic mechanism of superconductivity in the magic-angle twisted graphene family, including magic-angle twisted trilayer graphene (MATTG), is poorly understood. Properties of MATTG, like Pauli limit violation, suggest unconventional superconductivity. Theoretical studies propose proximal magnetic states in the phase diagram, but direct experimental evidence is lacking. We show direct evidence for an in-plane magnetic order proximal to the superconducting state using two complementary electrical transport measurements. First, we probe the superconducting phase by using statistically significant switching events from superconducting to the dissipative state of MATTG. The system behaves like a network of Josephson junctions due to lattice relaxation-induced moiré inhomogeneity in the system. We observe non-monotonic and hysteretic responses in the switching distributions as a function of temperature and in-plane magnetic field. Second, in normal regions doped slightly away from the superconducting regime, we observe hysteresis in magnetoresistance with an in-plane magnetic field; showing evidence for in-plane magnetic order that vanishes $\sim$900 mK. Additionally, we show a broadened Berezinskii-Kosterlitz-Thouless transition due to relaxation-induced moiré inhomogeneity. We find superfluid stiffness $J_{\mathrm{s}}$$\sim$0.15 K with strong temperature dependence. Theoretically, the magnetic and superconducting order arising from the magnetic order's fluctuations have been proposed - we show direct evidence for both. Our observation that the hysteretic magnetoresistance is sensitive to the in-plane field may constrain possible intervalley-coherent magnetic orders and the resulting superconductivity that arises from its fluctuations.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Influence of joint measurement bases on sharing network nonlocality
Authors:
Amit Kundu,
Debasis Sarkar
Abstract:
Sharing network nonlocality in an extended quantum network scenario is the new paradigm in the development of quantum theory. In this paper, we investigate the influence of Elegant joint measurement(in short, EJM) bases in an extended bilocal scenario on sharing network nonlocality via sequential measurement. The work essentially based on the newly introduced[Phys. Rev. Lett. 126, 220401(2021)] bi…
▽ More
Sharing network nonlocality in an extended quantum network scenario is the new paradigm in the development of quantum theory. In this paper, we investigate the influence of Elegant joint measurement(in short, EJM) bases in an extended bilocal scenario on sharing network nonlocality via sequential measurement. The work essentially based on the newly introduced[Phys. Rev. Lett. 126, 220401(2021)] bilocal inequality with ternary inputs for end parties and EJM as joint measurement bases in $Alice_n-Bob-Charlie_m$ scenario. Here, we are able to capture all simultaneous violation of this inequality for $(n,m)\in \{(2,1),(1,2),(1,1),(2,2)\}$ cases. We further observe the criteria for sharing network nonlocality where we are able to find also the dependence of the sharing on the amount of entanglement of the joint bases. The effect of the nonlinearity in this inequality is also captured in our results with the symmetrical and asymmetrical violation in this extended scenario. The work will generate further the realization of quantum correlations in network scenario.
△ Less
Submitted 2 June, 2024;
originally announced June 2024.
-
LMO-DP: Optimizing the Randomization Mechanism for Differentially Private Fine-Tuning (Large) Language Models
Authors:
Qin Yang,
Meisam Mohammad,
Han Wang,
Ali Payani,
Ashish Kundu,
Kai Shu,
Yan Yan,
Yuan Hong
Abstract:
Differentially Private Stochastic Gradient Descent (DP-SGD) and its variants have been proposed to ensure rigorous privacy for fine-tuning large-scale pre-trained language models. However, they rely heavily on the Gaussian mechanism, which may overly perturb the gradients and degrade the accuracy, especially in stronger privacy regimes (e.g., the privacy budget $ε< 3$). To address such limitations…
▽ More
Differentially Private Stochastic Gradient Descent (DP-SGD) and its variants have been proposed to ensure rigorous privacy for fine-tuning large-scale pre-trained language models. However, they rely heavily on the Gaussian mechanism, which may overly perturb the gradients and degrade the accuracy, especially in stronger privacy regimes (e.g., the privacy budget $ε< 3$). To address such limitations, we propose a novel Language Model-based Optimal Differential Privacy (LMO-DP) mechanism, which takes the first step to enable the tight composition of accurately fine-tuning (large) language models with a sub-optimal DP mechanism, even in strong privacy regimes (e.g., $0.1\leq ε<3$). Furthermore, we propose a novel offline optimal noise search method to efficiently derive the sub-optimal DP that significantly reduces the noise magnitude. For instance, fine-tuning RoBERTa-large (with 300M parameters) on the SST-2 dataset can achieve an accuracy of 92.20% (given $ε=0.3$, $δ=10^{-10}$) by drastically outperforming the Gaussian mechanism (e.g., $\sim 50\%$ for small $ε$ and $δ$). We also draw similar findings on the text generation tasks on GPT-2. Finally, to our best knowledge, LMO-DP is also the first solution to accurately fine-tune Llama-2 with strong differential privacy guarantees. The code will be released soon and available upon request.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Shallow core levels, or how to determine the doping and $T_c$ of Bi$_2$Sr$_2$CaCu$_2$O$_{8+δ}$ and Bi$_{2}$Sr$_2$CuO$_{6+δ}$ without cooling
Authors:
Tonica Valla,
Asish K. Kundu,
Petar Pervan,
Ivo Pletikosić,
Ilya K. Drozdov,
Zebin Wu,
Genda D. Gu
Abstract:
Determining the doping level in high-temperature cuprate superconductors is crucial for understanding the origin of superconductivity in these materials and for unlocking their full potential. However, accurately determining the doping level remains a significant challenge due to a complex interplay of factors and limitations in various measurement techniques. In particular, in Bi$_{2}$Sr$_2$CuO…
▽ More
Determining the doping level in high-temperature cuprate superconductors is crucial for understanding the origin of superconductivity in these materials and for unlocking their full potential. However, accurately determining the doping level remains a significant challenge due to a complex interplay of factors and limitations in various measurement techniques. In particular, in Bi$_{2}$Sr$_2$CuO$_{6+δ}$ and Bi$_2$Sr$_2$CaCu$_2$O$_{8+δ}$, where the mobile carriers are introduced by non-stoichiometric oxygen $δ$, the determination has been extremely problematic. Here, we study the doping dependence of the electronic structure of these materials in angle-resolved photoemission and find that both the doping level, $p$, and the superconducting transition temeprature, $T_c$ can be precisely determined from the binding energy of the Bi $5d$ core-levels. The measurements can be performed at room temperature, enabling the determination of $p$ and $T_c$ without cooling the samples. This should be very helpful for further studies of these materials.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Generalized hydrodynamics and approach to Generalized Gibbs equilibrium for a classical harmonic chain
Authors:
Saurav Pandey,
Abhishek Dhar,
Anupam Kundu
Abstract:
We study the evolution of a classical harmonic chain with nearest-neighbor interactions starting from domain wall initial conditions. The initial state is taken to be either a product of two Gibbs Ensembles (GEs) with unequal temperatures on the two halves of the chain or a product of two Generalized Gibbs Ensembles (GGEs) with different parameters in the two halves. For this system, we construct…
▽ More
We study the evolution of a classical harmonic chain with nearest-neighbor interactions starting from domain wall initial conditions. The initial state is taken to be either a product of two Gibbs Ensembles (GEs) with unequal temperatures on the two halves of the chain or a product of two Generalized Gibbs Ensembles (GGEs) with different parameters in the two halves. For this system, we construct the Wigner function and demonstrate that its evolution defines the Generalized Hydrodynamics (GHD) describing the evolution of the conserved quantities. We solve the GHD for both finite and infinite chains and compute the evolution of conserved densities and currents. For a finite chain with fixed boundaries, we show that these quantities relax as $\sim 1/\sqrt{t}$ to their respective steady-state values given by the final expected GE or GGE state, depending on the initial conditions. Exact expressions for the Lagrange multipliers of the final expected GGE state are obtained in terms of the steady state densities. In the case of an infinite chain, we find that the conserved densities and currents at any finite time exhibit ballistic scaling while, at infinite time, any finite segment of the system can be described by a current-carrying non-equilibrium steady state (NESS). We compute the scaling functions analytically and show that the relaxation to the NESS occurs as $\sim 1/t$ for the densities and as $\sim 1/t^2$ for the currents. We compare the analytic results from hydrodynamics with those from exact microscopic numerics and find excellent agreement.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
The ORT and the uGMRT Pulsar Monitoring Program : Pulsar Timing Irregularities & the Gaussian Process Realization
Authors:
Himanshu Grover,
Bhal Chandra Joshi,
Jaikhomba Singha,
Erbil Gügercinoğlu,
Paramasivan Arumugam,
Debades Bandyopadhyay,
James O. Chibueze,
Shantanu Desai,
Innocent O. Eya,
Anu Kundu,
Johnson O. Urama
Abstract:
The spin-down law of pulsars is generally perturbed by two types of timing irregularities: glitches and timing noise. Glitches are sudden changes in the rotational frequency of pulsars, while timing noise is a discernible stochastic wandering in the phase, period, or spin-down rate of a pulsar. We present the timing results of a sample of glitching pulsars observed using the Ooty Radio Telescope (…
▽ More
The spin-down law of pulsars is generally perturbed by two types of timing irregularities: glitches and timing noise. Glitches are sudden changes in the rotational frequency of pulsars, while timing noise is a discernible stochastic wandering in the phase, period, or spin-down rate of a pulsar. We present the timing results of a sample of glitching pulsars observed using the Ooty Radio Telescope (ORT) and the upgraded Giant Metrewave Radio Telescope (uGMRT). Our findings include timing noise analysis for 17 pulsars, with seven being reported for the first time. We detected five glitches in four pulsars and a glitch-like event in J1825-0935. The frequency evolution of glitch in pulsars, J0742-2822 and J1740-3015, is presented for the first time. Additionally, we report timing noise results for three glitching pulsars. The timing noise was analyzed separately in the pre-glitch region and post-glitch regions. We observed an increase in the red noise parameters in the post-glitch regions, where exponential recovery was considered in the noise analysis. Timing noise can introduce ambiguities in the correct evaluation of glitch observations. Hence, it is important to consider timing noise in glitch analysis. We propose an innovative glitch verification approach designed to discern between a glitch and strong timing noise. The novel glitch analysis technique is also demonstrated using the observed data.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
In-beam $γ-$spectroscopy of the transitional nucleus $^{217}$Ac
Authors:
Dhananjaya Sahoo,
A. Y. Deo,
Madhu,
Khamosh Yadav,
S. S. Tiwary,
P. C. Srivastava,
R. Palit,
S. K. Tandel,
Anil Kumar,
P. Dey,
Biswajit Das,
Vishal Malik,
A. Kundu,
A. Sindhu,
S. V. Jadhav,
B. S. Naidu,
A. V. Thomas
Abstract:
High-spin states in the transitional $^{217}$Ac nucleus are established up to 3.8 MeV excitation energy and $I^π =$ 41/2$^+$ with the addition of around 20 new transitions. The structure of the yrast and near-yrast states below the 29/2$^+$ isomer is revisited. The inconsistencies in the level schemes reported earlier are resolved. The level structure above the 29/2$^+$ isomer is established for t…
▽ More
High-spin states in the transitional $^{217}$Ac nucleus are established up to 3.8 MeV excitation energy and $I^π =$ 41/2$^+$ with the addition of around 20 new transitions. The structure of the yrast and near-yrast states below the 29/2$^+$ isomer is revisited. The inconsistencies in the level schemes reported earlier are resolved. The level structure above the 29/2$^+$ isomer is established for the first time. Large-basis shell-model calculations with the KHPE interaction are performed to compare the experimentally observed level energies with the theoretical predictions. A comparison with the systematics of the N = 128 isotones suggests that the yrast structures result from a weak coupling of the odd proton to the even-even 216Ra core, which is consistent with the shell-model configurations. Furthermore, alpha decay of the 29/2$^+$ isomer is revisited and the decay scheme established from this work is discussed in the framework of the shell model.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Measurement dependence can enhance security in a quantum network
Authors:
Amit Kundu,
Debasis Sarkar
Abstract:
Network Nonlocality is an advanced study of quantum nonlocality that comprises network structure beyond Bell's theorem. The development of quantum networks has the potential to bring a lot of technological applications in sevaral quantum information processing tasks. Here, we are focusing on how the role of the independence of the measurement choices of the end parties in a network works and can b…
▽ More
Network Nonlocality is an advanced study of quantum nonlocality that comprises network structure beyond Bell's theorem. The development of quantum networks has the potential to bring a lot of technological applications in sevaral quantum information processing tasks. Here, we are focusing on how the role of the independence of the measurement choices of the end parties in a network works and can be used to enhance the security in a quantum network. In both three-parties two-sources bilocal network and four-parties three-sources star network scenarios, we are able to show, a practical way to understand the relaxation of the assumptions to enhance a real security protocol if someone wants to breach in a network communications. Theoratically, we have proved that by relaxing the independence of the measurement choices of only one end party we can create a Standard Network Nonlocality(SNN) and more stronger Full Network Nonlocality(FNN) and we can get maximum quantum violation by the classical no-signalling local model. We are able to distinguish between two types of network nonlocality in the sense that the FNN is stronger than SNN, i.e., FNN states all the sources in a network need to distribute nonlocal resources.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
About: "Float stacked graphene PMMA laminate"
Authors:
Anirban Kundu,
Won Kyung Seong,
S. Kamal Jalali,
Nicola M. Pugno,
Rodney S. Ruoff
Abstract:
We report the scientific and technical queries regarding the article reported by Kim et al.1 on the mechanical properties of graphene-poly(methyl methacrylate) (PMMA) composites. Our analysis finds that the current experimental data is insufficient to fully support the conclusions presented in the article. We suggest the enhancement in Youngs modulus and strength of the graphene-PMMA laminates (GP…
▽ More
We report the scientific and technical queries regarding the article reported by Kim et al.1 on the mechanical properties of graphene-poly(methyl methacrylate) (PMMA) composites. Our analysis finds that the current experimental data is insufficient to fully support the conclusions presented in the article. We suggest the enhancement in Youngs modulus and strength of the graphene-PMMA laminates (GPL) samples are mainly due to the heat treatment of the polymer rather than the incorporation of graphene. The Raman spectroscopy data (as per our analysis) for the GPL samples indicates that large cracks and defects were introduced during the hot rolling process used to fabricate the graphene-PMMA composite. We believe that the queries will aid the audience in better understanding the mechanical response of graphene-PMMA composites.
△ Less
Submitted 27 May, 2024; v1 submitted 20 May, 2024;
originally announced May 2024.
-
Nuclear Quantum Effects on the Electronic Structure of Water and Ice
Authors:
Margaret Berrens,
Arpan Kundu,
Marcos F. Calegari Andrade,
Tuan Anh Pham,
Giulia Galli,
Davide Donadio
Abstract:
The electronic properties and optical response of ice and water are intricately shaped by their molecular structure, including the quantum mechanical nature of hydrogen atoms. In spite of numerous studies appeared over decades, a comprehensive understanding of the effect of the nuclear quantum motion on the electronic structure of water and ice at finite temperatures remains elusive. Here, we util…
▽ More
The electronic properties and optical response of ice and water are intricately shaped by their molecular structure, including the quantum mechanical nature of hydrogen atoms. In spite of numerous studies appeared over decades, a comprehensive understanding of the effect of the nuclear quantum motion on the electronic structure of water and ice at finite temperatures remains elusive. Here, we utilize molecular simulations that harness the efficiency of machine-learning potentials and many-body perturbation theory to assess the impact of nuclear quantum effects on the electronic structure of water and hexagonal ice. By comparing the results of path-integral and classical simulations, we find that including nuclear quantum effects leads to a larger renormalization of the fundamental gap of ice, compared to that of water, eventually leading to a comparable gap in the two systems, consistent with experimental estimates. Our calculations suggest that the quantum fluctuations responsible for an increased delocalization of protons in ice, relative to water, are a key factor leading to the enhancement of nuclear quantum effects on the electronic structure of ice.
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
Josephson junction of minimally twisted bilayer graphene
Authors:
Ritajit Kundu,
Arijit Kundu
Abstract:
We theoretically investigate the transport properties of Josephson junctions composed of superconductor/minimally twisted bilayer graphene/superconductor structures. In the presence of an out-of-plane electric field, the low energy physics is best described by a network of chiral domain-wall states. Depending on system parameters, they lead to the emergence of zig-zag or pseudo-Landau level modes…
▽ More
We theoretically investigate the transport properties of Josephson junctions composed of superconductor/minimally twisted bilayer graphene/superconductor structures. In the presence of an out-of-plane electric field, the low energy physics is best described by a network of chiral domain-wall states. Depending on system parameters, they lead to the emergence of zig-zag or pseudo-Landau level modes with distinct transport characteristics. Specifically, we find zig-zag modes feature linear dispersion of Andreev bound states, resulting in a $4π$-periodic Josephson current. In contrast, pseudo-Landau level modes exhibit flat Andreev bound states and, consequently, a vanishing bulk Josephson current. Interestingly, edge states can give rise to $4π$-periodic Josephson response in the pseudo-Landau level regime. We also discuss experimental signatures of such responses.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Tuning irreversibility of mesoscopic processes using hydrodynamic interactions
Authors:
Biswajit Das,
Sreekanth K Manikandan,
Shuvojit Paul,
Avijit Kundu,
Supriya Krishnamurthy,
Ayan Banerjee
Abstract:
Optically confined colloidal particles, when placed in close proximity, form a dissipatively coupled system through hydrodynamic interactions. Here, we demonstrate that these interactions can be harnessed to design systems with non-trivial and highly tunable non-equilibrium characteristics, directly quantifiable from experimental data. Furthermore, we clarify that such interactions do not modify t…
▽ More
Optically confined colloidal particles, when placed in close proximity, form a dissipatively coupled system through hydrodynamic interactions. Here, we demonstrate that these interactions can be harnessed to design systems with non-trivial and highly tunable non-equilibrium characteristics, directly quantifiable from experimental data. Furthermore, we clarify that such interactions do not modify the underlying potential energy function, nor do they violate the energy balance at the level of individual trajectories, as was believed earlier. Moreover, they offer new opportunities for tailored control and design of mesoscale systems with emergent and targeted nonequilibrium properties.
△ Less
Submitted 17 June, 2024; v1 submitted 1 May, 2024;
originally announced May 2024.
-
Spin-dependent $π$$π^{\ast}$ gap in graphene on a magnetic substrate
Authors:
P. M. Sheverdyaeva,
G. Bihlmayer,
E. Cappelluti,
D. Pacilé,
F. Mazzola,
N. Atodiresei,
M. Jugovac,
I. Grimaldi,
G. Contini,
A. K. Kundu,
I. Vobornik,
J. Fujii,
P. Moras,
C. Carbone,
L. Ferrari
Abstract:
We present a detailed analysis of the electronic properties of graphene/Eu/Ni(111). By using angle and spin-resolved photoemission spectroscopy and ab initio calculations, we show that the Eu-intercalation of graphene/Ni(111) restores the nearly freestanding dispersion of the $ππ^\ast$ Dirac cones at the K point with an additional lifting of the spin degeneracy due to the mixing of graphene and Eu…
▽ More
We present a detailed analysis of the electronic properties of graphene/Eu/Ni(111). By using angle and spin-resolved photoemission spectroscopy and ab initio calculations, we show that the Eu-intercalation of graphene/Ni(111) restores the nearly freestanding dispersion of the $ππ^\ast$ Dirac cones at the K point with an additional lifting of the spin degeneracy due to the mixing of graphene and Eu states. The interaction with the magnetic substrate results in a large spin-dependent gap in the Dirac cones with a topological nature characterized by a large Berry curvature, and a spin-polarized van Hove singularity, whose closeness to the Fermi level gives rise to a polaronic band.
△ Less
Submitted 27 April, 2024;
originally announced April 2024.
-
Giant Rashba-Splitting of One-Dimensional Metallic States in Bi Dimer Lines on InAs(100)
Authors:
Polina M. Sheverdyaeva,
Gustav Bihlmayer,
Silvio Modesti,
Vitaliy Feyer,
Matteo Jugovac,
Giovanni Zamborlini,
Christian Tusche,
Ying-Jiun Chen,
Xin Liang Tan,
Kenta Hagiwara,
Luca Petaccia,
Sangeeta Thakur,
Asish K. Kundu,
Carlo Carbone,
Paolo Moras
Abstract:
Bismuth produces different types of ordered superstructures on the InAs(100) surface, depending on the growth procedure and coverage. The (2x1) phase forms at completion of a Bi monolayer and consists of a uniformly oriented array of parallel lines of Bi dimers. Scanning tunneling and core level spectroscopies demonstrate its metallic character, in contrast with the semiconducting properties expec…
▽ More
Bismuth produces different types of ordered superstructures on the InAs(100) surface, depending on the growth procedure and coverage. The (2x1) phase forms at completion of a Bi monolayer and consists of a uniformly oriented array of parallel lines of Bi dimers. Scanning tunneling and core level spectroscopies demonstrate its metallic character, in contrast with the semiconducting properties expected on the basis of the electron counting principle. The weak electronic coupling among neighboring lines gives rise to quasi one-dimensional Bi-derived bands with open contours at the Fermi level. Spin- and angle-resolved photoelectron spectroscopy reveals a giant Rashba splitting of these bands, in good agreement with ab-initio electronic structure calculations. The very high density of the dimer lines, the metallic and quasi one-dimensional band dispersion and the Rashba-like spin texture make the Bi/InAs(100)-(2x1) phase an intriguing system, where novel transport regimes can be studied.
△ Less
Submitted 20 April, 2024;
originally announced April 2024.
-
Gaga: Group Any Gaussians via 3D-aware Memory Bank
Authors:
Weijie Lyu,
Xueting Li,
Abhijit Kundu,
Yi-Hsuan Tsai,
Ming-Hsuan Yang
Abstract:
We introduce Gaga, a framework that reconstructs and segments open-world 3D scenes by leveraging inconsistent 2D masks predicted by zero-shot segmentation models. Contrasted to prior 3D scene segmentation approaches that heavily rely on video object tracking, Gaga utilizes spatial information and effectively associates object masks across diverse camera poses. By eliminating the assumption of cont…
▽ More
We introduce Gaga, a framework that reconstructs and segments open-world 3D scenes by leveraging inconsistent 2D masks predicted by zero-shot segmentation models. Contrasted to prior 3D scene segmentation approaches that heavily rely on video object tracking, Gaga utilizes spatial information and effectively associates object masks across diverse camera poses. By eliminating the assumption of continuous view changes in training images, Gaga demonstrates robustness to variations in camera poses, particularly beneficial for sparsely sampled images, ensuring precise mask label consistency. Furthermore, Gaga accommodates 2D segmentation masks from diverse sources and demonstrates robust performance with different open-world zero-shot segmentation models, enhancing its versatility. Extensive qualitative and quantitative evaluations demonstrate that Gaga performs favorably against state-of-the-art methods, emphasizing its potential for real-world applications such as scene understanding and manipulation.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
Gersten's Injectivity for Smooth Algebras over Valuation Rings
Authors:
Arnab Kundu
Abstract:
Gersten's injectivity conjecture for a functor $F$ of ``motivic type'', predicts that given a semilocal, ``non-singular'', integral domain $R$ with a fraction field $K$, the restriction morphism induces an injection of $F(R)$ inside $F(K)$. We prove two new cases of this conjecture for smooth algebras over valuation rings. Namely, we show that the higher algebraic $K$-groups of a semilocal, integr…
▽ More
Gersten's injectivity conjecture for a functor $F$ of ``motivic type'', predicts that given a semilocal, ``non-singular'', integral domain $R$ with a fraction field $K$, the restriction morphism induces an injection of $F(R)$ inside $F(K)$. We prove two new cases of this conjecture for smooth algebras over valuation rings. Namely, we show that the higher algebraic $K$-groups of a semilocal, integral domain that is an essentially smooth algebra over an equicharacteristic valuation ring inject inside the same of its fraction field. Secondly, we show that Gersten's injectivity is true for smooth algebras over, possibly of mixed-characteristic, valuation rings in the case of torsors under tori and also in the case of the Brauer group.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
A quantum information theoretic analysis of reinforcement learning-assisted quantum architecture search
Authors:
Abhishek Sadhu,
Aritra Sarkar,
Akash Kundu
Abstract:
In the field of quantum computing, variational quantum algorithms (VQAs) represent a pivotal category of quantum solutions across a broad spectrum of applications. These algorithms demonstrate significant potential for realising quantum computational advantage. A fundamental aspect of VQAs involves formulating expressive and efficient quantum circuits (namely ansatz) and automating the search of s…
▽ More
In the field of quantum computing, variational quantum algorithms (VQAs) represent a pivotal category of quantum solutions across a broad spectrum of applications. These algorithms demonstrate significant potential for realising quantum computational advantage. A fundamental aspect of VQAs involves formulating expressive and efficient quantum circuits (namely ansatz) and automating the search of such ansatz is known as quantum architecture search (QAS). RL-QAS involves optimising QAS using reinforcement learning techniques. This study investigates RL-QAS for crafting ansatzes tailored to the variational quantum state diagonalisation problem. Our investigation includes a comprehensive analysis of various dimensions, such as the entanglement thresholds of the resultant states, the impact of initial conditions on the performance of RL-agent, the phase change behaviour of correlation in concurrence bounds, and the discrete contributions of qubits in deducing eigenvalues through conditional entropy metrics. We leverage these insights to devise entanglement-guided admissible ansatz in QAS to diagonalise random quantum states using optimal resources. Furthermore, the methodologies presented herein offer a generalised framework for constructing reward functions within RL-QAS applicable to variational quantum algorithms.
△ Less
Submitted 15 April, 2024; v1 submitted 9 April, 2024;
originally announced April 2024.
-
Stabilizing switched nonlinear systems under restricted but arbitrary switching signals
Authors:
Atreyee Kundu
Abstract:
This paper deals with input/output-to-state stability (IOSS) of switched nonlinear systems whose switching signals obey pre-specified restrictions on admissible switches between the subsystems and admissible dwell times on the subsystems. We present sufficient conditions on the subsystems, admissible switches between them and admissible dwell times on them, such that a switched system generated un…
▽ More
This paper deals with input/output-to-state stability (IOSS) of switched nonlinear systems whose switching signals obey pre-specified restrictions on admissible switches between the subsystems and admissible dwell times on the subsystems. We present sufficient conditions on the subsystems, admissible switches between them and admissible dwell times on them, such that a switched system generated under all switching signals obeying the given restrictions is IOSS. Multiple Lyapunov-like functions and graph theory are the key apparatuses for our analysis. A numerical example is presented to demonstrate our results.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
Efficiently Distilling LLMs for Edge Applications
Authors:
Achintya Kundu,
Fabian Lim,
Aaron Chew,
Laura Wynter,
Penny Chong,
Rhui Dih Lee
Abstract:
Supernet training of LLMs is of great interest in industrial applications as it confers the ability to produce a palette of smaller models at constant cost, regardless of the number of models (of different size / latency) produced. We propose a new method called Multistage Low-rank Fine-tuning of Super-transformers (MLFS) for parameter-efficient supernet training. We show that it is possible to ob…
▽ More
Supernet training of LLMs is of great interest in industrial applications as it confers the ability to produce a palette of smaller models at constant cost, regardless of the number of models (of different size / latency) produced. We propose a new method called Multistage Low-rank Fine-tuning of Super-transformers (MLFS) for parameter-efficient supernet training. We show that it is possible to obtain high-quality encoder models that are suitable for commercial edge applications, and that while decoder-only models are resistant to a comparable degree of compression, decoders can be effectively sliced for a significant reduction in training time.
△ Less
Submitted 1 April, 2024;
originally announced April 2024.
-
Design, Fabrication and Evaluation of a Stretchable High-Density Electromyography Array
Authors:
Rejin John Varghese,
Matteo Pizzi,
Aritra Kundu,
Agnese Grison,
Etienne Burdet,
Dario Farina
Abstract:
The adoption of high-density electrode systems for human-machine interfaces in real-life applications has been impeded by practical and technical challenges, including noise interference, motion artifacts and the lack of compact electrode interfaces. To overcome some of these challenges, we introduce a wearable and stretchable electromyography (EMG) array, and present its design, fabrication metho…
▽ More
The adoption of high-density electrode systems for human-machine interfaces in real-life applications has been impeded by practical and technical challenges, including noise interference, motion artifacts and the lack of compact electrode interfaces. To overcome some of these challenges, we introduce a wearable and stretchable electromyography (EMG) array, and present its design, fabrication methodology, characterisation, and comprehensive evaluation. Our proposed solution comprises dry-electrodes on flexible printed circuit board (PCB) substrates, eliminating the need for time-consuming skin preparation. The proposed fabrication method allows the manufacturing of stretchable sleeves, with consistent and standardised coverage across subjects. We thoroughly tested our developed prototype, evaluating its potential for application in both research and real-world environments. The results of our study showed that the developed stretchable array matches or outperforms traditional EMG grids and holds promise in furthering the real-world translation of high-density EMG for human-machine interfaces.
△ Less
Submitted 29 March, 2024;
originally announced March 2024.
-
RAW: A Robust and Agile Plug-and-Play Watermark Framework for AI-Generated Images with Provable Guarantees
Authors:
Xun Xian,
Ganghua Wang,
Xuan Bi,
Jayanth Srinivasa,
Ashish Kundu,
Mingyi Hong,
Jie Ding
Abstract:
Safeguarding intellectual property and preventing potential misuse of AI-generated images are of paramount importance. This paper introduces a robust and agile plug-and-play watermark detection framework, dubbed as RAW. As a departure from traditional encoder-decoder methods, which incorporate fixed binary codes as watermarks within latent representations, our approach introduces learnable waterma…
▽ More
Safeguarding intellectual property and preventing potential misuse of AI-generated images are of paramount importance. This paper introduces a robust and agile plug-and-play watermark detection framework, dubbed as RAW. As a departure from traditional encoder-decoder methods, which incorporate fixed binary codes as watermarks within latent representations, our approach introduces learnable watermarks directly into the original image data. Subsequently, we employ a classifier that is jointly trained with the watermark to detect the presence of the watermark. The proposed framework is compatible with various generative architectures and supports on-the-fly watermark injection after training. By incorporating state-of-the-art smoothing techniques, we show that the framework provides provable guarantees regarding the false positive rate for misclassifying a watermarked image, even in the presence of certain adversarial attacks targeting watermark removal. Experiments on a diverse range of images generated by state-of-the-art diffusion models reveal substantial performance enhancements compared to existing approaches. For instance, our method demonstrates a notable increase in AUROC, from 0.48 to 0.82, when compared to state-of-the-art approaches in detecting watermarked images under adversarial attacks, while maintaining image quality, as indicated by closely aligned FID and CLIP scores.
△ Less
Submitted 23 January, 2024;
originally announced March 2024.
-
Full counting statistics of 1d short-range Riesz gases in confinement
Authors:
Jitendra Kethepalli,
Manas Kulkarni,
Anupam Kundu,
Satya N. Majumdar,
David Mukamel,
Grégory Schehr
Abstract:
We investigate the full counting statistics (FCS) of a harmonically confined 1d short-range Riesz gas consisting of $N$ particles in equilibrium at finite temperature. The particles interact with each other through a repulsive power-law interaction with an exponent $k>1$ which includes the Calogero-Moser model for $k=2$. We examine the probability distribution of the number of particles in a finit…
▽ More
We investigate the full counting statistics (FCS) of a harmonically confined 1d short-range Riesz gas consisting of $N$ particles in equilibrium at finite temperature. The particles interact with each other through a repulsive power-law interaction with an exponent $k>1$ which includes the Calogero-Moser model for $k=2$. We examine the probability distribution of the number of particles in a finite domain $[-W, W]$ called number distribution, denoted by $\mathcal{N}(W, N)$. We analyze the probability distribution of $\mathcal{N}(W, N)$ and show that it exhibits a large deviation form for large $N$ characterised by a speed $N^{\frac{3k+2}{k+2}}$ and by a large deviation function of the fraction $c = \mathcal{N}(W, N)/N$ of the particles inside the domain and $W$. We show that the density profiles that create the large deviations display interesting shape transitions as one varies $c$ and $W$. This is manifested by a third-order phase transition exhibited by the large deviation function that has discontinuous third derivatives. Monte-Carlo (MC) simulations show good agreement with our analytical expressions for the corresponding density profiles. We find that the typical fluctuations of $\mathcal{N}(W, N)$, obtained from our field theoretic calculations are Gaussian distributed with a variance that scales as $N^{ν_k}$, with $ν_k = (2-k)/(2+k)$. We also present some numerical findings on the mean and the variance. Furthermore, we adapt our formalism to study the index distribution (where the domain is semi-infinite $(-\infty, W])$, linear statistics (the variance), thermodynamic pressure and bulk modulus.
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
The Flying Sidekick Traveling Salesman Problem with Multiple Drops: A Simple and Effective Heuristic Approach
Authors:
Sarah K. Schaumann,
Abhishake Kundu,
Juan C. Pina-Pardo,
Matthias Winkenbach,
Ricardo A. Gatica,
Stephan M. Wagner,
Timothy I. Matis
Abstract:
We study the Flying Sidekick Traveling Salesman Problem with Multiple Drops (FSTSP-MD), a multi-modal last-mile delivery model where a single truck and a single drone cooperatively deliver customer packages. In the FSTSP-MD, the drone can be launched from the truck to deliver multiple packages before it returns to the truck for a new delivery operation. The FSTSP-MD aims to find the synchronized t…
▽ More
We study the Flying Sidekick Traveling Salesman Problem with Multiple Drops (FSTSP-MD), a multi-modal last-mile delivery model where a single truck and a single drone cooperatively deliver customer packages. In the FSTSP-MD, the drone can be launched from the truck to deliver multiple packages before it returns to the truck for a new delivery operation. The FSTSP-MD aims to find the synchronized truck and drone delivery routes that minimize the completion time of the delivery process. We develop a simple and effective heuristic approach based on an order-first, split-second scheme. This heuristic combines standard local search and diversification techniques with a novel shortest-path problem that finds FSTSP-MD solutions in polynomial time. We show that our heuristic consistently outperforms state-of-the-art heuristics developed for the FSTSP-MD and the FSTSP (i.e., the single-drop case) through extensive numerical experiments. We also show that the FSTSP-MD substantially reduces completion times compared to a traditional truck-only delivery system. Several managerial insights are described regarding the effects of drone capacity, drone speed, drone flight endurance, and customer distribution.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
Unveiling extended gamma-ray emission around HESS J1813-178
Authors:
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
A. Baktash,
V. Barbosa Martins,
J. Barnard,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
B. Bi,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
J. Borowska,
M. Bouyahiaoui,
M. Breuhaus,
R. Brose,
F. Brun,
B. Bruno,
T. Bulik,
C. Burger-Scheidlin
, et al. (126 additional authors not shown)
Abstract:
HESS J1813$-$178 is a very-high-energy $γ$-ray source spatially coincident with the young and energetic pulsar PSR J1813$-$1749 and thought to be associated with its pulsar wind nebula (PWN). Recently, evidence for extended high-energy emission in the vicinity of the pulsar has been revealed in the Fermi Large Area Telescope (LAT) data. This motivates revisiting the HESS J1813$-$178 region, taking…
▽ More
HESS J1813$-$178 is a very-high-energy $γ$-ray source spatially coincident with the young and energetic pulsar PSR J1813$-$1749 and thought to be associated with its pulsar wind nebula (PWN). Recently, evidence for extended high-energy emission in the vicinity of the pulsar has been revealed in the Fermi Large Area Telescope (LAT) data. This motivates revisiting the HESS J1813$-$178 region, taking advantage of improved analysis methods and an extended data set. Using data taken by the High Energy Stereoscopic System (H.E.S.S.) experiment and the Fermi-LAT, we aim to describe the $γ$-ray emission in the region with a consistent model, to provide insights into its origin. We performed a likelihood-based analysis on 32 hours of H.E.S.S. data and 12 years of Fermi-LAT data and fit a spectro-morphological model to the combined datasets. These results allowed us to develop a physical model for the origin of the observed $γ$-ray emission in the region. In addition to the compact very-high-energy $γ$-ray emission centered on the pulsar, we find a significant yet previously undetected component along the Galactic plane. With Fermi-LAT data, we confirm extended high-energy emission consistent with the position and elongation of the extended emission observed with H.E.S.S. These results establish a consistent description of the emission in the region from GeV energies to several tens of TeV. This study suggests that HESS J1813$-$178 is associated with a $γ$-ray PWN powered by PSR J1813$-$1749. A possible origin of the extended emission component is inverse Compton emission from electrons and positrons that have escaped the confines of the pulsar and form a halo around the PWN.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Spectrum and extension of the inverse-Compton emission of the Crab Nebula from a combined Fermi-LAT and H.E.S.S. analysis
Authors:
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
A. Baktash,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
B. Bi,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
J. Borowska,
F. Bradascio,
M. Breuhaus,
R. Brose,
A. Brown,
F. Brun,
B. Bruno,
T. Bulik,
C. Burger-Scheidlin
, et al. (137 additional authors not shown)
Abstract:
The Crab Nebula is a unique laboratory for studying the acceleration of electrons and positrons through their non-thermal radiation. Observations of very-high-energy $γ$ rays from the Crab Nebula have provided important constraints for modelling its broadband emission. We present the first fully self-consistent analysis of the Crab Nebula's $γ$-ray emission between 1 GeV and $\sim$100 TeV, that is…
▽ More
The Crab Nebula is a unique laboratory for studying the acceleration of electrons and positrons through their non-thermal radiation. Observations of very-high-energy $γ$ rays from the Crab Nebula have provided important constraints for modelling its broadband emission. We present the first fully self-consistent analysis of the Crab Nebula's $γ$-ray emission between 1 GeV and $\sim$100 TeV, that is, over five orders of magnitude in energy. Using the open-source software package Gammapy, we combined 11.4 yr of data from the Fermi Large Area Telescope and 80 h of High Energy Stereoscopic System (H.E.S.S.) data at the event level and provide a measurement of the spatial extension of the nebula and its energy spectrum. We find evidence for a shrinking of the nebula with increasing $γ$-ray energy. Furthermore, we fitted several phenomenological models to the measured data, finding that none of them can fully describe the spatial extension and the spectral energy distribution at the same time. Especially the extension measured at TeV energies appears too large when compared to the X-ray emission. Our measurements probe the structure of the magnetic field between the pulsar wind termination shock and the dust torus, and we conclude that the magnetic field strength decreases with increasing distance from the pulsar. We complement our study with a careful assessment of systematic uncertainties.
△ Less
Submitted 21 March, 2024; v1 submitted 19 March, 2024;
originally announced March 2024.
-
An Extreme Ultra-Compact X-ray Binary in a Globular Cluster: Multi-Wavelength Observations of RZ2109 Explored in a Triple System Framework
Authors:
Kristen C. Dage,
Arash Bahramian,
Smadar Naoz,
Alexey Bobrick,
Wasundara Athukoralalage,
McKinley C. Brumback,
Daryl Haggard,
Arunav Kundu,
Stephen E. Zepf
Abstract:
The globular cluster ultraluminous X-ray source, RZ2109, is a complex and unique system which has been detected at X-ray, ultra-violet, and optical wavelengths. Based on almost 20 years of Chandra and XMM-Newton observations, the X-ray luminosity exhibits order-of-magnitude variability, with the peak flux lasting on the order of a few hours. We perform robust time series analysis on the archival X…
▽ More
The globular cluster ultraluminous X-ray source, RZ2109, is a complex and unique system which has been detected at X-ray, ultra-violet, and optical wavelengths. Based on almost 20 years of Chandra and XMM-Newton observations, the X-ray luminosity exhibits order-of-magnitude variability, with the peak flux lasting on the order of a few hours. We perform robust time series analysis on the archival X-ray observations and find that this variability is periodic on a timescale of 1.3 $\pm 0.04$ days. The source also demonstrates broad [OIII] 5007 Angstrom emission, which has been observed since 2004, suggesting a white dwarf donor and therefore an ultra-compact X-ray binary. We present new spectra from 2020 and 2022, marking eighteen years of observed [OIII] emission from this source. Meanwhile, we find that the globular cluster counterpart is unusually bright in the NUV/UVW2 band. Finally, we discuss RZ2109 in the context of the eccentric Kozai Lidov mechanism and show that the observed 1.3 day periodicity can be used to place constraints on the tertiary configuration, ranging from 20 minutes (for a 0.1 ${\rm M}_\odot$ companion) to approximately 95 minutes (for a 1 ${\rm M}_\odot$ companion), if the eccentric Kozai Lidov mechanism is at the origin of the periodic variability.
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
Transfer Learning for Security: Challenges and Future Directions
Authors:
Adrian Shuai Li,
Arun Iyengar,
Ashish Kundu,
Elisa Bertino
Abstract:
Many machine learning and data mining algorithms rely on the assumption that the training and testing data share the same feature space and distribution. However, this assumption may not always hold. For instance, there are situations where we need to classify data in one domain, but we only have sufficient training data available from a different domain. The latter data may follow a distinct dist…
▽ More
Many machine learning and data mining algorithms rely on the assumption that the training and testing data share the same feature space and distribution. However, this assumption may not always hold. For instance, there are situations where we need to classify data in one domain, but we only have sufficient training data available from a different domain. The latter data may follow a distinct distribution. In such cases, successfully transferring knowledge across domains can significantly improve learning performance and reduce the need for extensive data labeling efforts. Transfer learning (TL) has thus emerged as a promising framework to tackle this challenge, particularly in security-related tasks. This paper aims to review the current advancements in utilizing TL techniques for security. The paper includes a discussion of the existing research gaps in applying TL in the security domain, as well as exploring potential future research directions and issues that arise in the context of TL-assisted security solutions.
△ Less
Submitted 1 March, 2024;
originally announced March 2024.
-
Reinforcement learning-assisted quantum architecture search for variational quantum algorithms
Authors:
Akash Kundu
Abstract:
A significant hurdle in the noisy intermediate-scale quantum (NISQ) era is identifying functional quantum circuits. These circuits must also adhere to the constraints imposed by current quantum hardware limitations. Variational quantum algorithms (VQAs), a class of quantum-classical optimization algorithms, were developed to address these challenges in the currently available quantum devices. Howe…
▽ More
A significant hurdle in the noisy intermediate-scale quantum (NISQ) era is identifying functional quantum circuits. These circuits must also adhere to the constraints imposed by current quantum hardware limitations. Variational quantum algorithms (VQAs), a class of quantum-classical optimization algorithms, were developed to address these challenges in the currently available quantum devices. However, the overall performance of VQAs depends on the initialization strategy of the variational circuit, the structure of the circuit (also known as ansatz), and the configuration of the cost function. Focusing on the structure of the circuit, in this thesis, we improve the performance of VQAs by automating the search for an optimal structure for the variational circuits using reinforcement learning (RL). Within the thesis, the optimality of a circuit is determined by evaluating its depth, the overall count of gates and parameters, and its accuracy in solving the given problem. The task of automating the search for optimal quantum circuits is known as quantum architecture search (QAS). The majority of research in QAS is primarily focused on a noiseless scenario. Yet, the impact of noise on the QAS remains inadequately explored. In this thesis, we tackle the issue by introducing a tensor-based quantum circuit encoding, restrictions on environment dynamics to explore the search space of possible circuits efficiently, an episode halting scheme to steer the agent to find shorter circuits, a double deep Q-network (DDQN) with an $ε$-greedy policy for better stability. The numerical experiments on noiseless and noisy quantum hardware show that in dealing with various VQAs, our RL-based QAS outperforms existing QAS. Meanwhile, the methods we propose in the thesis can be readily adapted to address a wide range of other VQAs.
△ Less
Submitted 7 March, 2024; v1 submitted 21 February, 2024;
originally announced February 2024.
-
Curvature in the very-high energy gamma-ray spectrum of M87
Authors:
H. E. S. S. Collaboration,
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
F. Bradascio,
R. Brose,
F. Brun,
B. Bruno,
T. Bulik C. Burger-Scheidlin,
T. Bylund,
S. Casanova,
R. Cecil,
J. Celic,
M. Cerruti
, et al. (110 additional authors not shown)
Abstract:
The radio galaxy M87 is a variable very-high energy (VHE) gamma-ray source, exhibiting three major flares reported in 2005, 2008, and 2010. Despite extensive studies, the origin of the VHE gamma-ray emission is yet to be understood. In this study, we investigate the VHE gamma-ray spectrum of M87 during states of high gamma-ray activity, utilizing 20.2$\,$ hours the H.E.S.S. observations. Our findi…
▽ More
The radio galaxy M87 is a variable very-high energy (VHE) gamma-ray source, exhibiting three major flares reported in 2005, 2008, and 2010. Despite extensive studies, the origin of the VHE gamma-ray emission is yet to be understood. In this study, we investigate the VHE gamma-ray spectrum of M87 during states of high gamma-ray activity, utilizing 20.2$\,$ hours the H.E.S.S. observations. Our findings indicate a preference for a curved spectrum, characterized by a log-parabola model with extra-galactic background light (EBL) model above 0.3$\,$TeV at the 4$σ$ level, compared to a power-law spectrum with EBL. We investigate the degeneracy between the absorption feature and the EBL normalization and derive upper limits on EBL models mainly sensitive in the wavelength range 12.4$\,$$μ$m - 40$\,$$μ$m.
△ Less
Submitted 25 April, 2024; v1 submitted 20 February, 2024;
originally announced February 2024.
-
Sign of the $hZZ$ coupling and implication for new physics
Authors:
Dipankar Das,
Anirban Kundu,
Miguel Levy,
Anugrah M. Prasad,
Ipsita Saha,
Agnivo Sarkar
Abstract:
The magnitudes of the couplings of the scalar resonance at 125 GeV with the SM particles are found to be consistent with those of the SM Higgs boson. However, the signs are not experimentally determined in most of the cases, a prime example being that with the $Z$-boson pair. In other words, $κ_Z^h$, the ratio of the couplings of the actual 125 GeV resonance with $ZZ$ and that of the SM Higgs boso…
▽ More
The magnitudes of the couplings of the scalar resonance at 125 GeV with the SM particles are found to be consistent with those of the SM Higgs boson. However, the signs are not experimentally determined in most of the cases, a prime example being that with the $Z$-boson pair. In other words, $κ_Z^h$, the ratio of the couplings of the actual 125 GeV resonance with $ZZ$ and that of the SM Higgs boson with the same, is consistent with both $+1$ and $-1$, the latter being the `wrong-sign'. We argue that the wrong-sign $hZZ$ coupling will necessitate the intervention of new physics below $\mathcal{O}\left(620\right)$ GeV to safeguard the underlying theory from unitarity violation. The strength of the new nonstandard couplings can be derived from the unitarity sum rules, which are comparable to the SM-Higgs couplings in magnitude. Thus the strong limits from the direct searches at the LHC can help us rule out the existence of such nonstandard particles with unusually large couplings thereby disfavoring the possibility of a wrong-sign $hZZ$ coupling.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
Stochastic Comparisons of Random Extremes from non-identical Random Variables
Authors:
Amarjit Kundu,
Shovan Chowdhury,
Bidhan Modok
Abstract:
We propose some new results on the comparison of the minimum or maximum order statistic from a random number of non-identical random variables. Under the non-identical set-up, with certain conditions, we prove that random minimum (maximum) of one system dominates the other in hazard rate (reversed hazard rate) order. Further, we prove variation diminishing property (Karlin [8]) for all possible re…
▽ More
We propose some new results on the comparison of the minimum or maximum order statistic from a random number of non-identical random variables. Under the non-identical set-up, with certain conditions, we prove that random minimum (maximum) of one system dominates the other in hazard rate (reversed hazard rate) order. Further, we prove variation diminishing property (Karlin [8]) for all possible restrictions to derive the new results.
△ Less
Submitted 7 March, 2024; v1 submitted 14 February, 2024;
originally announced February 2024.
-
Universal distribution of the number of minima for random walks and Lévy flights
Authors:
Anupam Kundu,
Satya N. Majumdar,
Gregory Schehr
Abstract:
We compute exactly the full distribution of the number $m$ of local minima in a one-dimensional landscape generated by a random walk or a Lévy flight. We consider two different ensembles of landscapes, one with a fixed number of steps $N$ and the other till the first-passage time of the random walk to the origin. We show that the distribution of $m$ is drastically different in the two ensembles (G…
▽ More
We compute exactly the full distribution of the number $m$ of local minima in a one-dimensional landscape generated by a random walk or a Lévy flight. We consider two different ensembles of landscapes, one with a fixed number of steps $N$ and the other till the first-passage time of the random walk to the origin. We show that the distribution of $m$ is drastically different in the two ensembles (Gaussian in the former case, while having a power-law tail in the latter $m^{-3/2}$ in the latter case). However, the most striking aspect of our results is that, in each case, the distribution is completely universal for all $m$ (and not just for large $m$), i.e., independent of the jump distribution in the random walk. This means that the distributions are exactly identical for Lévy flights and random walks with finite jump variance. Our analytical results are in excellent agreement with our numerical simulations.
△ Less
Submitted 14 February, 2024; v1 submitted 6 February, 2024;
originally announced February 2024.
-
Curriculum reinforcement learning for quantum architecture search under hardware errors
Authors:
Yash J. Patel,
Akash Kundu,
Mateusz Ostaszewski,
Xavier Bonet-Monroig,
Vedran Dunjko,
Onur Danaci
Abstract:
The key challenge in the noisy intermediate-scale quantum era is finding useful circuits compatible with current device limitations. Variational quantum algorithms (VQAs) offer a potential solution by fixing the circuit architecture and optimizing individual gate parameters in an external loop. However, parameter optimization can become intractable, and the overall performance of the algorithm dep…
▽ More
The key challenge in the noisy intermediate-scale quantum era is finding useful circuits compatible with current device limitations. Variational quantum algorithms (VQAs) offer a potential solution by fixing the circuit architecture and optimizing individual gate parameters in an external loop. However, parameter optimization can become intractable, and the overall performance of the algorithm depends heavily on the initially chosen circuit architecture. Several quantum architecture search (QAS) algorithms have been developed to design useful circuit architectures automatically. In the case of parameter optimization alone, noise effects have been observed to dramatically influence the performance of the optimizer and final outcomes, which is a key line of study. However, the effects of noise on the architecture search, which could be just as critical, are poorly understood. This work addresses this gap by introducing a curriculum-based reinforcement learning QAS (CRLQAS) algorithm designed to tackle challenges in realistic VQA deployment. The algorithm incorporates (i) a 3D architecture encoding and restrictions on environment dynamics to explore the search space of possible circuits efficiently, (ii) an episode halting scheme to steer the agent to find shorter circuits, and (iii) a novel variant of simultaneous perturbation stochastic approximation as an optimizer for faster convergence. To facilitate studies, we developed an optimized simulator for our algorithm, significantly improving computational efficiency in simulating noisy quantum circuits by employing the Pauli-transfer matrix formalism in the Pauli-Liouville basis. Numerical experiments focusing on quantum chemistry tasks demonstrate that CRLQAS outperforms existing QAS algorithms across several metrics in both noiseless and noisy environments.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Second-order charge and spin transport in LaO/STO system in the presence of cubic Rashba spin orbit couplings
Authors:
Zhuo Bin Siu,
Anirban Kundu,
Mansoor B. A. Jalil
Abstract:
Certain non-centrosymmetric materials with broken time-reversal symmetry may exhibit non-reciprocal transport behavior under an applied electric field in which the charge and spin currents contain components that are second order in the electric field. In this study, we investigate the second-order spin accumulation and charge and spin responses in the LaAlO$_3$/SrTiO$_3$ (LaO/STO) system with mag…
▽ More
Certain non-centrosymmetric materials with broken time-reversal symmetry may exhibit non-reciprocal transport behavior under an applied electric field in which the charge and spin currents contain components that are second order in the electric field. In this study, we investigate the second-order spin accumulation and charge and spin responses in the LaAlO$_3$/SrTiO$_3$ (LaO/STO) system with magnetic dopants under the influence of linear and cubic Rashba spin-orbit coupling (RSOC) terms. We explain the physical origin of the second-order response and perform a symmetry analysis of the first and second-order responses for different dopant magnetization directions relative to the applied electric field. We then numerically solve the Boltzmann transport equation by extending the approach of Schliemann and Loss [Phys. Rev. B 68, 165311] to higher orders in the electric field. We show that the sign of the second-order responses can be switched by varying the magnetization direction of the magnetic dopants or relative strengths of the two cubic RSOC terms and explain these trends by considering the Fermi surfaces of the respective systems. These findings provide insights into the interplay of multiple SOC effects in a LaO/STO system and how the resulting first- and second-order charge and spin responses can be engineered by exploiting the symmetries of the system.
△ Less
Submitted 4 February, 2024;
originally announced February 2024.
-
Trust and ethical considerations in a multi-modal, explainable AI-driven chatbot tutoring system: The case of collaboratively solving Rubik's Cube
Authors:
Kausik Lakkaraju,
Vedant Khandelwal,
Biplav Srivastava,
Forest Agostinelli,
Hengtao Tang,
Prathamjeet Singh,
Dezhi Wu,
Matt Irvin,
Ashish Kundu
Abstract:
Artificial intelligence (AI) has the potential to transform education with its power of uncovering insights from massive data about student learning patterns. However, ethical and trustworthy concerns of AI have been raised but are unsolved. Prominent ethical issues in high school AI education include data privacy, information leakage, abusive language, and fairness. This paper describes technolog…
▽ More
Artificial intelligence (AI) has the potential to transform education with its power of uncovering insights from massive data about student learning patterns. However, ethical and trustworthy concerns of AI have been raised but are unsolved. Prominent ethical issues in high school AI education include data privacy, information leakage, abusive language, and fairness. This paper describes technological components that were built to address ethical and trustworthy concerns in a multi-modal collaborative platform (called ALLURE chatbot) for high school students to collaborate with AI to solve the Rubik's cube. In data privacy, we want to ensure that the informed consent of children, parents, and teachers, is at the center of any data that is managed. Since children are involved, language, whether textual, audio, or visual, is acceptable both from users and AI and the system can steer interaction away from dangerous situations. In information management, we also want to ensure that the system, while learning to improve over time, does not leak information about users from one group to another.
△ Less
Submitted 30 January, 2024;
originally announced February 2024.
-
Acceleration and transport of relativistic electrons in the jets of the microquasar SS 433
Authors:
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
B. Bi,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
J. Borowska,
M. Bouyahiaou,
M. Breuhau,
R. Brose,
A. M. Brown,
F. Brun,
B. Bruno,
T. Bulik,
C. Burger-Scheidlin,
S. Caroff
, et al. (140 additional authors not shown)
Abstract:
SS 433 is a microquasar, a stellar binary system with collimated relativistic jets. We observed SS 433 in gamma rays using the High Energy Stereoscopic System (H.E.S.S.), finding an energy-dependent shift in the apparent position of the gamma-ray emission of the parsec-scale jets. These observations trace the energetic electron population and indicate the gamma rays are produced by inverse-Compton…
▽ More
SS 433 is a microquasar, a stellar binary system with collimated relativistic jets. We observed SS 433 in gamma rays using the High Energy Stereoscopic System (H.E.S.S.), finding an energy-dependent shift in the apparent position of the gamma-ray emission of the parsec-scale jets. These observations trace the energetic electron population and indicate the gamma rays are produced by inverse-Compton scattering. Modelling of the energy-dependent gamma-ray morphology constrains the location of particle acceleration and requires an abrupt deceleration of the jet flow. We infer the presence of shocks on either side of the binary system at distances of 25 to 30 parsecs and conclude that self-collimation of the precessing jets forms the shocks, which then efficiently accelerate electrons.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.