-
Centrality dependence of Lévy-stable two-pion Bose-Einstein correlations in $\sqrt{s_{_{NN}}}=200$ GeV Au$+$Au collisions
Authors:
PHENIX Collaboration,
N. J. Abdulameer,
U. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
R. Akimoto,
H. Al-Ta'ani,
J. Alexander,
A. Angerami,
K. Aoki,
N. Apadula,
Y. Aramaki,
H. Asano,
E. C. Aschenauer,
E. T. Atomssa,
T. C. Awes,
B. Azmoun,
V. Babintsev,
M. Bai,
B. Bannier,
K. N. Barish,
B. Bassalleck,
S. Bathe
, et al. (377 additional authors not shown)
Abstract:
The PHENIX experiment measured the centrality dependence of two-pion Bose-Einstein correlation functions in $\sqrt{s_{_{NN}}}=200$~GeV Au$+$Au collisions at the Relativistic Heavy Ion Collider at Brookhaven National Laboratory. The data are well represented by Lévy-stable source distributions. The extracted source parameters are the correlation-strength parameter $λ$, the Lévy index of stability…
▽ More
The PHENIX experiment measured the centrality dependence of two-pion Bose-Einstein correlation functions in $\sqrt{s_{_{NN}}}=200$~GeV Au$+$Au collisions at the Relativistic Heavy Ion Collider at Brookhaven National Laboratory. The data are well represented by Lévy-stable source distributions. The extracted source parameters are the correlation-strength parameter $λ$, the Lévy index of stability $α$, and the Lévy-scale parameter $R$ as a function of transverse mass $m_T$ and centrality. The $λ(m_T)$ parameter is constant at larger values of $m_T$, but decreases as $m_T$ decreases. The Lévy scale parameter $R(m_T)$ decreases with $m_T$ and exhibits proportionality to the length scale of the nuclear overlap region. The Lévy exponent $α(m_T)$ is independent of $m_T$ within uncertainties in each investigated centrality bin, but shows a clear centrality dependence. At all centralities, the Lévy exponent $α$ is significantly different from that of Gaussian ($α=2$) or Cauchy ($α=1$) source distributions. Comparisons to the predictions of Monte-Carlo simulations of resonance-decay chains show that in all but the most peripheral centrality class (50%-60%), the obtained results are inconsistent with the measurements, unless a significant reduction of the in-medium mass of the $η'$ meson is included. In each centrality class, the best value of the in-medium $η'$ mass is compared to the mass of the $η$ meson, as well as to several theoretical predictions that consider restoration of $U_A(1)$ symmetry in hot hadronic matter.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
XANE Background Acoustic Embeddings: Ablation and Clustering Analysis
Authors:
Dushyant Sharma,
James Fosburgh,
Sri Harsha Dumpala,
Chandramouli Shama Sastri,
Stanislav Yu. Kruchinin,
Patrick A. Naylor
Abstract:
We explore the recently proposed explainable acoustic neural embedding~(XANE) system that models the background acoustics of a speech signal in a non-intrusive manner. The XANE embeddings are used to estimate specific parameters related to the background acoustic properties of the signal which allows the embeddings to be explainable in terms of those parameters. We perform ablation studies on the…
▽ More
We explore the recently proposed explainable acoustic neural embedding~(XANE) system that models the background acoustics of a speech signal in a non-intrusive manner. The XANE embeddings are used to estimate specific parameters related to the background acoustic properties of the signal which allows the embeddings to be explainable in terms of those parameters. We perform ablation studies on the XANE system and show that estimating all acoustic parameters jointly has an overall positive effect. Furthermore, we illustrate the value of XANE embeddings by performing clustering experiments on unseen test data and show that the proposed embeddings achieve a mean F1 score of 92\% for three different tasks, outperforming significantly the WavLM based signal embeddings and are complimentary to speaker embeddings.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
IMC 2024 Methods & Solutions Review
Authors:
Shyam Gupta,
Dhanisha Sharma,
Songling Huang
Abstract:
For the past three years, Kaggle has been hosting the Image Matching Challenge, which focuses on solving a 3D image reconstruction problem using a collection of 2D images. Each year, this competition fosters the development of innovative and effective methodologies by its participants. In this paper, we introduce an advanced ensemble technique that we developed, achieving a score of 0.153449 on th…
▽ More
For the past three years, Kaggle has been hosting the Image Matching Challenge, which focuses on solving a 3D image reconstruction problem using a collection of 2D images. Each year, this competition fosters the development of innovative and effective methodologies by its participants. In this paper, we introduce an advanced ensemble technique that we developed, achieving a score of 0.153449 on the private leaderboard and securing the 160th position out of over 1,000 participants. Additionally, we conduct a comprehensive review of existing methods and techniques employed by top-performing teams in the competition. Our solution, alongside the insights gathered from other leading approaches, contributes to the ongoing advancement in the field of 3D image reconstruction. This research provides valuable knowledge for future participants and researchers aiming to excel in similar image matching and reconstruction challenges.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
AutoSplat: Constrained Gaussian Splatting for Autonomous Driving Scene Reconstruction
Authors:
Mustafa Khan,
Hamidreza Fazlali,
Dhruv Sharma,
Tongtong Cao,
Dongfeng Bai,
Yuan Ren,
Bingbing Liu
Abstract:
Realistic scene reconstruction and view synthesis are essential for advancing autonomous driving systems by simulating safety-critical scenarios. 3D Gaussian Splatting excels in real-time rendering and static scene reconstructions but struggles with modeling driving scenarios due to complex backgrounds, dynamic objects, and sparse views. We propose AutoSplat, a framework employing Gaussian splatti…
▽ More
Realistic scene reconstruction and view synthesis are essential for advancing autonomous driving systems by simulating safety-critical scenarios. 3D Gaussian Splatting excels in real-time rendering and static scene reconstructions but struggles with modeling driving scenarios due to complex backgrounds, dynamic objects, and sparse views. We propose AutoSplat, a framework employing Gaussian splatting to achieve highly realistic reconstructions of autonomous driving scenes. By imposing geometric constraints on Gaussians representing the road and sky regions, our method enables multi-view consistent simulation of challenging scenarios including lane changes. Leveraging 3D templates, we introduce a reflected Gaussian consistency constraint to supervise both the visible and unseen side of foreground objects. Moreover, to model the dynamic appearance of foreground objects, we estimate residual spherical harmonics for each foreground Gaussian. Extensive experiments on Pandaset and KITTI demonstrate that AutoSplat outperforms state-of-the-art methods in scene reconstruction and novel view synthesis across diverse driving scenarios. Visit our project page at https://autosplat.github.io/.
△ Less
Submitted 3 July, 2024; v1 submitted 2 July, 2024;
originally announced July 2024.
-
Advantages of quantum support vector machine in cross-domain classification of quantum states
Authors:
Diksha Sharma,
Vivek Balasaheb Sabale,
Parvinder Singh,
Atul Kumar
Abstract:
In this study, we use cross-domain classification using quantum machine learning for quantum advantages to address the entanglement versus separability paradigm. We further demonstrate the efficient classification of Bell diagonal states into zero and non-zero discord classes. The inherited structure of quantum states and its relation with a particular class of quantum states are exploited to intu…
▽ More
In this study, we use cross-domain classification using quantum machine learning for quantum advantages to address the entanglement versus separability paradigm. We further demonstrate the efficient classification of Bell diagonal states into zero and non-zero discord classes. The inherited structure of quantum states and its relation with a particular class of quantum states are exploited to intuitively approach the classification of different domain testing states, referred here as crossdomain classification. In addition, we extend our analysis to evaluate the robustness of our model for the analyzed problem using random unitary transformations. Using numerical analysis, our results clearly demonstrate the potential of QSVM for classifying quantum states across the multidimensional Hilbert space.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
Terahertz crystal-field transitions and quasi ferromagnetic magnon excitations in a noncollinear magnet for hybrid spin-wave computation
Authors:
Gaurav Dubey,
Brijesh Singh Mehra,
Sanjeev Kumar,
Ayyappan Shyam,
Karan Datt Sharma,
Megha Vagadia,
Dhanvir Singh Rana
Abstract:
The complexity of interactions between the crystal-field and unusual non-collinear spin arrangement in non-trivial magnets demands novel tools to unravel the mystery underneath. In this work, we study such interaction dynamics of crystal-field-excitations (CFE) and low-energy magnetic excitations in orthochromite TmCrO3 with controls of temperature and magnetic field using high-resolution magneto-…
▽ More
The complexity of interactions between the crystal-field and unusual non-collinear spin arrangement in non-trivial magnets demands novel tools to unravel the mystery underneath. In this work, we study such interaction dynamics of crystal-field-excitations (CFE) and low-energy magnetic excitations in orthochromite TmCrO3 with controls of temperature and magnetic field using high-resolution magneto-terahertz (THz) time-domain spectroscopy. The THz energy spectrum spanning 0.5-10 meV possesses a low-frequency spin-excitation (magnon) mode and a multitude of CFE modes at 10 K, all of which uniquely embody a range of phenomena. For the magnon mode, a temperature dependence of peak frequency is induced by magnetic interactions between Tm and Cr subsystems. While a change from blue- to red-shift of peak frequency of this mode marks the magnetization reversal transition, the spin reorientation temperature and change of magnetic anisotropy are depicted by different features of field- and temperature-dependent peak frequency dynamics. The modes corresponding to CFE are robust and laden with a multitude of sub-modes which are attributes of non-trivial interactions across different transitions. These modes are suppressed only upon substitution of Tb3+ at Tm3+ site, which suggests a dominant role of single-ion anisotropy in controlling entire THz excitations spectra. Overall, this remarkable range of phenomena seen through the unique lens of all-optical THz tools provides deeper insights into the origin of magnetic phases in systems with complex interactions between rare-earth and transition metal ions and provides a multitude of a novel combination of closely spaced modes for emerging hybrid spin-wave computation.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
GATSBI: An Online GTSP-Based Algorithm for Targeted Surface Bridge Inspection and Defect Detection
Authors:
Harnaik Dhami,
Charith Reddy,
Vishnu Dutt Sharma,
Troi Williams,
Pratap Tokekar
Abstract:
We study the problem of visual surface inspection of infrastructure for defects using an Unmanned Aerial Vehicle (UAV). We do not assume that the geometric model of the infrastructure is known beforehand. Our planner, termed GATSBI, plans a path in a receding horizon fashion to inspect all points on the surface of the infrastructure. The input to GATSBI consists of a 3D occupancy map created onlin…
▽ More
We study the problem of visual surface inspection of infrastructure for defects using an Unmanned Aerial Vehicle (UAV). We do not assume that the geometric model of the infrastructure is known beforehand. Our planner, termed GATSBI, plans a path in a receding horizon fashion to inspect all points on the surface of the infrastructure. The input to GATSBI consists of a 3D occupancy map created online with 3D pointclouds. Occupied voxels corresponding to the infrastructure in this map are semantically segmented and used to create an infrastructure-only occupancy map. Inspecting an infrastructure voxel requires the UAV to take images from a desired viewing angle and distance. We then create a Generalized Traveling Salesperson Problem (GTSP) instance to cluster candidate viewpoints for inspecting the infrastructure voxels and use an off-the-shelf GTSP solver to find the optimal path for the given instance. As the algorithm sees more parts of the environment over time, it replans the path to inspect uninspected parts of the infrastructure while avoiding obstacles. We evaluate the performance of our algorithm through high-fidelity simulations conducted in AirSim and real-world experiments. We compare the performance of GATSBI with a baseline inspection algorithm where the map is known a priori. Our evaluation reveals that targeting the inspection to only the segmented infrastructure voxels and planning carefully using a GTSP solver leads to a more efficient and thorough inspection than the baseline inspection algorithm.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Impact of Loss Mechanisms on Linear Spectra of Excitonic and Polaritonic Aggregates
Authors:
Devansh Sharma,
Amartya Bose
Abstract:
The presence of loss mechanisms governed by empirical time-scales affect the dynamics and spectra of systems in profound ways. However, incorporation of these effects and their interaction with the thermal dissipative environments interacting with the system prove to be challenging. We have recently developed the path integral Lindblad dynamics (PILD) method to combine numerically rigorous path in…
▽ More
The presence of loss mechanisms governed by empirical time-scales affect the dynamics and spectra of systems in profound ways. However, incorporation of these effects and their interaction with the thermal dissipative environments interacting with the system prove to be challenging. We have recently developed the path integral Lindblad dynamics (PILD) method to combine numerically rigorous path integral simulations with Lindblad dynamics to account for such empirical loss mechanisms. In this work, we utilize the PILD method to study the absorption and circular dichroism spectra of chiral molecular aggregates and excitonic polaritons. We demonstrate that the effect of loss on particular states in both systems can differ not just on the basis of the symmetries of the state but also on the basis of complicated "interactions" of the system and the loss mechanism with the dissipative environments. We present probably the first numerical exploration of the CD spectrum of chiral molecular aggregates confined in a cavity. While the CD spectrum of just the excitonic aggregates itself is not amenable to simplistic understanding like the exciton chirality (EC) rule, the CD spectrum of polaritonic molecules is even more complex. Additionally, the impact of empirical loss on the polaritonic CD spectrum seems to be highly site-dependent. The impact of a lossy cavity is qualitatively different from the impact of a molecule that leaks the excitation. We explore some of those effects in depth leveraging the framework of path integral Lindblad dynamics.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Bone Fracture Classification using Transfer Learning
Authors:
Shyam Gupta,
Dhanisha Sharma
Abstract:
The manual examination of X-ray images for fractures is a time-consuming process that is prone to human error. In this work, we introduce a robust yet simple training loop for the classification of fractures, which significantly outperforms existing methods. Our method achieves superior performance in less than ten epochs and utilizes the latest dataset to deliver the best-performing model for thi…
▽ More
The manual examination of X-ray images for fractures is a time-consuming process that is prone to human error. In this work, we introduce a robust yet simple training loop for the classification of fractures, which significantly outperforms existing methods. Our method achieves superior performance in less than ten epochs and utilizes the latest dataset to deliver the best-performing model for this task. We emphasize the importance of training deep learning models responsibly and efficiently, as well as the critical role of selecting high-quality datasets.
△ Less
Submitted 22 June, 2024;
originally announced June 2024.
-
Miniature fluorescence sensor for quantitative detection of brain tumour
Authors:
Jean Pierre Ndabakuranye,
James Belcourt,
Deepak Sharma,
Cathal D. O'Connell,
Victor Mondal,
Sanjay K. Srivastava,
Alastair Stacey,
Sam Long,
Bobbi Fleiss,
Arman Ahnood
Abstract:
Fluorescence-guided surgery has emerged as a vital tool for tumour resection procedures. As well as intraoperative tumour visualisation, 5-ALA-induced PpIX provides an avenue for quantitative tumour identification based on ratiometric fluorescence measurement. To this end, fluorescence imaging and fibre-based probes have enabled more precise demarcation between the cancerous and healthy tissues. T…
▽ More
Fluorescence-guided surgery has emerged as a vital tool for tumour resection procedures. As well as intraoperative tumour visualisation, 5-ALA-induced PpIX provides an avenue for quantitative tumour identification based on ratiometric fluorescence measurement. To this end, fluorescence imaging and fibre-based probes have enabled more precise demarcation between the cancerous and healthy tissues. These sensing approaches, which rely on collecting the fluorescence light from the tumour resection site and its remote spectral sensing, introduce challenges associated with optical losses. In this work, we demonstrate the viability of tumour detection at the resection site using a miniature fluorescence measurement system. Unlike the current bulky systems, which necessitate remote measurement, we have adopted a millimetre-sized spectral sensor chip for quantitative fluorescence measurements. A reliable measurement at the resection site requires a stable optical window between the tissue and the optoelectronic system. This is achieved using an antifouling diamond window, which provides stable optical transparency. The system achieved a sensitivity of 92.3% and specificity of 98.3% in detecting a surrogate tumour at a resolution of 1 x 1 mm2. As well as addressing losses associated with collecting and coupling fluorescence light in the current remote sensing approaches, the small size of the system introduced in this work paves the way for its direct integration with the tumour resection tools with the aim of more accurate interoperative tumour identification.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
Jet modification via $π^0$-hadron correlations in Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV
Authors:
PHENIX Collaboration,
N. J. Abdulameer,
U. Acharya,
A. Adare,
S. Afanasiev,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
H. Al-Bataineh,
J. Alexander,
M. Alfred,
K. Aoki,
N. Apadula,
L. Aphecetche,
J. Asai,
H. Asano,
E. T. Atomssa,
R. Averbeck,
T. C. Awes,
B. Azmoun,
V. Babintsev,
M. Bai,
G. Baksay,
L. Baksay,
A. Baldisseri
, et al. (510 additional authors not shown)
Abstract:
High-momentum two-particle correlations are a useful tool for studying jet-quenching effects in the quark-gluon plasma. Angular correlations between neutral-pion triggers and charged hadrons with transverse momenta in the range 4--12~GeV/$c$ and 0.5--7~GeV/$c$, respectively, have been measured by the PHENIX experiment in 2014 for Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$~GeV. Suppression is obs…
▽ More
High-momentum two-particle correlations are a useful tool for studying jet-quenching effects in the quark-gluon plasma. Angular correlations between neutral-pion triggers and charged hadrons with transverse momenta in the range 4--12~GeV/$c$ and 0.5--7~GeV/$c$, respectively, have been measured by the PHENIX experiment in 2014 for Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$~GeV. Suppression is observed in the yield of high-momentum jet fragments opposite the trigger particle, which indicates jet suppression stemming from in-medium partonic energy loss, while enhancement is observed for low-momentum particles. The ratio and differences between the yield in Au$+$Au collisions and $p$$+$$p$ collisions, $I_{AA}$ and $Δ_{AA}$, as a function of the trigger-hadron azimuthal separation, $Δφ$, are measured for the first time at the Relativistic Heavy Ion Collider. These results better quantify how the yield of low-$p_T$ associated hadrons is enhanced at wide angle, which is crucial for studying energy loss as well as medium-response effects.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Study on Kelvin Helmholtz shear flows subjected to differential rotation
Authors:
Prince Kumar,
Devendra Sharma
Abstract:
A numerical simulation of Kelvin-Helmholtz Instability (KHI) in parallel shear flows subjected to external rotation is carried out using a pseudo-spectral technique. The Coriolis force, arising in a rotation frame under the beta plane approximation, tends to suppress the growth of KHI modes. The numerical results show a close qualitative agreement with the analytical results obtained for a step-wi…
▽ More
A numerical simulation of Kelvin-Helmholtz Instability (KHI) in parallel shear flows subjected to external rotation is carried out using a pseudo-spectral technique. The Coriolis force, arising in a rotation frame under the beta plane approximation, tends to suppress the growth of KHI modes. The numerical results show a close qualitative agreement with the analytical results obtained for a step-wise shear flow profile. Experimental evidence demonstrates that particles in a rotating frame experience the Coriolis force, mathematically equivalent to the Lorentz force. Therefore, the Coriolis force affects fluid dynamics in a manner similar to the Lorentz force in magnetized shear flows. This paper exploits the analogy between the magnetic field and rotation to study effects equivalent to a magnetic field on KHI in a rotating frame. Similar to the magnetic field case, the Coriolis force suppresses KHI and tends to form compressed and elongated KH vortex structures. However, the magnetic field and Coriolis force act on different scales, with the latter suppressing long-wavelength mode perturbations. A higher number of vortices are observed in the presence of rotation compared to non-rotating cases
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
XANE: eXplainable Acoustic Neural Embeddings
Authors:
Sri Harsha Dumpala,
Dushyant Sharma,
Chandramouli Shama Sastri,
Stanislav Kruchinin,
James Fosburgh,
Patrick A. Naylor
Abstract:
We present a novel method for extracting neural embeddings that model the background acoustics of a speech signal. The extracted embeddings are used to estimate specific parameters related to the background acoustic properties of the signal in a non-intrusive manner, which allows the embeddings to be explainable in terms of those parameters. We illustrate the value of these embeddings by performin…
▽ More
We present a novel method for extracting neural embeddings that model the background acoustics of a speech signal. The extracted embeddings are used to estimate specific parameters related to the background acoustic properties of the signal in a non-intrusive manner, which allows the embeddings to be explainable in terms of those parameters. We illustrate the value of these embeddings by performing clustering experiments on unseen test data and show that the proposed embeddings achieve a mean F1 score of 95.2\% for three different tasks, outperforming significantly the WavLM based signal embeddings. We also show that the proposed method can explain the embeddings by estimating 14 acoustic parameters characterizing the background acoustics, including reverberation and noise levels, overlapped speech detection, CODEC type detection and noise type detection with high accuracy and a real-time factor 17 times lower than an external baseline method.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
Candidate strongly-lensed Type Ia supernovae in the Zwicky Transient Facility archive
Authors:
A. Townsend,
J. Nordin,
A. Sagués Carracedo,
M. Kowalski,
N. Arendse,
S. Dhawan,
A. Goobar,
J. Johansson,
E. Mörtsell,
S. Schulze,
I. Andreoni,
E. Fernández,
A. G. Kim,
P. E. Nugent,
F. Prada,
M. Rigault,
N. Sarin,
D. Sharma,
E. C. Bellm,
M. W. Coughlin,
R. Dekany,
S. L. Groom,
L. Lacroix,
R. R. Laher,
R. Riddle
, et al. (39 additional authors not shown)
Abstract:
Gravitationally lensed Type Ia supernovae (glSNe Ia) are unique astronomical tools for studying cosmological parameters, distributions of dark matter, the astrophysics of the supernovae and the intervening lensing galaxies themselves. Only a few highly magnified glSNe Ia have been discovered by ground-based telescopes, such as the Zwicky Transient Facility (ZTF), but simulations predict the existe…
▽ More
Gravitationally lensed Type Ia supernovae (glSNe Ia) are unique astronomical tools for studying cosmological parameters, distributions of dark matter, the astrophysics of the supernovae and the intervening lensing galaxies themselves. Only a few highly magnified glSNe Ia have been discovered by ground-based telescopes, such as the Zwicky Transient Facility (ZTF), but simulations predict the existence of a fainter, undetected population. We present a systematic search in the ZTF archive of alerts from 1 June 2019 to 1 September 2022. Using the AMPEL platform, we developed a pipeline that distinguishes candidate glSNe Ia from other variable sources. Initial cuts were applied to the ZTF alert photometry before forced photometry was obtained for the remaining candidates. Additional cuts were applied to refine the candidates based on their light curve colours, lens galaxy colours, and the resulting parameters from fits to the SALT2 SN Ia template. Candidates were also cross-matched with the DESI spectroscopic catalogue. Seven transients passed all the cuts and had an associated galaxy DESI redshift, which we present as glSN Ia candidates. While superluminous supernovae (SLSNe) cannot be fully rejected, two events, ZTF19abpjicm and ZTF22aahmovu, are significantly different from typical SLSNe and their light curves can be modelled as two-image glSN Ia systems. From this two-image modelling, we estimate time delays of 22 $\pm$ 3 and 34 $\pm$ 1 days for the two events, respectively, which suggests that we have uncovered a population with longer time delays. The pipeline is efficient and sensitive enough to parse full alert streams. It is currently being applied to the live ZTF alert stream to identify and follow-up future candidates while active. This pipeline could be the foundation for glSNe Ia searches in future surveys, like the Vera C. Rubin Observatory's Legacy Survey of Space and Time.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Learning accurate and interpretable decision trees
Authors:
Maria-Florina Balcan,
Dravyansh Sharma
Abstract:
Decision trees are a popular tool in machine learning and yield easy-to-understand models. Several techniques have been proposed in the literature for learning a decision tree classifier, with different techniques working well for data from different domains. In this work, we develop approaches to design decision tree learning algorithms given repeated access to data from the same domain. We propo…
▽ More
Decision trees are a popular tool in machine learning and yield easy-to-understand models. Several techniques have been proposed in the literature for learning a decision tree classifier, with different techniques working well for data from different domains. In this work, we develop approaches to design decision tree learning algorithms given repeated access to data from the same domain. We propose novel parameterized classes of node splitting criteria in top-down algorithms, which interpolate between popularly used entropy and Gini impurity based criteria, and provide theoretical bounds on the number of samples needed to learn the splitting function appropriate for the data at hand. We also study the sample complexity of tuning prior parameters in Bayesian decision tree learning, and extend our results to decision tree regression. We further consider the problem of tuning hyperparameters in pruning the decision tree for classical pruning algorithms including min-cost complexity pruning. We also study the interpretability of the learned decision trees and introduce a data-driven approach for optimizing the explainability versus accuracy trade-off using decision trees. Finally, we demonstrate the significance of our approach on real world datasets by learning data-specific decision trees which are simultaneously more accurate and interpretable.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
Analysis of Decentralized Stochastic Successive Convex Approximation for composite non-convex problems
Authors:
Basil M. Idrees,
Shivangi Dubey Sharma,
Ketan Rajawat
Abstract:
This work considers the decentralized successive convex approximation (SCA) method for minimizing stochastic non-convex objectives subject to convex constraints, along with possibly non-smooth convex regularizers. Although SCA has been widely applied in decentralized settings, its stochastic first order (SFO) complexity is unknown, and it is thought to be slower than the centralized momentum-enhan…
▽ More
This work considers the decentralized successive convex approximation (SCA) method for minimizing stochastic non-convex objectives subject to convex constraints, along with possibly non-smooth convex regularizers. Although SCA has been widely applied in decentralized settings, its stochastic first order (SFO) complexity is unknown, and it is thought to be slower than the centralized momentum-enhanced SCA variants. In this work, we advance the state-of-the-art for SCA methods by proposing an accelerated variant, namely the \textbf{D}ecentralized \textbf{M}omentum-based \textbf{S}tochastic \textbf{SCA} (\textbf{D-MSSCA}) and analyze its SFO complexity. The proposed algorithm entails creating a stochastic surrogate of the objective at every iteration, which is minimized at each node separately. Remarkably, the D-MSSCA achieves an SFO complexity of $\mathcal{O}(ε^{-3/2})$ to reach an $ε$-stationary point, which is at par with the SFO complexity lower bound for unconstrained stochastic non-convex optimization in centralized setting.
△ Less
Submitted 27 May, 2024; v1 submitted 11 May, 2024;
originally announced May 2024.
-
PLLM-CS: Pre-trained Large Language Model (LLM) for Cyber Threat Detection in Satellite Networks
Authors:
Mohammed Hassanin,
Marwa Keshk,
Sara Salim,
Majid Alsubaie,
Dharmendra Sharma
Abstract:
Satellite networks are vital in facilitating communication services for various critical infrastructures. These networks can seamlessly integrate with a diverse array of systems. However, some of these systems are vulnerable due to the absence of effective intrusion detection systems, which can be attributed to limited research and the high costs associated with deploying, fine-tuning, monitoring,…
▽ More
Satellite networks are vital in facilitating communication services for various critical infrastructures. These networks can seamlessly integrate with a diverse array of systems. However, some of these systems are vulnerable due to the absence of effective intrusion detection systems, which can be attributed to limited research and the high costs associated with deploying, fine-tuning, monitoring, and responding to security breaches. To address these challenges, we propose a pretrained Large Language Model for Cyber Security , for short PLLM-CS, which is a variant of pre-trained Transformers [1], which includes a specialized module for transforming network data into contextually suitable inputs. This transformation enables the proposed LLM to encode contextual information within the cyber data. To validate the efficacy of the proposed method, we conducted empirical experiments using two publicly available network datasets, UNSW_NB 15 and TON_IoT, both providing Internet of Things (IoT)-based traffic data. Our experiments demonstrate that proposed LLM method outperforms state-of-the-art techniques such as BiLSTM, GRU, and CNN. Notably, the PLLM-CS method achieves an outstanding accuracy level of 100% on the UNSW_NB 15 dataset, setting a new standard for benchmark performance in this domain.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
Fine-tuning Pre-trained Named Entity Recognition Models For Indian Languages
Authors:
Sankalp Bahad,
Pruthwik Mishra,
Karunesh Arora,
Rakesh Chandra Balabantaray,
Dipti Misra Sharma,
Parameswari Krishnamurthy
Abstract:
Named Entity Recognition (NER) is a useful component in Natural Language Processing (NLP) applications. It is used in various tasks such as Machine Translation, Summarization, Information Retrieval, and Question-Answering systems. The research on NER is centered around English and some other major languages, whereas limited attention has been given to Indian languages. We analyze the challenges an…
▽ More
Named Entity Recognition (NER) is a useful component in Natural Language Processing (NLP) applications. It is used in various tasks such as Machine Translation, Summarization, Information Retrieval, and Question-Answering systems. The research on NER is centered around English and some other major languages, whereas limited attention has been given to Indian languages. We analyze the challenges and propose techniques that can be tailored for Multilingual Named Entity Recognition for Indian Languages. We present a human annotated named entity corpora of 40K sentences for 4 Indian languages from two of the major Indian language families. Additionally,we present a multilingual model fine-tuned on our dataset, which achieves an F1 score of 0.80 on our dataset on average. We achieve comparable performance on completely unseen benchmark datasets for Indian languages which affirms the usability of our model.
△ Less
Submitted 10 May, 2024; v1 submitted 8 May, 2024;
originally announced May 2024.
-
Analysis of a Modular Autonomous Driving Architecture: The Top Submission to CARLA Leaderboard 2.0 Challenge
Authors:
Weize Zhang,
Mohammed Elmahgiubi,
Kasra Rezaee,
Behzad Khamidehi,
Hamidreza Mirkhani,
Fazel Arasteh,
Chunlin Li,
Muhammad Ahsan Kaleem,
Eduardo R. Corral-Soto,
Dhruv Sharma,
Tongtong Cao
Abstract:
In this paper we present the architecture of the Kyber-E2E submission to the map track of CARLA Leaderboard 2.0 Autonomous Driving (AD) challenge 2023, which achieved first place. We employed a modular architecture for our solution consists of five main components: sensing, localization, perception, tracking/prediction, and planning/control. Our solution leverages state-of-the-art language-assiste…
▽ More
In this paper we present the architecture of the Kyber-E2E submission to the map track of CARLA Leaderboard 2.0 Autonomous Driving (AD) challenge 2023, which achieved first place. We employed a modular architecture for our solution consists of five main components: sensing, localization, perception, tracking/prediction, and planning/control. Our solution leverages state-of-the-art language-assisted perception models to help our planner perform more reliably in highly challenging traffic scenarios. We use open-source driving datasets in conjunction with Inverse Reinforcement Learning (IRL) to enhance the performance of our motion planner. We provide insight into our design choices and trade-offs made to achieve this solution. We also explore the impact of each component in the overall performance of our solution, with the intent of providing a guideline where allocation of resources can have the greatest impact.
△ Less
Submitted 21 March, 2024;
originally announced May 2024.
-
Robust $μ$-distortion constraints on primordial supermassive black holes from non-Gaussian perturbations
Authors:
Christian T. Byrnes,
Julien Lesgourgues,
Devanshu Sharma
Abstract:
Explaining the origin of supermassive black holes via a primordial origin is severely challenged by the tight spectral distortion constraints on the amplitude of the primordial perturbations. Following the first calculation of how the $μ$ constraints are modified by non-Gaussianity in a companion paper, we here make the first robust constraints on primordial black hole formation under large non-Ga…
▽ More
Explaining the origin of supermassive black holes via a primordial origin is severely challenged by the tight spectral distortion constraints on the amplitude of the primordial perturbations. Following the first calculation of how the $μ$ constraints are modified by non-Gaussianity in a companion paper, we here make the first robust constraints on primordial black hole formation under large non-Gaussianity. Even the infinite $f_{\rm NL}$ limit is insufficiently non-Gaussian but much higher-order non-Gaussianity of the form ${\cal R}={\cal R}_{\rm G}^5$ may allow the formation of any mass primordial black hole without conflicting with distortion constraints. We caution that such extreme models face other challenges.
△ Less
Submitted 29 April, 2024;
originally announced April 2024.
-
Spectral distortions from acoustic dissipation with non-Gaussian (or not) perturbations
Authors:
Devanshu Sharma,
Julien Lesgourgues,
Christian T. Byrnes
Abstract:
A well-known route to form primordial black holes in the early universe relies on the existence of unusually large primordial curvature fluctuations, confined to a narrow range of wavelengths that would be too small to be constrained by Cosmic Microwave Background (CMB) anisotropies. This scenario would however boost the generation of $μ$-type spectral distortions in the CMB due to an enhanced dis…
▽ More
A well-known route to form primordial black holes in the early universe relies on the existence of unusually large primordial curvature fluctuations, confined to a narrow range of wavelengths that would be too small to be constrained by Cosmic Microwave Background (CMB) anisotropies. This scenario would however boost the generation of $μ$-type spectral distortions in the CMB due to an enhanced dissipation of acoustic waves. Previous studies of $μ$-distortion bounds on the primordial spectrum were based on the assumptions of Gaussian primordial fluctuations. In this work, we push the calculation of $μ$-distortions to one higher order in photon anisotropies. We discuss how to derive bounds on primordial spectrum peaks obeying non-Gaussian statistics under the assumption of local (perturbative or not) non-Gaussianity. We find that, depending on the value of the peak scale, the bounds may either remain stable or get tighter by several orders of magnitude, but only when the departure from Gaussian statistics is very strong. Our results are translated in terms of bounds on primordial supermassive black hole mass in a companion paper.
△ Less
Submitted 29 April, 2024;
originally announced April 2024.
-
Size-induced Exchange Bias in Single-phase CoO Nanoparticles
Authors:
Vikash Sharma,
Sudip Pal,
Divya Sharma,
Dinesh Kumar Shukla,
Ram Janay Chaudhary,
Gunadhor Singh Okram
Abstract:
We report exchange bias (EB) in single-phase CoO nanoparticles, where two magnetic phases naturally emerge as the crystallite size decreases from 34.6 to 10.8 nm. The Néel temperature (TN) associated with antiferromagnetic ordering decreases monotonically with the reduction in crystallite size, highlighting the significant influence of size effects. The 34.6 nm nanoparticles exhibit magnetization…
▽ More
We report exchange bias (EB) in single-phase CoO nanoparticles, where two magnetic phases naturally emerge as the crystallite size decreases from 34.6 to 10.8 nm. The Néel temperature (TN) associated with antiferromagnetic ordering decreases monotonically with the reduction in crystallite size, highlighting the significant influence of size effects. The 34.6 nm nanoparticles exhibit magnetization irreversibility between zero field cooled (ZFC) and field-cooled (FC) states below TN. This irreversibility appears well above TN with further reduction in size, resulting in the absence of true paramagnetic regime which indicates the occurrence of an additional magnetic phase. The frequency-dependent ac-susceptibility in 10.8 nm nanoparticles suggests slow dynamics of disordered surface spins above TN, coinciding with the establishment of long-range order in the core. The thermoremanent magnetization (TRM) and isothermoremanent magnetization (IRM) curves suggest a core-shell structure: the core is antiferromagnetic, and the shell consists of disordered surface spins causing ferromagnetic interaction. Hence, the exchange bias in these CoO nanoparticles results from the exchange coupling between an antiferromagnetic core and a disordered shell that exhibits unconventional surface spin characteristics.
△ Less
Submitted 12 April, 2024; v1 submitted 10 April, 2024;
originally announced April 2024.
-
Towards Large Language Model driven Reference-less Translation Evaluation for English and Indian Languages
Authors:
Vandan Mujadia,
Pruthwik Mishra,
Arafat Ahsan,
Dipti Misra Sharma
Abstract:
With the primary focus on evaluating the effectiveness of large language models for automatic reference-less translation assessment, this work presents our experiments on mimicking human direct assessment to evaluate the quality of translations in English and Indian languages. We constructed a translation evaluation task where we performed zero-shot learning, in-context example-driven learning, an…
▽ More
With the primary focus on evaluating the effectiveness of large language models for automatic reference-less translation assessment, this work presents our experiments on mimicking human direct assessment to evaluate the quality of translations in English and Indian languages. We constructed a translation evaluation task where we performed zero-shot learning, in-context example-driven learning, and fine-tuning of large language models to provide a score out of 100, where 100 represents a perfect translation and 1 represents a poor translation. We compared the performance of our trained systems with existing methods such as COMET, BERT-Scorer, and LABSE, and found that the LLM-based evaluator (LLaMA-2-13B) achieves a comparable or higher overall correlation with human judgments for the considered Indian language pairs.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
HCiM: ADC-Less Hybrid Analog-Digital Compute in Memory Accelerator for Deep Learning Workloads
Authors:
Shubham Negi,
Utkarsh Saxena,
Deepika Sharma,
Kaushik Roy
Abstract:
Analog Compute-in-Memory (CiM) accelerators are increasingly recognized for their efficiency in accelerating Deep Neural Networks (DNN). However, their dependence on Analog-to-Digital Converters (ADCs) for accumulating partial sums from crossbars leads to substantial power and area overhead. Moreover, the high area overhead of ADCs constrains the throughput due to the limited number of ADCs that c…
▽ More
Analog Compute-in-Memory (CiM) accelerators are increasingly recognized for their efficiency in accelerating Deep Neural Networks (DNN). However, their dependence on Analog-to-Digital Converters (ADCs) for accumulating partial sums from crossbars leads to substantial power and area overhead. Moreover, the high area overhead of ADCs constrains the throughput due to the limited number of ADCs that can be integrated per crossbar. An approach to mitigate this issue involves the adoption of extreme low-precision quantization (binary or ternary) for partial sums. Training based on such an approach eliminates the need for ADCs. While this strategy effectively reduces ADC costs, it introduces the challenge of managing numerous floating-point scale factors, which are trainable parameters like DNN weights. These scale factors must be multiplied with the binary or ternary outputs at the columns of the crossbar to ensure system accuracy. To that effect, we propose an algorithm-hardware co-design approach, where DNNs are first trained with quantization-aware training. Subsequently, we introduce HCiM, an ADC-Less Hybrid Analog-Digital CiM accelerator. HCiM uses analog CiM crossbars for performing Matrix-Vector Multiplication operations coupled with a digital CiM array dedicated to processing scale factors. This digital CiM array can execute both addition and subtraction operations within the memory array, thus enhancing processing speed. Additionally, it exploits the inherent sparsity in ternary quantization to achieve further energy savings. Compared to an analog CiM baseline architecture using 7 and 4-bit ADC, HCiM achieves energy reductions up to 28% and 12%, respectively
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
LAVA: Long-horizon Visual Action based Food Acquisition
Authors:
Amisha Bhaskar,
Rui Liu,
Vishnu D. Sharma,
Guangyao Shi,
Pratap Tokekar
Abstract:
Robotic Assisted Feeding (RAF) addresses the fundamental need for individuals with mobility impairments to regain autonomy in feeding themselves. The goal of RAF is to use a robot arm to acquire and transfer food to individuals from the table. Existing RAF methods primarily focus on solid foods, leaving a gap in manipulation strategies for semi-solid and deformable foods. This study introduces Lon…
▽ More
Robotic Assisted Feeding (RAF) addresses the fundamental need for individuals with mobility impairments to regain autonomy in feeding themselves. The goal of RAF is to use a robot arm to acquire and transfer food to individuals from the table. Existing RAF methods primarily focus on solid foods, leaving a gap in manipulation strategies for semi-solid and deformable foods. This study introduces Long-horizon Visual Action (LAVA) based food acquisition of liquid, semisolid, and deformable foods. Long-horizon refers to the goal of "clearing the bowl" by sequentially acquiring the food from the bowl. LAVA employs a hierarchical policy for long-horizon food acquisition tasks. The framework uses high-level policy to determine primitives by leveraging ScoopNet. At the mid-level, LAVA finds parameters for primitives using vision. To carry out sequential plans in the real world, LAVA delegates action execution which is driven by Low-level policy that uses parameters received from mid-level policy and behavior cloning ensuring precise trajectory execution. We validate our approach on complex real-world acquisition trials involving granular, liquid, semisolid, and deformable food types along with fruit chunks and soup acquisition. Across 46 bowls, LAVA acquires much more efficiently than baselines with a success rate of 89 +/- 4% and generalizes across realistic plate variations such as different positions, varieties, and amount of food in the bowl. Code, datasets, videos, and supplementary materials can be found on our website.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
Memristive control of plasmon-mediated nonlinear photoluminescence in Au nanowires
Authors:
Deepak K Sharma,
Adrian Agreda,
Florian DellOva,
Konstantin Malchow,
Gérard Colas-des-Francs,
Erik Dujardin,
Alexandre Bouhelier
Abstract:
Nonlinear photoluminescence (N-PL) is a broadband photon emission arising from non-equilibrium electron distribution generated at the surface of metallic nanostructures by an ultrafast pulsed laser illumination. N-PL is sensitive to surface morphology, local electromagnetic field strength, and electronic band structure making it relevant to probe optically excited nanoscale plasmonic systems. It a…
▽ More
Nonlinear photoluminescence (N-PL) is a broadband photon emission arising from non-equilibrium electron distribution generated at the surface of metallic nanostructures by an ultrafast pulsed laser illumination. N-PL is sensitive to surface morphology, local electromagnetic field strength, and electronic band structure making it relevant to probe optically excited nanoscale plasmonic systems. It also has been key to access the complex multiscale time dynamics ruling electron thermalization. Here, we show that the surface plasmons mediated N-PL emitted by a gold nanowire can be modified by an electrical architecture featuring a nanogap. Upon voltage activation, we observe that N-PL becomes dependent to the electrical transport dynamics and can thus be locally modulated. This finding brings an electrical leverage to externally control the photoluminescence generated from metal nanostructures, and constitutes an asset for the development of emerging nanoscale interface devices managing photons and electrons.
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
A comparative study of cosmological constraints from weak lensing using Convolutional Neural Networks
Authors:
Divij Sharma,
Biwei Dai,
Uros Seljak
Abstract:
Weak Lensing (WL) surveys are reaching unprecedented depths, enabling the investigation of very small angular scales. At these scales, nonlinear gravitational effects lead to higher-order correlations making the matter distribution highly non-Gaussian. Extracting this information using traditional statistics has proven difficult, and Machine Learning based summary statistics have emerged as a powe…
▽ More
Weak Lensing (WL) surveys are reaching unprecedented depths, enabling the investigation of very small angular scales. At these scales, nonlinear gravitational effects lead to higher-order correlations making the matter distribution highly non-Gaussian. Extracting this information using traditional statistics has proven difficult, and Machine Learning based summary statistics have emerged as a powerful alternative. We explore the capabilities of a discriminative, Convolutional Neural Networks (CNN) based approach, focusing on parameter constraints in the ($Ω_m$, $σ_8$) cosmological parameter space. Leveraging novel training loss functions and network representations on WL mock datasets without baryons, we show that our models achieve $\sim 5$ times stronger constraints than the power spectrum, $\sim 3$ stronger constraints than peak counts, and $\sim 2$ stronger constraints than previous CNN-learned summary statistics and scattering transforms, for noise levels relevant to Rubin or Euclid. For WL convergence maps with baryonic physics, our models achieve $\sim 2.3$ times stronger constraining power than the power spectrum at these noise levels, also outperforming previous summary statistics. To further explore the possibilities of CNNs for this task, we also discuss transfer learning where we adapt pre-trained models, trained on different tasks or datasets, for cosmological inference, finding that these do not improve the performance.
△ Less
Submitted 6 March, 2024;
originally announced March 2024.
-
Statistical and Machine Learning Models for Predicting Fire and Other Emergency Events
Authors:
Dilli Prasad Sharma,
Nasim Beigi-Mohammadi,
Hongxiang Geng,
Dawn Dixon,
Rob Madro,
Phil Emmenegger,
Carlos Tobar,
Jeff Li,
Alberto Leon-Garcia
Abstract:
Emergency events in a city cause considerable economic loss to individuals, their families, and the community. Accurate and timely prediction of events can help the emergency fire and rescue services in preparing for and mitigating the consequences of emergency events. In this paper, we present a systematic development of predictive models for various types of emergency events in the City of Edmon…
▽ More
Emergency events in a city cause considerable economic loss to individuals, their families, and the community. Accurate and timely prediction of events can help the emergency fire and rescue services in preparing for and mitigating the consequences of emergency events. In this paper, we present a systematic development of predictive models for various types of emergency events in the City of Edmonton, Canada. We present methods for (i) data collection and dataset development; (ii) descriptive analysis of each event type and its characteristics at different spatiotemporal levels; (iii) feature analysis and selection based on correlation coefficient analysis and feature importance analysis; and (iv) development of prediction models for the likelihood of occurrence of each event type at different temporal and spatial resolutions. We analyze the association of event types with socioeconomic and demographic data at the neighborhood level, identify a set of predictors for each event type, and develop predictive models with negative binomial regression. We conduct evaluations at neighborhood and fire station service area levels. Our results show that the models perform well for most of the event types with acceptable prediction errors for weekly and monthly periods. The evaluation shows that the prediction accuracy is consistent at the level of the fire station, so the predictions can be used in management by fire rescue service departments for planning resource allocation for these time periods. We also examine the impact of the COVID-19 pandemic on the occurrence of events and on the accuracy of event predictor models. Our findings show that COVID-19 had a significant impact on the performance of the event prediction models.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
A field-level emulator for modeling baryonic effects across hydrodynamic simulations
Authors:
Divij Sharma,
Biwei Dai,
Francisco Villaescusa-Navarro,
Uros Seljak
Abstract:
We develop a new and simple method to model baryonic effects at the field level relevant for weak lensing analyses. We analyze thousands of state-of-the-art hydrodynamic simulations from the CAMELS project, each with different cosmology and strength of feedback, and we find that the cross-correlation coefficient between full hydrodynamic and N-body simulations is very close to 1 down to…
▽ More
We develop a new and simple method to model baryonic effects at the field level relevant for weak lensing analyses. We analyze thousands of state-of-the-art hydrodynamic simulations from the CAMELS project, each with different cosmology and strength of feedback, and we find that the cross-correlation coefficient between full hydrodynamic and N-body simulations is very close to 1 down to $k\sim10~h{\rm Mpc}^{-1}$. This suggests that modeling baryonic effects at the field level down to these scales only requires N-body simulations plus a correction to the mode's amplitude given by: $\sqrt{P_{\rm hydro}(k)/P_{\rm nbody}(k)}$. In this paper, we build an emulator for this quantity, using Gaussian processes, that is flexible enough to reproduce results from thousands of hydrodynamic simulations that have different cosmologies, astrophysics, subgrid physics, volumes, resolutions, and at different redshifts. Our emulator is accurate at the percent level and exhibits a range of validation superior to previous studies. This method and our emulator enable field-level simulation-based inference analyses and accounting for baryonic effects in weak lensing analyses.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
ANFIS and metaheuristics for green supply chain with inspection and rework
Authors:
Nidhi Sharma,
Madhu Jain,
Dinesh Sharma
Abstract:
The focus of present article is to investigate a supply chain inventory model of deteriorated items along with inspection and stock dependent demand using green technology to reduce carbon emissions. Products that are decaying have a high sensitivity to the environment in terms of temperature, carbon emission, humidity, waste disposal, etc. This study develops a profit maximization model in the pr…
▽ More
The focus of present article is to investigate a supply chain inventory model of deteriorated items along with inspection and stock dependent demand using green technology to reduce carbon emissions. Products that are decaying have a high sensitivity to the environment in terms of temperature, carbon emission, humidity, waste disposal, etc. This study develops a profit maximization model in the presence of deterioration, preservation, imperfect production, inspection error, rework, stock and price-dependent demand. Three carbon emission strategies are proposed to reduce the expenses in different carbon emissions scenarios. The suggested approach may be used to determine the optimal production period, preservation investment, and level of green investment. The solution of the proposed non-linear constraint optimization is provided by using a penalty method in metaheuristic approaches. In order to conduct a sensitivity analysis for the essential model parameters, a numerical example is presented. The results produced by DE and PSO are compared with the results obtained by Adaptive Neuro-Fuzzy Inference System (ANFIS) technique.
△ Less
Submitted 12 July, 2024; v1 submitted 17 January, 2024;
originally announced January 2024.
-
Auxiliary Network-Enabled Attack Detection and Resilient Control of Islanded AC Microgrid
Authors:
Vaibhav Vaishnav,
Anoop Jain,
Dushyant Sharma
Abstract:
This paper proposes a cyber-resilient distributed control strategy equipped with attack detection capabilities for islanded AC microgrids in the presence of bounded stealthy cyber attacks affecting both frequency and power information exchanged among neighboring distributed generators (DGs). The proposed control methodology relies on the construction of an auxiliary layer and the establishment of…
▽ More
This paper proposes a cyber-resilient distributed control strategy equipped with attack detection capabilities for islanded AC microgrids in the presence of bounded stealthy cyber attacks affecting both frequency and power information exchanged among neighboring distributed generators (DGs). The proposed control methodology relies on the construction of an auxiliary layer and the establishment of effective inter-layer cooperation between the actual DGs in the control layer and the virtual DGs in the auxiliary layer. This cooperation aims to achieve robust frequency restoration and proportional active power-sharing. It is shown that the in situ presence of a concealed auxiliary layer not only guarantees resilience against stealthy bounded attacks on both frequency and power-sharing but also facilitates a network-enabled attack identification mechanism. The paper provides rigorous proof of the stability of the closed-loop system and derives bounds for frequency and power deviations under attack conditions, offering insights into the impact of the attack signal, control and pinning gains, and network connectivity on the system's convergence properties. The performance of the proposed controllers is illustrated by simulating a networked islanded AC microgrid in a Simulink environment showcasing both attributes of attack resilience and attack detection.
△ Less
Submitted 30 December, 2023;
originally announced January 2024.
-
Automatic Data Retrieval for Cross Lingual Summarization
Authors:
Nikhilesh Bhatnagar,
Ashok Urlana,
Vandan Mujadia,
Pruthwik Mishra,
Dipti Misra Sharma
Abstract:
Cross-lingual summarization involves the summarization of text written in one language to a different one. There is a body of research addressing cross-lingual summarization from English to other European languages. In this work, we aim to perform cross-lingual summarization from English to Hindi. We propose pairing up the coverage of newsworthy events in textual and video format can prove to be h…
▽ More
Cross-lingual summarization involves the summarization of text written in one language to a different one. There is a body of research addressing cross-lingual summarization from English to other European languages. In this work, we aim to perform cross-lingual summarization from English to Hindi. We propose pairing up the coverage of newsworthy events in textual and video format can prove to be helpful for data acquisition for cross lingual summarization. We analyze the data and propose methods to match articles to video descriptions that serve as document and summary pairs. We also outline filtering methods over reasonable thresholds to ensure the correctness of the summaries. Further, we make available 28,583 mono and cross-lingual article-summary pairs https://github.com/tingc9/Cross-Sum-News-Aligned. We also build and analyze multiple baselines on the collected data and report error analysis.
△ Less
Submitted 22 December, 2023;
originally announced December 2023.
-
Gemini: A Family of Highly Capable Multimodal Models
Authors:
Gemini Team,
Rohan Anil,
Sebastian Borgeaud,
Jean-Baptiste Alayrac,
Jiahui Yu,
Radu Soricut,
Johan Schalkwyk,
Andrew M. Dai,
Anja Hauth,
Katie Millican,
David Silver,
Melvin Johnson,
Ioannis Antonoglou,
Julian Schrittwieser,
Amelia Glaese,
Jilin Chen,
Emily Pitler,
Timothy Lillicrap,
Angeliki Lazaridou,
Orhan Firat,
James Molloy,
Michael Isard,
Paul R. Barham,
Tom Hennigan,
Benjamin Lee
, et al. (1325 additional authors not shown)
Abstract:
This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr…
▽ More
This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultra model advances the state of the art in 30 of 32 of these benchmarks - notably being the first model to achieve human-expert performance on the well-studied exam benchmark MMLU, and improving the state of the art in every one of the 20 multimodal benchmarks we examined. We believe that the new capabilities of the Gemini family in cross-modal reasoning and language understanding will enable a wide variety of use cases. We discuss our approach toward post-training and deploying Gemini models responsibly to users through services including Gemini, Gemini Advanced, Google AI Studio, and Cloud Vertex AI.
△ Less
Submitted 17 June, 2024; v1 submitted 18 December, 2023;
originally announced December 2023.
-
Verb Categorisation for Hindi Word Problem Solving
Authors:
Harshita Sharma,
Pruthwik Mishra,
Dipti Misra Sharma
Abstract:
Word problem Solving is a challenging NLP task that deals with solving mathematical problems described in natural language. Recently, there has been renewed interest in developing word problem solvers for Indian languages. As part of this paper, we have built a Hindi arithmetic word problem solver which makes use of verbs. Additionally, we have created verb categorization data for Hindi. Verbs are…
▽ More
Word problem Solving is a challenging NLP task that deals with solving mathematical problems described in natural language. Recently, there has been renewed interest in developing word problem solvers for Indian languages. As part of this paper, we have built a Hindi arithmetic word problem solver which makes use of verbs. Additionally, we have created verb categorization data for Hindi. Verbs are very important for solving word problems with addition/subtraction operations as they help us identify the set of operations required to solve the word problems. We propose a rule-based solver that uses verb categorisation to identify operations in a word problem and generate answers for it. To perform verb categorisation, we explore several approaches and present a comparative study.
△ Less
Submitted 18 December, 2023;
originally announced December 2023.
-
Identified charged-hadron production in $p$$+$Al, $^3$He$+$Au, and Cu$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV and in U$+$U collisions at $\sqrt{s_{_{NN}}}=193$ GeV
Authors:
PHENIX Collaboration,
N. J. Abdulameer,
U. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
R. Akimoto,
J. Alexander,
M. Alfred,
V. Andrieux,
K. Aoki,
N. Apadula,
H. Asano,
E. T. Atomssa,
T. C. Awes,
B. Azmoun,
V. Babintsev,
M. Bai,
X. Bai,
N. S. Bandara,
B. Bannier,
K. N. Barish,
S. Bathe,
V. Baublis
, et al. (456 additional authors not shown)
Abstract:
The PHENIX experiment has performed a systematic study of identified charged-hadron ($π^\pm$, $K^\pm$, $p$, $\bar{p}$) production at midrapidity in $p$$+$Al, $^3$He$+$Au, Cu$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV and U$+$U collisions at $\sqrt{s_{_{NN}}}=193$ GeV. Identified charged-hadron invariant transverse-momentum ($p_T$) and transverse-mass ($m_T$) spectra are presented and interprete…
▽ More
The PHENIX experiment has performed a systematic study of identified charged-hadron ($π^\pm$, $K^\pm$, $p$, $\bar{p}$) production at midrapidity in $p$$+$Al, $^3$He$+$Au, Cu$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV and U$+$U collisions at $\sqrt{s_{_{NN}}}=193$ GeV. Identified charged-hadron invariant transverse-momentum ($p_T$) and transverse-mass ($m_T$) spectra are presented and interpreted in terms of radially expanding thermalized systems. The particle ratios of $K/π$ and $p/π$ have been measured in different centrality ranges of large (Cu$+$Au, U$+$U) and small ($p$$+$Al, $^3$He$+$Au) collision systems. The values of $K/π$ ratios measured in all considered collision systems were found to be consistent with those measured in $p$$+$$p$ collisions. However the values of $p/π$ ratios measured in large collision systems reach the values of $\approx0.6$, which is $\approx2$ times larger than in $p$$+$$p$ collisions. These results can be qualitatively understood in terms of the baryon enhancement expected from hadronization by recombination. Identified charged-hadron nuclear-modification factors ($R_{AB}$) are also presented. Enhancement of proton $R_{AB}$ values over meson $R_{AB}$ values was observed in central $^3$He$+$Au, Cu$+$Au, and U$+$U collisions. The proton $R_{AB}$ values measured in $p$$+$Al collision system were found to be consistent with $R_{AB}$ values of $φ$, $π^\pm$, $K^\pm$, and $π^0$ mesons, which may indicate that the size of the system produced in $p$$+$Al collisions is too small for recombination to cause a noticeable increase in proton production.
△ Less
Submitted 22 May, 2024; v1 submitted 14 December, 2023;
originally announced December 2023.
-
Exploring Answer Information Methods for Question Generation with Transformers
Authors:
Talha Chafekar,
Aafiya Hussain,
Grishma Sharma,
Deepak Sharma
Abstract:
There has been a lot of work in question generation where different methods to provide target answers as input, have been employed. This experimentation has been mostly carried out for RNN based models. We use three different methods and their combinations for incorporating answer information and explore their effect on several automatic evaluation metrics. The methods that are used are answer pro…
▽ More
There has been a lot of work in question generation where different methods to provide target answers as input, have been employed. This experimentation has been mostly carried out for RNN based models. We use three different methods and their combinations for incorporating answer information and explore their effect on several automatic evaluation metrics. The methods that are used are answer prompting, using a custom product method using answer embeddings and encoder outputs, choosing sentences from the input paragraph that have answer related information, and using a separate cross-attention attention block in the decoder which attends to the answer. We observe that answer prompting without any additional modes obtains the best scores across rouge, meteor scores. Additionally, we use a custom metric to calculate how many of the generated questions have the same answer, as the answer which is used to generate them.
△ Less
Submitted 6 December, 2023;
originally announced December 2023.
-
On the representation of an imaginary quadratic integer in two different bases
Authors:
Divyum Sharma
Abstract:
Let $(α,\mathcal{N}_α)$ and $(β,\mathcal{N}_β)$ be two canonical number systems for an imaginary quadratic number field $K$ such that $α$ and $β$ are multiplicatively independent. We provide an effective lower bound for the sum of the number of non-zero digits in the $α$-adic and $β$-adic expansions of an algebraic integer $γ\in\mathcal{O}_K$ which is an increasing function of $|γ|$. This is an an…
▽ More
Let $(α,\mathcal{N}_α)$ and $(β,\mathcal{N}_β)$ be two canonical number systems for an imaginary quadratic number field $K$ such that $α$ and $β$ are multiplicatively independent. We provide an effective lower bound for the sum of the number of non-zero digits in the $α$-adic and $β$-adic expansions of an algebraic integer $γ\in\mathcal{O}_K$ which is an increasing function of $|γ|$. This is an analogue of an earlier result due to Stewart on integer representations.
△ Less
Submitted 28 November, 2023;
originally announced November 2023.
-
Assessing Translation capabilities of Large Language Models involving English and Indian Languages
Authors:
Vandan Mujadia,
Ashok Urlana,
Yash Bhaskar,
Penumalla Aditya Pavani,
Kukkapalli Shravya,
Parameswari Krishnamurthy,
Dipti Misra Sharma
Abstract:
Generative Large Language Models (LLMs) have achieved remarkable advancements in various NLP tasks. In this work, our aim is to explore the multilingual capabilities of large language models by using machine translation as a task involving English and 22 Indian languages. We first investigate the translation capabilities of raw large language models, followed by exploring the in-context learning c…
▽ More
Generative Large Language Models (LLMs) have achieved remarkable advancements in various NLP tasks. In this work, our aim is to explore the multilingual capabilities of large language models by using machine translation as a task involving English and 22 Indian languages. We first investigate the translation capabilities of raw large language models, followed by exploring the in-context learning capabilities of the same raw models. We fine-tune these large language models using parameter efficient fine-tuning methods such as LoRA and additionally with full fine-tuning. Through our study, we have identified the best performing large language model for the translation task involving LLMs, which is based on LLaMA.
Our results demonstrate significant progress, with average BLEU scores of 13.42, 15.93, 12.13, 12.30, and 12.07, as well as CHRF scores of 43.98, 46.99, 42.55, 42.42, and 45.39, respectively, using 2-stage fine-tuned LLaMA-13b for English to Indian languages on IN22 (conversational), IN22 (general), flores200-dev, flores200-devtest, and newstest2019 testsets. Similarly, for Indian languages to English, we achieved average BLEU scores of 14.03, 16.65, 16.17, 15.35 and 12.55 along with chrF scores of 36.71, 40.44, 40.26, 39.51, and 36.20, respectively, using fine-tuned LLaMA-13b on IN22 (conversational), IN22 (general), flores200-dev, flores200-devtest, and newstest2019 testsets. Overall, our findings highlight the potential and strength of large language models for machine translation capabilities, including for languages that are currently underrepresented in LLMs.
△ Less
Submitted 15 November, 2023;
originally announced November 2023.
-
A Multi-Agent Reinforcement Learning Framework for Evaluating the U.S. Ending the HIV Epidemic Plan
Authors:
Dinesh Sharma,
Ankit Shah,
Chaitra Gopalappa
Abstract:
Human immunodeficiency virus (HIV) is a major public health concern in the United States, with about 1.2 million people living with HIV and 35,000 newly infected each year. There are considerable geographical disparities in HIV burden and care access across the U.S. The 2019 Ending the HIV Epidemic (EHE) initiative aims to reduce new infections by 90% by 2030, by improving coverage of diagnoses, t…
▽ More
Human immunodeficiency virus (HIV) is a major public health concern in the United States, with about 1.2 million people living with HIV and 35,000 newly infected each year. There are considerable geographical disparities in HIV burden and care access across the U.S. The 2019 Ending the HIV Epidemic (EHE) initiative aims to reduce new infections by 90% by 2030, by improving coverage of diagnoses, treatment, and prevention interventions and prioritizing jurisdictions with high HIV prevalence. Identifying optimal scale-up of intervention combinations will help inform resource allocation. Existing HIV decision analytic models either evaluate specific cities or the overall national population, thus overlooking jurisdictional interactions or differences. In this paper, we propose a multi-agent reinforcement learning (MARL) model, that enables jurisdiction-specific decision analyses but in an environment with cross-jurisdictional epidemiological interactions. In experimental analyses, conducted on jurisdictions within California and Florida, optimal policies from MARL were significantly different than those generated from single-agent RL, highlighting the influence of jurisdictional variations and interactions. By using comprehensive modeling of HIV and formulations of state space, action space, and reward functions, this work helps demonstrate the strengths and applicability of MARL for informing public health policies, and provides a framework for expanding to the national-level to inform the EHE.
△ Less
Submitted 6 November, 2023; v1 submitted 1 November, 2023;
originally announced November 2023.
-
Knowledge-based in silico models and dataset for the comparative evaluation of mammography AI for a range of breast characteristics, lesion conspicuities and doses
Authors:
Elena Sizikova,
Niloufar Saharkhiz,
Diksha Sharma,
Miguel Lago,
Berkman Sahiner,
Jana G. Delfino,
Aldo Badano
Abstract:
To generate evidence regarding the safety and efficacy of artificial intelligence (AI) enabled medical devices, AI models need to be evaluated on a diverse population of patient cases, some of which may not be readily available. We propose an evaluation approach for testing medical imaging AI models that relies on in silico imaging pipelines in which stochastic digital models of human anatomy (in…
▽ More
To generate evidence regarding the safety and efficacy of artificial intelligence (AI) enabled medical devices, AI models need to be evaluated on a diverse population of patient cases, some of which may not be readily available. We propose an evaluation approach for testing medical imaging AI models that relies on in silico imaging pipelines in which stochastic digital models of human anatomy (in object space) with and without pathology are imaged using a digital replica imaging acquisition system to generate realistic synthetic image datasets. Here, we release M-SYNTH, a dataset of cohorts with four breast fibroglandular density distributions imaged at different exposure levels using Monte Carlo x-ray simulations with the publicly available Virtual Imaging Clinical Trial for Regulatory Evaluation (VICTRE) toolkit. We utilize the synthetic dataset to analyze AI model performance and find that model performance decreases with increasing breast density and increases with higher mass density, as expected. As exposure levels decrease, AI model performance drops with the highest performance achieved at exposure levels lower than the nominal recommended dose for the breast type.
△ Less
Submitted 27 October, 2023;
originally announced October 2023.
-
Quantum-inspired attribute selection algorithm: A Fidelity-based Quantum Decision Tree
Authors:
Diksha Sharma,
Parvinder Singh,
Atul Kumar
Abstract:
A classical decision tree is completely based on splitting measures, which utilize the occurrence of random events in correspondence to its class labels in order to optimally segregate datasets. However, the splitting measures are based on greedy strategy, which leads to construction of an imbalanced tree and hence decreases the prediction accuracy of the classical decision tree algorithm. An intr…
▽ More
A classical decision tree is completely based on splitting measures, which utilize the occurrence of random events in correspondence to its class labels in order to optimally segregate datasets. However, the splitting measures are based on greedy strategy, which leads to construction of an imbalanced tree and hence decreases the prediction accuracy of the classical decision tree algorithm. An intriguing approach is to utilize the foundational aspects of quantum computing for enhancing decision tree algorithm. Therefore, in this work, we propose to use fidelity as a quantum splitting criterion to construct an efficient and balanced quantum decision tree. For this, we construct a quantum state using the occurrence of random events in a feature and its corresponding class. The quantum state is further utilized to compute fidelity for determining the splitting attribute among all features. Using numerical analysis, our results clearly demonstrate that the proposed algorithm cooperatively ensures the construction of a balanced tree. We further compared the efficiency of our proposed quantum splitting criterion to different classical splitting criteria on balanced and imbalanced datasets. Our simulation results show that the proposed splitting criterion exceeds all classical splitting criteria for all possible evaluation metrics.
△ Less
Submitted 27 October, 2023;
originally announced October 2023.
-
Pre-Trained Masked Image Model for Mobile Robot Navigation
Authors:
Vishnu Dutt Sharma,
Anukriti Singh,
Pratap Tokekar
Abstract:
2D top-down maps are commonly used for the navigation and exploration of mobile robots through unknown areas. Typically, the robot builds the navigation maps incrementally from local observations using onboard sensors. Recent works have shown that predicting the structural patterns in the environment through learning-based approaches can greatly enhance task efficiency. While many such works build…
▽ More
2D top-down maps are commonly used for the navigation and exploration of mobile robots through unknown areas. Typically, the robot builds the navigation maps incrementally from local observations using onboard sensors. Recent works have shown that predicting the structural patterns in the environment through learning-based approaches can greatly enhance task efficiency. While many such works build task-specific networks using limited datasets, we show that the existing foundational vision networks can accomplish the same without any fine-tuning. Specifically, we use Masked Autoencoders, pre-trained on street images, to present novel applications for field-of-view expansion, single-agent topological exploration, and multi-agent exploration for indoor mapping, across different input modalities. Our work motivates the use of foundational vision models for generalized structure prediction-driven applications, especially in the dearth of training data. For more qualitative results see https://raaslab.org/projects/MIM4Robots.
△ Less
Submitted 25 March, 2024; v1 submitted 10 October, 2023;
originally announced October 2023.
-
Crystal-GFN: sampling crystals with desirable properties and constraints
Authors:
Mila AI4Science,
Alex Hernandez-Garcia,
Alexandre Duval,
Alexandra Volokhova,
Yoshua Bengio,
Divya Sharma,
Pierre Luc Carrier,
Yasmine Benabed,
Michał Koziarski,
Victor Schmidt
Abstract:
Accelerating material discovery holds the potential to greatly help mitigate the climate crisis. Discovering new solid-state materials such as electrocatalysts, super-ionic conductors or photovoltaic materials can have a crucial impact, for instance, in improving the efficiency of renewable energy production and storage. In this paper, we introduce Crystal-GFN, a generative model of crystal struct…
▽ More
Accelerating material discovery holds the potential to greatly help mitigate the climate crisis. Discovering new solid-state materials such as electrocatalysts, super-ionic conductors or photovoltaic materials can have a crucial impact, for instance, in improving the efficiency of renewable energy production and storage. In this paper, we introduce Crystal-GFN, a generative model of crystal structures that sequentially samples structural properties of crystalline materials, namely the space group, composition and lattice parameters. This domain-inspired approach enables the flexible incorporation of physical and structural hard constraints, as well as the use of any available predictive model of a desired physicochemical property as an objective function. To design stable materials, one must target the candidates with the lowest formation energy. Here, we use as objective the formation energy per atom of a crystal structure predicted by a new proxy machine learning model trained on MatBench. The results demonstrate that Crystal-GFN is able to sample highly diverse crystals with low (median -3.1 eV/atom) predicted formation energy.
△ Less
Submitted 13 December, 2023; v1 submitted 7 October, 2023;
originally announced October 2023.
-
Quantifying Outlierness of Funds from their Categories using Supervised Similarity
Authors:
Dhruv Desai,
Ashmita Dhiman,
Tushar Sharma,
Deepika Sharma,
Dhagash Mehta,
Stefano Pasquali
Abstract:
Mutual fund categorization has become a standard tool for the investment management industry and is extensively used by allocators for portfolio construction and manager selection, as well as by fund managers for peer analysis and competitive positioning. As a result, a (unintended) miscategorization or lack of precision can significantly impact allocation decisions and investment fund managers. H…
▽ More
Mutual fund categorization has become a standard tool for the investment management industry and is extensively used by allocators for portfolio construction and manager selection, as well as by fund managers for peer analysis and competitive positioning. As a result, a (unintended) miscategorization or lack of precision can significantly impact allocation decisions and investment fund managers. Here, we aim to quantify the effect of miscategorization of funds utilizing a machine learning based approach. We formulate the problem of miscategorization of funds as a distance-based outlier detection problem, where the outliers are the data-points that are far from the rest of the data-points in the given feature space. We implement and employ a Random Forest (RF) based method of distance metric learning, and compute the so-called class-wise outlier measures for each data-point to identify outliers in the data. We test our implementation on various publicly available data sets, and then apply it to mutual fund data. We show that there is a strong relationship between the outlier measures of the funds and their future returns and discuss the implications of our findings.
△ Less
Submitted 13 August, 2023;
originally announced August 2023.
-
MAP-NBV: Multi-agent Prediction-guided Next-Best-View Planning for Active 3D Object Reconstruction
Authors:
Harnaik Dhami,
Vishnu D. Sharma,
Pratap Tokekar
Abstract:
Next-Best View (NBV) planning is a long-standing problem of determining where to obtain the next best view of an object from, by a robot that is viewing the object. There are a number of methods for choosing NBV based on the observed part of the object. In this paper, we investigate how predicting the unobserved part helps with the efficiency of reconstructing the object. We present, Multi-Agent P…
▽ More
Next-Best View (NBV) planning is a long-standing problem of determining where to obtain the next best view of an object from, by a robot that is viewing the object. There are a number of methods for choosing NBV based on the observed part of the object. In this paper, we investigate how predicting the unobserved part helps with the efficiency of reconstructing the object. We present, Multi-Agent Prediction-Guided NBV (MAP-NBV), a decentralized coordination algorithm for active 3D reconstruction with multi-agent systems. Prediction-based approaches have shown great improvement in active perception tasks by learning the cues about structures in the environment from data. However, these methods primarily focus on single-agent systems. We design a decentralized next-best-view approach that utilizes geometric measures over the predictions and jointly optimizes the information gain and control effort for efficient collaborative 3D reconstruction of the object. Our method achieves 19% improvement over the non-predictive multi-agent approach in simulations using AirSim and ShapeNet. We make our code publicly available through our project website: http://raaslab.org/projects/MAPNBV/.
△ Less
Submitted 24 June, 2024; v1 submitted 8 July, 2023;
originally announced July 2023.
-
Kolam Simulation using Angles at Lattice Points
Authors:
Tulasi Bharathi,
Shailaja D Sharma,
Nithin Nagaraj
Abstract:
Kolam is a ritual art form practised by people in South India and consists of rule-bound geometric patterns of dots and lines. Single loop Kolams are mathematical closed loop patterns drawn over a grid of dots and conforming to certain heuristics. In this work, we propose a novel encoding scheme where we map the angular movements of Kolam at lattice points into sequences containing $4$ distinct sy…
▽ More
Kolam is a ritual art form practised by people in South India and consists of rule-bound geometric patterns of dots and lines. Single loop Kolams are mathematical closed loop patterns drawn over a grid of dots and conforming to certain heuristics. In this work, we propose a novel encoding scheme where we map the angular movements of Kolam at lattice points into sequences containing $4$ distinct symbols. This is then used to simulate single loop Kolam procedure via turtle moves in accordance with the desired angular direction at specific points. We thus obtain sequential codes for Kolams, unique up to cyclic permutations. We specify the requirements for the algorithm and indicate the general methodology. We demonstrate a sample of Kolams using our algorithm with a software implementation in Python.
△ Less
Submitted 5 July, 2023;
originally announced July 2023.
-
Large electro-opto-mechanical coupling in VO2 neuristors
Authors:
Upanya Khandelwal,
Rama Satya Sandilya,
Rajeev Kumar Rai,
Deepak Sharma,
Smruti Rekha Mahapatra,
Debasish Mondal,
Navakanta Bhat,
Naga Phani Aetkuri,
Sushobhan Avasthi,
Saurabh Chandorkar,
Pavan Nukala
Abstract:
Biological neurons are electro-mechanical systems, where the generation and propagation of an action potential is coupled to generation and transmission of an acoustic wave. Neuristors, such as VO2, characterized by insulator-metal transition (IMT) and negative differential resistance, can be engineered as self-oscillators, which are good approximations of biological neurons in the domain of elect…
▽ More
Biological neurons are electro-mechanical systems, where the generation and propagation of an action potential is coupled to generation and transmission of an acoustic wave. Neuristors, such as VO2, characterized by insulator-metal transition (IMT) and negative differential resistance, can be engineered as self-oscillators, which are good approximations of biological neurons in the domain of electrical signals. In this study, we show that these self-oscillators are coupled electro-opto-mechanical systems, with better energy conversion coefficients than the conventional electromechanical or electrooptical materials. This is due to the significant contrast in the material's resistance, optical refractive index and density across the induced temperature range in a Joule heating driven IMT. We carried out laser interferometry to measure the opto-mechanical response while simultaneously driving the devices electrically into self-oscillations of different kinds. We analyzed films of various thicknesses, engineered device geometry and performed analytical modelling to decouple the effects of refractive index change vis-a-vis mechanical strain in the interferometry signal. We show that the effective piezoelectric coefficient (d13*) for our neuristor devices is 660 pm/V, making them viable alternatives to Pb-based piezoelectrics for MEMS applications. Furthermore, we show that the effective electro-optic coefficient (r13*) is ~22 nm/V, which is much larger than that in thin-film and bulk Pockels materials.
△ Less
Submitted 25 June, 2023;
originally announced June 2023.
-
Numerical Simulation of Thermal Energy Storage using Phase Change Material
Authors:
Abhishek Rai,
N. S Thakur,
Deepak Sharma
Abstract:
This paper presents a study on the design optimization of Thermal Energy Storage (TES) using a cylindrical cavity and Gallium as a Phase Change Material (PCM). The objective is to improve the time span of charging and discharging, as well as minimize heat loss during storage. Five different models with varying geometries and heat source configurations were designed and analyzed using CFD simulatio…
▽ More
This paper presents a study on the design optimization of Thermal Energy Storage (TES) using a cylindrical cavity and Gallium as a Phase Change Material (PCM). The objective is to improve the time span of charging and discharging, as well as minimize heat loss during storage. Five different models with varying geometries and heat source configurations were designed and analyzed using CFD simulation in ANSYS Fluent. The results indicate that models with fins on the heat source surface outperform those without fins, due to increased heat transfer surface area. Comparing the models, Model 4 with three heat sources performs similarly to Model 2 with four heat sources, suggesting an optimal design. However, Model 5 demonstrates less desirable results as the charging time of the PCM increases. Overall, this study highlights the effectiveness of the optimized design in Model 4 with three heat sources for efficient Thermal Energy Storage.
△ Less
Submitted 21 June, 2023; v1 submitted 20 June, 2023;
originally announced June 2023.
-
An Introduction to the Compute Express Link (CXL) Interconnect
Authors:
Debendra Das Sharma,
Robert Blankenship,
Daniel S. Berger
Abstract:
The Compute Express Link (CXL) is an open industry-standard interconnect between processors and devices such as accelerators, memory buffers, smart network interfaces, persistent memory, and solid-state drives. CXL offers coherency and memory semantics with bandwidth that scales with PCIe bandwidth while achieving significantly lower latency than PCIe. All major CPU vendors, device vendors, and da…
▽ More
The Compute Express Link (CXL) is an open industry-standard interconnect between processors and devices such as accelerators, memory buffers, smart network interfaces, persistent memory, and solid-state drives. CXL offers coherency and memory semantics with bandwidth that scales with PCIe bandwidth while achieving significantly lower latency than PCIe. All major CPU vendors, device vendors, and datacenter operators have adopted CXL as a common standard. This enables an inter-operable ecosystem that supports key computing use cases including highly efficient accelerators, server memory bandwidth and capacity expansion, multi-server resource pooling and sharing, and efficient peer-to-peer communication. This survey provides an introduction to CXL covering the standards CXL 1.0, CXL 2.0, and CXL 3.0. We further survey CXL implementations, discuss CXL's impact on the datacenter landscape, and future directions.
△ Less
Submitted 7 May, 2024; v1 submitted 19 June, 2023;
originally announced June 2023.
-
Efficiently Learning the Graph for Semi-supervised Learning
Authors:
Dravyansh Sharma,
Maxwell Jones
Abstract:
Computational efficiency is a major bottleneck in using classic graph-based approaches for semi-supervised learning on datasets with a large number of unlabeled examples. Known techniques to improve efficiency typically involve an approximation of the graph regularization objective, but suffer two major drawbacks - first the graph is assumed to be known or constructed with heuristic hyperparameter…
▽ More
Computational efficiency is a major bottleneck in using classic graph-based approaches for semi-supervised learning on datasets with a large number of unlabeled examples. Known techniques to improve efficiency typically involve an approximation of the graph regularization objective, but suffer two major drawbacks - first the graph is assumed to be known or constructed with heuristic hyperparameter values, second they do not provide a principled approximation guarantee for learning over the full unlabeled dataset. Building on recent work on learning graphs for semi-supervised learning from multiple datasets for problems from the same domain, and leveraging techniques for fast approximations for solving linear systems in the graph Laplacian matrix, we propose algorithms that overcome both the above limitations.
We show a formal separation in the learning-theoretic complexity of sparse and dense graph families. We further show how to approximately learn the best graphs from the sparse families efficiently using the conjugate gradient method.
Our approach can also be used to learn the graph efficiently online with sub-linear regret, under mild smoothness assumptions. Our online learning results are stated generally, and may be useful for approximate and efficient parameter tuning in other problems. We implement our approach and demonstrate significant ($\sim$10-100x) speedups over prior work on semi-supervised learning with learned graphs on benchmark datasets.
△ Less
Submitted 12 June, 2023;
originally announced June 2023.