-
Local aspects of topological quantization and the Wu-Yang Monopoles
Authors:
Aayush Verma
Abstract:
In this paper, we review how local potentials arise in the Wu-Yang topological quantization. We also discuss the isomorphism between the de Rham cohomology classes and Cech cohomology classes in such topological quantization. We also emphasize the importance and application of local and global information in gauge theories.
In this paper, we review how local potentials arise in the Wu-Yang topological quantization. We also discuss the isomorphism between the de Rham cohomology classes and Cech cohomology classes in such topological quantization. We also emphasize the importance and application of local and global information in gauge theories.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Data-Centric AI in the Age of Large Language Models
Authors:
Xinyi Xu,
Zhaoxuan Wu,
Rui Qiao,
Arun Verma,
Yao Shu,
Jingtan Wang,
Xinyuan Niu,
Zhenfeng He,
Jiangwei Chen,
Zijian Zhou,
Gregory Kang Ruey Lau,
Hieu Dao,
Lucas Agussurja,
Rachael Hwee Ling Sim,
Xiaoqiang Lin,
Wenyang Hu,
Zhongxiang Dai,
Pang Wei Koh,
Bryan Kian Hsiang Low
Abstract:
This position paper proposes a data-centric viewpoint of AI research, focusing on large language models (LLMs). We start by making the key observation that data is instrumental in the developmental (e.g., pretraining and fine-tuning) and inferential stages (e.g., in-context learning) of LLMs, and yet it receives disproportionally low attention from the research community. We identify four specific…
▽ More
This position paper proposes a data-centric viewpoint of AI research, focusing on large language models (LLMs). We start by making the key observation that data is instrumental in the developmental (e.g., pretraining and fine-tuning) and inferential stages (e.g., in-context learning) of LLMs, and yet it receives disproportionally low attention from the research community. We identify four specific scenarios centered around data, covering data-centric benchmarks and data curation, data attribution, knowledge transfer, and inference contextualization. In each scenario, we underscore the importance of data, highlight promising research directions, and articulate the potential impacts on the research community and, where applicable, the society as a whole. For instance, we advocate for a suite of data-centric benchmarks tailored to the scale and complexity of data for LLMs. These benchmarks can be used to develop new data curation methods and document research efforts and results, which can help promote openness and transparency in AI and LLM research.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
Gravitational-wave background in bouncing models from semi-classical, quantum and string gravity
Authors:
Ido Ben-Dayan,
Gianluca Calcagni,
Maurizio Gasperini,
Anupam Mazumdar,
Eliseo Pavone,
Udaykrishna Thattarampilly,
Amresh Verma
Abstract:
We study the primordial spectra and the gravitational-wave background (GWB) of three models of semi-classical, quantum or string gravity where the big bang is replaced by a bounce and the tensor spectrum is blue-tilted: ekpyrotic universe with fast-rolling Galileons, string-gas cosmology with Atick-Witten conjecture and pre-big-bang cosmology. We find that the ekpyrotic scenario does not produce a…
▽ More
We study the primordial spectra and the gravitational-wave background (GWB) of three models of semi-classical, quantum or string gravity where the big bang is replaced by a bounce and the tensor spectrum is blue-tilted: ekpyrotic universe with fast-rolling Galileons, string-gas cosmology with Atick-Witten conjecture and pre-big-bang cosmology. We find that the ekpyrotic scenario does not produce a GWB amplitude detectable by present or third-generation interferometers, while the string-gas model is ruled out for producing too large a signal. In contrast, the GWB of the pre-big-bang scenario falls within the sensitivity window of both LISA and Einstein Telescope, where it takes the form of a single or a broken power law depending on the choice of parameters. The latter will be tightly constrained by both detectors.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Overlay Space-Air-Ground Integrated Networks with SWIPT-Empowered Aerial Communications
Authors:
Anuradha Verma,
Pankaj Kumar Sharma,
Pawan Kumar,
Dong In Kim
Abstract:
In this article, we consider overlay space-air-ground integrated networks (OSAGINs) where a low earth orbit (LEO) satellite communicates with ground users (GUs) with the assistance of an energy-constrained coexisting air-to-air (A2A) network. Particularly, a non-linear energy harvester with a hybrid SWIPT utilizing both power-splitting and time-switching energy harvesting (EH) techniques is employ…
▽ More
In this article, we consider overlay space-air-ground integrated networks (OSAGINs) where a low earth orbit (LEO) satellite communicates with ground users (GUs) with the assistance of an energy-constrained coexisting air-to-air (A2A) network. Particularly, a non-linear energy harvester with a hybrid SWIPT utilizing both power-splitting and time-switching energy harvesting (EH) techniques is employed at the aerial transmitter. Specifically, we take the random locations of the satellite, ground and aerial receivers to investigate the outage performance of both the satellite-to-ground and aerial networks leveraging the stochastic tools. By taking into account the Shadowed-Rician fading for satellite link, the Nakagami-\emph{m} for ground link, and the Rician fading for aerial link, we derive analytical expressions for the outage probability of these networks. For a comprehensive analysis of aerial network, we consider both the perfect and imperfect successive interference cancellation (SIC) scenarios. Through our analysis, we illustrate that, unlike linear EH, the implementation of non-linear EH provides accurate figures for any target rate, underscoring the significance of using non-linear EH models. Additionally, the influence of key parameters is emphasized, providing guidelines for the practical design of an energy-efficient as well as spectrum-efficient future non-terrestrial networks. Monte Carlo simulations validate the accuracy of our theoretical developments.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Investigating the Star-forming Sites in the Outer Galactic Arm
Authors:
Aayushi Verma,
Saurabh Sharma,
Lokesh K. Dewangan,
Devendra K. Ojha,
Kshitiz Mallick,
Ram Kesh Yadav,
Harmeen Kaur,
Tarak Chand,
Mamta Agarwal,
Archana Gupta
Abstract:
We aim to investigate the global star formation scenario in star-forming sites AFGL 5157, [FSR2007] 0807 (hereafter FSR0807), [HKS2019] E70 (hereafter E70), [KPS2012] MWSC 0620 (hereafter KPS0620), and IRAS 05331+3115 in the outer galactic arm. The distribution of young stellar objects in these sites coincides with a higher extinction and H2 column density, which agrees with the notion that star f…
▽ More
We aim to investigate the global star formation scenario in star-forming sites AFGL 5157, [FSR2007] 0807 (hereafter FSR0807), [HKS2019] E70 (hereafter E70), [KPS2012] MWSC 0620 (hereafter KPS0620), and IRAS 05331+3115 in the outer galactic arm. The distribution of young stellar objects in these sites coincides with a higher extinction and H2 column density, which agrees with the notion that star formation occurs inside the dense molecular cloud cores. We have found two molecular structures at different velocities in this direction; one contains AFGL 5157 and FSR0807, and the other contains E70, [KPS2012] MWSC 0620, and IRAS 05331+3115. All these clusters in our target region are in different evolutionary stages and might form stars through different mechanisms. The E70 cluster seems to be the oldest in our sample; AFGL 5157 and FSR0807 formed later, and KPS0620 and IRAS 05331+3115 are the youngest sites. AFGL 5157 and FSR0807 are physically connected and have cold filamentary structures and dense hub regions. Additionally, the near-infrared photometric analysis shows signatures of massive star formation in these sites. KPS0620 also seems to have cold filamentary structures with the central hub but lacks signatures of massive stars. Our analysis suggests molecular gas flow and the hub filamentary star formation scenario in these regions. IRAS 05331+3115 is a single clump of molecular gas favoring low-mass star formation. Our study suggests that the selected area is a menagerie of star-forming sites where the formation of the stars happens through different processes.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Nonclassicality in Two-Mode New Generalized Binomial State
Authors:
Kathakali Mandal,
Anjali Jatwani,
Amit Verma
Abstract:
The study of nonclassical properties of two-mode quantum states is particularly useful in quantum information theory because of the possibilities of obtaining entanglement and other two-mode quantum correlations in these states. Here we have investigated the possibilities of the existence of nonclassicality in a two-mode New generalized binomial state (TMNGBS). Specifically two-mode antibunching,…
▽ More
The study of nonclassical properties of two-mode quantum states is particularly useful in quantum information theory because of the possibilities of obtaining entanglement and other two-mode quantum correlations in these states. Here we have investigated the possibilities of the existence of nonclassicality in a two-mode New generalized binomial state (TMNGBS). Specifically two-mode antibunching, Quadrature squeezing, sum \& difference squeezing, and various entanglement criteria e.g Shchukin-Vogel entanglement criterion, the uncertainty relation of SU(1,1) Algebra and EPR entanglement criterion are explored in two mode particular example of quantum state named as New generalized binomial state. Earlier we studied nonclassicality in single-mode NGBS, here we are extending our study toward the two-mode version of a quantum state. Here we provide the general expressions of moments for a two-mode quantum state (Fock basis) and explore the quantification in a particular example NGBS. It is found that antibunching, squeezing, and SV entanglement are possible with different limits of depending parameters but the entanglement criteria (EPR, SU (1,1) algebra and Cauchy - Schwarz inequality based)for NGBS are not possible. This study opens up the possibility of exploring the two-mode nonclassicality in other states too.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Strong gravitational lenses from the Vera C. Rubin Observatory
Authors:
Anowar J. Shajib,
Graham P. Smith,
Simon Birrer,
Aprajita Verma,
Nikki Arendse,
Thomas E. Collett
Abstract:
Like many areas of astrophysics and cosmology, the Vera C. Rubin Observatory will be transformational for almost all the applications of strong lensing, thanks to the dramatic increase in the number of known strong lenses by two orders of magnitude or more and the readily available time-domain data for the lenses with transient sources. In this article, we provide an overview of the forecasted num…
▽ More
Like many areas of astrophysics and cosmology, the Vera C. Rubin Observatory will be transformational for almost all the applications of strong lensing, thanks to the dramatic increase in the number of known strong lenses by two orders of magnitude or more and the readily available time-domain data for the lenses with transient sources. In this article, we provide an overview of the forecasted number of discovered lenses of different types and describe the primary science cases these large lens samples will enable. We provide an updated forecast on the joint constraint for the dark energy equation-of-state parameters, $w_0$ and $w_a$, from combining all strong lensing probes of dark energy. We update the previous forecast from the Rubin Observatory Dark Energy Science Collaboration's Science Review Document by adding two new crucial strong lensing samples: lensed Type Ia supernovae and single-deflector lenses with measured stellar kinematics. Finally, we describe the current and near-future activities and collaborative efforts within the strong lensing community in preparation for the arrival of the first real dataset from Rubin in early 2026.
△ Less
Submitted 2 July, 2024; v1 submitted 13 June, 2024;
originally announced June 2024.
-
Four-qubit photonic system for publicly verifiable quantum random numbers and generation of public and private key
Authors:
Mayalakshmi Kolangatt,
Anirudh Verma,
Sujai Matta,
Kanad Sengupta,
C. M. Chandrashekar
Abstract:
We theoretically propose and experimentally demonstrate the use of a configurable four-qubit photonic system to generate a publicly verifiable quantum random numbers, to perform entanglement verification, and to generate a secure public and private key. Quantum circuits, to generate the desired four-qubit states and its experimental realization in the photonic architecture is carried out using pho…
▽ More
We theoretically propose and experimentally demonstrate the use of a configurable four-qubit photonic system to generate a publicly verifiable quantum random numbers, to perform entanglement verification, and to generate a secure public and private key. Quantum circuits, to generate the desired four-qubit states and its experimental realization in the photonic architecture is carried out using photon pairs entangled in polarization and path degree of freedom. By performing measurements on the four-qubit system and accessing partial information of the four-qubit state for public verification, we generate publicly verified and purely secured random bits at the rate of 370 kbps. When the system is used for generating public and private keys, an equal number of public and private keys are generated simultaneously. We also record about 97.9\% of sampled bits from four-qubit states passing entanglement verification. The theoretical model of noise on the four-qubit state and its effect on the generation rate of verified and secured bits are in perfect agreement with the experimental results. This demonstrates the practical use of the small-scale multi-qubit photonic system for quantum-safe applications by providing the option for real-time verification of the security feature of the quantum system.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
AI-Driven Predictive Analytics Approach for Early Prognosis of Chronic Kidney Disease Using Ensemble Learning and Explainable AI
Authors:
K M Tawsik Jawad,
Anusha Verma,
Fathi Amsaad
Abstract:
Chronic Kidney Disease (CKD) is one of the widespread Chronic diseases with no known ultimo cure and high morbidity. Research demonstrates that progressive Chronic Kidney Disease (CKD) is a heterogeneous disorder that significantly impacts kidney structure and functions, eventually leading to kidney failure. With the progression of time, chronic kidney disease has moved from a life-threatening dis…
▽ More
Chronic Kidney Disease (CKD) is one of the widespread Chronic diseases with no known ultimo cure and high morbidity. Research demonstrates that progressive Chronic Kidney Disease (CKD) is a heterogeneous disorder that significantly impacts kidney structure and functions, eventually leading to kidney failure. With the progression of time, chronic kidney disease has moved from a life-threatening disease affecting few people to a common disorder of varying severity. The goal of this research is to visualize dominating features, feature scores, and values exhibited for early prognosis and detection of CKD using ensemble learning and explainable AI. For that, an AI-driven predictive analytics approach is proposed to aid clinical practitioners in prescribing lifestyle modifications for individual patients to reduce the rate of progression of this disease. Our dataset is collected on body vitals from individuals with CKD and healthy subjects to develop our proposed AI-driven solution accurately. In this regard, blood and urine test results are provided, and ensemble tree-based machine-learning models are applied to predict unseen cases of CKD. Our research findings are validated after lengthy consultations with nephrologists. Our experiments and interpretation results are compared with existing explainable AI applications in various healthcare domains, including CKD. The comparison shows that our developed AI models, particularly the Random Forest model, have identified more features as significant contributors than XgBoost. Interpretability (I), which measures the ratio of important to masked features, indicates that our XgBoost model achieved a higher score, specifically a Fidelity of 98\%, in this metric and naturally in the FII index compared to competing models.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Enhancing Sign Language Detection through Mediapipe and Convolutional Neural Networks (CNN)
Authors:
Aditya Raj Verma,
Gagandeep Singh,
Karnim Meghwal,
Banawath Ramji,
Praveen Kumar Dadheech
Abstract:
This research combines MediaPipe and CNNs for the efficient and accurate interpretation of ASL dataset for the real-time detection of sign language. The system presented here captures and processes hands' gestures in real time. the intended purpose was to create a very easy, accurate, and fast way of entering commands without the necessity of touching something.MediaPipe supports one of the powerf…
▽ More
This research combines MediaPipe and CNNs for the efficient and accurate interpretation of ASL dataset for the real-time detection of sign language. The system presented here captures and processes hands' gestures in real time. the intended purpose was to create a very easy, accurate, and fast way of entering commands without the necessity of touching something.MediaPipe supports one of the powerful frameworks in real-time hand tracking capabilities for the ability to capture and preprocess hand movements, which increases the accuracy of the gesture recognition system. Actually, the integration of CNN with the MediaPipe results in higher efficiency in using the model of real-time processing.The accuracy achieved by the model on ASL datasets is 99.12\%.The model was tested using American Sign Language (ASL) datasets. The results were then compared to those of existing methods to evaluate how well it performed, using established evaluation techniques. The system will have applications in the communication, education, and accessibility domains. Making systems such as described in this paper even better will assist people with hearing impairment and make things accessible to them. We tested the recognition and translation performance on an ASL dataset and achieved better accuracy over previous models.It is meant to the research is to identify the characters that American signs recognize using hand images taken from a web camera by based on mediapipe and CNNs
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Prompt Optimization with Human Feedback
Authors:
Xiaoqiang Lin,
Zhongxiang Dai,
Arun Verma,
See-Kiong Ng,
Patrick Jaillet,
Bryan Kian Hsiang Low
Abstract:
Large language models (LLMs) have demonstrated remarkable performances in various tasks. However, the performance of LLMs heavily depends on the input prompt, which has given rise to a number of recent works on prompt optimization. However, previous works often require the availability of a numeric score to assess the quality of every prompt. Unfortunately, when a human user interacts with a black…
▽ More
Large language models (LLMs) have demonstrated remarkable performances in various tasks. However, the performance of LLMs heavily depends on the input prompt, which has given rise to a number of recent works on prompt optimization. However, previous works often require the availability of a numeric score to assess the quality of every prompt. Unfortunately, when a human user interacts with a black-box LLM, attaining such a score is often infeasible and unreliable. Instead, it is usually significantly easier and more reliable to obtain preference feedback from a human user, i.e., showing the user the responses generated from a pair of prompts and asking the user which one is preferred. Therefore, in this paper, we study the problem of prompt optimization with human feedback (POHF), in which we aim to optimize the prompt for a black-box LLM using only human preference feedback. Drawing inspiration from dueling bandits, we design a theoretically principled strategy to select a pair of prompts to query for preference feedback in every iteration, and hence introduce our algorithm named automated POHF (APOHF). We apply our APOHF algorithm to various tasks, including optimizing user instructions, prompt optimization for text-to-image generative models, and response optimization with human feedback (i.e., further refining the response using a variant of our APOHF). The results demonstrate that our APOHF can efficiently find a good prompt using a small number of preference feedback instances. Our code can be found at \url{https://github.com/xqlin98/APOHF}.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Euclid. I. Overview of the Euclid mission
Authors:
Euclid Collaboration,
Y. Mellier,
Abdurro'uf,
J. A. Acevedo Barroso,
A. Achúcarro,
J. Adamek,
R. Adam,
G. E. Addison,
N. Aghanim,
M. Aguena,
V. Ajani,
Y. Akrami,
A. Al-Bahlawan,
A. Alavi,
I. S. Albuquerque,
G. Alestas,
G. Alguero,
A. Allaoui,
S. W. Allen,
V. Allevato,
A. V. Alonso-Tetilla,
B. Altieri,
A. Alvarez-Candal,
A. Amara,
L. Amendola
, et al. (1086 additional authors not shown)
Abstract:
The current standard model of cosmology successfully describes a variety of measurements, but the nature of its main ingredients, dark matter and dark energy, remains unknown. Euclid is a medium-class mission in the Cosmic Vision 2015-2025 programme of the European Space Agency (ESA) that will provide high-resolution optical imaging, as well as near-infrared imaging and spectroscopy, over about 14…
▽ More
The current standard model of cosmology successfully describes a variety of measurements, but the nature of its main ingredients, dark matter and dark energy, remains unknown. Euclid is a medium-class mission in the Cosmic Vision 2015-2025 programme of the European Space Agency (ESA) that will provide high-resolution optical imaging, as well as near-infrared imaging and spectroscopy, over about 14,000 deg^2 of extragalactic sky. In addition to accurate weak lensing and clustering measurements that probe structure formation over half of the age of the Universe, its primary probes for cosmology, these exquisite data will enable a wide range of science. This paper provides a high-level overview of the mission, summarising the survey characteristics, the various data-processing steps, and data products. We also highlight the main science objectives and expected performance.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
U.S. Election Hardens Hate Universe
Authors:
Akshay Verma,
Richard Sear,
Neil F. Johnson
Abstract:
Local or national politics can trigger potentially dangerous hate in someone. But with a third of the world's population eligible to vote in elections in 2024 alone, we lack understanding of how individual-level hate multiplies up to hate behavior at the collective global scale. Here we show, based on the most recent U.S. election, that offline events are associated with a rapid adaptation of the…
▽ More
Local or national politics can trigger potentially dangerous hate in someone. But with a third of the world's population eligible to vote in elections in 2024 alone, we lack understanding of how individual-level hate multiplies up to hate behavior at the collective global scale. Here we show, based on the most recent U.S. election, that offline events are associated with a rapid adaptation of the global online hate universe that hardens (strengthens) both its network-of-networks structure and the 'flavors' of hate content that it collectively produces. Approximately 50 million potential voters in hate communities are drawn closer to each other and to the broad mainstream of approximately 2 billion others. It triggers new hate content at scale around immigration, ethnicity, and antisemitism that aligns with conspiracy theories about Jewish-led replacement before blending in hate around gender identity/sexual orientation, and religion. Telegram acts as a key hardening agent - yet is overlooked by U.S. Congressional hearings and new E.U. legislation. Because the hate universe has remained robust since 2020, anti-hate messaging surrounding not only upcoming elections but also other events like the war in Gaza, should pivot to blending multiple hate 'flavors' while targeting previously untouched social media structures.
△ Less
Submitted 1 May, 2024;
originally announced May 2024.
-
Using LLMs in Software Requirements Specifications: An Empirical Evaluation
Authors:
Madhava Krishna,
Bhagesh Gaur,
Arsh Verma,
Pankaj Jalote
Abstract:
The creation of a Software Requirements Specification (SRS) document is important for any software development project. Given the recent prowess of Large Language Models (LLMs) in answering natural language queries and generating sophisticated textual outputs, our study explores their capability to produce accurate, coherent, and structured drafts of these documents to accelerate the software deve…
▽ More
The creation of a Software Requirements Specification (SRS) document is important for any software development project. Given the recent prowess of Large Language Models (LLMs) in answering natural language queries and generating sophisticated textual outputs, our study explores their capability to produce accurate, coherent, and structured drafts of these documents to accelerate the software development lifecycle. We assess the performance of GPT-4 and CodeLlama in drafting an SRS for a university club management system and compare it against human benchmarks using eight distinct criteria. Our results suggest that LLMs can match the output quality of an entry-level software engineer to generate an SRS, delivering complete and consistent drafts. We also evaluate the capabilities of LLMs to identify and rectify problems in a given requirements document. Our experiments indicate that GPT-4 is capable of identifying issues and giving constructive feedback for rectifying them, while CodeLlama's results for validation were not as encouraging. We repeated the generation exercise for four distinct use cases to study the time saved by employing LLMs for SRS generation. The experiment demonstrates that LLMs may facilitate a significant reduction in development time for entry-level software engineers. Hence, we conclude that the LLMs can be gainfully used by software engineers to increase productivity by saving time and effort in generating, validating and rectifying software requirements.
△ Less
Submitted 27 April, 2024;
originally announced April 2024.
-
Fast photon-mediated entanglement of continuously-cooled trapped ions for quantum networking
Authors:
Jameson O'Reilly,
George Toh,
Isabella Goetting,
Sagnik Saha,
Mikhail Shalaev,
Allison Carter,
Andrew Risinger,
Ashish Kalakuntla,
Tingguang Li,
Ashrit Verma,
Christopher Monroe
Abstract:
We entangle two co-trapped atomic barium ion qubits by collecting single visible photons from each ion through in-vacuo 0.8 NA objectives, interfering them through an integrated fiber-beamsplitter and detecting them in coincidence. This projects the qubits into an entangled Bell state with an observed fidelity lower bound of F > 94%. We also introduce an ytterbium ion for sympathetic cooling to re…
▽ More
We entangle two co-trapped atomic barium ion qubits by collecting single visible photons from each ion through in-vacuo 0.8 NA objectives, interfering them through an integrated fiber-beamsplitter and detecting them in coincidence. This projects the qubits into an entangled Bell state with an observed fidelity lower bound of F > 94%. We also introduce an ytterbium ion for sympathetic cooling to remove the need for recooling interruptions and achieve a continuous entanglement rate of 250 1/s.
△ Less
Submitted 2 July, 2024; v1 submitted 24 April, 2024;
originally announced April 2024.
-
Extraction of YSO Cores and Active Regions near Star-forming Site AFGL 5157
Authors:
Aayushi Verma,
Saurabh Sharma,
Lokesh Dewangan
Abstract:
We have carried out a quantitative analysis of the $1^{\circ} \times 1^{\circ}$ region near star-forming site AFGL 5157 using 'Minimal Spanning Tree' (MST). The analysis reveals that this region consists of five major clusters. The cluster radii of the cores and active regions were found to be varying between 0.75-2.62 pc and 2.77-4.58 pc, respectively, for these regions, while the aspect ratio va…
▽ More
We have carried out a quantitative analysis of the $1^{\circ} \times 1^{\circ}$ region near star-forming site AFGL 5157 using 'Minimal Spanning Tree' (MST). The analysis reveals that this region consists of five major clusters. The cluster radii of the cores and active regions were found to be varying between 0.75-2.62 pc and 2.77-4.58 pc, respectively, for these regions, while the aspect ratio varies between 0.71 to 7.17. This hints towards the clumpy as well as elongated clusters in the region. We calculated structure parameter Q for each region which varies between 0.41-0.62 and 0.23-0.81 for the cores and ARs, respectively. This shows the existence of fractal distribution in all the cores and ARs except the core of the [HKS2019] E70 bubble.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
Context-Enhanced Language Models for Generating Multi-Paper Citations
Authors:
Avinash Anand,
Kritarth Prasad,
Ujjwal Goel,
Mohit Gupta,
Naman Lal,
Astha Verma,
Rajiv Ratn Shah
Abstract:
Citation text plays a pivotal role in elucidating the connection between scientific documents, demanding an in-depth comprehension of the cited paper. Constructing citations is often time-consuming, requiring researchers to delve into extensive literature and grapple with articulating relevant content. To address this challenge, the field of citation text generation (CTG) has emerged. However, whi…
▽ More
Citation text plays a pivotal role in elucidating the connection between scientific documents, demanding an in-depth comprehension of the cited paper. Constructing citations is often time-consuming, requiring researchers to delve into extensive literature and grapple with articulating relevant content. To address this challenge, the field of citation text generation (CTG) has emerged. However, while earlier methods have primarily centered on creating single-sentence citations, practical scenarios frequently necessitate citing multiple papers within a single paragraph. To bridge this gap, we propose a method that leverages Large Language Models (LLMs) to generate multi-citation sentences. Our approach involves a single source paper and a collection of target papers, culminating in a coherent paragraph containing multi-sentence citation text. Furthermore, we introduce a curated dataset named MCG-S2ORC, composed of English-language academic research papers in Computer Science, showcasing multiple citation instances. In our experiments, we evaluate three LLMs LLaMA, Alpaca, and Vicuna to ascertain the most effective model for this endeavor. Additionally, we exhibit enhanced performance by integrating knowledge graphs from target papers into the prompts for generating citation text. This research underscores the potential of harnessing LLMs for citation generation, opening a compelling avenue for exploring the intricate connections between scientific documents.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
MM-PhyRLHF: Reinforcement Learning Framework for Multimodal Physics Question-Answering
Authors:
Avinash Anand,
Janak Kapuriya,
Chhavi Kirtani,
Apoorv Singh,
Jay Saraf,
Naman Lal,
Jatin Kumar,
Adarsh Raj Shivam,
Astha Verma,
Rajiv Ratn Shah,
Roger Zimmermann
Abstract:
Recent advancements in LLMs have shown their significant potential in tasks like text summarization and generation. Yet, they often encounter difficulty while solving complex physics problems that require arithmetic calculation and a good understanding of concepts. Moreover, many physics problems include images that contain important details required to understand the problem's context. We propose…
▽ More
Recent advancements in LLMs have shown their significant potential in tasks like text summarization and generation. Yet, they often encounter difficulty while solving complex physics problems that require arithmetic calculation and a good understanding of concepts. Moreover, many physics problems include images that contain important details required to understand the problem's context. We propose an LMM-based chatbot to answer multimodal physics MCQs. For domain adaptation, we utilize the MM-PhyQA dataset comprising Indian high school-level multimodal physics problems. To improve the LMM's performance, we experiment with two techniques, RLHF (Reinforcement Learning from Human Feedback) and Image Captioning. In image captioning, we add a detailed explanation of the diagram in each image, minimizing hallucinations and image processing errors. We further explore the integration of Reinforcement Learning from Human Feedback (RLHF) methodology inspired by the ranking approach in RLHF to enhance the human-like problem-solving abilities of the models. The RLHF approach incorporates human feedback into the learning process of LLMs, improving the model's problem-solving skills, truthfulness, and reasoning capabilities, minimizing the hallucinations in the answers, and improving the quality instead of using vanilla-supervised fine-tuned models. We employ the LLaVA open-source model to answer multimodal physics MCQs and compare the performance with and without using RLHF.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
LTL-Constrained Policy Optimization with Cycle Experience Replay
Authors:
Ameesh Shah,
Cameron Voloshin,
Chenxi Yang,
Abhinav Verma,
Swarat Chaudhuri,
Sanjit A. Seshia
Abstract:
Linear Temporal Logic (LTL) offers a precise means for constraining the behavior of reinforcement learning agents. However, in many tasks, LTL is insufficient for task specification; LTL-constrained policy optimization, where the goal is to optimize a scalar reward under LTL constraints, is needed. Prior methods for this constrained problem are restricted to finite state spaces. In this work, we p…
▽ More
Linear Temporal Logic (LTL) offers a precise means for constraining the behavior of reinforcement learning agents. However, in many tasks, LTL is insufficient for task specification; LTL-constrained policy optimization, where the goal is to optimize a scalar reward under LTL constraints, is needed. Prior methods for this constrained problem are restricted to finite state spaces. In this work, we present Cycle Experience Replay (CyclER), a reward-shaping approach to this problem that allows continuous state and action spaces and the use of function approximations. CyclER guides a policy towards satisfaction by encouraging partial behaviors compliant with the LTL constraint, using the structure of the constraint. In doing so, it addresses the optimization challenges stemming from the sparse nature of LTL satisfaction. We evaluate CyclER in three continuous control domains. On these tasks, CyclER outperforms existing reward-shaping methods at finding performant and LTL-satisfying policies.
△ Less
Submitted 24 May, 2024; v1 submitted 17 April, 2024;
originally announced April 2024.
-
SEVD: Synthetic Event-based Vision Dataset for Ego and Fixed Traffic Perception
Authors:
Manideep Reddy Aliminati,
Bharatesh Chakravarthi,
Aayush Atul Verma,
Arpitsinh Vaghela,
Hua Wei,
Xuesong Zhou,
Yezhou Yang
Abstract:
Recently, event-based vision sensors have gained attention for autonomous driving applications, as conventional RGB cameras face limitations in handling challenging dynamic conditions. However, the availability of real-world and synthetic event-based vision datasets remains limited. In response to this gap, we present SEVD, a first-of-its-kind multi-view ego, and fixed perception synthetic event-b…
▽ More
Recently, event-based vision sensors have gained attention for autonomous driving applications, as conventional RGB cameras face limitations in handling challenging dynamic conditions. However, the availability of real-world and synthetic event-based vision datasets remains limited. In response to this gap, we present SEVD, a first-of-its-kind multi-view ego, and fixed perception synthetic event-based dataset using multiple dynamic vision sensors within the CARLA simulator. Data sequences are recorded across diverse lighting (noon, nighttime, twilight) and weather conditions (clear, cloudy, wet, rainy, foggy) with domain shifts (discrete and continuous). SEVD spans urban, suburban, rural, and highway scenes featuring various classes of objects (car, truck, van, bicycle, motorcycle, and pedestrian). Alongside event data, SEVD includes RGB imagery, depth maps, optical flow, semantic, and instance segmentation, facilitating a comprehensive understanding of the scene. Furthermore, we evaluate the dataset using state-of-the-art event-based (RED, RVT) and frame-based (YOLOv8) methods for traffic participant detection tasks and provide baseline benchmarks for assessment. Additionally, we conduct experiments to assess the synthetic event-based dataset's generalization capabilities. The dataset is available at https://eventbasedvision.github.io/SEVD
△ Less
Submitted 19 April, 2024; v1 submitted 12 April, 2024;
originally announced April 2024.
-
KG-CTG: Citation Generation through Knowledge Graph-guided Large Language Models
Authors:
Avinash Anand,
Mohit Gupta,
Kritarth Prasad,
Ujjwal Goel,
Naman Lal,
Astha Verma,
Rajiv Ratn Shah
Abstract:
Citation Text Generation (CTG) is a task in natural language processing (NLP) that aims to produce text that accurately cites or references a cited document within a source document. In CTG, the generated text draws upon contextual cues from both the source document and the cited paper, ensuring accurate and relevant citation information is provided. Previous work in the field of citation generati…
▽ More
Citation Text Generation (CTG) is a task in natural language processing (NLP) that aims to produce text that accurately cites or references a cited document within a source document. In CTG, the generated text draws upon contextual cues from both the source document and the cited paper, ensuring accurate and relevant citation information is provided. Previous work in the field of citation generation is mainly based on the text summarization of documents. Following this, this paper presents a framework, and a comparative study to demonstrate the use of Large Language Models (LLMs) for the task of citation generation. Also, we have shown the improvement in the results of citation generation by incorporating the knowledge graph relations of the papers in the prompt for the LLM to better learn the relationship between the papers. To assess how well our model is performing, we have used a subset of standard S2ORC dataset, which only consists of computer science academic research papers in the English Language. Vicuna performs best for this task with 14.15 Meteor, 12.88 Rouge-1, 1.52 Rouge-2, and 10.94 Rouge-L. Also, Alpaca performs best, and improves the performance by 36.98% in Rouge-1, and 33.14% in Meteor by including knowledge graphs.
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
Initial results of our spectro-photometric monitoring of XZ Tau
Authors:
Arpan Ghosh,
Saurabh Sharma,
Joe Philip Ninan,
Devendra K. Ojha,
Aayushi Verma,
Tarak Chand Sahu,
Rakesh Pandey,
Koshvendra Singh
Abstract:
We present here initial results of our spectro-photometric monitoring of XZ Tau. During our monitoring period, XZ Tau exhibited several episodes of brightness variations in timescales of months at optical wavelengths in contrast to the mid-infrared wavelengths. The color evolution of XZ Tau during this period suggest that the brightness variations are driven by changes in accretion from the disc.…
▽ More
We present here initial results of our spectro-photometric monitoring of XZ Tau. During our monitoring period, XZ Tau exhibited several episodes of brightness variations in timescales of months at optical wavelengths in contrast to the mid-infrared wavelengths. The color evolution of XZ Tau during this period suggest that the brightness variations are driven by changes in accretion from the disc. The mid-infrared light curve shows an overall decline in brightness by $\sim$ 0.5 and 0.7 magnitude respectively in WISE W1 (3.4 $μ$m) and W2 (4.6 $μ$m) bands. The emission profile of the hydrogen recombination lines along with that of Ca II IRT lines points towards magnetospheric accretion of XZ Tau. We have detected P Cygni profile in H$β$ indicating of outflowing winds from regions close to accretion. Forbidden transitions of oxygen are also detected, likely indicating of jets originating around the central pre-main sequence star.
△ Less
Submitted 14 April, 2024;
originally announced April 2024.
-
MM-PhyQA: Multimodal Physics Question-Answering With Multi-Image CoT Prompting
Authors:
Avinash Anand,
Janak Kapuriya,
Apoorv Singh,
Jay Saraf,
Naman Lal,
Astha Verma,
Rushali Gupta,
Rajiv Shah
Abstract:
While Large Language Models (LLMs) can achieve human-level performance in various tasks, they continue to face challenges when it comes to effectively tackling multi-step physics reasoning tasks. To identify the shortcomings of existing models and facilitate further research in this area, we curated a novel dataset, MM-PhyQA, which comprises well-constructed, high schoollevel multimodal physics pr…
▽ More
While Large Language Models (LLMs) can achieve human-level performance in various tasks, they continue to face challenges when it comes to effectively tackling multi-step physics reasoning tasks. To identify the shortcomings of existing models and facilitate further research in this area, we curated a novel dataset, MM-PhyQA, which comprises well-constructed, high schoollevel multimodal physics problems. By evaluating the performance of contemporary LLMs that are publicly available, both with and without the incorporation of multimodal elements in these problems, we aim to shed light on their capabilities. For generating answers for questions consisting of multimodal input (in this case, images and text) we employed Zero-shot prediction using GPT-4 and utilized LLaVA (LLaVA and LLaVA-1.5), the latter of which were fine-tuned on our dataset. For evaluating the performance of LLMs consisting solely of textual input, we tested the performance of the base and fine-tuned versions of the Mistral-7B and LLaMA2-7b models. We also showcased the performance of the novel Multi-Image Chain-of-Thought (MI-CoT) Prompting technique, which when used to train LLaVA-1.5 13b yielded the best results when tested on our dataset, with superior scores in most metrics and the highest accuracy of 71.65% on the test set.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
A Lightweight Security Solution for Mitigation of Hatchetman Attack in RPL-based 6LoWPAN
Authors:
Girish Sharma,
Jyoti Grover,
Abhishek Verma
Abstract:
In recent times, the Internet of Things (IoT) has a significant rise in industries, and we live in the era of Industry 4.0, where each device is connected to the Internet from small to big. These devices are Artificial Intelligence (AI) enabled and are capable of perspective analytics. By 2023, it's anticipated that over 14 billion smart devices will be available on the Internet. These application…
▽ More
In recent times, the Internet of Things (IoT) has a significant rise in industries, and we live in the era of Industry 4.0, where each device is connected to the Internet from small to big. These devices are Artificial Intelligence (AI) enabled and are capable of perspective analytics. By 2023, it's anticipated that over 14 billion smart devices will be available on the Internet. These applications operate in a wireless environment where memory, power, and other resource limitations apply to the nodes. In addition, the conventional routing method is ineffective in networks with limited resource devices, lossy links, and slow data rates. Routing Protocol for Low Power and Lossy Networks (RPL), a new routing protocol for such networks, was proposed by the IETF's ROLL group. RPL operates in two modes: Storing and Non-Storing. In Storing mode, each node have the information to reach to other node. In Non-Storing mode, the routing information lies with the root node only. The attacker may exploit the Non-Storing feature of the RPL. When the root node transmits User Datagram Protocol~(UDP) or control message packet to the child nodes, the routing information is stored in the extended header of the IPv6 packet. The attacker may modify the address from the source routing header which leads to Denial of Service (DoS) attack. This attack is RPL specific which is known as Hatchetman attack. This paper shows significant degradation in terms of network performance when an attacker exploits this feature. We also propose a lightweight mitigation of Hatchetman attack using game theoretic approach to detect the Hatchetman attack in IoT.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
The Canvas of Holography in (A)dS/CFT
Authors:
Vaibhav Kalvakota,
Aayush Verma
Abstract:
The dynamic of holography between anti-de Sitter space holography and de Sitter holography is a very fascinating comparison, which provides many key insights into what we expect from holography in general. In this Essay, we highlight this dynamic with three examples: first, when taking Wheeler-DeWitt states to the asymptotic boundary, the dual interpretation is unclear in de Sitter. Second, what w…
▽ More
The dynamic of holography between anti-de Sitter space holography and de Sitter holography is a very fascinating comparison, which provides many key insights into what we expect from holography in general. In this Essay, we highlight this dynamic with three examples: first, when taking Wheeler-DeWitt states to the asymptotic boundary, the dual interpretation is unclear in de Sitter. Second, what we make of bulk reconstruction and subregion duality in AdS/CFT is not trivially reflected in the dS/CFT scenario. Third, a way of formulating emergence and subregion-subalgebra duality in de Sitter space does not yet exist. With these examples, we provide some musings on this canvas of holography in the settings of (A)dS/CFT.
△ Less
Submitted 14 April, 2024; v1 submitted 28 March, 2024;
originally announced April 2024.
-
eTraM: Event-based Traffic Monitoring Dataset
Authors:
Aayush Atul Verma,
Bharatesh Chakravarthi,
Arpitsinh Vaghela,
Hua Wei,
Yezhou Yang
Abstract:
Event cameras, with their high temporal and dynamic range and minimal memory usage, have found applications in various fields. However, their potential in static traffic monitoring remains largely unexplored. To facilitate this exploration, we present eTraM - a first-of-its-kind, fully event-based traffic monitoring dataset. eTraM offers 10 hr of data from different traffic scenarios in various li…
▽ More
Event cameras, with their high temporal and dynamic range and minimal memory usage, have found applications in various fields. However, their potential in static traffic monitoring remains largely unexplored. To facilitate this exploration, we present eTraM - a first-of-its-kind, fully event-based traffic monitoring dataset. eTraM offers 10 hr of data from different traffic scenarios in various lighting and weather conditions, providing a comprehensive overview of real-world situations. Providing 2M bounding box annotations, it covers eight distinct classes of traffic participants, ranging from vehicles to pedestrians and micro-mobility. eTraM's utility has been assessed using state-of-the-art methods for traffic participant detection, including RVT, RED, and YOLOv8. We quantitatively evaluate the ability of event-based models to generalize on nighttime and unseen scenes. Our findings substantiate the compelling potential of leveraging event cameras for traffic monitoring, opening new avenues for research and application. eTraM is available at https://eventbasedvision.github.io/eTraM
△ Less
Submitted 2 April, 2024; v1 submitted 29 March, 2024;
originally announced March 2024.
-
A Sociotechnical Readiness Level Framework for the Development of Advanced Nuclear Technologies
Authors:
Aditi Verma,
Todd Allen
Abstract:
The Technology Readiness Level (TRL) scale was initially developed by NASA in the 1970s and is now widely used in space, nuclear, and other complex technology sectors in the US and beyond. The TRL scale is particularly useful for determining where extrapolation of untested sub-systems or features could produce technical risk, cause expensive redesigns, or act as a roadblock to technology developme…
▽ More
The Technology Readiness Level (TRL) scale was initially developed by NASA in the 1970s and is now widely used in space, nuclear, and other complex technology sectors in the US and beyond. The TRL scale is particularly useful for determining where extrapolation of untested sub-systems or features could produce technical risk, cause expensive redesigns, or act as a roadblock to technology development. In this paper, we propose the development of a sociotechnical readiness level (SRL), premised on the understanding that the successful development and eventual use of a technology requires achieving not only full technological readiness but also anticipating, prioritizing, and addressing societal concerns that may arise during the course of development of a technology. Failures to anticipate and address societal factors in the early stages of technology development have led to high-profile delays and, in some cases, ultimate failures of nuclear technology projects. The sociotechnical readiness scale, which conceptually draws on the design research and science and technology studies scholarship, centers on principles of equity and environmental justice in technology design, and emphasizes the need for social engagement during the process of technology development. Nowhere is such an approach to technology development more vital or needed than for the long-term management of spent nuclear fuel.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
Reckoning with the wicked problems of nuclear technology: Philosophy, design, and pedagogical method underlying a course on Nuclear Technology, Policy, and Society
Authors:
Aditi Verma
Abstract:
This paper describes the underlying philosophy, design, and implementation of a course on "Nuclear Technology, Policy, and Society" taught in the Department of Nuclear Engineering and Radiological Sciences at the University of Michigan. The course explores some of nuclear technology's most pressing challenges or its 'wicked problems'. Through this course students explore the origins of these probl…
▽ More
This paper describes the underlying philosophy, design, and implementation of a course on "Nuclear Technology, Policy, and Society" taught in the Department of Nuclear Engineering and Radiological Sciences at the University of Michigan. The course explores some of nuclear technology's most pressing challenges or its 'wicked problems'. Through this course students explore the origins of these problems be they social or technical and, they are offered tools, both conceptual and methodological to make sense of these problems, and guided through a semester-long exploration of how scientists engineers can work towards their resolution, and to what degree these problems can be solved through institutional transformation or a transformation in our own practices and norms as a field. The underlying pedagogical philosophy, implementation, and response to the course are described here for other instructors who might wish to create a similar course, or for non-academic nuclear scientists and engineers, who might perhaps, in these pages, find a vocabulary for articulating and reflecting on the nature of these problems as encountered in their praxis.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
Fusion energy commercialization requires solving social and environmental challenges
Authors:
Stephanie Diem,
Laila El-Guebaly,
Aditi Verma
Abstract:
Fusion energy, the process that uses the same reaction that powers the sun and the stars, offers the promise of virtually unlimited, carbon-free energy and is approaching reality. Recently, there's been a dramatic global increase in the investment and research focused on addressing the hurdles to commercialize fusion energy. While a majority of the effort has been focused on gaps in technology, li…
▽ More
Fusion energy, the process that uses the same reaction that powers the sun and the stars, offers the promise of virtually unlimited, carbon-free energy and is approaching reality. Recently, there's been a dramatic global increase in the investment and research focused on addressing the hurdles to commercialize fusion energy. While a majority of the effort has been focused on gaps in technology, little work has been done to address the societal and environmental impacts of this technology. Three community- and environmentally-focused research priorities are identified for commercializing fusion energy: 1) understanding the environmental impacts of fusion energy across the technology lifecycle, 2) developing risk and safety assessment methodologies for fusion power plant technologies, and 3) creating a community-based socially engaged approach for fusion technology design and development. This approach will benefit private companies who wish to deploy future fusion power plants as concerns about the technology will be addressed early in the design process, thus minimizing delays in deployment that may result in increased costs for developers. Community engagement around fusion technology development must be evidence-based in order to build trust between communities and technology developers. Such an approach is grounded in informed consent is vital for the sustainable development and use of fusion technologies.
△ Less
Submitted 9 March, 2024;
originally announced March 2024.
-
Hierarchical energy signatures using machine learning for operational visibility and diagnostics in automotive manufacturing
Authors:
Ankur Verma,
Seog-Chan Oh,
Jorge Arinez,
Soundar Kumara
Abstract:
Manufacturing energy consumption data contains important process signatures required for operational visibility and diagnostics. These signatures may be of different temporal scales, ranging from monthly to sub-second resolutions. We introduce a hierarchical machine learning approach to identify automotive process signatures from paint shop electricity consumption data at varying temporal scales (…
▽ More
Manufacturing energy consumption data contains important process signatures required for operational visibility and diagnostics. These signatures may be of different temporal scales, ranging from monthly to sub-second resolutions. We introduce a hierarchical machine learning approach to identify automotive process signatures from paint shop electricity consumption data at varying temporal scales (weekly and daily). A Multi-Layer Perceptron (MLP), a Convolutional Neural Network (CNN), and Principal Component Analysis (PCA) combined with Logistic Regression (LR) are used for the analysis. We validate the utility of the developed algorithms with subject matter experts for (i) better operational visibility, and (ii) identifying energy saving opportunities.
△ Less
Submitted 24 February, 2024;
originally announced February 2024.
-
Mask-up: Investigating Biases in Face Re-identification for Masked Faces
Authors:
Siddharth D Jaiswal,
Ankit Kr. Verma,
Animesh Mukherjee
Abstract:
AI based Face Recognition Systems (FRSs) are now widely distributed and deployed as MLaaS solutions all over the world, moreso since the COVID-19 pandemic for tasks ranging from validating individuals' faces while buying SIM cards to surveillance of citizens. Extensive biases have been reported against marginalized groups in these systems and have led to highly discriminatory outcomes. The post-pa…
▽ More
AI based Face Recognition Systems (FRSs) are now widely distributed and deployed as MLaaS solutions all over the world, moreso since the COVID-19 pandemic for tasks ranging from validating individuals' faces while buying SIM cards to surveillance of citizens. Extensive biases have been reported against marginalized groups in these systems and have led to highly discriminatory outcomes. The post-pandemic world has normalized wearing face masks but FRSs have not kept up with the changing times. As a result, these systems are susceptible to mask based face occlusion. In this study, we audit four commercial and nine open-source FRSs for the task of face re-identification between different varieties of masked and unmasked images across five benchmark datasets (total 14,722 images). These simulate a realistic validation/surveillance task as deployed in all major countries around the world. Three of the commercial and five of the open-source FRSs are highly inaccurate; they further perpetuate biases against non-White individuals, with the lowest accuracy being 0%. A survey for the same task with 85 human participants also results in a low accuracy of 40%. Thus a human-in-the-loop moderation in the pipeline does not alleviate the concerns, as has been frequently hypothesized in literature. Our large-scale study shows that developers, lawmakers and users of such services need to rethink the design principles behind FRSs, especially for the task of face re-identification, taking cognizance of observed biases.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
A mathematical survey on Fourier type integral transform and their offshoots: windowed Fourier transform, wavelet transform and Stockwell transform
Authors:
Bivek Gupta,
Amit K. Verma
Abstract:
This comprehensive review paper delves into the intricacies of advanced Fourier type integral transforms and their mathematical properties, with a particular focus on fractional Fourier transform (FrFT), linear canonical transform (LCT), quadratic phase Fourier transform (QPFT), and their associated offshoots: windowed Fourier transform, wavelet transform, and Stockwell transform. In the pursuit o…
▽ More
This comprehensive review paper delves into the intricacies of advanced Fourier type integral transforms and their mathematical properties, with a particular focus on fractional Fourier transform (FrFT), linear canonical transform (LCT), quadratic phase Fourier transform (QPFT), and their associated offshoots: windowed Fourier transform, wavelet transform, and Stockwell transform. In the pursuit of a deeper understanding of these transformations, we explore their convolution properties, shedding light on their capacity to define windowed, wavelet and Stockwell transforms in the realm of Fourier, fractional Fourier and quadratic phase Fourier transforms. This review also expands its purview to the realm of uncertainty principles. Several uncertainty principles, like Heisenberg, logarithmic, local, Rényi uncertainty principles, etc., within the context of fractional Fourier, linear canonical, and quadratic phase Fourier transforms, as well as their derivative offshoots are presented in the paper both for the functions of complex as well as quatenrion valued. In particular, the counterpart of several important inequalities of classical Fourier transform are also presented in details for the quaternion case. This article also reviews that multiresolution analysis that has been developed in the literature so far.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
Ultrafast Nuclear Dynamics in Double-Core Ionized Water Molecules
Authors:
Iyas Ismail,
Ludger Inhester,
Tatiana Marchenko,
Florian Trinter,
Abhishek Verma,
Alberto De Fanis,
Anthony Ferte,
Daniel E. Rivas,
Dawei Peng,
Dimitris Koulentianos,
Edwin Kukk,
Francis Penent,
Gilles Doumy,
Giuseppe Sansone,
John D. Bozek,
Kai Li,
Linda Young,
Markus Ilchen,
Maria Novella Piancastelli,
Michael Meyer,
Nicolas Velasquez,
Oksana Travnikova,
Rebecca Boll,
Renaud Guillemin,
Reinhard Dorner
, et al. (8 additional authors not shown)
Abstract:
Double-core-hole (DCH) states in isolated water and heavy water molecules, resulting from the sequential absorption of two x-ray photons, have been investigated. A comparison of the subsequent Auger emission spectra from the two isotopes provides direct evidence of ultrafast nuclear motion during the 1.5 fs lifetime of these DCH states. Our numerical results align well with the experimental data,…
▽ More
Double-core-hole (DCH) states in isolated water and heavy water molecules, resulting from the sequential absorption of two x-ray photons, have been investigated. A comparison of the subsequent Auger emission spectra from the two isotopes provides direct evidence of ultrafast nuclear motion during the 1.5 fs lifetime of these DCH states. Our numerical results align well with the experimental data, providing for various DCH states an in-depth study of the dynamics responsible of the observed isotope effect.
△ Less
Submitted 11 March, 2024; v1 submitted 5 February, 2024;
originally announced February 2024.
-
Decentralised, Collaborative, and Privacy-preserving Machine Learning for Multi-Hospital Data
Authors:
Congyu Fang,
Adam Dziedzic,
Lin Zhang,
Laura Oliva,
Amol Verma,
Fahad Razak,
Nicolas Papernot,
Bo Wang
Abstract:
Machine Learning (ML) has demonstrated its great potential on medical data analysis. Large datasets collected from diverse sources and settings are essential for ML models in healthcare to achieve better accuracy and generalizability. Sharing data across different healthcare institutions is challenging because of complex and varying privacy and regulatory requirements. Hence, it is hard but crucia…
▽ More
Machine Learning (ML) has demonstrated its great potential on medical data analysis. Large datasets collected from diverse sources and settings are essential for ML models in healthcare to achieve better accuracy and generalizability. Sharing data across different healthcare institutions is challenging because of complex and varying privacy and regulatory requirements. Hence, it is hard but crucial to allow multiple parties to collaboratively train an ML model leveraging the private datasets available at each party without the need for direct sharing of those datasets or compromising the privacy of the datasets through collaboration. In this paper, we address this challenge by proposing Decentralized, Collaborative, and Privacy-preserving ML for Multi-Hospital Data (DeCaPH). It offers the following key benefits: (1) it allows different parties to collaboratively train an ML model without transferring their private datasets; (2) it safeguards patient privacy by limiting the potential privacy leakage arising from any contents shared across the parties during the training process; and (3) it facilitates the ML model training without relying on a centralized server. We demonstrate the generalizability and power of DeCaPH on three distinct tasks using real-world distributed medical datasets: patient mortality prediction using electronic health records, cell-type classification using single-cell human genomes, and pathology identification using chest radiology images. We demonstrate that the ML models trained with DeCaPH framework have an improved utility-privacy trade-off, showing it enables the models to have good performance while preserving the privacy of the training data points. In addition, the ML models trained with DeCaPH framework in general outperform those trained solely with the private datasets from individual parties, showing that DeCaPH enhances the model generalizability.
△ Less
Submitted 28 April, 2024; v1 submitted 31 January, 2024;
originally announced February 2024.
-
Meme-ingful Analysis: Enhanced Understanding of Cyberbullying in Memes Through Multimodal Explanations
Authors:
Prince Jha,
Krishanu Maity,
Raghav Jain,
Apoorv Verma,
Sriparna Saha,
Pushpak Bhattacharyya
Abstract:
Internet memes have gained significant influence in communicating political, psychological, and sociocultural ideas. While memes are often humorous, there has been a rise in the use of memes for trolling and cyberbullying. Although a wide variety of effective deep learning-based models have been developed for detecting offensive multimodal memes, only a few works have been done on explainability a…
▽ More
Internet memes have gained significant influence in communicating political, psychological, and sociocultural ideas. While memes are often humorous, there has been a rise in the use of memes for trolling and cyberbullying. Although a wide variety of effective deep learning-based models have been developed for detecting offensive multimodal memes, only a few works have been done on explainability aspect. Recent laws like "right to explanations" of General Data Protection Regulation, have spurred research in developing interpretable models rather than only focusing on performance. Motivated by this, we introduce {\em MultiBully-Ex}, the first benchmark dataset for multimodal explanation from code-mixed cyberbullying memes. Here, both visual and textual modalities are highlighted to explain why a given meme is cyberbullying. A Contrastive Language-Image Pretraining (CLIP) projection-based multimodal shared-private multitask approach has been proposed for visual and textual explanation of a meme. Experimental results demonstrate that training with multimodal explanations improves performance in generating textual justifications and more accurately identifying the visual evidence supporting a decision with reliable performance improvements.
△ Less
Submitted 18 January, 2024;
originally announced January 2024.
-
Effect of relativistic equation of state and diffusion coefficient on diffusive shock acceleration
Authors:
Anshuman Verma,
Saksham Chandna,
Ritam Mallick
Abstract:
Diffusive Shock Acceleration, resulting from first-order Fermi acceleration occurring near Magnetohydrodynamic shock waves, is essential in explaining the power law spectrum in various astrophysical radiation and cosmic rays. We perform Monte Carlo simulations to model the stochastic scattering process in Fermi acceleration, capturing the confinement of particles around the shock within the ambien…
▽ More
Diffusive Shock Acceleration, resulting from first-order Fermi acceleration occurring near Magnetohydrodynamic shock waves, is essential in explaining the power law spectrum in various astrophysical radiation and cosmic rays. We perform Monte Carlo simulations to model the stochastic scattering process in Fermi acceleration, capturing the confinement of particles around the shock within the ambient fluid. The model is tested and validated by comparing it with the spectral index obtained with analytical calculation. Assuming a relativistic EoS, we calculate the power-law spectral index for different diffusion coefficients. With constant diffusion co-efficient and stiffer EoS, the observed range of the spectral index is very narrow; however, as the EoS becomes softer, the range increases. With varying diffusion co-efficient stiffer EoS fails to give a well-defined spectral index (no linear spectrum); however, as the EoS becomes softer, the spectral index lies between $2-4$. For ultra-relativistic shocks, we consistently obtained a linear spectrum; however, the spectral index range varied considerably with the diffusion coefficient.
△ Less
Submitted 16 January, 2024;
originally announced January 2024.
-
Dynamic Multi Color Switching using Ultrathin Vanadium Oxide on Aluminium based Asymmetric Fabry-Perot Resonant Structure
Authors:
Shubhangi Saini,
Ashok P,
Amit Verma
Abstract:
Vanadium dioxide ($VO_{2}$) exhibits strong infrared optical switching due to its insulator-metal phase-transition property. However, in the visible wavelengths, it's intrinsic optical switching is quite low. Current research explores solutions like multilayering, intricate structural patterning, high thermal budget processes and costly metals for improved color switching. Nonetheless, the color g…
▽ More
Vanadium dioxide ($VO_{2}$) exhibits strong infrared optical switching due to its insulator-metal phase-transition property. However, in the visible wavelengths, it's intrinsic optical switching is quite low. Current research explores solutions like multilayering, intricate structural patterning, high thermal budget processes and costly metals for improved color switching. Nonetheless, the color gamut coverage with these methodologies remains notably limited. This work overcomes these limitations and demonstrates dynamic multi-colour switching covering a large color gamut using a simple, unpatterned, ultrathin ($\sim$ $\fracλ{14}$, where wavelength $λ$ is taken as 575 nm at the center of visible spectrum) asymmetric Fabry-Pérot structure of $VO_{2}$ on Aluminium (Al). We use the transfer matrix method to design the $VO_{2}/Aluminium\,(Al)/Sapphire$ structure for maximum visible reflectance switching. $VO_{2}$ films are synthesized using a simple, low thermal budget atmospheric oxidation of Vanadium (V). With varying oxidation durations, different colors of the oxidized samples are observed. Consistent and reversible color-switching is observed visibly and in reflectance measurements with the change in temperature from low (RT $\sim$ 30$^{\circ}$C) to high (HT $\sim$ 100$^{\circ}$C) or vice versa due to the phase transition property of the $VO_{2}$ layer in the structure. Compared to the existing studies, this work shows a significant change in chromaticities and covers a large color gamut when plotted on the CIE chromaticity diagram. This work has potential applications in the fields of display, thermochromic structures, and visible camouflage.
△ Less
Submitted 7 January, 2024;
originally announced January 2024.
-
Non-standard neutrino interactions mediated by a light scalar at DUNE
Authors:
Bhaskar Dutta,
Sumit Ghosh,
Kevin J. Kelly,
Tianjun Li,
Adrian Thompson,
Ankur Verma
Abstract:
We investigate the effect on neutrino oscillations generated by beyond-the-standard-model interactions between neutrinos and matter. Specifically, we focus on scalar-mediated non-standard interactions (NSI) whose impact fundamentally differs from that of vector-mediated NSI. Scalar NSI contribute as corrections to the neutrino mass matrix rather than the matter potential and thereby predict distin…
▽ More
We investigate the effect on neutrino oscillations generated by beyond-the-standard-model interactions between neutrinos and matter. Specifically, we focus on scalar-mediated non-standard interactions (NSI) whose impact fundamentally differs from that of vector-mediated NSI. Scalar NSI contribute as corrections to the neutrino mass matrix rather than the matter potential and thereby predict distinct phenomenology from the vector-mediated ones. Similar to vector-type NSI, the presence of scalar-mediated neutrino NSI can influence measurements of oscillation parameters in long-baseline neutrino oscillation experiments, with a notable impact on CP measurement in the case of DUNE. Our study focuses on the effect of scalar NSI on neutrino oscillations, using DUNE as an example. We introduce a model-independent parameterization procedure that enables the examination of the impact of all non-zero scalar NSI parameters simultaneously. Subsequently, we convert DUNE's sensitivity to the NSI parameters into projected sensitivity concerning the parameters of a light scalar model. We compare these results with existing non-oscillation probes. Our findings reveal that the region of the light scalar parameter space sensitive to DUNE is predominantly excluded by non-oscillation probes, except for scenarios with very light mediator mass.
△ Less
Submitted 4 January, 2024;
originally announced January 2024.
-
Cluster Formation in a Filamentary Cloud: The Case of the Stellar Cluster NGC 2316
Authors:
Saurabh Sharma,
Aayushi Verma,
Kshitiz Mallick,
Lokesh K. Dewangan,
Harmeen Kaur,
Ram Kesh Yadav,
Neelam Panwar,
Devendra K. Ojha,
Tarak Chand,
Mamta Agarwal
Abstract:
We present a multi-wavelength analysis of the star cluster NGC 2316 and its surroundings. We estimated the physical parameters of the NGC 2316 cluster, including its shape (elongated), size (Rcluster = 0.4 pc), distance (1.3 +/- 0.3 kpc), and minimum reddening (AV = 1.55 mag). We discovered two massive stars (B2.0V-B1.5V, age ~12 Myr) embedded (AV = 4 mag) within this cluster. The cluster region s…
▽ More
We present a multi-wavelength analysis of the star cluster NGC 2316 and its surroundings. We estimated the physical parameters of the NGC 2316 cluster, including its shape (elongated), size (Rcluster = 0.4 pc), distance (1.3 +/- 0.3 kpc), and minimum reddening (AV = 1.55 mag). We discovered two massive stars (B2.0V-B1.5V, age ~12 Myr) embedded (AV = 4 mag) within this cluster. The cluster region still forms young stars even though the most massive star was born ~12 Myr ago. We also found evidence of positive feedback from these massive stars. We identified a cold gas/dust lane extending westward from the cluster. The western end of the dust lane seems to favor low-mass star formation, whereas the cluster's end favors bit massive star formation, which seems to have started earlier than the western end. We found an elongated molecular cloud in this region, characterized by numerous filamentary structures. The morphology of the filaments, along with position-velocity (pv) maps, velocity dispersion maps, channel maps, etc., indicate a coalescence of filaments and a potential longitudinal flow of matter toward the cluster through the western end of the gas/dust lane. This entire region seems to be a Hub-filamentary system (HFS), in which the NGC 2316 cluster is probably the hub and the dark lane is the main filamentary structure. Being the gravity well of this HFS, star formation started first in the NGC 2316 region and went on to the other filamentary nodes.
△ Less
Submitted 29 December, 2023;
originally announced December 2023.
-
Bilayer Vanadium Dioxide Thin Film with Elevated Transition Temperatures and High Resistance Switching
Authors:
Achintya Dutta,
Ashok P,
Amit Verma
Abstract:
Despite widespread interest in the phase-change applications of vanadium dioxide (VO$_2$), the fabrication of high-quality VO$_2$ thin films with elevated transition temperatures (TIMT) and high Insulator-Metal-Transition resistance switching still remains a challenge. This study introduces a two-step atmospheric oxidation approach to fabricate bilayer VO$_{2-x}$/VO$_2$ films on a c-plane sapphire…
▽ More
Despite widespread interest in the phase-change applications of vanadium dioxide (VO$_2$), the fabrication of high-quality VO$_2$ thin films with elevated transition temperatures (TIMT) and high Insulator-Metal-Transition resistance switching still remains a challenge. This study introduces a two-step atmospheric oxidation approach to fabricate bilayer VO$_{2-x}$/VO$_2$ films on a c-plane sapphire substrate. To quantify the impact of the VO$_2$ buffer layer, a single-layer VO$_2$ film of the same thickness was also fabricated. The bilayer VO$_{2-x}$/VO$_2$ films wherein the top VO$_{2-x}$ film was under-oxidized demonstrated an elevation in TIMT reaching ~97 $^\circ$C, one of the highest reported to date for VO$_2$ films and is achieved in a doping-free manner. Our results also reveal a one-order increase in resistance switching, with the optimum bilayer VO$_2$/VO$_2$ film exhibiting ~3.6 orders of switching from 25 $^\circ$C to 110 $^\circ$C, compared to the optimum single-layer VO$_2$ reference film. This is accompanied by a one-order decrease in the on-state resistance in its metallic phase. The elevation in TIMT, coupled with increased strain extracted from the XRD characterization of the bilayer film, suggests the possibility of compressive strain along the c-axis. These VO$_{2-x}$/VO$_2$ films also demonstrate a significant change in the slope of their resistance vs temperature curves contrary to the conventional smooth transition. This feature was ascribed to the rutile/monoclinic quasi-heterostructure formed due to the top VO$_{2-x}$ film having a reduced TIMT. Our findings carry significant implications for both the lucid fabrication of VO$_2$ thin film devices as well as the study of phase transitions in correlated oxides.
△ Less
Submitted 28 December, 2023;
originally announced December 2023.
-
Efficacy of Machine-Generated Instructions
Authors:
Samaksh Gulati,
Anshit Verma,
Manoj Parmar,
Palash Chaudhary
Abstract:
Large "instruction-tuned" language models (i.e., finetuned to respond to instructions) have demonstrated a remarkable ability to generalize zero-shot to new tasks. Nevertheless, they depend heavily on human-written instruction data that is often limited in quantity, diversity, and creativity, therefore hindering the generality of the tuned model. We conducted a quantitative study to figure out the…
▽ More
Large "instruction-tuned" language models (i.e., finetuned to respond to instructions) have demonstrated a remarkable ability to generalize zero-shot to new tasks. Nevertheless, they depend heavily on human-written instruction data that is often limited in quantity, diversity, and creativity, therefore hindering the generality of the tuned model. We conducted a quantitative study to figure out the efficacy of machine-generated annotations, where we compare the results of a fine-tuned BERT model with human v/s machine-generated annotations. Applying our methods to the vanilla GPT-3 model, we saw that machine generated annotations were 78.54% correct and the fine-tuned model achieved a 96.01% model performance compared to the performance with human-labelled annotations. This result shows that machine-generated annotations are a resource and cost effective way to fine-tune down-stream models.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
Compositional Policy Learning in Stochastic Control Systems with Formal Guarantees
Authors:
Đorđe Žikelić,
Mathias Lechner,
Abhinav Verma,
Krishnendu Chatterjee,
Thomas A. Henzinger
Abstract:
Reinforcement learning has shown promising results in learning neural network policies for complicated control tasks. However, the lack of formal guarantees about the behavior of such policies remains an impediment to their deployment. We propose a novel method for learning a composition of neural network policies in stochastic environments, along with a formal certificate which guarantees that a…
▽ More
Reinforcement learning has shown promising results in learning neural network policies for complicated control tasks. However, the lack of formal guarantees about the behavior of such policies remains an impediment to their deployment. We propose a novel method for learning a composition of neural network policies in stochastic environments, along with a formal certificate which guarantees that a specification over the policy's behavior is satisfied with the desired probability. Unlike prior work on verifiable RL, our approach leverages the compositional nature of logical specifications provided in SpectRL, to learn over graphs of probabilistic reach-avoid specifications. The formal guarantees are provided by learning neural network policies together with reach-avoid supermartingales (RASM) for the graph's sub-tasks and then composing them into a global policy. We also derive a tighter lower bound compared to previous work on the probability of reach-avoidance implied by a RASM, which is required to find a compositional policy with an acceptable probabilistic threshold for complex tasks with multiple edge policies. We implement a prototype of our approach and evaluate it on a Stochastic Nine Rooms environment.
△ Less
Submitted 3 December, 2023;
originally announced December 2023.
-
A Comparative Analysis of Text-to-Image Generative AI Models in Scientific Contexts: A Case Study on Nuclear Power
Authors:
Veda Joynt,
Jacob Cooper,
Naman Bhargava,
Katie Vu,
O Hwang Kwon,
Todd R. Allen,
Aditi Verma,
Majdi I. Radaideh
Abstract:
In this work, we propose and assess the potential of generative artificial intelligence (AI) to generate public engagement around potential clean energy sources. Such an application could increase energy literacy -- an awareness of low-carbon energy sources among the public therefore leading to increased participation in decision-making about the future of energy systems. We explore the use of gen…
▽ More
In this work, we propose and assess the potential of generative artificial intelligence (AI) to generate public engagement around potential clean energy sources. Such an application could increase energy literacy -- an awareness of low-carbon energy sources among the public therefore leading to increased participation in decision-making about the future of energy systems. We explore the use of generative AI to communicate technical information about low-carbon energy sources to the general public, specifically in the realm of nuclear energy. We explored 20 AI-powered text-to-image generators and compared their individual performances on general and scientific nuclear-related prompts. Of these models, DALL-E, DreamStudio, and Craiyon demonstrated promising performance in generating relevant images from general-level text related to nuclear topics. However, these models fall short in three crucial ways: (1) they fail to accurately represent technical details of energy systems; (2) they reproduce existing biases surrounding gender and work in the energy sector; and (3) they fail to accurately represent indigenous landscapes -- which have historically been sites of resource extraction and waste deposition for energy industries. This work is performed to motivate the development of specialized generative tools and their captions to improve energy literacy and effectively engage the public with low-carbon energy sources.
△ Less
Submitted 2 December, 2023;
originally announced December 2023.
-
Galaxy Formation and Symbiotic Evolution with the Inter-Galactic Medium in the Age of ELT-ANDES
Authors:
Valentina D'Odorico,
James S. Bolton,
Lise Christensen,
Annalisa De Cia,
Erik Zackrisson,
Aron Kordt,
Luca Izzo,
Jiangtao Li,
Roberto Maiolino,
Alessandro Marconi,
Philipp Richter,
Andrea Saccardi,
Stefania Salvadori,
Irene Vanni,
Chiara Feruglio,
Michele Fumagalli,
Johan P. U. Fynbo,
Pasquier Noterdaeme,
Polychronis Papaderos,
Celine Peroux,
Aprajita Verma,
Paolo Di Marcantonio,
Livia Origlia,
Alessio Zanutta
Abstract:
High-resolution absorption spectroscopy toward bright background sources has had a paramount role in understanding early galaxy formation, the evolution of the intergalactic medium and the reionisation of the Universe. However, these studies are now approaching the boundaries of what can be achieved at ground-based 8-10m class telescopes. The identification of primeval systems at the highest redsh…
▽ More
High-resolution absorption spectroscopy toward bright background sources has had a paramount role in understanding early galaxy formation, the evolution of the intergalactic medium and the reionisation of the Universe. However, these studies are now approaching the boundaries of what can be achieved at ground-based 8-10m class telescopes. The identification of primeval systems at the highest redshifts, within the reionisation epoch and even into the dark ages, and of the products of the first generation of stars and the chemical enrichment of the early Universe, requires observing very faint targets with a signal-to-noise ratio high enough to detect very faint spectral signatures. In this paper, we describe the giant leap forward that will be enabled by ANDES, the high-resolution spectrograph for the ELT, in these key science fields, together with a brief, non-exhaustive overview of other extragalactic research topics that will be pursued by this instrument, and its synergistic use with other facilities that will become available in the early 2030s.
△ Less
Submitted 28 November, 2023;
originally announced November 2023.
-
A Bayesian Approach to Strong Lens Finding in the Era of Wide-area Surveys
Authors:
Philip Holloway,
Philip J. Marshall,
Aprajita Verma,
Anupreeta More,
Raoul Cañameras,
Anton T. Jaelani,
Yuichiro Ishida,
Kenneth C. Wong
Abstract:
The arrival of the Vera C. Rubin Observatory's Legacy Survey of Space and Time (LSST), Euclid-Wide and Roman wide area sensitive surveys will herald a new era in strong lens science in which the number of strong lenses known is expected to rise from $\mathcal{O}(10^3)$ to $\mathcal{O}(10^5)$. However, current lens-finding methods still require time-consuming follow-up visual inspection by strong-l…
▽ More
The arrival of the Vera C. Rubin Observatory's Legacy Survey of Space and Time (LSST), Euclid-Wide and Roman wide area sensitive surveys will herald a new era in strong lens science in which the number of strong lenses known is expected to rise from $\mathcal{O}(10^3)$ to $\mathcal{O}(10^5)$. However, current lens-finding methods still require time-consuming follow-up visual inspection by strong-lens experts to remove false positives which is only set to increase with these surveys. In this work we demonstrate a range of methods to produce calibrated probabilities to help determine the veracity of any given lens candidate. To do this we use the classifications from citizen science and multiple neural networks for galaxies selected from the Hyper Suprime-Cam (HSC) survey. Our methodology is not restricted to particular classifier types and could be applied to any strong lens classifier which produces quantitative scores. Using these calibrated probabilities, we generate an ensemble classifier, combining citizen science and neural network lens finders. We find such an ensemble can provide improved classification over the individual classifiers. We find a false positive rate of $10^{-3}$ can be achieved with a completeness of $46\%$, compared to $34\%$ for the best individual classifier. Given the large number of galaxy-galaxy strong lenses anticipated in LSST, such improvement would still produce significant numbers of false positives, in which case using calibrated probabilities will be essential for population analysis of large populations of lenses.
△ Less
Submitted 17 April, 2024; v1 submitted 13 November, 2023;
originally announced November 2023.
-
Exploiting Correlated Auxiliary Feedback in Parameterized Bandits
Authors:
Arun Verma,
Zhongxiang Dai,
Yao Shu,
Bryan Kian Hsiang Low
Abstract:
We study a novel variant of the parameterized bandits problem in which the learner can observe additional auxiliary feedback that is correlated with the observed reward. The auxiliary feedback is readily available in many real-life applications, e.g., an online platform that wants to recommend the best-rated services to its users can observe the user's rating of service (rewards) and collect addit…
▽ More
We study a novel variant of the parameterized bandits problem in which the learner can observe additional auxiliary feedback that is correlated with the observed reward. The auxiliary feedback is readily available in many real-life applications, e.g., an online platform that wants to recommend the best-rated services to its users can observe the user's rating of service (rewards) and collect additional information like service delivery time (auxiliary feedback). In this paper, we first develop a method that exploits auxiliary feedback to build a reward estimator with tight confidence bounds, leading to a smaller regret. We then characterize the regret reduction in terms of the correlation coefficient between reward and its auxiliary feedback. Experimental results in different settings also verify the performance gain achieved by our proposed method.
△ Less
Submitted 5 November, 2023;
originally announced November 2023.
-
Gravity- and temperature-driven phase transitions in a model for collapsed axionic condensates
Authors:
Sanjay Shukla,
Akhilesh Kumar Verma,
Marc E. Brachet,
Rahul Pandit
Abstract:
We show how to use the cubic-quintic Gross-Pitaevskii-Poisson equation (cq-GPPE) and the cubic-quintic Stochastic Ginzburg-Landau-Poisson equation (cq-SGLPE) to investigate the gravitational collapse of a tenuous axionic gas into a collapsed axionic condensate for both zero and finite temperature $T$. At $T=0$, we use a Gaussian Ansatz for a spherically symmetric density to obtain parameter regime…
▽ More
We show how to use the cubic-quintic Gross-Pitaevskii-Poisson equation (cq-GPPE) and the cubic-quintic Stochastic Ginzburg-Landau-Poisson equation (cq-SGLPE) to investigate the gravitational collapse of a tenuous axionic gas into a collapsed axionic condensate for both zero and finite temperature $T$. At $T=0$, we use a Gaussian Ansatz for a spherically symmetric density to obtain parameter regimes in which we might expect to find compact axionic condensates. We then go beyond this Ansatz, by using the cq-SGLPE to investigate the dependence of the axionic condensate on the gravitational strength $G$ at $T = 0$. We demonstrate that, as $G$ increases, the equilibrium configuration goes from a tenuous axionic gas, to flat sheets or $\textit{Zeldovich pancakes}$, cylindrical structures, and finally a spherical axionic condensate. By varying $G$, we show that there are first-order phase transitions, as the system goes from one of these structures to the next one; we find hysteresis loops that are associated with these transitions. We examine these states and the transitions between these states via the Fourier truncated cq-GPPE; and we also obtain the thermalized $T > 0$ states from the cq-SGLPE; the transitions between these states yield thermally driven first-order phase transitions and their associated hysteresis loops. Finally, we discuss how our cq-GPPE approach can be used to follow the spatiotemporal evolution of a rotating axionic condensate and also a rotating binary-axionic-condensate system; in particular, we demonstrate, in the former, the emergence of vortices at large angular speeds $Ω$ and, in the latter, the rich dynamics of the mergers of the components of this binary system, which can yield vortices in the process of merging.
△ Less
Submitted 3 November, 2023;
originally announced November 2023.
-
Towards long-tailed, multi-label disease classification from chest X-ray: Overview of the CXR-LT challenge
Authors:
Gregory Holste,
Yiliang Zhou,
Song Wang,
Ajay Jaiswal,
Mingquan Lin,
Sherry Zhuge,
Yuzhe Yang,
Dongkyun Kim,
Trong-Hieu Nguyen-Mau,
Minh-Triet Tran,
Jaehyup Jeong,
Wongi Park,
Jongbin Ryu,
Feng Hong,
Arsh Verma,
Yosuke Yamagishi,
Changhyun Kim,
Hyeryeong Seo,
Myungjoo Kang,
Leo Anthony Celi,
Zhiyong Lu,
Ronald M. Summers,
George Shih,
Zhangyang Wang,
Yifan Peng
Abstract:
Many real-world image recognition problems, such as diagnostic medical imaging exams, are "long-tailed" $\unicode{x2013}$ there are a few common findings followed by many more relatively rare conditions. In chest radiography, diagnosis is both a long-tailed and multi-label problem, as patients often present with multiple findings simultaneously. While researchers have begun to study the problem of…
▽ More
Many real-world image recognition problems, such as diagnostic medical imaging exams, are "long-tailed" $\unicode{x2013}$ there are a few common findings followed by many more relatively rare conditions. In chest radiography, diagnosis is both a long-tailed and multi-label problem, as patients often present with multiple findings simultaneously. While researchers have begun to study the problem of long-tailed learning in medical image recognition, few have studied the interaction of label imbalance and label co-occurrence posed by long-tailed, multi-label disease classification. To engage with the research community on this emerging topic, we conducted an open challenge, CXR-LT, on long-tailed, multi-label thorax disease classification from chest X-rays (CXRs). We publicly release a large-scale benchmark dataset of over 350,000 CXRs, each labeled with at least one of 26 clinical findings following a long-tailed distribution. We synthesize common themes of top-performing solutions, providing practical recommendations for long-tailed, multi-label medical image classification. Finally, we use these insights to propose a path forward involving vision-language foundation models for few- and zero-shot disease classification.
△ Less
Submitted 1 April, 2024; v1 submitted 24 October, 2023;
originally announced October 2023.
-
Anti-Black racism workshop during the Vera C. Rubin Observatory virtual 2021 Project and Community Workshop
Authors:
Andrés A. Plazas Malagón,
Federica Bianco,
Ranpal Gill,
Robert D. Blum,
Rosaria,
Bonito,
Wil O'Mullane,
Alsyha Shugart,
Rachel Street,
Aprajita Verma
Abstract:
Systemic racism is a ubiquitous theme in societies worldwide and plays a central role in shaping our economic, social, and academic institutions. The Vera C. Rubin Observatory is a major US ground-based facility based in Chile with international participation. The Observatory is an example of excellence and will deliver the largest survey of the sky ever attempted. Rubin's full scientific and soci…
▽ More
Systemic racism is a ubiquitous theme in societies worldwide and plays a central role in shaping our economic, social, and academic institutions. The Vera C. Rubin Observatory is a major US ground-based facility based in Chile with international participation. The Observatory is an example of excellence and will deliver the largest survey of the sky ever attempted. Rubin's full scientific and social potential can not be attained without addressing systemic racism and associated barriers to equity, diversity, and inclusion (EDI). During Rubin's 2021 virtual Project and Community Workshop (PCW), the annual Rubin community-based meeting, an anti-Black racism workshop took place, facilitated by 'The BIPOC Project' organization. About 60 members from different parts of the Rubin ecosystem participated. We describe the motivation, organization, challenges, outcomes, and near- and long-term goals of this workshop.
△ Less
Submitted 16 October, 2023;
originally announced October 2023.
-
Answering open questions in biology using spatial genomics and structured methods
Authors:
Siddhartha G Jena,
Archit Verma,
Barbara E Engelhardt
Abstract:
Genomics methods have uncovered patterns in a range of biological systems, but obscure important aspects of cell behavior: the shape, relative locations of, movement of, and interactions between cells in space. Spatial technologies that collect genomic or epigenomic data while preserving spatial information have begun to overcome these limitations. These new data promise a deeper understanding of…
▽ More
Genomics methods have uncovered patterns in a range of biological systems, but obscure important aspects of cell behavior: the shape, relative locations of, movement of, and interactions between cells in space. Spatial technologies that collect genomic or epigenomic data while preserving spatial information have begun to overcome these limitations. These new data promise a deeper understanding of the factors that affect cellular behavior, and in particular the ability to directly test existing theories about cell state and variation in the context of morphology, location, motility, and signaling that could not be tested before. Rapid advancements in resolution, ease-of-use, and scale of spatial genomics technologies to address these questions also require an updated toolkit of statistical methods with which to interrogate these data. We present four open biological questions that can now be answered using spatial genomics data paired with methods for analysis. We outline spatial data modalities for each that may yield specific insight, discuss how conflicting theories may be tested by comparing the data to conceptual models of biological behavior, and highlight statistical and machine learning-based tools that may prove particularly helpful to recover biological insight.
△ Less
Submitted 14 October, 2023;
originally announced October 2023.