subscribe to arXiv mailings

Perspective on Non-Hermitian Elastodynamics

Authors: Johan Christensen, Michael R. Haberman, Ankit Srivastava, Guoliang Huang, Gal Shmuel

Abstract: The manipulation of mechanical waves is a long-standing challenge for scientists and engineers, as numerous devices require their control. The current forefront of research in the control of classical waves has emerged from a seemingly unrelated field, namely, non-Hermitian quantum mechanics. By drawing analogies between this theory and those of classical systems, researchers have discovered pheno… ▽ More The manipulation of mechanical waves is a long-standing challenge for scientists and engineers, as numerous devices require their control. The current forefront of research in the control of classical waves has emerged from a seemingly unrelated field, namely, non-Hermitian quantum mechanics. By drawing analogies between this theory and those of classical systems, researchers have discovered phenomena that defy conventional intuition and have exploited them to control light, sound, and elastic waves. Here, we provide a brief perspective on recent developments, challenges and intricacies that distinguish non-Hermitian elastodynamics from optics and acoustics. We close this perspective with an outlook on potential directions such as topological phases in non-Hermitian elastodynamics and broken Hermitian symmetry in materials with electromomentum couplings. △ Less

Submitted 21 June, 2024; originally announced July 2024.

arXiv:2407.00464 [pdf, other]

doi 10.1145/3673422.3674896

To Switch or Not to Switch to TCP Prague? Incentives for Adoption in a Partial L4S Deployment

Authors: Fatih Berkay Sarpkaya, Ashutosh Srivastava, Fraida Fund, Shivendra Panwar

Abstract: The Low Latency, Low Loss, Scalable Throughput (L4S) architecture has the potential to reduce queuing delay when it is deployed at endpoints and routers throughout the Internet. However, it is not clear how TCP Prague, a prototype scalable congestion control for L4S, behaves when L4S is not yet universally deployed. Specifically, we consider the question: in a partial L4S deployment, will a user b… ▽ More The Low Latency, Low Loss, Scalable Throughput (L4S) architecture has the potential to reduce queuing delay when it is deployed at endpoints and routers throughout the Internet. However, it is not clear how TCP Prague, a prototype scalable congestion control for L4S, behaves when L4S is not yet universally deployed. Specifically, we consider the question: in a partial L4S deployment, will a user benefit by unilaterally switching from the status quo TCP to TCP Prague? To address this question, we evaluate the performance of a TCP Prague flow when sharing an L4S or non-L4S bottleneck queue with a non-L4S flow. Our findings suggest that the L4S congestion control, TCP Prague, has less favorable throughput or fairness properties than TCP Cubic or BBR in some coexistence scenarios, which may hinder adoption. △ Less

Submitted 29 June, 2024; originally announced July 2024.

Comments: Accepted to ACM Applied Networking Research Workshop (ANRW), 2024

arXiv:2406.17235 [pdf, other]

Task-Agnostic Federated Learning

Authors: Zhengtao Yao, Hong Nguyen, Ajitesh Srivastava, Jose Luis Ambite

Abstract: In the realm of medical imaging, leveraging large-scale datasets from various institutions is crucial for developing precise deep learning models, yet privacy concerns frequently impede data sharing. federated learning (FL) emerges as a prominent solution for preserving privacy while facilitating collaborative learning. However, its application in real-world scenarios faces several obstacles, such… ▽ More In the realm of medical imaging, leveraging large-scale datasets from various institutions is crucial for developing precise deep learning models, yet privacy concerns frequently impede data sharing. federated learning (FL) emerges as a prominent solution for preserving privacy while facilitating collaborative learning. However, its application in real-world scenarios faces several obstacles, such as task & data heterogeneity, label scarcity, non-identically distributed (non-IID) data, computational vaiation, etc. In real-world, medical institutions may not want to disclose their tasks to FL server and generalization challenge of out-of-network institutions with un-seen task want to join the on-going federated system. This study address task-agnostic and generalization problem on un-seen tasks by adapting self-supervised FL framework. Utilizing Vision Transformer (ViT) as consensus feature encoder for self-supervised pre-training, no initial labels required, the framework enabling effective representation learning across diverse datasets and tasks. Our extensive evaluations, using various real-world non-IID medical imaging datasets, validate our approach's efficacy, retaining 90\% of F1 accuracy with only 5\% of the training data typically required for centralized approaches and exhibiting superior adaptability to out-of-distribution task. The result indicate that federated learning architecture can be a potential approach toward multi-task foundation modeling. △ Less

Submitted 24 June, 2024; originally announced June 2024.

arXiv:2406.14861 [pdf, other]

Resilience of the Electric Grid through Trustable IoT-Coordinated Assets

Authors: Vineet J. Nair, Venkatesh Venkataramanan, Priyank Srivastava, Partha S. Sarker, Anurag Srivastava, Laurentiu D. Marinovici, Jun Zha, Christopher Irwin, Prateek Mittal, John Williams, H. Vincent Poor, Anuradha M. Annaswamy

Abstract: The electricity grid has evolved from a physical system to a cyber-physical system with digital devices that perform measurement, control, communication, computation, and actuation. The increased penetration of distributed energy resources (DERs) that include renewable generation, flexible loads, and storage provides extraordinary opportunities for improvements in efficiency and sustainability. Ho… ▽ More The electricity grid has evolved from a physical system to a cyber-physical system with digital devices that perform measurement, control, communication, computation, and actuation. The increased penetration of distributed energy resources (DERs) that include renewable generation, flexible loads, and storage provides extraordinary opportunities for improvements in efficiency and sustainability. However, they can introduce new vulnerabilities in the form of cyberattacks, which can cause significant challenges in ensuring grid resilience. %, i.e. the ability to rapidly restore grid services in the face of severe disruptions. We propose a framework in this paper for achieving grid resilience through suitably coordinated assets including a network of Internet of Things (IoT) devices. A local electricity market is proposed to identify trustable assets and carry out this coordination. Situational Awareness (SA) of locally available DERs with the ability to inject power or reduce consumption is enabled by the market, together with a monitoring procedure for their trustability and commitment. With this SA, we show that a variety of cyberattacks can be mitigated using local trustable resources without stressing the bulk grid. The demonstrations are carried out using a variety of platforms with a high-fidelity co-simulation platform, real-time hardware-in-the-loop validation, and a utility-friendly simulator. △ Less

Submitted 21 June, 2024; originally announced June 2024.

Comments: Submitted to the Proceedings of the National Academy of Sciences (PNAS), under review

arXiv:2406.14172 [pdf, ps, other]

Dynamics of Phase Transition in Quark-Gluon Plasma Droplet Formation under Magnetic Field

Authors: Agam K. Jha, Aviral Srivastava

Abstract: Pre-existing density of states for a Quark-Gluon Phase, based on Thomas-Fermi and Bethe mode, is expanded by incorporation of new variables. Results from recent study indicate that perturbations in the form of a finite non-zero chemical potential T, B, dynamic thermal masses M and of course Temperature T are indeed vital to fully comprehend the formation and dynamics of QGP. Simulations depict an… ▽ More Pre-existing density of states for a Quark-Gluon Phase, based on Thomas-Fermi and Bethe mode, is expanded by incorporation of new variables. Results from recent study indicate that perturbations in the form of a finite non-zero chemical potential T, B, dynamic thermal masses M and of course Temperature T are indeed vital to fully comprehend the formation and dynamics of QGP. Simulations depict an overall increase in the stability of QGP in the paradigm of the statistical model. On the top of Free Energy, Entropy and heat capacity are calculated for the phase transition. The overall qualitative behavior, of entropy or Heat Capacity determines the order of phase transition of the QGP. Investigation of order of phase transition is carried out in this study through Monte-Carlo based differential element, which ensures the inclusion of the randomness of the collisions at the particle colliders. △ Less

Submitted 20 June, 2024; originally announced June 2024.

arXiv:2406.13232 [pdf]

Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and Metrics for Open Domain Question Answering in the Era of Large Language Models

Authors: Akchay Srivastava, Atif Memon

Abstract: Open Domain Question Answering (ODQA) within natural language processing involves building systems that answer factual questions using large-scale knowledge corpora. Recent advances stem from the confluence of several factors, such as large-scale training datasets, deep learning techniques, and the rise of large language models. High-quality datasets are used to train models on realistic scenarios… ▽ More Open Domain Question Answering (ODQA) within natural language processing involves building systems that answer factual questions using large-scale knowledge corpora. Recent advances stem from the confluence of several factors, such as large-scale training datasets, deep learning techniques, and the rise of large language models. High-quality datasets are used to train models on realistic scenarios and enable the evaluation of the system on potentially unseen data. Standardized metrics facilitate comparisons between different ODQA systems, allowing researchers to objectively track advancements in the field. Our study presents a thorough examination of the current landscape of ODQA benchmarking by reviewing 52 datasets and 20 evaluation techniques across textual and multimodal modalities. We introduce a novel taxonomy for ODQA datasets that incorporates both the modality and difficulty of the question types. Additionally, we present a structured organization of ODQA evaluation metrics along with a critical analysis of their inherent trade-offs. Our study aims to empower researchers by providing a framework for the robust evaluation of modern question-answering systems. We conclude by identifying the current challenges and outlining promising avenues for future research and development. △ Less

Submitted 19 June, 2024; originally announced June 2024.

Comments: 22 pages, 13 tables, 7 figures

arXiv:2406.12458 [pdf, other]

Planning Using Schrödinger Bridge Diffusion Models

Authors: Adarsh Srivastava

Abstract: Offline planning often struggles with poor sampling efficiency as it tries to learn policies from scratch. Especially with diffusion models, such cold start practices mean that both training and sampling become very expensive. We hypothesize that certain environment constraint priors or cheaply available policies make it unnecessary to learn from scratch, and explore a way to incorporate such prio… ▽ More Offline planning often struggles with poor sampling efficiency as it tries to learn policies from scratch. Especially with diffusion models, such cold start practices mean that both training and sampling become very expensive. We hypothesize that certain environment constraint priors or cheaply available policies make it unnecessary to learn from scratch, and explore a way to incorporate such priors in the learning process. To achieve that, we borrow a variation of the Schrödinger bridge formulation from the image-to-image setting and apply it to planning tasks. We study the performance on some planning tasks and compare the performance against the DDPM formulation. The code for this work is available at https://github.com/adrshsrvstv/bridge_diffusion_planning. △ Less

Submitted 18 June, 2024; originally announced June 2024.

arXiv:2406.12308 [pdf, other]

Status of Astronomy Education in India: A Baseline Survey

Authors: Moupiya Maji, Surhud More, Aniket Sule, Vishaak Balasubramanya, Ankit Bhandari, Hum Chand, Kshitij Chavan, Avik Dasgupta, Anindya De, Jayant Gangopadhyay, Mamta Gulati, Priya Hasan, Syed Ishtiyaq, Meraj Madani, Kuntal Misra, Amoghavarsha N, Divya Oberoi, Subhendu Pattnaik, Mayuri Patwardhan, Niruj Mohan Ramanujam, Pritesh Ranadive, Disha Sawant, Paryag Sharma, Twinkle Sharma, Sai Shetye , et al. (6 additional authors not shown)

Abstract: We present the results of a nation-wide baseline survey, conducted by us, for the status of Astronomy education among secondary school students in India. The survey was administered in 10 different languages to over 2000 students from diverse backgrounds, and it explored multiple facets of their perspectives on astronomy. The topics included students' views on the incorporation of astronomy in cur… ▽ More We present the results of a nation-wide baseline survey, conducted by us, for the status of Astronomy education among secondary school students in India. The survey was administered in 10 different languages to over 2000 students from diverse backgrounds, and it explored multiple facets of their perspectives on astronomy. The topics included students' views on the incorporation of astronomy in curricula, their grasp of fundamental astronomical concepts, access to educational resources, cultural connections to astronomy, and their levels of interest and aspirations in the subject. We find notable deficiencies in students' knowledge of basic astronomical principles, with only a minority demonstrating proficiency in key areas such as celestial sizes, distances, and lunar phases. Furthermore, access to resources such as telescopes and planetariums remain limited across the country. Despite these challenges, a significant majority of students expressed a keen interest in astronomy. We further analyze the data along socioeconomic and gender lines. Particularly striking were the socioeconomic disparities, with students from resource-poor backgrounds often having lower levels of access and proficiency. Some differences were observed between genders, although not very pronounced. The insights gleaned from this study hold valuable implications for the development of a more robust astronomy curriculum and the design of effective teacher training programs in the future. △ Less

Submitted 18 June, 2024; originally announced June 2024.

Comments: 15 pages, 19 figures

arXiv:2406.08304 [pdf, other]

NIRPS first light and early science: breaking the 1 m/s RV precision barrier at infrared wavelengths

Authors: Étienne Artigau, François Bouchy, René Doyon, Frédérique Baron, Lison Malo, François Wildi, Franceso Pepe, Neil J. Cook, Simon Thibault, Vladimir Reshetov, Xavier Dumusque, Christophe Lovis, Danuta Sosnowska, Bruno L. Canto Martins, Jose Renan De Medeiros, Xavier Delfosse, Nuno Santos, Rafael Rebolo, Manuel Abreu, Guillaume Allain, Romain Allart, Hugues Auger, Susana Barros, Luc Bazinet, Nicolas Blind , et al. (89 additional authors not shown)

Abstract: The Near-InfraRed Planet Searcher or NIRPS is a precision radial velocity spectrograph developed through collaborative efforts among laboratories in Switzerland, Canada, Brazil, France, Portugal and Spain. NIRPS extends to the 0.98-1.8 $μ$m domain of the pioneering HARPS instrument at the La Silla 3.6-m telescope in Chile and it has achieved unparalleled precision, measuring stellar radial velocit… ▽ More The Near-InfraRed Planet Searcher or NIRPS is a precision radial velocity spectrograph developed through collaborative efforts among laboratories in Switzerland, Canada, Brazil, France, Portugal and Spain. NIRPS extends to the 0.98-1.8 $μ$m domain of the pioneering HARPS instrument at the La Silla 3.6-m telescope in Chile and it has achieved unparalleled precision, measuring stellar radial velocities in the infrared with accuracy better than 1 m/s. NIRPS can be used either stand-alone or simultaneously with HARPS. Commissioned in late 2022 and early 2023, NIRPS embarked on a 5-year Guaranteed Time Observation (GTO) program in April 2023, spanning 720 observing nights. This program focuses on planetary systems around M dwarfs, encompassing both the immediate solar vicinity and transit follow-ups, alongside transit and emission spectroscopy observations. We highlight NIRPS's current performances and the insights gained during its deployment at the telescope. The lessons learned and successes achieved contribute to the ongoing advancement of precision radial velocity measurements and high spectral fidelity, further solidifying NIRPS' role in the forefront of the field of exoplanets. △ Less

Submitted 13 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

Comments: Proceeding at the SPIE Astronomical Telescopes + Instrumentation conference [Yokohama,Japan; June 2024]

arXiv:2406.07840 [pdf, other]

SynthForge: Synthesizing High-Quality Face Dataset with Controllable 3D Generative Models

Authors: Abhay Rawat, Shubham Dokania, Astitva Srivastava, Shuaib Ahmed, Haiwen Feng, Rahul Tallamraju

Abstract: Recent advancements in generative models have unlocked the capabilities to render photo-realistic data in a controllable fashion. Trained on the real data, these generative models are capable of producing realistic samples with minimal to no domain gap, as compared to the traditional graphics rendering. However, using the data generated using such models for training downstream tasks remains under… ▽ More Recent advancements in generative models have unlocked the capabilities to render photo-realistic data in a controllable fashion. Trained on the real data, these generative models are capable of producing realistic samples with minimal to no domain gap, as compared to the traditional graphics rendering. However, using the data generated using such models for training downstream tasks remains under-explored, mainly due to the lack of 3D consistent annotations. Moreover, controllable generative models are learned from massive data and their latent space is often too vast to obtain meaningful sample distributions for downstream task with limited generation. To overcome these challenges, we extract 3D consistent annotations from an existing controllable generative model, making the data useful for downstream tasks. Our experiments show competitive performance against state-of-the-art models using only generated synthetic data, demonstrating potential for solving downstream tasks. Project page: https://synth-forge.github.io △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: 11 pages, 4 figures, 3 tables. Under Review

arXiv:2406.06608 [pdf, other]

The Prompt Report: A Systematic Survey of Prompting Techniques

Authors: Sander Schulhoff, Michael Ilie, Nishant Balepur, Konstantine Kahadze, Amanda Liu, Chenglei Si, Yinheng Li, Aayush Gupta, HyoJung Han, Sevien Schulhoff, Pranav Sandeep Dulepet, Saurav Vidyadhara, Dayeon Ki, Sweta Agrawal, Chau Pham, Gerson Kroiz, Feileen Li, Hudson Tao, Ashay Srivastava, Hevander Da Costa, Saloni Gupta, Megan L. Rogers, Inna Goncearenco, Giuseppe Sarli, Igor Galynker , et al. (6 additional authors not shown)

Abstract: Generative Artificial Intelligence (GenAI) systems are being increasingly deployed across all parts of industry and research settings. Developers and end users interact with these systems through the use of prompting or prompt engineering. While prompting is a widespread and highly researched concept, there exists conflicting terminology and a poor ontological understanding of what constitutes a p… ▽ More Generative Artificial Intelligence (GenAI) systems are being increasingly deployed across all parts of industry and research settings. Developers and end users interact with these systems through the use of prompting or prompt engineering. While prompting is a widespread and highly researched concept, there exists conflicting terminology and a poor ontological understanding of what constitutes a prompt due to the area's nascency. This paper establishes a structured understanding of prompts, by assembling a taxonomy of prompting techniques and analyzing their use. We present a comprehensive vocabulary of 33 vocabulary terms, a taxonomy of 58 text-only prompting techniques, and 40 techniques for other modalities. We further present a meta-analysis of the entire literature on natural language prefix-prompting. △ Less

Submitted 16 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

arXiv:2405.21050 [pdf, other]

Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models

Authors: Xinxi Zhang, Song Wen, Ligong Han, Felix Juefei-Xu, Akash Srivastava, Junzhou Huang, Hao Wang, Molei Tao, Dimitris N. Metaxas

Abstract: Adapting large-scale pre-trained generative models in a parameter-efficient manner is gaining traction. Traditional methods like low rank adaptation achieve parameter efficiency by imposing constraints but may not be optimal for tasks requiring high representation capacity. We propose a novel spectrum-aware adaptation framework for generative models. Our method adjusts both singular values and the… ▽ More Adapting large-scale pre-trained generative models in a parameter-efficient manner is gaining traction. Traditional methods like low rank adaptation achieve parameter efficiency by imposing constraints but may not be optimal for tasks requiring high representation capacity. We propose a novel spectrum-aware adaptation framework for generative models. Our method adjusts both singular values and their basis vectors of pretrained weights. Using the Kronecker product and efficient Stiefel optimizers, we achieve parameter-efficient adaptation of orthogonal matrices. We introduce Spectral Orthogonal Decomposition Adaptation (SODA), which balances computational efficiency and representation capacity. Extensive evaluations on text-to-image diffusion models demonstrate SODA's effectiveness, offering a spectrum-aware alternative to existing fine-tuning methods. △ Less

Submitted 31 May, 2024; originally announced May 2024.

arXiv:2405.20592 [pdf, other]

LInK: Learning Joint Representations of Design and Performance Spaces through Contrastive Learning for Mechanism Synthesis

Authors: Amin Heyrani Nobari, Akash Srivastava, Dan Gutfreund, Kai Xu, Faez Ahmed

Abstract: In this paper, we introduce LInK, a novel framework that integrates contrastive learning of performance and design space with optimization techniques for solving complex inverse problems in engineering design with discrete and continuous variables. We focus on the path synthesis problem for planar linkage mechanisms. By leveraging a multi-modal and transformation-invariant contrastive learning fra… ▽ More In this paper, we introduce LInK, a novel framework that integrates contrastive learning of performance and design space with optimization techniques for solving complex inverse problems in engineering design with discrete and continuous variables. We focus on the path synthesis problem for planar linkage mechanisms. By leveraging a multi-modal and transformation-invariant contrastive learning framework, LInK learns a joint representation that captures complex physics and design representations of mechanisms, enabling rapid retrieval from a vast dataset of over 10 million mechanisms. This approach improves precision through the warm start of a hierarchical unconstrained nonlinear optimization algorithm, combining the robustness of traditional optimization with the speed and adaptability of modern deep learning methods. Our results on an existing benchmark demonstrate that LInK outperforms existing methods with 28 times less error compared to a state-of-the-art approach while taking 20 times less time on an existing benchmark. Moreover, we introduce a significantly more challenging benchmark, named LINK-ABC, which involves synthesizing linkages that trace the trajectories of English capital alphabets - an inverse design benchmark task that existing methods struggle with due to large non-linearities and tiny feasible space. Our results demonstrate that LInK not only advances the field of mechanism design but also broadens the applicability of contrastive learning and optimization to other areas of engineering. △ Less

Submitted 30 May, 2024; originally announced May 2024.

arXiv:2405.19310 [pdf, other]

Network Connectivity--Information Freshness Tradeoff in Information Dissemination Over Networks

Authors: Arunabh Srivastava, Sennur Ulukus

Abstract: We consider a gossip network consisting of a source generating updates and $n$ nodes connected according to a given graph structure. The source keeps updates of a process, that might be generated or observed, and shares them with the gossiping network. The nodes in the network communicate with their neighbors and disseminate these version updates using a push-style gossip strategy. We use the vers… ▽ More We consider a gossip network consisting of a source generating updates and $n$ nodes connected according to a given graph structure. The source keeps updates of a process, that might be generated or observed, and shares them with the gossiping network. The nodes in the network communicate with their neighbors and disseminate these version updates using a push-style gossip strategy. We use the version age metric to quantify the timeliness of information at the nodes. We first find an upper bound for the average version age for a set of nodes in a general network. Using this, we find the average version age scaling of a node in several network graph structures, such as two-dimensional grids, generalized rings and hyper-cubes. Prior to our work, it was known that when $n$ nodes are connected on a ring the version age scales as $O(n^{\frac{1}{2}})$, and when they are connected on a fully-connected graph the version age scales as $O(\log n)$. Ours is the first work to show an age scaling result for a connectivity structure other than the ring and the fully-connected network, which constitute the two extremes of network connectivity. Our work helps fill the gap between these two extremes by analyzing a large variety of graphs with intermediate connectivity, thus providing insight into the relationship between the connectivity structure of the network and the version age, and uncovering a network connectivity--information freshness tradeoff. △ Less

Submitted 29 May, 2024; originally announced May 2024.

Comments: arXiv admin note: text overlap with arXiv:2307.08670, arXiv:2308.12266

arXiv:2405.18670 [pdf, other]

Adapting Differentially Private Synthetic Data to Relational Databases

Authors: Kaveh Alimohammadi, Hao Wang, Ojas Gulati, Akash Srivastava, Navid Azizan

Abstract: Existing differentially private (DP) synthetic data generation mechanisms typically assume a single-source table. In practice, data is often distributed across multiple tables with relationships across tables. In this paper, we introduce the first-of-its-kind algorithm that can be combined with any existing DP mechanisms to generate synthetic relational databases. Our algorithm iteratively refines… ▽ More Existing differentially private (DP) synthetic data generation mechanisms typically assume a single-source table. In practice, data is often distributed across multiple tables with relationships across tables. In this paper, we introduce the first-of-its-kind algorithm that can be combined with any existing DP mechanisms to generate synthetic relational databases. Our algorithm iteratively refines the relationship between individual synthetic tables to minimize their approximation errors in terms of low-order marginal distributions while maintaining referential integrity. Finally, we provide both DP and theoretical utility guarantees for our algorithm. △ Less

Submitted 28 May, 2024; originally announced May 2024.

arXiv:2405.13132 [pdf]

doi 10.1007/s40192-024-00357-3

Illustrating an Effective Workflow for Accelerated Materials Discovery

Authors: Mrinalini Mulukutla, A. Nicole Person, Sven Voigt, Lindsey Kuettner, Branden Kappes, Danial Khatamsaz, Robert Robinson, Daniel Salas, Wenle Xu, Daniel Lewis, Hongkyu Eoh, Kailu Xiao, Haoren Wang, Jaskaran Singh Saini, Raj Mahat, Trevor Hastings, Matthew Skokan, Vahid Attari, Michael Elverud, James D. Paramore, Brady Butler, Kenneth Vecchio, Surya R. Kalidindi, Douglas Allaire, Ibrahim Karaman , et al. (4 additional authors not shown)

Abstract: Algorithmic materials discovery is a multi-disciplinary domain that integrates insights from specialists in alloy design, synthesis, characterization, experimental methodologies, computational modeling, and optimization. Central to this effort is a robust data management system paired with an interactive work platform. This platform should empower users to not only access others data but also inte… ▽ More Algorithmic materials discovery is a multi-disciplinary domain that integrates insights from specialists in alloy design, synthesis, characterization, experimental methodologies, computational modeling, and optimization. Central to this effort is a robust data management system paired with an interactive work platform. This platform should empower users to not only access others data but also integrate their analyses, paving the way for sophisticated data pipelines. To realize this vision, there is a need for an integrative collaboration platform, streamlined data sharing and analysis tools, and efficient communication channels. Such a collaborative mechanism should transcend geographical barriers, facilitating remote interaction and fostering a challenge-response dynamic. In this paper, we present our ongoing efforts in addressing the critical challenges related to an accelerated Materials Discovery Framework as a part of the High-Throughput Materials Discovery for Extreme Conditions Initiative. Our BIRDSHOT Center has successfully harnessed various tools and strategies, including the utilization of cloud-based storage, a standardized sample naming convention, a structured file system, the implementation of sample travelers, a robust sample tracking method, and the incorporation of knowledge graphs for efficient data management. Additionally, we present the development of a data collection platform, reinforcing seamless collaboration among our team members. In summary, this paper provides an illustration and insight into the various elements of an efficient and effective workflow within an accelerated materials discovery framework while highlighting the dynamic and adaptable nature of the data management tools and sharing platforms. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: 28 pages, 9 figures, 2 tables, with appendix that has 8 pages, accepted for publication at IMMI

arXiv:2405.12052 [pdf, other]

Parallelization of the K-Means Algorithm with Applications to Big Data Clustering

Authors: Ashish Srivastava, Mohammed Nawfal

Abstract: The K-Means clustering using LLoyd's algorithm is an iterative approach to partition the given dataset into K different clusters. The algorithm assigns each point to the cluster based on the following objective function \[\ \min Σ_{i=1}^{n}||x_i-μ_{x_i}||^2\] The serial algorithm involves iterative steps where we compute the distance of each datapoint from the centroids and assign the datapoint… ▽ More The K-Means clustering using LLoyd's algorithm is an iterative approach to partition the given dataset into K different clusters. The algorithm assigns each point to the cluster based on the following objective function \[\ \min Σ_{i=1}^{n}||x_i-μ_{x_i}||^2\] The serial algorithm involves iterative steps where we compute the distance of each datapoint from the centroids and assign the datapoint to the nearest centroid. This approach is essentially known as the expectation-maximization step. Clustering involves extensive computations to calculate distances at each iteration, which increases as the number of data points increases. This provides scope for parallelism. However, we must ensure that in a parallel process, each thread has access to the updated centroid value and no racing condition exists on any centroid values. We will compare two different approaches in this project. The first approach is an OpenMP flat synchronous method where all processes are run in parallel, and we use synchronization to ensure safe updates of clusters. The second approach we adopt is a GPU based parallelization approach using OpenACC wherein we will try to make use of GPU architecture to parallelize chunks of the algorithm to observe decreased computation time. We will analyze metrics such as speed up, efficiency,time taken with varying data points, and number of processes to compare the two approaches and understand the relative performance improvement we can get. △ Less

Submitted 20 May, 2024; originally announced May 2024.

Comments: 7 Pages, 5 tables, 12 figures

arXiv:2405.08900 [pdf]

An Interoperable Multi Objective Batch Bayesian Optimization Framework for High Throughput Materials Discovery

Authors: Trevor Hastings, Mrinalini Mulukutla, Danial Khatamsaz, Daniel Salas, Wenle Xu, Daniel Lewis, Nicole Person, Matthew Skokan, Braden Miller, James Paramore, Brady Butler, Douglas Allaire, Ibrahim Karaman, George Pharr, Ankit Srivastava, Raymundo Arroyave

Abstract: In this study, we introduce a groundbreaking framework for materials discovery, we efficiently navigate a vast phase space of material compositions by leveraging Batch Bayesian statistics in order to achieve specific performance objectives. This approach addresses the challenge of identifying optimal materials from an untenably large array of possibilities in a reasonable timeframe with high confi… ▽ More In this study, we introduce a groundbreaking framework for materials discovery, we efficiently navigate a vast phase space of material compositions by leveraging Batch Bayesian statistics in order to achieve specific performance objectives. This approach addresses the challenge of identifying optimal materials from an untenably large array of possibilities in a reasonable timeframe with high confidence. Crucially, our batchwise methods align seamlessly with existing material processing infrastructure for synthesizing and characterizing materials. By applying this framework to a specific high entropy alloy system, we demonstrate its versatility and robustness in optimizing properties like strain hardening, hardness, and strain rate sensitivity. The fact that the Bayesian model is adept in refining and expanding the property Pareto front highlights its broad applicability across various materials, including steels, shape memory alloys, ceramics, and composites. This study advances the field of materials science and sets a new benchmark for material discovery methodologies. By proving the effectiveness of Bayesian optimization, we showcase its potential to redefine the landscape of materials discovery. △ Less

Submitted 14 May, 2024; originally announced May 2024.

Comments: 12 pages, 6 figures, with Supplementary Appendix that has 17 pages, 9 figures

arXiv:2405.06639 [pdf, other]

Value Augmented Sampling for Language Model Alignment and Personalization

Authors: Seungwook Han, Idan Shenfeld, Akash Srivastava, Yoon Kim, Pulkit Agrawal

Abstract: Aligning Large Language Models (LLMs) to cater to different human preferences, learning new skills, and unlearning harmful behavior is an important problem. Search-based methods, such as Best-of-N or Monte-Carlo Tree Search, are performant, but impractical for LLM adaptation due to their high inference cost. On the other hand, using Reinforcement Learning (RL) for adaptation is computationally eff… ▽ More Aligning Large Language Models (LLMs) to cater to different human preferences, learning new skills, and unlearning harmful behavior is an important problem. Search-based methods, such as Best-of-N or Monte-Carlo Tree Search, are performant, but impractical for LLM adaptation due to their high inference cost. On the other hand, using Reinforcement Learning (RL) for adaptation is computationally efficient, but performs worse due to the optimization challenges in co-training the value function and the policy. We present a new framework for reward optimization, Value Augmented Sampling (VAS), that can maximize different reward functions using data sampled from only the initial, frozen LLM. VAS solves for the optimal reward-maximizing policy without co-training the policy and the value function, making the optimization stable, outperforming established baselines, such as PPO and DPO, on standard benchmarks, and achieving comparable results to Best-of-128 with lower inference cost. Unlike existing RL methods that require changing the weights of the LLM, VAS does not require access to the weights of the pre-trained LLM. Thus, it can even adapt LLMs (e.g., ChatGPT), which are available only as APIs. In addition, our algorithm unlocks the new capability of composing several rewards and controlling the extent of each one during deployment time, paving the road ahead for the future of aligned, personalized LLMs. △ Less

Submitted 10 May, 2024; originally announced May 2024.

Comments: Website: https://sites.google.com/view/llm-vas

arXiv:2405.03736 [pdf, ps, other]

doi 10.1016/j.physletb.2024.138683

Percolating Cosmic String loops from evaporating primordial black holes

Authors: Ajit M. Srivastava

Abstract: The Pulsar timing data from NANOGrav Collaboration has regenerated interest in the possibility of observing stochastic gravitational wave background arising from cosmic strings. In the standard theory, the cosmic string network forms during spontaneous symmetry breaking (SSB) phase transition in the whole universe via the so called Kibble mechanism. This scenario would not be possible, e.g., in mo… ▽ More The Pulsar timing data from NANOGrav Collaboration has regenerated interest in the possibility of observing stochastic gravitational wave background arising from cosmic strings. In the standard theory, the cosmic string network forms during spontaneous symmetry breaking (SSB) phase transition in the whole universe via the so called Kibble mechanism. This scenario would not be possible, e.g., in models of low energy inflation, where the reheat temperature is much lower than the energy scale of cosmic strings. We point out a very different possibility, where a network of even high energy scale cosmic strings can form when the temperature of the Universe is much lower. We consider local heating of plasma in the early universe by evaporating primordial black holes (PBHs). It is known that for suitable masses of PBHs, their Hawking radiation may re-heat the surrounding plasma to high temperatures, restoring certain symmetries {\it locally} which are broken at the ambient temperature at that stage. Expansion of the hot plasma cools it so that the {\it locally restored symmetry} is spontaneously broken again. If this SSB supports formation of cosmic strings, then string loops will form in this region around the PBH. Further, resulting temperature gradients lead to pressure gradients such that plasma develops radial flow with the string loops getting stretched as they get dragged by the flow. For a finite density of PBHs of suitable masses, one will get local hot spots, each one contributing to expanding cosmic string loops. For suitable PBH density, the loops from different regions may intersect. Intercommutation of strings can then lead to percolation, leading to the possibility of formation of infinite string network, even when the entire universe never goes through the respective SSB phase transition. △ Less

Submitted 6 May, 2024; originally announced May 2024.

Comments: 8 pages. arXiv admin note: substantial text overlap with arXiv:hep-ph/0611253

Journal ref: Phys. Lett. B Volume 853, June 2024, 138683

arXiv:2404.14153 [pdf, other]

185 mW, 1 MHz, 15 fs carrier-envelope phase-stable pulse generation via polarization-optimized down-conversion from gas-filled hollow-core fiber

Authors: Anchit Srivastava, Kilian Scheffter, Soyeon Jun, Andreas Herbst, Hanieh Fattahi

Abstract: Gas-filled hollow core fibers allow the generation of single-cycle pulses at megahertz repetition rates. When coupled with difference frequency generation, they can be an ideal driver for the generation of carrier-envelope phase stable, octave-spanning pulses in the short-wavelength infrared. In this work, we investigate the dependence of the polarization state in gas-filled hollow-core fibers on… ▽ More Gas-filled hollow core fibers allow the generation of single-cycle pulses at megahertz repetition rates. When coupled with difference frequency generation, they can be an ideal driver for the generation of carrier-envelope phase stable, octave-spanning pulses in the short-wavelength infrared. In this work, we investigate the dependence of the polarization state in gas-filled hollow-core fibers on the subsequent difference frequency generation stage. We show that by adjusting the input polarization state of light in geometrically symmetric systems, such as hollow-core fibers, one can achieve precise control over the polarization state of the output pulses. Importantly, this manipulation preserves the temporal characteristics of the ultrashort pulses generated, especially when operating near the single-cycle regime. We leverage this property to boost the down-conversion efficiency of these pulses in a type I difference frequency generation stage. Our technique overcomes the bandwidth and dispersion constraints of the previous methods that rely on broadband waveplates or adjustment of crystal axes relative to the laboratory frame. This advancement is crucial for experiments demanding pure polarization states in the eigenmodes of the laboratory frame. △ Less

Submitted 22 April, 2024; originally announced April 2024.

arXiv:2404.12643 [pdf]

AipanVR: A Virtual Reality Experience for Preserving Uttarakhand's Traditional Art Form

Authors: Nishant Chaudhary, Mihir Raj, Richik Bhattacharjee, Anmol Srivastava, Rakesh Sah, Pankaj Badoni

Abstract: This paper presents a demonstration of the developed prototype showcasing a way to preserve the Intangible Cultural Heritage of Uttarakhand, India. Aipan is a traditional art form practiced in the Kumaon region in the state of Uttarakhand. It is typically used to decorate floors and walls at places of worship or entrances of homes and is considered auspicious to begin any work or event. This art i… ▽ More This paper presents a demonstration of the developed prototype showcasing a way to preserve the Intangible Cultural Heritage of Uttarakhand, India. Aipan is a traditional art form practiced in the Kumaon region in the state of Uttarakhand. It is typically used to decorate floors and walls at places of worship or entrances of homes and is considered auspicious to begin any work or event. This art is associated with a great degree of social, cultural as well as religious significance and is passed from generation to generation. However, in the present era of modernization and technological advancements, this art form now stands on the verge of depletion. This study presents a humble attempt to preserve this vanishing art form through the use of Virtual Reality (VR). Ethnographic studies were conducted in Almora, Nainital, and Haldwani regions of Uttarakhand to trace the origins as well as to gain a deeper understanding of this art form. A total of ten (N =10) Aipan designers were interviewed. Several interesting insights are revealed through these studies that show the potential to be incorporated as a VR experience. △ Less

Submitted 19 April, 2024; originally announced April 2024.

Comments: Demonstrated at ISMAR 2020

arXiv:2404.12261 [pdf]

Design And Flight Testing Of LQRi Attitude Control For Quadcopter UAV

Authors: Astik Srivastava, S. Indu, Richa Sharma

Abstract: This paper presents the design, implementation, and flight test results of linear quadratic integral regulator (LQRi) based attitude control for a quadcopter UAV. We present the derivation of the mathematical model for the kinematics and dynamics of the UAV, along with the linearized state space representation of the system about hover conditions. LQR and LQRi controllers are then designed to stab… ▽ More This paper presents the design, implementation, and flight test results of linear quadratic integral regulator (LQRi) based attitude control for a quadcopter UAV. We present the derivation of the mathematical model for the kinematics and dynamics of the UAV, along with the linearized state space representation of the system about hover conditions. LQR and LQRi controllers are then designed to stabilize the UAV in hover conditions and to track desired attitude commands. The controllers are then implemented onboard the Pixhawk flight controller and flight test results are discussed. Finally, the code related to this paper has been published open-source for replication and further research △ Less

Submitted 18 April, 2024; originally announced April 2024.

Comments: This research is still work under progress. The paper has been posted here for wider review by community

arXiv:2404.07304 [pdf, other]

We're Calling an Intervention: Exploring the Fundamental Hurdles in Adapting Language Models to Nonstandard Text

Authors: Aarohi Srivastava, David Chiang

Abstract: We present a suite of experiments that allow us to understand the underlying challenges of language model adaptation to nonstandard text. We do so by designing interventions that approximate several types of linguistic variation and their interactions with existing biases of language models. Applying our interventions during language model adaptation with varying size and nature of training data,… ▽ More We present a suite of experiments that allow us to understand the underlying challenges of language model adaptation to nonstandard text. We do so by designing interventions that approximate several types of linguistic variation and their interactions with existing biases of language models. Applying our interventions during language model adaptation with varying size and nature of training data, we gain important insights into when knowledge transfer can be successful, as well as the aspects of linguistic variation that are particularly difficult for language models to deal with. For instance, on text with character-level variation, performance improves with even a few training examples but approaches a plateau, suggesting that more data is not the solution. In contrast, on text with variation involving new words or meanings, far more data is needed, but it leads to a massive breakthrough in performance. Our findings reveal that existing models lack the necessary infrastructure to handle diverse forms of nonstandard text and linguistic variation, guiding the development of more resilient language modeling techniques for the future. We make the code for our interventions, which can be applied to any English text data, publicly available. △ Less

Submitted 15 June, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

Comments: Preprint

arXiv:2404.05379 [pdf, other]

Logic-dependent emergence of multistability, hysteresis, and biphasic dynamics in a minimal positive feedback network with an autoloop

Authors: Akriti Srivastava, Mubasher Rashid

Abstract: Cellular decision-making (CDM) is a dynamic phenomenon often controlled by regulatory networks defining interactions between genes and transcription factor proteins. Traditional studies have focussed on molecular switches such as positive feedback circuits that exhibit at most bistability. However, higher-order dynamics such as tristability is also prominent in many biological processes. It is thu… ▽ More Cellular decision-making (CDM) is a dynamic phenomenon often controlled by regulatory networks defining interactions between genes and transcription factor proteins. Traditional studies have focussed on molecular switches such as positive feedback circuits that exhibit at most bistability. However, higher-order dynamics such as tristability is also prominent in many biological processes. It is thus imperative to identify a minimal circuit that can alone explain mono, bi, and tristable dynamics. In this work, we consider a two-component positive feedback network with an autoloop and explore these regimes of stability for different degrees of multimerization and the choice of Boolean logic functions. We report that this network can exhibit numerous dynamical scenarios such as bi-and tristability, hysteresis, and biphasic kinetics, explaining the possibilities of abrupt cell state transitions and the smooth state swap without a step-like switch. Specifically, while with monomeric regulation and competitive OR logic, the circuit exhibits mono-and bistability and biphasic dynamics, with non-competitive AND and OR logics only monostability can be achieved. To obtain bistability in the latter cases, we show that the autoloop must have (at least) dimeric regulation. In pursuit of higher-order stability, we show that tristability occurs with higher degrees of multimerization and with non-competitive OR logic only. Our results, backed by rigorous analytical calculations and numerical examples, thus explain the association between multistability, multimerization, and logic in this minimal circuit. Since this circuit underlies various biological processes, including epithelial-mesenchymal transition which often drives carcinoma metastasis, these results can thus offer crucial inputs to control cell state transition by manipulating multimerization and the logic of regulation in cells. △ Less

Submitted 8 April, 2024; originally announced April 2024.

arXiv:2404.03843 [pdf, other]

Scaling Motion Forecasting Models with Ensemble Distillation

Authors: Scott Ettinger, Kratarth Goel, Avikalp Srivastava, Rami Al-Rfou

Abstract: Motion forecasting has become an increasingly critical component of autonomous robotic systems. Onboard compute budgets typically limit the accuracy of real-time systems. In this work we propose methods of improving motion forecasting systems subject to limited compute budgets by combining model ensemble and distillation techniques. The use of ensembles of deep neural networks has been shown to im… ▽ More Motion forecasting has become an increasingly critical component of autonomous robotic systems. Onboard compute budgets typically limit the accuracy of real-time systems. In this work we propose methods of improving motion forecasting systems subject to limited compute budgets by combining model ensemble and distillation techniques. The use of ensembles of deep neural networks has been shown to improve generalization accuracy in many application domains. We first demonstrate significant performance gains by creating a large ensemble of optimized single models. We then develop a generalized framework to distill motion forecasting model ensembles into small student models which retain high performance with a fraction of the computing cost. For this study we focus on the task of motion forecasting using real world data from autonomous driving systems. We develop ensemble models that are very competitive on the Waymo Open Motion Dataset (WOMD) and Argoverse leaderboards. From these ensembles, we train distilled student models which have high performance at a fraction of the compute costs. These experiments demonstrate distillation from ensembles as an effective method for improving accuracy of predictive models for robotic systems with limited compute budgets. △ Less

Submitted 13 May, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

Comments: 11 pages, 14 figures

arXiv:2403.20027 [pdf, ps, other]

Interplay between Negation of a Probability Distribution and Jensen Inequality

Authors: Amit Srivastava

Abstract: Yager[5] proposed a transformation for opposing(negating) the occurence of an event that is not certain using the idea that one can oppose the occurence of any uncertain event by allocating its probability among the other outcomes in the sample space without preference to any particular outcome \textit{i.e.} the probability of every event in the sample space is redistributed equally among the othe… ▽ More Yager[5] proposed a transformation for opposing(negating) the occurence of an event that is not certain using the idea that one can oppose the occurence of any uncertain event by allocating its probability among the other outcomes in the sample space without preference to any particular outcome \textit{i.e.} the probability of every event in the sample space is redistributed equally among the other outcomes in the sample space. However this redistribution increases the uncertainty associated with the occurence of events. In the present work, we have established bounds on the uncertainty associated with negation of a probability distribution using well known Jensen inequality. The obtained results are validated with the help of various numerical examples. Finally a dissimilarity function between a probability distribution and its negation has been developed. △ Less

Submitted 29 March, 2024; originally announced March 2024.

arXiv:2403.17541 [pdf, other]

WordRobe: Text-Guided Generation of Textured 3D Garments

Authors: Astitva Srivastava, Pranav Manu, Amit Raj, Varun Jampani, Avinash Sharma

Abstract: In this paper, we tackle a new and challenging problem of text-driven generation of 3D garments with high-quality textures. We propose "WordRobe", a novel framework for the generation of unposed & textured 3D garment meshes from user-friendly text prompts. We achieve this by first learning a latent representation of 3D garments using a novel coarse-to-fine training strategy and a loss for latent d… ▽ More In this paper, we tackle a new and challenging problem of text-driven generation of 3D garments with high-quality textures. We propose "WordRobe", a novel framework for the generation of unposed & textured 3D garment meshes from user-friendly text prompts. We achieve this by first learning a latent representation of 3D garments using a novel coarse-to-fine training strategy and a loss for latent disentanglement, promoting better latent interpolation. Subsequently, we align the garment latent space to the CLIP embedding space in a weakly supervised manner, enabling text-driven 3D garment generation and editing. For appearance modeling, we leverage the zero-shot generation capability of ControlNet to synthesize view-consistent texture maps in a single feed-forward inference step, thereby drastically decreasing the generation time as compared to existing methods. We demonstrate superior performance over current SOTAs for learning 3D garment latent space, garment interpolation, and text-driven texture synthesis, supported by quantitative evaluation and qualitative user study. The unposed 3D garment meshes generated using WordRobe can be directly fed to standard cloth simulation & animation pipelines without any post-processing. △ Less

Submitted 26 March, 2024; originally announced March 2024.

arXiv:2403.14603 [pdf, other]

Alfvén Pulse Driven Spicule-like Jets in the Presence of Thermal Conduction and Ion-Neutral Collision in Two-Fluid Regime

Authors: A. K. Srivastava, Anshika Singh, Balveer Singh, K. Murawski, T. V. Zaqarashvili, D. Yuan, E. Scullion, Sudheer K. Mishra, B. N. Dwivedi

Abstract: We present the formation of quasi-periodic cool spicule-like jets in the solar atmosphere using 2.5-D numerical simulation in two-fluid regime (ions+neutrals) under the presence of thermal conduction and ion-neutral collision. The non-linear, impulsive Alfvénic perturbations at the top of the photosphere trigger field aligned magnetoacoustic perturbations due to ponderomotive force. The transport… ▽ More We present the formation of quasi-periodic cool spicule-like jets in the solar atmosphere using 2.5-D numerical simulation in two-fluid regime (ions+neutrals) under the presence of thermal conduction and ion-neutral collision. The non-linear, impulsive Alfvénic perturbations at the top of the photosphere trigger field aligned magnetoacoustic perturbations due to ponderomotive force. The transport of energy from Alfvén pulse to such vertical velocity perturbations due to ponderomotive force is considered as an initial trigger mechanism. Thereafter, these velocity perturbations steepen into the shocks followed by quasi-periodic rise and fall of the cool jets transporting mass in the overlying corona. △ Less

Submitted 21 March, 2024; originally announced March 2024.

Comments: In Press; Philosophical Transactions of the Royal Society A; 21 Pages; 06 Figures. Philosophical Transactions is the oldest English science journal in the world, which has been published continuously since March 1665 as launched by Henry Oldenburg

arXiv:2403.11539 [pdf, ps, other]

Detecting superfluid transition in the pulsar core

Authors: Partha Bagchi, Biswanath Layek, Dheeraj Saini, Anjishnu Sarkar, Ajit M. Srivastava, Deepthi Godaba Venkata

Abstract: It is believed that the core of a neutron star can be host to various novel phases of matter, from nucleon superfluid phase to exotic high baryon density QCD phases. Different observational signals for such phase transitions have been discussed in the literature. Here, we point out a unique phenomenon associated with phase transition to a superfluid phase, which may be the nucleon superfluid phase… ▽ More It is believed that the core of a neutron star can be host to various novel phases of matter, from nucleon superfluid phase to exotic high baryon density QCD phases. Different observational signals for such phase transitions have been discussed in the literature. Here, we point out a unique phenomenon associated with phase transition to a superfluid phase, which may be the nucleon superfluid phase or a phase like the CFL phase, allowing for superfluid vortices. In any superfluid phase transition, a random network of vortices forms via the so-called Kibble-Zurek mechanism, which eventually mostly decays away, finally leaving primarily vortices arising from the initial angular momentum of the core. This transient, random vortex network can have a non-zero net angular momentum for the superfluid component, which will generally be oriented in an arbitrary direction. This is in contrast to the final vortices, which arise from initial rotation and hence have the initial angular momentum of the neutron star. The angular momentum of the random vortex network is balanced by an equal and opposite angular momentum in the normal fluid due to the conservation of angular momentum, thereby imparting an arbitrarily oriented angular momentum component to the outer shell of the neutron star. This will affect the pulse timing and pulse profile of a pulsar. These changes in the pulses will decay away in a characteristic manner as the random vortex network decays, obeying specific scaling laws leading to universal features for the detection of superfluid transitions occurring in a pulsar core. △ Less

Submitted 18 March, 2024; originally announced March 2024.

Comments: 9 pages, no figures

arXiv:2403.11009 [pdf, other]

DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages

Authors: Fahim Faisal, Orevaoghene Ahia, Aarohi Srivastava, Kabir Ahuja, David Chiang, Yulia Tsvetkov, Antonios Anastasopoulos

Abstract: Language technologies should be judged on their usefulness in real-world use cases. An often overlooked aspect in natural language processing (NLP) research and evaluation is language variation in the form of non-standard dialects or language varieties (hereafter, varieties). Most NLP benchmarks are limited to standard language varieties. To fill this gap, we propose DIALECTBENCH, the first-ever l… ▽ More Language technologies should be judged on their usefulness in real-world use cases. An often overlooked aspect in natural language processing (NLP) research and evaluation is language variation in the form of non-standard dialects or language varieties (hereafter, varieties). Most NLP benchmarks are limited to standard language varieties. To fill this gap, we propose DIALECTBENCH, the first-ever large-scale benchmark for NLP on varieties, which aggregates an extensive set of task-varied variety datasets (10 text-level tasks covering 281 varieties). This allows for a comprehensive evaluation of NLP system performance on different language varieties. We provide substantial evidence of performance disparities between standard and non-standard language varieties, and we also identify language clusters with large performance divergence across tasks. We believe DIALECTBENCH provides a comprehensive view of the current state of NLP for language varieties and one step towards advancing it further. Code/data: https://github.com/ffaisal93/DialectBench △ Less

Submitted 7 July, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

Comments: Equal contribution: Fahim Faisal, Orevaoghene Ahia

arXiv:2403.10392 [pdf, other]

Statistical investigation of wave propagation in the quiet-Sun using IRIS spectroscopic observations

Authors: Kartika Sangal, A. K. Srivastava, P. Kayshap, Ding Yuan, E. Scullion

Abstract: In the current analysis, we use spectroscopic observations of the quiet-Sun made by IRIS instrument, and investigate wave propagation. We analyze various spectral lines formed in different atmospheric layers such as the photosphere, chromosphere, and transition region. We examine Doppler velocity time-series at various locations in the quiet-Sun to determine the dominant oscillation periods. Our r… ▽ More In the current analysis, we use spectroscopic observations of the quiet-Sun made by IRIS instrument, and investigate wave propagation. We analyze various spectral lines formed in different atmospheric layers such as the photosphere, chromosphere, and transition region. We examine Doppler velocity time-series at various locations in the quiet-Sun to determine the dominant oscillation periods. Our results executing statistical analysis resemble those of the classical physical scenario, indicating that the photosphere is mainly characterized by the dominant 5-minute period, while the chromosphere is primarily associated with the 3-minute oscillation period. In the transition region, we observe a variety of oscillation periods, with dominant periods of 3, 8, and 12 minutes. We estimate the cut-off frequency by deducing phase difference between two Doppler velocity time-series obtained from spectral line pairs in different atmospheric layers formed at different temperatures. It reveals a significant correlation between 3-minute periods in TR and photospheric oscillations, suggesting that these oscillations in the TR might propagate from the photosphere. Additionally, we analyze the phase difference between chromospheric oscillations and photospheric oscillations, demonstrating that only the 3-minute oscillations propagate upwards. Based on the statistical analyses, we suggest the presence of magnetoacoustic waves in the solar atmosphere in which some are propagating from the lower solar atmosphere upward, while some others are propagating downward. TR carries both long-period oscillations generated in situ, and some photospheric oscillations which are also able to reach there from below. △ Less

Submitted 15 March, 2024; originally announced March 2024.

Comments: 18 pages, 10 figures, accepted for publication in ApJ

arXiv:2403.03004 [pdf, other]

Ultralight vector dark matter search using data from the KAGRA O3GK run

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi , et al. (1778 additional authors not shown)

Abstract: Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese… ▽ More Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we present the result of a search for $U(1)_{B-L}$ gauge boson DM using the KAGRA data from auxiliary length channels during the first joint observation run together with GEO600. By applying our search pipeline, which takes into account the stochastic nature of ultralight DM, upper bounds on the coupling strength between the $U(1)_{B-L}$ gauge boson and ordinary matter are obtained for a range of DM masses. While our constraints are less stringent than those derived from previous experiments, this study demonstrates the applicability of our method to the lower-mass vector DM search, which is made difficult in this measurement by the short observation time compared to the auto-correlation time scale of DM. △ Less

Submitted 5 March, 2024; originally announced March 2024.

Comments: 20 pages, 5 figures

Report number: LIGO-P2300250

arXiv:2403.02292 [pdf, other]

A Decade of Privacy-Relevant Android App Reviews: Large Scale Trends

Authors: Omer Akgul, Sai Teja Peddinti, Nina Taft, Michelle L. Mazurek, Hamza Harkous, Animesh Srivastava, Benoit Seguin

Abstract: We present an analysis of 12 million instances of privacy-relevant reviews publicly visible on the Google Play Store that span a 10 year period. By leveraging state of the art NLP techniques, we examine what users have been writing about privacy along multiple dimensions: time, countries, app types, diverse privacy topics, and even across a spectrum of emotions. We find consistent growth of privac… ▽ More We present an analysis of 12 million instances of privacy-relevant reviews publicly visible on the Google Play Store that span a 10 year period. By leveraging state of the art NLP techniques, we examine what users have been writing about privacy along multiple dimensions: time, countries, app types, diverse privacy topics, and even across a spectrum of emotions. We find consistent growth of privacy-relevant reviews, and explore topics that are trending (such as Data Deletion and Data Theft), as well as those on the decline (such as privacy-relevant reviews on sensitive permissions). We find that although privacy reviews come from more than 200 countries, 33 countries provide 90% of privacy reviews. We conduct a comparison across countries by examining the distribution of privacy topics a country's users write about, and find that geographic proximity is not a reliable indicator that nearby countries have similar privacy perspectives. We uncover some countries with unique patterns and explore those herein. Surprisingly, we uncover that it is not uncommon for reviews that discuss privacy to be positive (32%); many users express pleasure about privacy features within apps or privacy-focused apps. We also uncover some unexpected behaviors, such as the use of reviews to deliver privacy disclaimers to developers. Finally, we demonstrate the value of analyzing app reviews with our approach as a complement to existing methods for understanding users' perspectives about privacy △ Less

Submitted 15 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

Comments: This is the extended version of the paper accepted to USENIX Security 2024

arXiv:2403.02088 [pdf]

Realization of a Spin Glass in a two-dimensional van der Waals material

Authors: Banabir Pal, Ajesh K. Gopi, Yicheng Guan, Anirban Chakraborty, Kajal Tiwari, Anagha Mathew, Abhay K. Srivastava, Wenjie Zhang, Binoy K. Hazra, Holger Meyerheim, Stuart S. P. Parkin

Abstract: Recent advances in van der Waals (vdW) materials have sparked renewed interest in the impact of dimensionality on magnetic phase transitions. While ordered magnetic phases have been demonstrated to survive in the two-dimensional (2D) limit, the quest for a spin-glass with quenched magnetic disorder in lower dimensions has proven elusive. Here we show evidence of a spin-glass emerging from randomly… ▽ More Recent advances in van der Waals (vdW) materials have sparked renewed interest in the impact of dimensionality on magnetic phase transitions. While ordered magnetic phases have been demonstrated to survive in the two-dimensional (2D) limit, the quest for a spin-glass with quenched magnetic disorder in lower dimensions has proven elusive. Here we show evidence of a spin-glass emerging from randomly distributed Fe atoms in Fe3GeTe2, the first time such a state has been reported in a vdW material. AC magnetic susceptibility displays a strong frequency dependence indicative of slow spin dynamics. Additional distinctive phenomena, including ageing, chaos, and memory effects, further substantiate the existence of a glassy state. Remarkably, we find that this state persists even in single-unit-cell thick Fe3GeTe2, thereby confirming the existence of a 2D spin-glass. The formation of spin-glass states via intercalation in vdW systems allows for highly tunable spin-glass states that are otherwise difficult to realize. △ Less

Submitted 4 March, 2024; originally announced March 2024.

arXiv:2403.01081 [pdf, other]

LAB: Large-Scale Alignment for ChatBots

Authors: Shivchander Sudalairaj, Abhishek Bhandwaldar, Aldo Pareja, Kai Xu, David D. Cox, Akash Srivastava

Abstract: This work introduces LAB (Large-scale Alignment for chatBots), a novel methodology designed to overcome the scalability challenges in the instruction-tuning phase of large language model (LLM) training. Leveraging a taxonomy-guided synthetic data generation process and a multi-phase tuning framework, LAB significantly reduces reliance on expensive human annotations and proprietary models like GPT-… ▽ More This work introduces LAB (Large-scale Alignment for chatBots), a novel methodology designed to overcome the scalability challenges in the instruction-tuning phase of large language model (LLM) training. Leveraging a taxonomy-guided synthetic data generation process and a multi-phase tuning framework, LAB significantly reduces reliance on expensive human annotations and proprietary models like GPT-4. We demonstrate that LAB-trained models can achieve competitive performance across several benchmarks compared to models trained with traditional human-annotated or GPT-4 generated synthetic data. Thus offering a scalable, cost-effective solution for enhancing LLM capabilities and instruction-following behaviors without the drawbacks of catastrophic forgetting, marking a step forward in the efficient training of LLMs for a wide range of applications. △ Less

Submitted 29 April, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

Comments: Corresponding Author: Akash Srivastava. Equal Contribution: Shivchander Sudalairaj, Abhishek Bhandwaldar, Aldo Pareja, Akash Srivastava, Code: https://github.com/instructlab

arXiv:2403.00026 [pdf, other]

Learning to Deliver: a Foundation Model for the Montreal Capacitated Vehicle Routing Problem

Authors: Samuel J. K. Chin, Matthias Winkenbach, Akash Srivastava

Abstract: In this paper, we present the Foundation Model for the Montreal Capacitated Vehicle Routing Problem (FM-MCVRP), a novel Deep Learning (DL) model that approximates high-quality solutions to a variant of the Capacitated Vehicle Routing Problem (CVRP) that characterizes many real-world applications. The so-called Montreal Capacitated Vehicle Routing Problem (MCVRP), first formally described by Bengio… ▽ More In this paper, we present the Foundation Model for the Montreal Capacitated Vehicle Routing Problem (FM-MCVRP), a novel Deep Learning (DL) model that approximates high-quality solutions to a variant of the Capacitated Vehicle Routing Problem (CVRP) that characterizes many real-world applications. The so-called Montreal Capacitated Vehicle Routing Problem (MCVRP), first formally described by Bengio et al. (2021), is defined on a fixed and finite graph, which is analogous to a city. Each MCVRP instance is essentially the sub-graph connecting a randomly sampled subset of the nodes in the fixed graph, which represent a set of potential addresses in a real-world delivery problem on a given day. Our work exploits this problem structure to frame the MCVRP as an analogous Natural Language Processing (NLP) task. Specifically, we leverage a Transformer architecture embedded in a Large Language Model (LLM) framework to train our model in a supervised manner on computationally inexpensive, sub-optimal MCVRP solutions obtained algorithmically. Through comprehensive computational experiments, we show that FM-MCVRP produces better MCVRP solutions than the training data and generalizes to larger sized problem instances not seen during training. Even when compared to near-optimal solutions from state-of-the-art heuristics, FM-MCVRP yields competitive results despite being trained on inferior data. For instance, for 400-customer problems, FM-MCVRP solutions on average fall within 2% of the benchmark. Our results further demonstrate that unlike prior works in the literature, FM-MCVRP is a unified model, which performs consistently and reliably on a range of problem instance sizes and parameter values such as the vehicle capacity. △ Less

Submitted 28 February, 2024; originally announced March 2024.

arXiv:2402.19464 [pdf, other]

Curiosity-driven Red-teaming for Large Language Models

Authors: Zhang-Wei Hong, Idan Shenfeld, Tsun-Hsuan Wang, Yung-Sung Chuang, Aldo Pareja, James Glass, Akash Srivastava, Pulkit Agrawal

Abstract: Large language models (LLMs) hold great potential for many natural language applications but risk generating incorrect or toxic content. To probe when an LLM generates unwanted content, the current paradigm is to recruit a \textit{red team} of human testers to design input prompts (i.e., test cases) that elicit undesirable responses from LLMs. However, relying solely on human testers is expensive… ▽ More Large language models (LLMs) hold great potential for many natural language applications but risk generating incorrect or toxic content. To probe when an LLM generates unwanted content, the current paradigm is to recruit a \textit{red team} of human testers to design input prompts (i.e., test cases) that elicit undesirable responses from LLMs. However, relying solely on human testers is expensive and time-consuming. Recent works automate red teaming by training a separate red team LLM with reinforcement learning (RL) to generate test cases that maximize the chance of eliciting undesirable responses from the target LLM. However, current RL methods are only able to generate a small number of effective test cases resulting in a low coverage of the span of prompts that elicit undesirable responses from the target LLM. To overcome this limitation, we draw a connection between the problem of increasing the coverage of generated test cases and the well-studied approach of curiosity-driven exploration that optimizes for novelty. Our method of curiosity-driven red teaming (CRT) achieves greater coverage of test cases while mantaining or increasing their effectiveness compared to existing methods. Our method, CRT successfully provokes toxic responses from LLaMA2 model that has been heavily fine-tuned using human preferences to avoid toxic outputs. Code is available at \url{https://github.com/Improbable-AI/curiosity_redteam} △ Less

Submitted 29 February, 2024; originally announced February 2024.

Comments: Published at ICLR 2024

arXiv:2402.19052 [pdf]

Exploring the Efficacy of Large Language Models in Summarizing Mental Health Counseling Sessions: A Benchmark Study

Authors: Prottay Kumar Adhikary, Aseem Srivastava, Shivani Kumar, Salam Michael Singh, Puneet Manuja, Jini K Gopinath, Vijay Krishnan, Swati Kedia, Koushik Sinha Deb, Tanmoy Chakraborty

Abstract: Comprehensive summaries of sessions enable an effective continuity in mental health counseling, facilitating informed therapy planning. Yet, manual summarization presents a significant challenge, diverting experts' attention from the core counseling process. This study evaluates the effectiveness of state-of-the-art Large Language Models (LLMs) in selectively summarizing various components of ther… ▽ More Comprehensive summaries of sessions enable an effective continuity in mental health counseling, facilitating informed therapy planning. Yet, manual summarization presents a significant challenge, diverting experts' attention from the core counseling process. This study evaluates the effectiveness of state-of-the-art Large Language Models (LLMs) in selectively summarizing various components of therapy sessions through aspect-based summarization, aiming to benchmark their performance. We introduce MentalCLOUDS, a counseling-component guided summarization dataset consisting of 191 counseling sessions with summaries focused on three distinct counseling components (aka counseling aspects). Additionally, we assess the capabilities of 11 state-of-the-art LLMs in addressing the task of component-guided summarization in counseling. The generated summaries are evaluated quantitatively using standard summarization metrics and verified qualitatively by mental health professionals. Our findings demonstrate the superior performance of task-specific LLMs such as MentalLlama, Mistral, and MentalBART in terms of standard quantitative metrics such as Rouge-1, Rouge-2, Rouge-L, and BERTScore across all aspects of counseling components. Further, expert evaluation reveals that Mistral supersedes both MentalLlama and MentalBART based on six parameters -- affective attitude, burden, ethicality, coherence, opportunity costs, and perceived effectiveness. However, these models share the same weakness by demonstrating a potential for improvement in the opportunity costs and perceived effectiveness metrics. △ Less

Submitted 29 February, 2024; originally announced February 2024.

arXiv:2402.09523 [pdf, other]

Introduction to quantum entanglement in many-body systems

Authors: Anubhav Kumar Srivastava, Guillem Müller-Rigat, Maciej Lewenstein, Grzegorz Rajchel-Mieldzioć

Abstract: The quantum mechanics formalism introduced new revolutionary concepts challenging our everyday perceptions. Arguably, quantum entanglement, which explains correlations that cannot be reproduced classically, is the most notable of them. Besides its fundamental aspect, entanglement is also a resource, fueling emergent technologies such as quantum simulators and computers. The purpose of this chapter… ▽ More The quantum mechanics formalism introduced new revolutionary concepts challenging our everyday perceptions. Arguably, quantum entanglement, which explains correlations that cannot be reproduced classically, is the most notable of them. Besides its fundamental aspect, entanglement is also a resource, fueling emergent technologies such as quantum simulators and computers. The purpose of this chapter is to give a pedagogical introduction to the topic with a special emphasis on the multipartite scenario, i.e., entanglement distributed among many degrees of freedom. Due to the combinatorial complexity of this setting, particles can interact and become entangled in a plethora of ways, which we characterize here. We start by providing the necessary mathematical tools and elementary concepts from entanglement theory. A part of this chapter will be devoted to classifying and ordering entangled states. Then, we focus on various entanglement structures useful in condensed-matter theory such as tensor-network states or symmetric states useful for quantum-enhanced sensing. Finally, we discuss state-of-the-art methods to detect and certify such correlations in experiments, with some relevant illustrative examples. △ Less

Submitted 14 February, 2024; originally announced February 2024.

Comments: 59 pages, 8 figures. Comments are welcome

arXiv:2402.07173 [pdf, other]

INSITE: labelling medical images using submodular functions and semi-supervised data programming

Authors: Akshat Gautam, Anurag Shandilya, Akshit Srivastava, Venkatapathy Subramanian, Ganesh Ramakrishnan, Kshitij Jadhav

Abstract: The necessity of large amounts of labeled data to train deep models, especially in medical imaging creates an implementation bottleneck in resource-constrained settings. In Insite (labelINg medical imageS usIng submodular funcTions and sEmi-supervised data programming) we apply informed subset selection to identify a small number of most representative or diverse images from a huge pool of unlabel… ▽ More The necessity of large amounts of labeled data to train deep models, especially in medical imaging creates an implementation bottleneck in resource-constrained settings. In Insite (labelINg medical imageS usIng submodular funcTions and sEmi-supervised data programming) we apply informed subset selection to identify a small number of most representative or diverse images from a huge pool of unlabelled data subsequently annotated by a domain expert. The newly annotated images are then used as exemplars to develop several data programming-driven labeling functions. These labelling functions output a predicted-label and a similarity score when given an unlabelled image as an input. A consensus is brought amongst the outputs of these labeling functions by using a label aggregator function to assign the final predicted label to each unlabelled data point. We demonstrate that informed subset selection followed by semi-supervised data programming methods using these images as exemplars perform better than other state-of-the-art semi-supervised methods. Further, for the first time we demonstrate that this can be achieved through a small set of images used as exemplars. △ Less

Submitted 11 February, 2024; originally announced February 2024.

arXiv:2402.06377 [pdf, other]

High-Precision Geosteering via Reinforcement Learning and Particle Filters

Authors: Ressi Bonti Muhammad, Apoorv Srivastava, Sergey Alyaev, Reidar Brumer Bratvold, Daniel M. Tartakovsky

Abstract: Geosteering, a key component of drilling operations, traditionally involves manual interpretation of various data sources such as well-log data. This introduces subjective biases and inconsistent procedures. Academic attempts to solve geosteering decision optimization with greedy optimization and Approximate Dynamic Programming (ADP) showed promise but lacked adaptivity to realistic diverse scenar… ▽ More Geosteering, a key component of drilling operations, traditionally involves manual interpretation of various data sources such as well-log data. This introduces subjective biases and inconsistent procedures. Academic attempts to solve geosteering decision optimization with greedy optimization and Approximate Dynamic Programming (ADP) showed promise but lacked adaptivity to realistic diverse scenarios. Reinforcement learning (RL) offers a solution to these challenges, facilitating optimal decision-making through reward-based iterative learning. State estimation methods, e.g., particle filter (PF), provide a complementary strategy for geosteering decision-making based on online information. We integrate an RL-based geosteering with PF to address realistic geosteering scenarios. Our framework deploys PF to process real-time well-log data to estimate the location of the well relative to the stratigraphic layers, which then informs the RL-based decision-making process. We compare our method's performance with that of using solely either RL or PF. Our findings indicate a synergy between RL and PF in yielding optimized geosteering decisions. △ Less

Submitted 9 February, 2024; originally announced February 2024.

Comments: 40 pages

arXiv:2401.07048 [pdf, other]

2.5-D MHD Simulation of the Formation and Evolution of Plasmoids in Coronal Current Sheets

Authors: Sripan Mondal, Abhishek K Srivastava, David I. Pontin, Ding Yuan, Eric R. Priest

Abstract: In the present paper, using MPI-AMRVAC, we perform a 2.5-D numerical MHD simulation of the dynamics and associated thermodynamical evolution of an initially force-free Harris current sheet subjected to an external velocity perturbation under the condition of uniform resistivity. The amplitude of the magnetic field is taken to be 10 Gauss, typical of the solar corona. We impose a Gaussian velocity… ▽ More In the present paper, using MPI-AMRVAC, we perform a 2.5-D numerical MHD simulation of the dynamics and associated thermodynamical evolution of an initially force-free Harris current sheet subjected to an external velocity perturbation under the condition of uniform resistivity. The amplitude of the magnetic field is taken to be 10 Gauss, typical of the solar corona. We impose a Gaussian velocity pulse across this current sheet mimicking the interaction of fast magnetoacoustic waves with a current sheet in corona. This leads to a variety of dynamics and plasma processes in the current sheet, which is initially quasi-static. The initial pulse interacts with the current sheet and splits into a pair of counter-propagating wavefronts, which forms a rarefied region and leads to inflow and a thinning of the current sheet. The thinning results in Petschek-type magnetic reconnection followed by tearing instability and plasmoid formation. The reconnection outflows containing outward-moving plasmoids have accelerated motions with velocities ranging from 105-303 km/s. The average temperature and density of the plasmoids are found to be 8 MK and twice the background density of the solar corona, respectively. These estimates of velocity, temperature and density of plasmoids are similar to values reported from various solar coronal observations. Therefore, we infer that the external triggering of a quasi-static current sheet by a single velocity pulse is capable of initiating magnetic reconnection and plasmoid formation in the absence of a localized enhancement of resistivity in the solar corona. △ Less

Submitted 13 January, 2024; originally announced January 2024.

Comments: 20 pages, 10 figures, Accepted for publication in The Astrophysical Journal

arXiv:2401.06295 [pdf, ps, other]

Linear and nonlinear Granger causality analysis of turbulent duct flows

Authors: Barbara Lopez-Doriga, Marco Atzori, Ricardo Vinuesa, H. Jane Bae, Ankit Srivastava, Scott T. M. Dawson

Abstract: This research focuses on the identification and causality analysis of coherent structures that arise in turbulent flows in square and rectangular ducts. Coherent structures are first identified from direct numerical simulation data via proper orthogonal decomposition (POD), both by using all velocity components, and after separating the streamwise and secondary components of the flow. The causal r… ▽ More This research focuses on the identification and causality analysis of coherent structures that arise in turbulent flows in square and rectangular ducts. Coherent structures are first identified from direct numerical simulation data via proper orthogonal decomposition (POD), both by using all velocity components, and after separating the streamwise and secondary components of the flow. The causal relations between the mode coefficients are analysed using pairwise-conditional Granger causality analysis. We also formulate a nonlinear Granger causality analysis that can account for nonlinear interactions between modes. Focusing on streamwise-constant structures within a duct of short streamwise extent, we show that the causal relationships are highly sensitive to whether the mode coefficients or their squared values are considered, whether nonlinear effects are explicitly accounted for, and whether streamwise and secondary flow structures are separated prior to causality analyses. We leverage these sensitivities to determine that linear mechanisms underpin causal relationships between modes that share the same symmetry or anti-symmetry properties about the corner bisector, while nonlinear effects govern the causal interactions between symmetric and antisymmetric modes. In all cases, we find that the secondary flow fluctuations (manifesting as streamwise vorticial structures) are the primary cause of both the presence and movement of near-wall streaks towards and away from the duct corners. △ Less

Submitted 11 January, 2024; originally announced January 2024.

arXiv:2401.03390 [pdf, other]

Dynamics-based Feature Augmentation of Graph Neural Networks for Variant Emergence Prediction

Authors: Majd Al Aawar, Srikar Mutnuri, Mansooreh Montazerin, Ajitesh Srivastava

Abstract: During the COVID-19 pandemic, a major driver of new surges has been the emergence of new variants. When a new variant emerges in one or more countries, other nations monitor its spread in preparation for its potential arrival. The impact of the new variant and the timings of epidemic peaks in a country highly depend on when the variant arrives. The current methods for predicting the spread of new… ▽ More During the COVID-19 pandemic, a major driver of new surges has been the emergence of new variants. When a new variant emerges in one or more countries, other nations monitor its spread in preparation for its potential arrival. The impact of the new variant and the timings of epidemic peaks in a country highly depend on when the variant arrives. The current methods for predicting the spread of new variants rely on statistical modeling, however, these methods work only when the new variant has already arrived in the region of interest and has a significant prevalence. Can we predict when a variant existing elsewhere will arrive in a given region? To address this question, we propose a variant-dynamics-informed Graph Neural Network (GNN) approach. First, we derive the dynamics of variant prevalence across pairs of regions (countries) that apply to a large class of epidemic models. The dynamics motivate the introduction of certain features in the GNN. We demonstrate that our proposed dynamics-informed GNN outperforms all the baselines, including the currently pervasive framework of Physics-Informed Neural Networks (PINNs). To advance research in this area, we introduce a benchmarking tool to assess a user-defined model's prediction performance across 87 countries and 36 variants. △ Less

Submitted 28 May, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

arXiv:2401.03108 [pdf, other]

Dress-Me-Up: A Dataset & Method for Self-Supervised 3D Garment Retargeting

Authors: Shanthika Naik, Kunwar Singh, Astitva Srivastava, Dhawal Sirikonda, Amit Raj, Varun Jampani, Avinash Sharma

Abstract: We propose a novel self-supervised framework for retargeting non-parameterized 3D garments onto 3D human avatars of arbitrary shapes and poses, enabling 3D virtual try-on (VTON). Existing self-supervised 3D retargeting methods only support parametric and canonical garments, which can only be draped over parametric body, e.g. SMPL. To facilitate the non-parametric garments and body, we propose a no… ▽ More We propose a novel self-supervised framework for retargeting non-parameterized 3D garments onto 3D human avatars of arbitrary shapes and poses, enabling 3D virtual try-on (VTON). Existing self-supervised 3D retargeting methods only support parametric and canonical garments, which can only be draped over parametric body, e.g. SMPL. To facilitate the non-parametric garments and body, we propose a novel method that introduces Isomap Embedding based correspondences matching between the garment and the human body to get a coarse alignment between the two meshes. We perform neural refinement of the coarse alignment in a self-supervised setting. Further, we leverage a Laplacian detail integration method for preserving the inherent details of the input garment. For evaluating our 3D non-parametric garment retargeting framework, we propose a dataset of 255 real-world garments with realistic noise and topological deformations. The dataset contains $44$ unique garments worn by 15 different subjects in 5 distinctive poses, captured using a multi-view RGBD capture setup. We show superior retargeting quality on non-parametric garments and human avatars over existing state-of-the-art methods, acting as the first-ever baseline on the proposed dataset for non-parametric 3D garment retargeting. △ Less

Submitted 5 January, 2024; originally announced January 2024.

arXiv:2401.01266 [pdf, other]

Modelling the mechanics of 32 T REBCO superconductor magnet using numerical simulation

Authors: Arpit Kumar Srivastava, Enric Pardo

Abstract: High temperature REBCO superconducting tapes are very promising for high-field magnets. Under high magnetic fields there are high electro-mechanical forces, and thus concern for mechanical damage. Due to large screening currents and composite structure of the tape, the mechanical design of these magnets is not straightforward. In addition, many contemporary designs use insulated winding. In this w… ▽ More High temperature REBCO superconducting tapes are very promising for high-field magnets. Under high magnetic fields there are high electro-mechanical forces, and thus concern for mechanical damage. Due to large screening currents and composite structure of the tape, the mechanical design of these magnets is not straightforward. In addition, many contemporary designs use insulated winding. In this work we develop a novel two-dimensional axisymmetric finite element tool programmed in MATLAB that assumes the displacement field within linear elastic range. The stack of pancakes and a large number of REBCO tape turns are approximated as an an-isotropic bulk hollow cylinder. Our results agree with uni-axial stress experiments in literature, validating the bulk approximation. Here, we study the following configuration. The current is first ramp up to below the critical current and we calculate the screening currents and the forces that they cause using the MEMEP model. As a case study, 32 T REBCO superconductor magnet, is taken and simulated numerically. We have done complete mechanical analysis of the magnet by including the axial and shear mechanical quantities for each pancake unlike previous work where only radial and circumferential quantities are focused. Effect on mechanical quantities without screening current is also calculated and compared. It is shown that including screening current induced field strongly affect the mechanical quantities, specially the shear stress. The latter might be the critical quantity for certain magnet configurations. Additionally, in order to overcome high stresses, a stiff over banding of different material is considered and numerically modelled which significantly reduces the mechanical stresses. The FE based model developed is efficient to calculate the mechanical behaviour of any general superconductor magnet and its devices. △ Less

Submitted 2 January, 2024; originally announced January 2024.

arXiv:2401.00338 [pdf]

A Rapid Scoping Review and Conceptual Analysis of the Educational Metaverse in the Global South: Socio-Technical Perspectives

Authors: Anmol Srivastava

Abstract: This paper presents a conceptual insight into the Design of the Metaverse to facilitate educational transformation in selected developing nations within the Global South regions, e.g., India. These regions are often afflicted with socio-economic challenges but rich in cultural diversity. By utilizing a socio-technical design approach, this study explores the specific needs and opportunities presen… ▽ More This paper presents a conceptual insight into the Design of the Metaverse to facilitate educational transformation in selected developing nations within the Global South regions, e.g., India. These regions are often afflicted with socio-economic challenges but rich in cultural diversity. By utilizing a socio-technical design approach, this study explores the specific needs and opportunities presented by these diverse settings. A rapid scoping review of the scant existing literature is conducted to provide fundamental insights. A novel design methodology was formulated that utilized ChatGPT for ideation, brainstorming, and literature survey query generation. This paper aims not only to shed light on the educational possibilities enabled by the Metaverse but also to highlight design considerations unique to the Global South. △ Less

Submitted 30 December, 2023; originally announced January 2024.

arXiv:2312.17270 [pdf, other]

Anticipated Network Surveillance -- An extrapolated study to predict cyber-attacks using Machine Learning and Data Analytics

Authors: Aviral Srivastava, Dhyan Thakkar, Dr. Sharda Valiveti, Dr. Pooja Shah, Dr. Gaurang Raval

Abstract: Machine learning and data mining techniques are utiized for enhancement of the security of any network. Researchers used machine learning for pattern detection, anomaly detection, dynamic policy setting, etc. The methods allow the program to learn from data and make decisions without human intervention, consuming a huge training period and computation power. This paper discusses a novel technique… ▽ More Machine learning and data mining techniques are utiized for enhancement of the security of any network. Researchers used machine learning for pattern detection, anomaly detection, dynamic policy setting, etc. The methods allow the program to learn from data and make decisions without human intervention, consuming a huge training period and computation power. This paper discusses a novel technique to predict an upcoming attack in a network based on several data parameters. The dataset is continuous in real-time implementation. The proposed model comprises dataset pre-processing, and training, followed by the testing phase. Based on the results of the testing phase, the best model is selected using which, event class which may lead to an attack is extracted. The event statistics are used for attack △ Less

Submitted 26 December, 2023; originally announced December 2023.

arXiv:2312.16163 [pdf, other]

Age of Information in Gossip Networks: A Friendly Introduction and Literature Survey

Authors: Priyanka Kaswan, Purbesh Mitra, Arunabh Srivastava, Sennur Ulukus

Abstract: Gossiping is a communication mechanism, used for fast information dissemination in a network, where each node of the network randomly shares its information with the neighboring nodes. To characterize the notion of fastness in the context of gossip networks, age of information (AoI) is used as a timeliness metric. In this article, we summarize the recent works related to timely gossiping in a netw… ▽ More Gossiping is a communication mechanism, used for fast information dissemination in a network, where each node of the network randomly shares its information with the neighboring nodes. To characterize the notion of fastness in the context of gossip networks, age of information (AoI) is used as a timeliness metric. In this article, we summarize the recent works related to timely gossiping in a network. We start with the introduction of randomized gossip algorithms as an epidemic algorithm for database maintenance, and how the gossiping literature was later developed in the context of rumor spreading, message passing and distributed mean estimation. Then, we motivate the need for timely gossiping in applications such as source tracking and decentralized learning. We evaluate timeliness scaling of gossiping in various network topologies, such as, fully connected, ring, grid, generalized ring, hierarchical, and sparse asymmetric networks. We discuss age-aware gossiping and the higher order moments of the age process. We also consider different variations of gossiping in networks, such as, file slicing and network coding, reliable and unreliable sources, information mutation, different adversarial actions in gossiping, and energy harvesting sensors. Finally, we conclude this article with a few open problems and future directions in timely gossiping. △ Less

Submitted 26 December, 2023; originally announced December 2023.

Showing 1–50 of 787 results for author: Srivastava, A