-
In situ observation of thermally activated and localized Li leaching from lithiated graphite
Authors:
Harrison Szeto,
Vijay Kumar,
Yangying Zhu
Abstract:
Temperature is known to impact Li-ion battery performance and safety, however, understanding its effect on Li-ion batteries has largely been limited to uniform high or low temperatures. While the insights gathered from such research are important, much less information is available on the effects of non-uniform temperatures which more accurately reflect the environments that Li-ion batteries are e…
▽ More
Temperature is known to impact Li-ion battery performance and safety, however, understanding its effect on Li-ion batteries has largely been limited to uniform high or low temperatures. While the insights gathered from such research are important, much less information is available on the effects of non-uniform temperatures which more accurately reflect the environments that Li-ion batteries are exposed to in real world applications. In this paper, we characterize the impact of a microscale, temperature hotspot on a Li-ion battery using a combination of in situ micro-Raman spectroscopy, in situ optical microscopy and COMSOL Multiphysics thermal simulations. Our results show that mild temperature heterogeneity induced by the micro-Raman laser can cause lithium to locally leach out from different lithiated graphite phases (LiC6 and LiC12) in the absence of an applied current. The Li metal is found to be largely localized to the region heated by the micro-Raman laser and is not observed upon uniform heating to comparable temperatures suggesting that temperature heterogeneity is uniquely responsible for causing Li to leach out from lithiated graphite phases. A mechanism whereby localized temperature heterogeneity induced by the laser induces heterogeneity in the degree of lithiation across the graphite anode is proposed to explain the localized Li leaching. This study highlights the sensitivity of lithiated graphite phases to minor temperature heterogeneity in the absence of an applied current.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
An Autoencoder Architecture for L-band Passive Microwave Retrieval of Landscape Freeze-Thaw Cycle
Authors:
Divya Kumawat,
Ardeshir Ebtehaj,
Xiaolan Xu,
Andreas Colliander,
Vipin Kumar
Abstract:
Estimating the landscape and soil freeze-thaw (FT) dynamics in the Northern Hemisphere is crucial for understanding permafrost response to global warming and changes in regional and global carbon budgets. A new framework is presented for surface FT-cycle retrievals using L-band microwave radiometry based on a deep convolutional autoencoder neural network. This framework defines the landscape FT-cy…
▽ More
Estimating the landscape and soil freeze-thaw (FT) dynamics in the Northern Hemisphere is crucial for understanding permafrost response to global warming and changes in regional and global carbon budgets. A new framework is presented for surface FT-cycle retrievals using L-band microwave radiometry based on a deep convolutional autoencoder neural network. This framework defines the landscape FT-cycle retrieval as a time series anomaly detection problem considering the frozen states as normal and thawed states as anomalies. The autoencoder retrieves the FT-cycle probabilistically through supervised reconstruction of the brightness temperature (TB) time series using a contrastive loss function that minimizes (maximizes) the reconstruction error for the peak winter (summer). Using the data provided by the Soil Moisture Active Passive (SMAP) satellite, it is demonstrated that the framework learns to isolate the landscape FT states over different land surface types with varying complexities related to the radiometric characteristics of snow cover, lake-ice phenology, and vegetation canopy. The consistency of the retrievals is evaluated over Alaska, against in situ ground-based observations, showing reduced uncertainties compared to the traditional methods that use thresholding of the normalized polarization ratio.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
Closure invariants for polarised radio interferometric observations: a graph theoretical approach
Authors:
Vinay Kumar,
Rajaram Nityananda,
Joseph Samuel
Abstract:
Aperture synthesis observations with full polarisation have long been used to study the magnetic fields of synchrotron emitting sources. Recently proposed closure invariants give us a powerful method for extracting information from measured visibilities which are corrupted by antenna and polarisation dependent gains. In this paper, a formalism developed earlier for complete graphs (where all visib…
▽ More
Aperture synthesis observations with full polarisation have long been used to study the magnetic fields of synchrotron emitting sources. Recently proposed closure invariants give us a powerful method for extracting information from measured visibilities which are corrupted by antenna and polarisation dependent gains. In this paper, a formalism developed earlier for complete graphs (where all visibilities are available) is extended to incomplete graphs. The formalism provides a complete and independent set of closure invariants from the measured visibilities in a general situation where not all visibilities are available. We then show in a simulated, quasi-realistic case that the invariants developed here contain usable information even in the presence of noise.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
Evanescent Optothermoelectric Trapping: Deeper Potentials at a Largescale
Authors:
Chaudhary Eksha Rani,
Rahul Chand,
Ashutosh Shukla,
G V Pavan Kumar
Abstract:
Surface plasmons (SP) and their mediated effects have been widely used to manipulate micro- and nanoscale objects of dielectric and metallic nature. In this work, we show how SP excitation can be used to induce thermofluidic and thermoelectric effects to manipulate colloidal dynamics on a large scale. In an evanescent plasmonic trap, temperature gradients induce fluid flow that can facilitate part…
▽ More
Surface plasmons (SP) and their mediated effects have been widely used to manipulate micro- and nanoscale objects of dielectric and metallic nature. In this work, we show how SP excitation can be used to induce thermofluidic and thermoelectric effects to manipulate colloidal dynamics on a large scale. In an evanescent plasmonic trap, temperature gradients induce fluid flow that can facilitate particle accumulation. However, large out-of-plane flows expel particles from the trap, resulting in a shallow trap potential. Here, we numerically demonstrate how adding thermoelectric fields can overpower the optical and hydrodynamic forces to achieve a stable nanoparticle assembly at low excitation powers. We calculate the corresponding optical, fluidic, and thermoelectric trapping forces and potentials. These potentials can be enabled without resonant SP excitation, which requires careful optical alignment. Thus, we explain the mechanism of how, despite weak optical intensities and forces, sufficient trapping force can be supplied via the evanescent optothermoelectric trap to obtain large-scale reversible nanoparticle assemblies, irrespective of their shape, size, or material.
△ Less
Submitted 29 June, 2024;
originally announced July 2024.
-
Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks
Authors:
Ibrahim Abdelaziz,
Kinjal Basu,
Mayank Agarwal,
Sadhana Kumaravel,
Matthew Stallone,
Rameswar Panda,
Yara Rizk,
GP Bhargav,
Maxwell Crouse,
Chulaka Gunasekara,
Shajith Ikbal,
Sachin Joshi,
Hima Karanam,
Vineet Kumar,
Asim Munawar,
Sumit Neelam,
Dinesh Raghu,
Udit Sharma,
Adriana Meza Soria,
Dheeraj Sreedhar,
Praveen Venkateswaran,
Merve Unuvar,
David Cox,
Salim Roukos,
Luis Lastras
, et al. (1 additional authors not shown)
Abstract:
Large language models (LLMs) have recently shown tremendous promise in serving as the backbone to agentic systems, as demonstrated by their performance in multi-faceted, challenging benchmarks like SWE-Bench and Agent-Bench. However, to realize the true potential of LLMs as autonomous agents, they must learn to identify, call, and interact with external tools and application program interfaces (AP…
▽ More
Large language models (LLMs) have recently shown tremendous promise in serving as the backbone to agentic systems, as demonstrated by their performance in multi-faceted, challenging benchmarks like SWE-Bench and Agent-Bench. However, to realize the true potential of LLMs as autonomous agents, they must learn to identify, call, and interact with external tools and application program interfaces (APIs) to complete complex tasks. These tasks together are termed function calling. Endowing LLMs with function calling abilities leads to a myriad of advantages, such as access to current and domain-specific information in databases and knowledge sources, and the ability to outsource tasks that can be reliably performed by tools, e.g., a Python interpreter or calculator. While there has been significant progress in function calling with LLMs, there is still a dearth of open models that perform on par with proprietary LLMs like GPT, Claude, and Gemini. Therefore, in this work, we introduce the GRANITE-20B-FUNCTIONCALLING model under an Apache 2.0 license. The model is trained using a multi-task training approach on seven fundamental tasks encompassed in function calling, those being Nested Function Calling, Function Chaining, Parallel Functions, Function Name Detection, Parameter-Value Pair Detection, Next-Best Function, and Response Generation. We present a comprehensive evaluation on multiple out-of-domain datasets comparing GRANITE-20B-FUNCTIONCALLING to more than 15 other best proprietary and open models. GRANITE-20B-FUNCTIONCALLING provides the best performance among all open models on the Berkeley Function Calling Leaderboard and fourth overall. As a result of the diverse tasks and datasets used for training our model, we show that GRANITE-20B-FUNCTIONCALLING has better generalizability on multiple tasks in seven different evaluation datasets.
△ Less
Submitted 27 June, 2024;
originally announced July 2024.
-
UltraCortex: Submillimeter Ultra-High Field 9.4 T1 Brain MR Image Collection and Manual Cortical Segmentations
Authors:
Lucas Mahler,
Julius Steiglechner,
Benjamin Bender,
Tobias Lindig,
Dana Ramadan,
Jonas Bause,
Florian Birk,
Rahel Heule,
Edyta Charyasz,
Michael Erb,
Vinod Jangir Kumar,
Gisela E Hagberg,
Pascal Martin,
Gabriele Lohmann,
Klaus Scheffler
Abstract:
The UltraCortex repository (https://www.ultracortex.org) houses magnetic resonance imaging data of the human brain obtained at an ultra-high field strength of 9.4 T. It contains 86 structural MR images with spatial resolutions ranging from 0.6 to 0.8 mm. Additionally, the repository includes segmentations of 12 brains into gray and white matter compartments. These segmentations have been independe…
▽ More
The UltraCortex repository (https://www.ultracortex.org) houses magnetic resonance imaging data of the human brain obtained at an ultra-high field strength of 9.4 T. It contains 86 structural MR images with spatial resolutions ranging from 0.6 to 0.8 mm. Additionally, the repository includes segmentations of 12 brains into gray and white matter compartments. These segmentations have been independently validated by two expert neuroradiologists, thus establishing them as a reliable gold standard. This resource provides researchers with access to high-quality brain imaging data and validated segmentations, facilitating neuroimaging studies and advancing our understanding of brain structure and function. Existing repositories do not accommodate field strengths beyond 7 T, nor do they offer validated segmentations, underscoring the significance of this new resource.
△ Less
Submitted 5 July, 2024; v1 submitted 3 June, 2024;
originally announced June 2024.
-
SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation
Authors:
Xu Liu,
Jiuzhou Lei,
Ankit Prabhu,
Yuezhan Tao,
Igor Spasojevic,
Pratik Chaudhari,
Nikolay Atanasov,
Vijay Kumar
Abstract:
This paper develops a real-time decentralized metric-semantic Simultaneous Localization and Mapping (SLAM) approach that leverages a sparse and lightweight object-based representation to enable a heterogeneous robot team to autonomously explore 3D environments featuring indoor, urban, and forested areas without relying on GPS. We use a hierarchical metric-semantic representation of the environment…
▽ More
This paper develops a real-time decentralized metric-semantic Simultaneous Localization and Mapping (SLAM) approach that leverages a sparse and lightweight object-based representation to enable a heterogeneous robot team to autonomously explore 3D environments featuring indoor, urban, and forested areas without relying on GPS. We use a hierarchical metric-semantic representation of the environment, including high-level sparse semantic maps of object models and low-level voxel maps. We leverage the informativeness and viewpoint invariance of the high-level semantic map to obtain an effective semantics-driven place-recognition algorithm for inter-robot loop closure detection across aerial and ground robots with different sensing modalities. A communication module is designed to track each robot's own observations and those of other robots whenever communication links are available. Such observations are then used to construct a merged map. Our framework enables real-time decentralized operations onboard robots, allowing them to opportunistically leverage communication. We integrate and deploy our proposed framework on three types of aerial and ground robots. Extensive experimental results show an average inter-robot localization error of approximately 20 cm in position and 0.2 degrees in orientation, an object mapping F1 score consistently over 0.9, and a communication packet size of merely 2-3 megabytes per kilometer trajectory with as many as 1,000 landmarks. The project website can be found at https://xurobotics.github.io/slideslam/.
△ Less
Submitted 2 July, 2024; v1 submitted 24 June, 2024;
originally announced June 2024.
-
Surface phonons and possible structural phase transition in a topological semimetal PbTaSe2
Authors:
Vivek Kumar,
Pradeep Kumar
Abstract:
Topological insulators are a novel class of quantum materials characterized by protected gapless surface or edge states but insulating bulk states which is due to presence of spin-orbit interactions and time-reversal symmetry. Such an intriguing surface and bulk topology manifests itself in coupling with lattice dynamics due to electron-phonon scattering. Here we report an in-depth investigation o…
▽ More
Topological insulators are a novel class of quantum materials characterized by protected gapless surface or edge states but insulating bulk states which is due to presence of spin-orbit interactions and time-reversal symmetry. Such an intriguing surface and bulk topology manifests itself in coupling with lattice dynamics due to electron-phonon scattering. Here we report an in-depth investigation of a topological nodal line semimetal PbTaSe2 via temperature, polarization dependent Raman spectroscopy and temperature dependent single crystal X-ray diffraction (SC-XRD) measurements. Our analysis shows signature of electron-phonon coupling as reflected in the Fano asymmetry in line shape of M1-M4 modes and anomalous temperature variation of line-width of P3-P4 modes. Further polarization dependent phonon symmetry changes at different temperature (6K and 300K), discontinuities in bulk phonon dynamics for P2-P5 modes and disappearance of phonon modes i.e., M1-M5, on decreasing temperatures indicates towards a thermally induced structural phase transition which is also supported by the SC-XRD results. Hence based on our findings we propose that M1-M4 modes are surface phonon modes, the material undergoes a thermally induced structural phase transition from alpha to beta phase at T ~ 150 K or is in close proximity to the beta phase and another transition below T(CDW+beta) ~ 100K which is possibly due to the interplay of remanent completely commensurate charge density wave (CCDW) of 1H-TaSe2 and beta phase.
△ Less
Submitted 10 July, 2024; v1 submitted 22 June, 2024;
originally announced June 2024.
-
Connecting Rashba and Dresselhaus spin-orbit interactions to inversion asymmetry in perovskite oxide heterostructures
Authors:
Nirmal Ganguli,
Avishek Singh,
Vivek Kumar,
Jayita Chakraborty
Abstract:
Inversion asymmetry, combined with spin orbit interaction, leads to Rashba or Dresselhaus effects, or combinations of them that are promising for technologies based on antiferromagnetic spintronics. Since understanding the exact nature of spin-orbit interaction is crucial for developing a technology based on it, mapping the nature of inversion asymmetry with the type of spin-orbit interaction beco…
▽ More
Inversion asymmetry, combined with spin orbit interaction, leads to Rashba or Dresselhaus effects, or combinations of them that are promising for technologies based on antiferromagnetic spintronics. Since understanding the exact nature of spin-orbit interaction is crucial for developing a technology based on it, mapping the nature of inversion asymmetry with the type of spin-orbit interaction becomes the key. We simulate a perovskite oxide heterostructure LaAlO$_3|$SrIrO$_3|$SrTiO$_3$ preserving the inversion symmetry within density functional theory to demonstrate the relation between the nature of inversion asymmetry and the corresponding Rashba or Dresselhaus-type interaction. With progressive distortion in the heterostructure, we find how the structure inversion asymmetry sets in with distorted bond lengths and bond angles, leading to Rashba effect in the system. Further, introduction of tilted IrO$_6$ octahedra leads to bulk inversion asymmetry, helping a combined Rashba-Dresselhaus interaction to set in. A comparison of the spin textures obtained from our DFT calculations and theoretical modeling helps us identify the exact nature of the interactions. Besides demonstrating the connection between the nature of asymmetry with Rashba and Dresselhaus interactions, our work may serve as a guide to identifying different types of Rashba-like spin-orbit interactions.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Citation-Based Summarization of Landmark Judgments
Authors:
Purnima Bindal,
Vikas Kumar,
Vasudha Bhatnagar,
Parikshet Sirohi,
Ashwini Siwal
Abstract:
Landmark judgments are of prime importance in the Common Law System because of their exceptional jurisprudence and frequent references in other judgments. In this work, we leverage contextual references available in citing judgments to create an extractive summary of the target judgment. We evaluate the proposed algorithm on two datasets curated from the judgments of Indian Courts and find the res…
▽ More
Landmark judgments are of prime importance in the Common Law System because of their exceptional jurisprudence and frequent references in other judgments. In this work, we leverage contextual references available in citing judgments to create an extractive summary of the target judgment. We evaluate the proposed algorithm on two datasets curated from the judgments of Indian Courts and find the results promising.
△ Less
Submitted 16 June, 2024;
originally announced June 2024.
-
Interpretable MA-island clusters and fingerprints relating bainite microstructures to composition and processing temperature
Authors:
Vinod Kumar,
Sharukh Hussain,
Priyanka S,
P G Kubendran Amos
Abstract:
Realising the affect of composition and processing condition on bainite microstructures is often challenging, owing to the intricate distribution of the constituent phases. In this work, scanning electron micrographs of non-isothermally transformed bainite, with martensite-austenite (MA) islands, are analysed to relate the microstructures to the composition and quench-stop temperature. The inadequ…
▽ More
Realising the affect of composition and processing condition on bainite microstructures is often challenging, owing to the intricate distribution of the constituent phases. In this work, scanning electron micrographs of non-isothermally transformed bainite, with martensite-austenite (MA) islands, are analysed to relate the microstructures to the composition and quench-stop temperature. The inadequacy of the MA-islands' geometric features, namely aspect ratio, polygon area and compactness, in establishing this relation is made evident from Kullback-Leibler (KL) divergence at the outset. Clustering the bainite microstructures, following a combination of feature extraction and dimensionality reduction, further fails to realise the affect of composition and processing temperature. Deep-learning analysis of the individual MA islands, in contrast to the bainite microstructures, yields interpretable clusters with characteristically distinct size and morphology. These five clusters, referred to as fine- and coarse-dendrite, fine- and coarse-polygon and elongated, are exceptionally discernible and can be adopted to describe any MA island. Characterising the bainite microstructures, based on the distribution of the interpretable MA-island clusters, generates \textit{fingerprints} that sufficiently relates the composition and processing conditions with the microstructures.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
The inadequacy of the geometric features of MA islands in relating bainite microstructures to composition and processing conditions
Authors:
Vinod Kumar,
Sharukh Hussain,
Priyanka S,
P G Kubendran Amos
Abstract:
Achieving desired properties in bainite steels with MA islands demands understanding the affect of processing conditions and composition on their size and morphology. Generally, this understanding is gained by studying the change in the size and morphology of the MA islands with composition and the processing conditions. In the present work, around 8500 MA islands dispersed across of approximately…
▽ More
Achieving desired properties in bainite steels with MA islands demands understanding the affect of processing conditions and composition on their size and morphology. Generally, this understanding is gained by studying the change in the size and morphology of the MA islands with composition and the processing conditions. In the present work, around 8500 MA islands dispersed across of approximately 1500 bainite microstructures are investigated to comprehend the influence of composition and heat treatment cycle on the geometric features. The geometric features considered in this study include polygon area metric, compactness and aspect ratio. A thorough statistical analysis of these features across bainite steels of different compositions and processing conditions unravel that, though there are minor changes, no characteristic variation is introduced in the size and morphology of the MA islands. In other words, the distribution of the various forms of MA islands are almost identical in bainite steels of different chemistry and heat treatment, thereby indicating the inadequacy of geometric features in explicating the affect of processing conditions on microstructure.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Optimal Convex Cover as Collision-free Space Approximation for Trajectory Generation
Authors:
Yuwei Wu,
Igor Spasojevic,
Pratik Chaudhari,
Vijay Kumar
Abstract:
We propose an online iterative algorithm to find a suitable convex cover to under-approximate the free space for autonomous navigation to delineate Safe Flight Corridors (SFC). The convex cover consists of a set of polytopes such that the union of the polytopes represents obstacle-free space, allowing us to find trajectories for robots that lie within the convex cover. In order to find the SFC tha…
▽ More
We propose an online iterative algorithm to find a suitable convex cover to under-approximate the free space for autonomous navigation to delineate Safe Flight Corridors (SFC). The convex cover consists of a set of polytopes such that the union of the polytopes represents obstacle-free space, allowing us to find trajectories for robots that lie within the convex cover. In order to find the SFC that facilitates optimal trajectory generation, we iteratively find overlapping polytopes of maximum volumes that include specified waypoints initialized by a geometric or kinematic planner. Constraints at waypoints appear in two alternating stages of a joint optimization problem, which is solved by a method inspired by the Alternating Direction Method of Multipliers (ADMM) with partially distributed variables. We validate the effectiveness of our proposed algorithm using a range of parameterized environments and show its applications for two-stage motion planning.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets
Authors:
Jiatong Shi,
Shih-Heng Wang,
William Chen,
Martijn Bartelds,
Vanya Bannihatti Kumar,
Jinchuan Tian,
Xuankai Chang,
Dan Jurafsky,
Karen Livescu,
Hung-yi Lee,
Shinji Watanabe
Abstract:
ML-SUPERB evaluates self-supervised learning (SSL) models on the tasks of language identification and automatic speech recognition (ASR). This benchmark treats the models as feature extractors and uses a single shallow downstream model, which can be fine-tuned for a downstream task. However, real-world use cases may require different configurations. This paper presents ML-SUPERB~2.0, which is a ne…
▽ More
ML-SUPERB evaluates self-supervised learning (SSL) models on the tasks of language identification and automatic speech recognition (ASR). This benchmark treats the models as feature extractors and uses a single shallow downstream model, which can be fine-tuned for a downstream task. However, real-world use cases may require different configurations. This paper presents ML-SUPERB~2.0, which is a new benchmark for evaluating pre-trained SSL and supervised speech models across downstream models, fine-tuning setups, and efficient model adaptation approaches. We find performance improvements over the setup of ML-SUPERB. However, performance depends on the downstream model design. Also, we find large performance differences between languages and datasets, suggesting the need for more targeted approaches to improve multilingual ASR performance.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Spin-polarized DFT calculations for physical properties of novel KVSb half-Heusler compound for spintronic and thermodynamic applicability
Authors:
Ashwani Kumar,
Anupam,
Shyam L. Gupta,
Sumit Kumar,
Vipan Kumar,
Diwaker
Abstract:
In the reported study we have investigated the robust phase stability, elasto-mechanical, thermophysical and magnetic properties of KVSb half Heusler compound by implementing density functional theory models in Wien2k simulation package. The dynamic phase stability is computed in phase type I, II & III phase configurations by optimising their energy. It is observed that given compound is more stab…
▽ More
In the reported study we have investigated the robust phase stability, elasto-mechanical, thermophysical and magnetic properties of KVSb half Heusler compound by implementing density functional theory models in Wien2k simulation package. The dynamic phase stability is computed in phase type I, II & III phase configurations by optimising their energy. It is observed that given compound is more stable in spin-polarised state of phase type I. To explore the electronic band structure, we apply the generalised gradient approximation. The electronic band profile of the Heusler alloy display a half-metallic nature. Moreover, the calculated second-order elastic parameters divulge the ductile nature. To understand the thermodynamical and thermoelectric stability of the alloy at various temperature and pressures ranges we have utilised the Quasi-Harmonic Debye model. The computed value of magnetic moment found in good agreement with Slater-Pauling rule. Our findings confirms that the predicted half Heusler alloy can be used in various spintronics and thermoelectric applications.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Review of Computational Epigraphy
Authors:
Vishal Kumar
Abstract:
Computational Epigraphy refers to the process of extracting text from stone inscription, transliteration, interpretation, and attribution with the aid of computational methods. Traditional epigraphy methods are time consuming, and tend to damage the stone inscriptions while extracting text. Additionally, interpretation and attribution are subjective and can vary between different epigraphers. Howe…
▽ More
Computational Epigraphy refers to the process of extracting text from stone inscription, transliteration, interpretation, and attribution with the aid of computational methods. Traditional epigraphy methods are time consuming, and tend to damage the stone inscriptions while extracting text. Additionally, interpretation and attribution are subjective and can vary between different epigraphers. However, using modern computation methods can not only be used to extract text, but also interpret and attribute the text in a robust way. We survey and document the existing computational methods that aid in the above-mentioned tasks in epigraphy.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies
Authors:
Junlin Wang,
Siddhartha Jain,
Dejiao Zhang,
Baishakhi Ray,
Varun Kumar,
Ben Athiwaratkun
Abstract:
A diverse array of reasoning strategies has been proposed to elicit the capabilities of large language models. However, in this paper, we point out that traditional evaluations which focus solely on performance metrics miss a key factor: the increased effectiveness due to additional compute. By overlooking this aspect, a skewed view of strategy efficiency is often presented. This paper introduces…
▽ More
A diverse array of reasoning strategies has been proposed to elicit the capabilities of large language models. However, in this paper, we point out that traditional evaluations which focus solely on performance metrics miss a key factor: the increased effectiveness due to additional compute. By overlooking this aspect, a skewed view of strategy efficiency is often presented. This paper introduces a framework that incorporates the compute budget into the evaluation, providing a more informative comparison that takes into account both performance metrics and computational cost. In this budget-aware perspective, we find that complex reasoning strategies often don't surpass simpler baselines purely due to algorithmic ingenuity, but rather due to the larger computational resources allocated. When we provide a simple baseline like chain-of-thought self-consistency with comparable compute resources, it frequently outperforms reasoning strategies proposed in the literature. In this scale-aware perspective, we find that unlike self-consistency, certain strategies such as multi-agent debate or Reflexion can become worse if more compute budget is utilized.
△ Less
Submitted 14 June, 2024; v1 submitted 10 June, 2024;
originally announced June 2024.
-
First-principle screening of structural, electronic and hydrogen storage properties of Vanadium based hydride perovskites XVH$_3$ (X = Li, K)
Authors:
Anupam,
Shyam Lal Gupta,
Vipan Kumar,
Sumit Kumar,
Sanjay Panwar,
Diwaker
Abstract:
V-based XVH$_3$ (X = Li,K) hydrides perovskites are investigated for their hydrogen storage capacity using the WIEN2K code. To verify the stability of these hydrides, first-principles investigations are employed to examine their structural, electronic and hydrogen storage properties. According to structural studies these compositions hydrides are stable and part of the cubic space group (221 Pm-3m…
▽ More
V-based XVH$_3$ (X = Li,K) hydrides perovskites are investigated for their hydrogen storage capacity using the WIEN2K code. To verify the stability of these hydrides, first-principles investigations are employed to examine their structural, electronic and hydrogen storage properties. According to structural studies these compositions hydrides are stable and part of the cubic space group (221 Pm-3m). We have examined many aspects of these compositions throughout, using the PBE-GGA exchange correlation potential. We obtained the energy versus volume curve and found the stable phase and structural parameter of these hydrides using equation of state given by Birch-Murnaghan's. These hydrides thermodynamic stability is expressed in terms of their gravimetric hydrogen storage capacity.The goal of this study is to compute the standard enthalpy of formation and thermal desorption to ascertain the stability of these hydrides. Based on band structure and density of state plots it is found that these compositions are metallic in nature. The study presents a preliminary theoretical approach for hydrogen storage applications of thermoelectric compositions, revealing their strong thermoelectric responses and potential for green energy sources.
△ Less
Submitted 8 June, 2024;
originally announced June 2024.
-
Ab-initio investigations of novel potential all-d metal Heusler alloys Co$_2$MnNb
Authors:
Sumit Kumar,
Diwaker,
Vivek Kumar,
Karan S. Vinayak,
Shyam Lal Gupta
Abstract:
In this study, we employ the Wien2k code to conduct ab-initio study of a novel potential all-d-metal Heusler alloy Co$_2$MnNb. The analysis utilizes the comparison of local spin density approximations (LDA) with Perdew-Burke-Ernzerh parameterized Generalized Gradient Approximation (PBE-GGA) for structural optimization while modified Becke-Jones potential (mBJ) exchange-correlation potentials to ex…
▽ More
In this study, we employ the Wien2k code to conduct ab-initio study of a novel potential all-d-metal Heusler alloy Co$_2$MnNb. The analysis utilizes the comparison of local spin density approximations (LDA) with Perdew-Burke-Ernzerh parameterized Generalized Gradient Approximation (PBE-GGA) for structural optimization while modified Becke-Jones potential (mBJ) exchange-correlation potentials to examine various characteristic properties of the alloy under study. Employing Birch-Murnaghan equation of state, we construct the energy-versus-volume curve, facilitating the determination of stable phases and structural parameters of the investigated alloys. Structural optimization in both non-magnetic (NM) and spin-polarized (FM) states reveals the stability of the alloy in the FM state. The compound exhibits metallic behavior in bulk, with notable anisotropic semiconducting behavior for down spin while pure metallic behavior for up spin electrons. Partial density of states of each element of the composition is also analysed to compare their respective contribution towards the observed band structure. The anisotropic behavior of Co$_2$MnNb for a specific spin state could be of importance in future spintronic and other thin films device applications.
△ Less
Submitted 8 June, 2024;
originally announced June 2024.
-
Exploring Child-Robot Interaction in Individual and Group settings in India
Authors:
Gayathri Manikutty,
Sai Ankith Potapragada,
Devasena Pasupuleti,
Mahesh S. Unnithan,
Arjun Venugopal,
Pranav Prabha,
Arunav H.,
Vyshnavi Anil Kumar,
Rthuraj P. R.,
Rao R Bhavani
Abstract:
This study evaluates the effectiveness of child-robot interactions with the HaKsh-E social robot in India, examining both individual and group interaction settings. The research centers on game-based interactions designed to teach hand hygiene to children aged 7-11. Utilizing video analysis, rubric assessments, and post-study questionnaires, the study gathered data from 36 participants. Findings i…
▽ More
This study evaluates the effectiveness of child-robot interactions with the HaKsh-E social robot in India, examining both individual and group interaction settings. The research centers on game-based interactions designed to teach hand hygiene to children aged 7-11. Utilizing video analysis, rubric assessments, and post-study questionnaires, the study gathered data from 36 participants. Findings indicate that children in both settings developed positive perceptions of the robot in terms of the robot's trustworthiness, closeness, and social support. The significant difference in the interaction level scores presented in the study suggests that group settings foster higher levels of interaction, potentially due to peer influence and collaborative dynamics. While both settings showed significant improvements in learning outcomes, the individual setting had more pronounced learning gains. This suggests that personal interactions with the robot might lead to deeper or more effective learning experiences. Consequently, this study concludes that individual interaction settings are more conducive for focused learning gains, while group settings enhance interaction and engagement.
△ Less
Submitted 4 June, 2024; v1 submitted 2 June, 2024;
originally announced June 2024.
-
mRNA secondary structure prediction using utility-scale quantum computers
Authors:
Dimitris Alevras,
Mihir Metkar,
Takahiro Yamamoto,
Vaibhaw Kumar,
Triet Friedhoff,
Jae-Eun Park,
Mitsuharu Takeori,
Mariana LaDue,
Wade Davis,
Alexey Galda
Abstract:
Recent advancements in quantum computing have opened new avenues for tackling long-standing complex combinatorial optimization problems that are intractable for classical computers. Predicting secondary structure of mRNA is one such notoriously difficult problem that can benefit from the ever-increasing maturity of quantum computing technology. Accurate prediction of mRNA secondary structure is cr…
▽ More
Recent advancements in quantum computing have opened new avenues for tackling long-standing complex combinatorial optimization problems that are intractable for classical computers. Predicting secondary structure of mRNA is one such notoriously difficult problem that can benefit from the ever-increasing maturity of quantum computing technology. Accurate prediction of mRNA secondary structure is critical in designing RNA-based therapeutics as it dictates various steps of an mRNA life cycle, including transcription, translation, and decay. The current generation of quantum computers have reached utility-scale, allowing us to explore relatively large problem sizes. In this paper, we examine the feasibility of solving mRNA secondary structures on a quantum computer with sequence length up to 60 nucleotides representing problems in the qubit range of 10 to 80. We use Conditional Value at Risk (CVaR)-based VQE algorithm to solve the optimization problems, originating from the mRNA structure prediction problem, on the IBM Eagle and Heron quantum processors. To our encouragement, even with ``minimal'' error mitigation and fixed-depth circuits, our hardware runs yield accurate predictions of minimum free energy (MFE) structures that match the results of the classical solver CPLEX. Our results provide sufficient evidence for the viability of solving mRNA structure prediction problems on a quantum computer and motivate continued research in this direction.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Training LLMs to Better Self-Debug and Explain Code
Authors:
Nan Jiang,
Xiaopeng Li,
Shiqi Wang,
Qiang Zhou,
Soneya Binta Hossain,
Baishakhi Ray,
Varun Kumar,
Xiaofei Ma,
Anoop Deoras
Abstract:
In the domain of code generation, self-debugging is crucial. It allows LLMs to refine their generated code based on execution feedback. This is particularly important because generating correct solutions in one attempt proves challenging for complex tasks. Prior works on self-debugging mostly focus on prompting methods by providing LLMs with few-shot examples, which work poorly on small open-sourc…
▽ More
In the domain of code generation, self-debugging is crucial. It allows LLMs to refine their generated code based on execution feedback. This is particularly important because generating correct solutions in one attempt proves challenging for complex tasks. Prior works on self-debugging mostly focus on prompting methods by providing LLMs with few-shot examples, which work poorly on small open-sourced LLMs. In this work, we propose a training framework that significantly improves self-debugging capability of LLMs. Intuitively, we observe that a chain of explanations on the wrong code followed by code refinement helps LLMs better analyze the wrong code and do refinement. We thus propose an automated pipeline to collect a high-quality dataset for code explanation and refinement by generating a number of explanations and refinement trajectories and filtering via execution verification. We perform supervised fine-tuning (SFT) and further reinforcement learning (RL) on both success and failure trajectories with a novel reward design considering code explanation and refinement quality. SFT improves the pass@1 by up to 15.92% and pass@10 by 9.30% over four benchmarks. RL training brings additional up to 3.54% improvement on pass@1 and 2.55% improvement on pass@10. The trained LLMs show iterative refinement ability, and can keep refining code continuously. Lastly, our human evaluation shows that the LLMs trained with our framework generate more useful code explanations and help developers better understand bugs in source code.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Probing the Relationship between Defects and Enhanced Mobility in MoS2 Monolayers Grown by Mo Foil
Authors:
Sudipta Majumder,
Vaibhav Walve,
Rahul Chand,
Gokul M. A.,
Sooyeon Hwang,
G. V. Pavan Kumar,
Aparna Deshpande,
Atikur Rahman
Abstract:
Atomic vacancies, such as chalcogen vacancies in 2D TMDs, are important in changing the host material's electronic structure and transport properties. We present a straightforward one-step method for growing monolayer MoS2 utilizing oxidized Molybdenum (Mo) foil using CVD and delve into the transport properties of as-grown samples. Devices fabricated from these MoS2 sheets exhibit excellent electr…
▽ More
Atomic vacancies, such as chalcogen vacancies in 2D TMDs, are important in changing the host material's electronic structure and transport properties. We present a straightforward one-step method for growing monolayer MoS2 utilizing oxidized Molybdenum (Mo) foil using CVD and delve into the transport properties of as-grown samples. Devices fabricated from these MoS2 sheets exhibit excellent electrical responses, with the standout device achieving mobility exceeding 100 cm2V-1s-1. Structural analysis and optical signatures unveiled the presence of chalcogen defects within these samples. To decipher the influence of inherent defects on the electronic transport properties, we measured low-temperature transport on two distinct sets of devices exhibiting relatively high or low mobilities. Combining the thermally activated transport model with quantum capacitance calculations, we have shown the existence of shallow states near the conduction band, likely attributed to sulfur vacancies within MoS2. These vacancies are responsible for the hopping conduction of electrons in the device channel. Furthermore, our claims were substantiated through low-temperature scanning tunnelling microscopy measurements, which revealed an abundance of isolated and lateral double sulfur vacancies in Mo foil-grown samples. We found that these vacancies increase the density of states near the conduction band, inducing intrinsic n-type doping in the MoS2 channel. Consequently, this elevated conductivity enhances the field-effect mobility of MoS2 transistors. Our study offers insights into chalcogen vacancies in CVD-grown monolayer MoS2 and highlights their beneficial impact on electronic transport properties.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Probeable Problems for Beginner-level Programming-with-AI Contests
Authors:
Mrigank Pawagi,
Viraj Kumar
Abstract:
To broaden participation, competitive programming contests may include beginner-level problems that do not require knowledge of advanced Computer Science concepts (e.g., algorithms and data structures). However, since most participants have easy access to AI code-generation tools, these problems often become trivial to solve. For beginner-friendly programming contests that do not prohibit the use…
▽ More
To broaden participation, competitive programming contests may include beginner-level problems that do not require knowledge of advanced Computer Science concepts (e.g., algorithms and data structures). However, since most participants have easy access to AI code-generation tools, these problems often become trivial to solve. For beginner-friendly programming contests that do not prohibit the use of AI tools, we propose Probeable Problems: code writing tasks that provide (1) a problem specification that deliberately omits certain details, and (2) a mechanism to probe for these details by asking clarifying questions and receiving immediate feedback. To evaluate our proposal, we conducted a 2-hour programming contest for undergraduate Computer Science students from multiple institutions, where each student was an active member of their institution's computing club. The contest comprised of six Probeable Problems for which a popular code-generation tool (GitHub Copilot) was unable to generate accurate solutions due to the absence of details. Students were permitted to work individually or in groups, and were free to use AI tools. We obtained consent from 26 groups (67 students) to use their submissions for research. We analyze the extent to which the code submitted by these groups identifies missing details and identify ways in which Probeable Problems can support learning in formal and informal CS educational contexts.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
A common zero at the end point of the support of measure for the quasi-natured spectrally transformed polynomials
Authors:
Vikash Kumar,
A. Swaminathan
Abstract:
In this work, the explicit expressions of coefficients involved in quasi-type kernel polynomials of order one and quasi-Geronimus polynomials of order one are determined for Jacobi polynomials. These coefficients are responsible for establishing the orthogonality of quasi-spectral polynomials for Jacobi polynomials. Additionally, the orthogonality of quasi-type kernel Laguerre polynomials of order…
▽ More
In this work, the explicit expressions of coefficients involved in quasi-type kernel polynomials of order one and quasi-Geronimus polynomials of order one are determined for Jacobi polynomials. These coefficients are responsible for establishing the orthogonality of quasi-spectral polynomials for Jacobi polynomials. Additionally, the orthogonality of quasi-type kernel Laguerre polynomials of order one is derived. In the process of achieving orthogonality, one zero in both cases is located on the boundary of the support of the measure. This allows us to derive the chain sequence and minimal parameter sequence at the point lying at the end point of the support of the measure. Also, this leads to the question of characterizing such spectrally transformed polynomials.
Furthermore, the interlacing properties among the zeros of quasi-spectral orthogonal Jacobi polynomials and Jacobi polynomials are illustrated.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
Elucidating the role of electron transfer in the photoluminescence of $\mathrm{MoS_{2}}$ quantum dots synthesized by fs-pulse ablation
Authors:
Anubhab Sahoo,
Tejendra Dixit,
K. V. Anil Kumar,
K. Lakshmi Ganapathi,
Pramoda K. Nayak,
M. S. Ramachandra Rao,
Sivarama Krishnan
Abstract:
Herein, $\mathrm{MoS_{2}}$ quantum dot (QDs) with controlled optical, structural, and electronic properties are synthesized using the femtosecond pulsed laser ablation in liquid (fs-PLAL) technique by varying pulse-width, ablation power, and ablation time to harness the potential for next-generation optoelectronics and quantum technology. Furthermore, this work elucidates key aspects of the mechan…
▽ More
Herein, $\mathrm{MoS_{2}}$ quantum dot (QDs) with controlled optical, structural, and electronic properties are synthesized using the femtosecond pulsed laser ablation in liquid (fs-PLAL) technique by varying pulse-width, ablation power, and ablation time to harness the potential for next-generation optoelectronics and quantum technology. Furthermore, this work elucidates key aspects of the mechanisms underlying the near-UV and blue emission, the accompanying large Stokes-shift, and the consequent change in sample color with laser exposure parameters pertaining to $\mathrm{MoS_{2}}$ QDs. Through spectroscopic analysis, including UV-visible absorption, photoluminescence, and Raman spectroscopy, we successfully unravelled the mechanisms for the change in optoelectronic properties of $\mathrm{MoS_{2}}$ QDs with laser parameters. We realize that the occurrence of a secondary phase, specifically $\mathrm{MoO_{3-x}}$, is responsible for the significant Stokes-shift and blue emission observed in this QDs system. The primary factor influencing these activities is the electron transfer observed between these two phases, as validated by excitation dependent photoluminescence, XPS and Raman spectroscopies.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
Cosmic rays for imaging cultural heritage objects
Authors:
Andrea Giammanco,
Marwa Al Moussawi,
Matthieu Boone,
Tim De Kock,
Judy De Roy,
Sam Huysmans,
Vishal Kumar,
Maxime Lagrange,
Michael Tytgat
Abstract:
In cultural heritage conservation, it is increasingly common to rely on non-destructive imaging methods based on the absorption or scattering of photons ($X$ or $γ$ rays) or neutrons. However, physical and practical issues limit these techniques: their penetration depth may be insufficient for large and dense objects, they require transporting the objects of interest to dedicated laboratories, art…
▽ More
In cultural heritage conservation, it is increasingly common to rely on non-destructive imaging methods based on the absorption or scattering of photons ($X$ or $γ$ rays) or neutrons. However, physical and practical issues limit these techniques: their penetration depth may be insufficient for large and dense objects, they require transporting the objects of interest to dedicated laboratories, artificial radiation is hazardous and may induce activation in the material under study. Muons are elementary particles abundantly and freely produced in cosmic-ray interactions in the atmosphere. Their absorption and scattering in matter are characteristically dependent on the density and elemental composition of the material that they traverse, which offers the possibility of exploiting them for sub-surface remote imaging. This novel technique, nicknamed "muography", has been applied in use cases ranging from geophysics to archaeology to nuclear safety, but it has been so far under-explored for a vast category of cultural heritage objects that are relatively large (from decimeters to human size) and dense (stone, metals). The development of portable muon detectors makes muography particularly competitive in cases where the items to be analysed are not transportable, or set up in a confined environment. This document reviews the relevant literature, presents some exemplary use cases, and critically assesses the strengths and weaknesses of muography in this context.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Vision Transformers for End-to-End Vision-Based Quadrotor Obstacle Avoidance
Authors:
Anish Bhattacharya,
Nishanth Rao,
Dhruv Parikh,
Pratik Kunapuli,
Nikolai Matni,
Vijay Kumar
Abstract:
We demonstrate the capabilities of an attention-based end-to-end approach for high-speed quadrotor obstacle avoidance in dense, cluttered environments, with comparison to various state-of-the-art architectures. Quadrotor unmanned aerial vehicles (UAVs) have tremendous maneuverability when flown fast; however, as flight speed increases, traditional vision-based navigation via independent mapping, p…
▽ More
We demonstrate the capabilities of an attention-based end-to-end approach for high-speed quadrotor obstacle avoidance in dense, cluttered environments, with comparison to various state-of-the-art architectures. Quadrotor unmanned aerial vehicles (UAVs) have tremendous maneuverability when flown fast; however, as flight speed increases, traditional vision-based navigation via independent mapping, planning, and control modules breaks down due to increased sensor noise, compounding errors, and increased processing latency. Thus, learning-based, end-to-end planning and control networks have shown to be effective for online control of these fast robots through cluttered environments. We train and compare convolutional, U-Net, and recurrent architectures against vision transformer models for depth-based end-to-end control, in a photorealistic, high-physics-fidelity simulator as well as in hardware, and observe that the attention-based models are more effective as quadrotor speeds increase, while recurrent models with many layers provide smoother commands at lower speeds. To the best of our knowledge, this is the first work to utilize vision transformers for end-to-end vision-based quadrotor control.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Investigation of BaTiO$_3$-NiO composite as compact Dielectric Resonator Antenna
Authors:
Prithwiraj Ganguly,
Vince Kumar,
P. Maneesha,
Saptarshi Ghosh,
Somaditya Sen
Abstract:
A compact dielectric resonator antenna has been fabricated on a microstrip transmission line for the purpose of C-band wireless communication using a ceramic material made out of a sintered mixture of BTO and NiO. The antenna parameters are optimized using Ansys HFSS software and verified experimentally. Ni replaces both Ba at A site and Ti at B site. Such a solid solution has a limit depending on…
▽ More
A compact dielectric resonator antenna has been fabricated on a microstrip transmission line for the purpose of C-band wireless communication using a ceramic material made out of a sintered mixture of BTO and NiO. The antenna parameters are optimized using Ansys HFSS software and verified experimentally. Ni replaces both Ba at A site and Ti at B site. Such a solid solution has a limit depending on the amount of NiO provided during sintering. A complete study of the structural changes and the dielectric constant enables the correlation with the resonating property. All the samples retain the ferroelectric tetragonal P4mm phase with a nominal decrease in the c/a ratio. NiO incorporation in BTO decreases the sintering temperature and shows two types of morphology associated with BTO-like and NiO-like phases. It induces prominent reduction in the permittivity and loss tangent (<0.01) in the range 100Hz to 1MHz. These properties make these samples suitable for DRA application in the C-Band range [4-8 GHz]. Experimental and theoretical assessment using HFSS software yields a C-band signal at ~7.27 GHz.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
Phase separation in a binary mixture of sticky spheres
Authors:
D. C. Thakur,
Jalim Singh,
A. V. Anil Kumar
Abstract:
We numerically investigate the dependence of range of attractive potential on the phase separation of 2-D binary systems. Through extensive simulations and analysis, we show that when the range of attractive interactions approaches the sticky sphere limit, the system undergoes a phase separation at lower temperature. Further reduction in temperature causes the system to mix again. These mixing-dem…
▽ More
We numerically investigate the dependence of range of attractive potential on the phase separation of 2-D binary systems. Through extensive simulations and analysis, we show that when the range of attractive interactions approaches the sticky sphere limit, the system undergoes a phase separation at lower temperature. Further reduction in temperature causes the system to mix again. These mixing-demixing-mixing transitions are of first order. Such phase separation is not observed for systems with larger interaction range. In the phase separated region of the phase diagram, one of the components of the mixture chooses to be in crystalline configuration, while other being in disordered state
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Challenges and Opportunities for Large-Scale Exploration with Air-Ground Teams using Semantics
Authors:
Fernando Cladera,
Ian D. Miller,
Zachary Ravichandran,
Varun Murali,
Jason Hughes,
M. Ani Hsieh,
C. J. Taylor,
Vijay Kumar
Abstract:
One common and desirable application of robots is exploring potentially hazardous and unstructured environments. Air-ground collaboration offers a synergistic approach to addressing such exploration challenges. In this paper, we demonstrate a system for large-scale exploration using a team of aerial and ground robots. Our system uses semantics as lingua franca, and relies on fully opportunistic co…
▽ More
One common and desirable application of robots is exploring potentially hazardous and unstructured environments. Air-ground collaboration offers a synergistic approach to addressing such exploration challenges. In this paper, we demonstrate a system for large-scale exploration using a team of aerial and ground robots. Our system uses semantics as lingua franca, and relies on fully opportunistic communications. We highlight the unique challenges from this approach, explain our system architecture and showcase lessons learned during our experiments. All our code is open-source, encouraging researchers to use it and build upon.
△ Less
Submitted 12 May, 2024;
originally announced May 2024.
-
On Existence of Latency Optimal Uncoded Storage Schemes in Geo-Distributed Data Storage Systems
Authors:
Srivathsa Acharya,
P. Vijay Kumar,
Viveck R. Cadambe
Abstract:
We consider the problem of geographically distributed data storage in a network of servers (or nodes) where the nodes are connected to each other via communication links having certain round-trip times (RTTs). Each node serves a specific set of clients, where a client can request for any of the files available in the distributed system. The parent node provides the requested file if available loca…
▽ More
We consider the problem of geographically distributed data storage in a network of servers (or nodes) where the nodes are connected to each other via communication links having certain round-trip times (RTTs). Each node serves a specific set of clients, where a client can request for any of the files available in the distributed system. The parent node provides the requested file if available locally; else it contacts other nodes that have the data needed to retrieve the requested file. This inter-node communication incurs a delay resulting in a certain latency in servicing the data request. The worst-case latency incurred at a servicing node and the system average latency are important performance metrics of a storage system, which depend not only on inter-node RTTs, but also on how the data is stored across the nodes. Data files could be placed in the nodes as they are, i.e., in uncoded fashion, or can be coded and placed. This paper provides the necessary and sufficient conditions for the existence of uncoded storage schemes that are optimal in terms of both per-node worst-case latency and system average latency. In addition, the paper provides efficient binary storage codes for a specific case where optimal uncoded schemes do not exist.
△ Less
Submitted 13 May, 2024; v1 submitted 10 May, 2024;
originally announced May 2024.
-
On Streaming Codes for Simultaneously Correcting Burst and Random Erasures
Authors:
Shobhit Bhatnagar,
Biswadip Chakraborty,
P. Vijay Kumar
Abstract:
Streaming codes are packet-level codes that recover dropped packets within a strict decoding-delay constraint. We study streaming codes over a sliding-window (SW) channel model which admits only those erasure patterns which allow either a single burst erasure of $\le b$ packets along with $\le e$ random packet erasures, or else, $\le a$ random packet erasures, in any sliding-window of $w$ time slo…
▽ More
Streaming codes are packet-level codes that recover dropped packets within a strict decoding-delay constraint. We study streaming codes over a sliding-window (SW) channel model which admits only those erasure patterns which allow either a single burst erasure of $\le b$ packets along with $\le e$ random packet erasures, or else, $\le a$ random packet erasures, in any sliding-window of $w$ time slots. We determine the optimal rate of a streaming code constructed via the popular diagonal embedding (DE) technique over such a SW channel under delay constraint $τ=(w-1)$ and provide an $O(w)$ field size code construction. For the case $e>1$, we show that it is not possible to significantly reduce this field size requirement, assuming the well-known MDS conjecture. We then provide a block code construction whose DE yields a streaming code achieving the rate derived above, over a field of size sub-linear in $w,$ for a family of parameters having $e=1.$ We show the field size optimality of this construction for some parameters, and near-optimality for others under a sparsity constraint. Additionally, we derive an upper-bound on the $d_{\text{min}}$ of a cyclic code and characterize cyclic codes which achieve this bound via their ability to simultaneously recover from burst and random erasures.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
On Streaming Codes for Burst and Random Errors
Authors:
Shobhit Bhatnagar,
P. Vijay Kumar
Abstract:
Streaming codes (SCs) are packet-level codes that recover erased packets within a strict decoding-delay deadline. Streaming codes for various packet erasure channel models such as sliding-window (SW) channel models that admit random or burst erasures in any SW of a fixed length have been studied in the literature, and the optimal rate as well as rate-optimal code constructions of SCs over such cha…
▽ More
Streaming codes (SCs) are packet-level codes that recover erased packets within a strict decoding-delay deadline. Streaming codes for various packet erasure channel models such as sliding-window (SW) channel models that admit random or burst erasures in any SW of a fixed length have been studied in the literature, and the optimal rate as well as rate-optimal code constructions of SCs over such channel models are known. In this paper, we study error-correcting streaming codes ($\text{SC}_{\text{ERR}}$s), i.e., packet-level codes which recover erroneous packets within a delay constraint. We study $\text{SC}_{\text{ERR}}$s for two classes of SW channel models, one that admits random packet errors, and another that admits multiple bursts of packet errors, in any SW of a fixed length. For the case of random packet errors, we establish the equivalence of an $\text{SC}_{\text{ERR}}$ and a corresponding SC that recovers from random packet erasures, thus determining the optimal rate of an $\text{SC}_{\text{ERR}}$ for this setting, and providing a rate-optimal code construction for all parameters. We then focus on SCs that recover from multiple erasure bursts and derive a rate-upper-bound for such SCs. We show the necessity of a divisibility constraint for the existence of an SC constructed by the popular diagonal embedding technique, that achieves this rate-bound under a stringent delay requirement. We then show that a construction known in the literature achieves this rate-bound when the divisibility constraint is met. We further show the equivalence of the SCs considered and $\text{SC}_{\text{ERR}}$s for the setting of multiple error bursts, under a stringent delay requirement.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
Long-range magnetic order in CePdAl$_3$ enabled by orthorhombic deformation
Authors:
M. Stekiel,
P. Čermák,
C. Franz,
M. Meven,
D. Legut,
W. Simeth,
U. B. Hansen,
B. Fåk,
S. Weber,
R. Schönmann,
V. Kumar,
K. Nemkovski,
H. Deng,
A. Bauer,
C. Pfleiderer,
A. Schneidewind
Abstract:
We investigate the effect of structural deformation on the magnetic properties of orthorhombic CePdAl$_3$ in relation to its tetragonal polymorph. Utilizing x-ray and neutron diffraction we establish that the crystal structure has the $Cmcm$ space group symmetry and exhibits pseudo-tetragonal twinning. According to density-functional calculations the tetragonal-orthorhombic deformation mechanism h…
▽ More
We investigate the effect of structural deformation on the magnetic properties of orthorhombic CePdAl$_3$ in relation to its tetragonal polymorph. Utilizing x-ray and neutron diffraction we establish that the crystal structure has the $Cmcm$ space group symmetry and exhibits pseudo-tetragonal twinning. According to density-functional calculations the tetragonal-orthorhombic deformation mechanism has its grounds in relatively small free enthalpy difference between the polymorphs, allowing either phase to be quenched and fully accounts for the twinned microstructure of the orthorhombic phase. Neutron diffraction measurements show that orthorhombic CePdAl$_3$ establishes long-range magnetic order below $T_\mathrm{N}$=5.29 (5) K characterized by a collinear, antiferromagnetic arrangement of magnetic moments. Magnetic anisotropies of orthorhombic CePdAl$_3$ arise from strong spin-orbit coupling as evidenced by the crystal-field splitting of the $4f$ multiplet, fully characterised with neutron spectroscopy. We discuss the potential mechanism of frustration posed by antiferromagnetic interactions between nearest neighbours in the tetragonal phase, which hinders the formation of long-range magnetic order in tetragonal CePdAl$_3$. We propose that orthorhombic deformation releases the frustration and allows for long-range magnetic order.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
Measurement of gravitational acceleration in a single laser operated atomic fountain
Authors:
Kavish Bhardwaj,
S. Singh,
S. P. Ram,
B. Jain,
Vijay Kumar,
Ayukt Pathak,
Shradha Tiwari,
V. B. Tiwari,
S. R. Mishra
Abstract:
We present measurements on Earth's gravitational acceleration (g) using an in-house developed cold atom gravimeter (CAG) in an atomic fountain geometry. In the setup, the laser cooled $^{87}Rb$ atoms are launched vertically up in the fountain geometry and Doppler sensitive two-photon Raman pulse atom interferometry is applied to detect the gravitational acceleration experienced by the atoms. Using…
▽ More
We present measurements on Earth's gravitational acceleration (g) using an in-house developed cold atom gravimeter (CAG) in an atomic fountain geometry. In the setup, the laser cooled $^{87}Rb$ atoms are launched vertically up in the fountain geometry and Doppler sensitive two-photon Raman pulse atom interferometry is applied to detect the gravitational acceleration experienced by the atoms. Using our gravimeter setup, we have measured the local value of 'g' in our laboratory with sensitivity of 621 $μ$Gal for integration time of 1350 s.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
Computational complexity and quantum interpretations
Authors:
Vivek Kumar,
M. P. Singh,
R. Srikanth
Abstract:
In computational complexity theory, it remains to be understood whether $\textbf{BQP}$ is the same as $\textbf{BPP}$. Prima facie, one would expect that this mathematical question is quite unrelated to the foundational question of whether the quantum state is an element of reality or of the observer's knowledge. By contrast, here we argue that the complexity of computation in a physical theory may…
▽ More
In computational complexity theory, it remains to be understood whether $\textbf{BQP}$ is the same as $\textbf{BPP}$. Prima facie, one would expect that this mathematical question is quite unrelated to the foundational question of whether the quantum state is an element of reality or of the observer's knowledge. By contrast, here we argue that the complexity of computation in a physical theory may constrain its physical interpretation. Specifically in the quantum case, we argue that a subjective interpretation of the quantum mechanics favors the proposition $\textbf{BQP} = \textbf{BPP}$. Therefore, if $\textbf{BPP} \subset \textbf{BQP}$, then a realist interpretation of quantum mechanics would be favored.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
PhilHumans: Benchmarking Machine Learning for Personal Health
Authors:
Vadim Liventsev,
Vivek Kumar,
Allmin Pradhap Singh Susaiyah,
Zixiu Wu,
Ivan Rodin,
Asfand Yaar,
Simone Balloccu,
Marharyta Beraziuk,
Sebastiano Battiato,
Giovanni Maria Farinella,
Aki Härmä,
Rim Helaoui,
Milan Petkovic,
Diego Reforgiato Recupero,
Ehud Reiter,
Daniele Riboni,
Raymond Sterling
Abstract:
The use of machine learning in Healthcare has the potential to improve patient outcomes as well as broaden the reach and affordability of Healthcare. The history of other application areas indicates that strong benchmarks are essential for the development of intelligent systems. We present Personal Health Interfaces Leveraging HUman-MAchine Natural interactions (PhilHumans), a holistic suite of be…
▽ More
The use of machine learning in Healthcare has the potential to improve patient outcomes as well as broaden the reach and affordability of Healthcare. The history of other application areas indicates that strong benchmarks are essential for the development of intelligent systems. We present Personal Health Interfaces Leveraging HUman-MAchine Natural interactions (PhilHumans), a holistic suite of benchmarks for machine learning across different Healthcare settings - talk therapy, diet coaching, emergency care, intensive care, obstetric sonography - as well as different learning settings, such as action anticipation, timeseries modeling, insight mining, language modeling, computer vision, reinforcement learning and program synthesis
△ Less
Submitted 16 May, 2024; v1 submitted 4 May, 2024;
originally announced May 2024.
-
Pocket Schlieren: a background oriented schlieren imaging platform on a smartphone
Authors:
Diganta Rabha,
Vimod Kumar,
Akshay Kumar,
Dinesh Saini,
Manish Kumar
Abstract:
Background-oriented schlieren (BOS) is a powerful technique for flow visualization. Nevertheless, the widespread dissemination of BOS is impeded by its dependence on scientific cameras, computing hardware, and dedicated analysis software. In this work, we aim to democratize BOS by providing a smartphone based scientific tool called "Pocket Schlieren". Pocket Schlieren enables users to directly cap…
▽ More
Background-oriented schlieren (BOS) is a powerful technique for flow visualization. Nevertheless, the widespread dissemination of BOS is impeded by its dependence on scientific cameras, computing hardware, and dedicated analysis software. In this work, we aim to democratize BOS by providing a smartphone based scientific tool called "Pocket Schlieren". Pocket Schlieren enables users to directly capture, process, and visualize flow phenomena on their smartphones. The underlying algorithm incorporates consecutive frame subtraction (CFS) and optical flow (OF) techniques to compute the density gradients inside a flow. It performs on both engineered and natural background patterns. Using Pocket Schlieren, we successfully visualized the flow produced from a burning candle flame, butane lighter, hot soldering iron, room heater, water immersion heating rod, and a large outdoor butane flame. Pocket Schlieren promises to serve as a frugal yet potent instrument for scientific and educational purposes. We have made it publicly available at doi: 10.5281/zenodo.10949271.
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
2D Monolayer Molybdenum (IV) Telluride TMD: An Efficient Electrocatalyst for Hydrogen Evolution Reaction
Authors:
Vikash Kumar,
Srimanta Pakhira
Abstract:
An electrocatalyst is needed to efficiently lower the reaction barriers to produce hydrogen through the H2 evolution reaction (HER). Recently, two-dimensional transition metal dichalcogenides (2D TMDs), such as the pure 2D monolayer MoTe2 TMD, have become attractive materials for HER. Using the first principle-based hybrid DFT-D method, we have computationally designed a pure 2D monolayer MoTe2 TM…
▽ More
An electrocatalyst is needed to efficiently lower the reaction barriers to produce hydrogen through the H2 evolution reaction (HER). Recently, two-dimensional transition metal dichalcogenides (2D TMDs), such as the pure 2D monolayer MoTe2 TMD, have become attractive materials for HER. Using the first principle-based hybrid DFT-D method, we have computationally designed a pure 2D monolayer MoTe2 TMD and examined its structural and electronic properties and electrocatalytic efficacy towards HER. A non-periodic finite molecular cluster model Mo10Te21 system was employed to explore the feasibility of both the Volmer-Heyrovsky and Volmer-Tafel reaction mechanisms for the HER. The solvent-phase calculations of the HER on the 2D monolayer MoTe2 TMD demonstrate that this material can effectively undergo either Volmer-Heyrovsky or Volmer-Tafel reaction pathways. This conclusion is supported by our determination of low reaction barriers for the H*-migration, Heyrovsky, and Tafel transition states (TSs), which were found to be approximately 9.80, 12.55, and 5.29 kcal.mol-1, respectively. These results highlight the potential utility of MoTe2 TMD as a promising electrocatalyst for HER. The unusual electrocatalytic activity of the pure 2D monolayer MoTe2 TMD is evidenced by its ability to significantly reduce reaction barriers, achieving impressive turnover frequency during the Heyrovsky and Tafel reaction steps, respectively. Additionally, it demonstrates a remarkably low Tafel slope of 29.58 mV.dec-1. Further exploration of its potential applications in electrocatalysis is warranted. The present work provides valuable insights into the atomic modulation of active sites for enhanced electrocatalytic performance towards HER, paving a way for designing advanced non-noble metal free electrocatalysts.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
Fewer Truncations Improve Language Modeling
Authors:
Hantian Ding,
Zijian Wang,
Giovanni Paolini,
Varun Kumar,
Anoop Deoras,
Dan Roth,
Stefano Soatto
Abstract:
In large language model training, input documents are typically concatenated together and then split into sequences of equal length to avoid padding tokens. Despite its efficiency, the concatenation approach compromises data integrity -- it inevitably breaks many documents into incomplete pieces, leading to excessive truncations that hinder the model from learning to compose logically coherent and…
▽ More
In large language model training, input documents are typically concatenated together and then split into sequences of equal length to avoid padding tokens. Despite its efficiency, the concatenation approach compromises data integrity -- it inevitably breaks many documents into incomplete pieces, leading to excessive truncations that hinder the model from learning to compose logically coherent and factually consistent content that is grounded on the complete context. To address the issue, we propose Best-fit Packing, a scalable and efficient method that packs documents into training sequences through length-aware combinatorial optimization. Our method completely eliminates unnecessary truncations while retaining the same training efficiency as concatenation. Empirical results from both text and code pre-training show that our method achieves superior performance (e.g., relatively +4.7% on reading comprehension; +16.8% in context following; and +9.2% on program synthesis), and reduces closed-domain hallucination effectively by up to 58.3%.
△ Less
Submitted 2 May, 2024; v1 submitted 16 April, 2024;
originally announced April 2024.
-
Higher order hypoelliptic damped wave equations on graded Lie groups with data from negative order Sobolev spaces
Authors:
Aparajita Dasgupta,
Vishvesh Kumar,
Shyam Swarup Mondal,
Michael Ruzhansky
Abstract:
Let $\mathbb G$ be a graded Lie group with homogeneous dimension $Q$. In this paper, we study the Cauchy problem for a semilinear hypoelliptic damped wave equation involving a positive Rockland operator $\mathcal{R}$ of homogeneous degree $ν\geq 2$ on $\mathbb G$ with power type nonlinearity $|u|^p$ and initial data taken from negative order homogeneous Sobolev space $\dot H^{-γ}(\mathbb G), γ>0$.…
▽ More
Let $\mathbb G$ be a graded Lie group with homogeneous dimension $Q$. In this paper, we study the Cauchy problem for a semilinear hypoelliptic damped wave equation involving a positive Rockland operator $\mathcal{R}$ of homogeneous degree $ν\geq 2$ on $\mathbb G$ with power type nonlinearity $|u|^p$ and initial data taken from negative order homogeneous Sobolev space $\dot H^{-γ}(\mathbb G), γ>0$. In the framework of Sobolev spaces of negative order, we prove that $p_{\text{Crit}}(Q, γ, ν) :=1+\frac{2ν}{Q+2γ}$ is the new critical exponent for $γ\in (0, \frac{Q}{2})$. More precisely, we show the global-in-time existence of small data Sobolev solutions of lower regularity for $p>p_{\text{Crit}}(Q, γ, ν) $ in the energy evolution space $ \mathcal{C}\left([0, T], H^{s}(\mathbb{G})\right), s\in (0, 1]$. Under certain conditions on the initial data, we also prove a finite-time blow-up of weak solutions for $1<p<p_{\text{Crit}}(Q, γ, ν)$. Furthermore, to precisely characterize the blow-up time, we derive sharp upper bound and lower bound estimates for the lifespan in the subcritical cases. We emphasize that our results are also new, even in the setting of higher-order differential operators on $\mathbb{R}^n$, and more generally, on stratified Lie groups.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Multi-Robot Target Tracking with Sensing and Communication Danger Zones
Authors:
Jiazhen Liu,
Peihan Li,
Yuwei Wu,
Gaurav S. Sukhatme,
Vijay Kumar,
Lifeng Zhou
Abstract:
Multi-robot target tracking finds extensive applications in different scenarios, such as environmental surveillance and wildfire management, which require the robustness of the practical deployment of multi-robot systems in uncertain and dangerous environments. Traditional approaches often focus on the performance of tracking accuracy with no modeling and assumption of the environments, neglecting…
▽ More
Multi-robot target tracking finds extensive applications in different scenarios, such as environmental surveillance and wildfire management, which require the robustness of the practical deployment of multi-robot systems in uncertain and dangerous environments. Traditional approaches often focus on the performance of tracking accuracy with no modeling and assumption of the environments, neglecting potential environmental hazards which result in system failures in real-world deployments. To address this challenge, we investigate multi-robot target tracking in the adversarial environment considering sensing and communication attacks with uncertainty. We design specific strategies to avoid different danger zones and proposed a multi-agent tracking framework under the perilous environment. We approximate the probabilistic constraints and formulate practical optimization strategies to address computational challenges efficiently. We evaluate the performance of our proposed methods in simulations to demonstrate the ability of robots to adjust their risk-aware behaviors under different levels of environmental uncertainty and risk confidence. The proposed method is further validated via real-world robot experiments where a team of drones successfully track dynamic ground robots while being risk-aware of the sensing and/or communication danger zones.
△ Less
Submitted 20 June, 2024; v1 submitted 11 April, 2024;
originally announced April 2024.
-
Attention based End to end network for Offline Writer Identification on Word level data
Authors:
Vineet Kumar,
Suresh Sundaram
Abstract:
Writer identification due to its widespread application in various fields has gained popularity over the years. In scenarios where optimum handwriting samples are available, whether they be in the form of a single line, a sentence, or an entire page, writer identification algorithms have demonstrated noteworthy levels of accuracy. However, in scenarios where only a limited number of handwritten sa…
▽ More
Writer identification due to its widespread application in various fields has gained popularity over the years. In scenarios where optimum handwriting samples are available, whether they be in the form of a single line, a sentence, or an entire page, writer identification algorithms have demonstrated noteworthy levels of accuracy. However, in scenarios where only a limited number of handwritten samples are available, particularly in the form of word images, there is a significant scope for improvement.
In this paper, we propose a writer identification system based on an attention-driven Convolutional Neural Network (CNN). The system is trained utilizing image segments, known as fragments, extracted from word images, employing a pyramid-based strategy. This methodology enables the system to capture a comprehensive representation of the data, encompassing both fine-grained details and coarse features across various levels of abstraction. These extracted fragments serve as the training data for the convolutional network, enabling it to learn a more robust representation compared to traditional convolution-based networks trained on word images. Additionally, the paper explores the integration of an attention mechanism to enhance the representational power of the learned features. The efficacy of the proposed algorithm is evaluated on three benchmark databases, demonstrating its proficiency in writer identification tasks, particularly in scenarios with limited access to handwriting data.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
DaF-BEVSeg: Distortion-aware Fisheye Camera based Bird's Eye View Segmentation with Occlusion Reasoning
Authors:
Senthil Yogamani,
David Unger,
Venkatraman Narayanan,
Varun Ravi Kumar
Abstract:
Semantic segmentation is an effective way to perform scene understanding. Recently, segmentation in 3D Bird's Eye View (BEV) space has become popular as its directly used by drive policy. However, there is limited work on BEV segmentation for surround-view fisheye cameras, commonly used in commercial vehicles. As this task has no real-world public dataset and existing synthetic datasets do not han…
▽ More
Semantic segmentation is an effective way to perform scene understanding. Recently, segmentation in 3D Bird's Eye View (BEV) space has become popular as its directly used by drive policy. However, there is limited work on BEV segmentation for surround-view fisheye cameras, commonly used in commercial vehicles. As this task has no real-world public dataset and existing synthetic datasets do not handle amodal regions due to occlusion, we create a synthetic dataset using the Cognata simulator comprising diverse road types, weather, and lighting conditions. We generalize the BEV segmentation to work with any camera model; this is useful for mixing diverse cameras. We implement a baseline by applying cylindrical rectification on the fisheye images and using a standard LSS-based BEV segmentation model. We demonstrate that we can achieve better performance without undistortion, which has the adverse effects of increased runtime due to pre-processing, reduced field-of-view, and resampling artifacts. Further, we introduce a distortion-aware learnable BEV pooling strategy that is more effective for the fisheye cameras. We extend the model with an occlusion reasoning module, which is critical for estimating in BEV space. Qualitative performance of DaF-BEVSeg is showcased in the video at https://streamable.com/ge4v51.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
Numerical modelling of flame spread over thin circular ducts
Authors:
Vipin Kumar,
Kambam Naresh,
Amit Kumar
Abstract:
This paper presents a numerical investigation into the phenomenon of flame spread over thin circular ducts in normal gravity and microgravity environments. Flame spread over such geometry is of significant interest due to its relevance in various practical applications, including tubes for flow purpose in medical system, fire safety in spacecrafts, ducts as well as wiring tubes. This study compris…
▽ More
This paper presents a numerical investigation into the phenomenon of flame spread over thin circular ducts in normal gravity and microgravity environments. Flame spread over such geometry is of significant interest due to its relevance in various practical applications, including tubes for flow purpose in medical system, fire safety in spacecrafts, ducts as well as wiring tubes. This study comprises of a comprehensive investigation of key parameters affecting flame spread rate, including fuel radius and opposed flow speed in normal gravity and microgravity environments. A 2-D axisymmetric flame spread model accounted for char and numerical simulations were performed which revealed valuable insights into the underlying mechanisms governing flame spread over such geometry. The results computed from the numerical model is compared with the experimentally observed flame spread rate to validate the numerical model which can be used to gain a comprehensive understanding of the underlying physical phenomena. As the radius of circular duct increases the flame spread rate increases both in normal gravity and microgravity environments. The conduction heat feedback and radiation heat gain coming from hot char through gas phase at inner core region are the two major mechanisms which controls the flame spread phenomena over the circular duct fuels. The flame spread rate at different flow ranging from quiescent (0 cm/s) to 30 cm/s is also evaluated and 21 % oxygen and found a non-monotonic increasing decreasing trend of flame spread rate at different opposed flow speed in both normal gravity and microgravity environments.
△ Less
Submitted 1 April, 2024;
originally announced April 2024.
-
An Active Perception Game for Robust Autonomous Exploration
Authors:
Siming He,
Yuezhan Tao,
Igor Spasojevic,
Vijay Kumar,
Pratik Chaudhari
Abstract:
We formulate active perception for an autonomous agent that explores an unknown environment as a two-player zero-sum game: the agent aims to maximize information gained from the environment while the environment aims to minimize the information gained by the agent. In each episode, the environment reveals a set of actions with their potentially erroneous information gain. In order to select the be…
▽ More
We formulate active perception for an autonomous agent that explores an unknown environment as a two-player zero-sum game: the agent aims to maximize information gained from the environment while the environment aims to minimize the information gained by the agent. In each episode, the environment reveals a set of actions with their potentially erroneous information gain. In order to select the best action, the robot needs to recover the true information gain from the erroneous one. The robot does so by minimizing the discrepancy between its estimate of information gain and the true information gain it observes after taking the action. We propose an online convex optimization algorithm that achieves sub-linear expected regret $O(T^{3/4})$ for estimating the information gain. We also provide a bound on the regret of active perception performed by any (near-)optimal prediction and trajectory selection algorithms. We evaluate this approach using semantic neural radiance fields (NeRFs) in simulated realistic 3D environments to show that the robot can discover up to 12% more objects using the improved estimate of the information gain. On the M3ED dataset, the proposed algorithm reduced the error of information gain prediction in occupancy map by over 67%. In real-world experiments using occupancy maps on a Jackal ground robot, we show that this approach can calculate complicated trajectories that efficiently explore all occluded regions.
△ Less
Submitted 31 March, 2024;
originally announced April 2024.
-
Trajectory Optimization with Global Yaw Parameterization for Field-of-View Constrained Autonomous Flight
Authors:
Yuwei Wu,
Yuezhan Tao,
Igor Spasojevic,
Vijay Kumar
Abstract:
Trajectory generation for quadrotors with limited field-of-view sensors has numerous applications such as aerial exploration, coverage, inspection, videography, and target tracking. Most previous works simplify the task of optimizing yaw trajectories by either aligning the heading of the robot with its velocity, or potentially restricting the feasible space of candidate trajectories by using a lim…
▽ More
Trajectory generation for quadrotors with limited field-of-view sensors has numerous applications such as aerial exploration, coverage, inspection, videography, and target tracking. Most previous works simplify the task of optimizing yaw trajectories by either aligning the heading of the robot with its velocity, or potentially restricting the feasible space of candidate trajectories by using a limited yaw domain to circumvent angular singularities. In this paper, we propose a novel \textit{global} yaw parameterization method for trajectory optimization that allows a 360-degree yaw variation as demanded by the underlying algorithm. This approach effectively bypasses inherent singularities by including supplementary quadratic constraints and transforming the final decision variables into the desired state representation. This method significantly reduces the needed control effort, and improves optimization feasibility. Furthermore, we apply the method to several examples of different applications that require jointly optimizing over both the yaw and position trajectories. Ultimately, we present a comprehensive numerical analysis and evaluation of our proposed method in both simulation and real-world experiments.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
TrustAI at SemEval-2024 Task 8: A Comprehensive Analysis of Multi-domain Machine Generated Text Detection Techniques
Authors:
Ashok Urlana,
Aditya Saibewar,
Bala Mallikarjunarao Garlapati,
Charaka Vinayak Kumar,
Ajeet Kumar Singh,
Srinivasa Rao Chalamala
Abstract:
The Large Language Models (LLMs) exhibit remarkable ability to generate fluent content across a wide spectrum of user queries. However, this capability has raised concerns regarding misinformation and personal information leakage. In this paper, we present our methods for the SemEval2024 Task8, aiming to detect machine-generated text across various domains in both mono-lingual and multi-lingual co…
▽ More
The Large Language Models (LLMs) exhibit remarkable ability to generate fluent content across a wide spectrum of user queries. However, this capability has raised concerns regarding misinformation and personal information leakage. In this paper, we present our methods for the SemEval2024 Task8, aiming to detect machine-generated text across various domains in both mono-lingual and multi-lingual contexts. Our study comprehensively analyzes various methods to detect machine-generated text, including statistical, neural, and pre-trained model approaches. We also detail our experimental setup and perform a in-depth error analysis to evaluate the effectiveness of these methods. Our methods obtain an accuracy of 86.9\% on the test set of subtask-A mono and 83.7\% for subtask-B. Furthermore, we also highlight the challenges and essential factors for consideration in future studies.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Impact of Video Compression Artifacts on Fisheye Camera Visual Perception Tasks
Authors:
Madhumitha Sakthi,
Louis Kerofsky,
Varun Ravi Kumar,
Senthil Yogamani
Abstract:
Autonomous driving systems require extensive data collection schemes to cover the diverse scenarios needed for building a robust and safe system. The data volumes are in the order of Exabytes and have to be stored for a long period of time (i.e., more than 10 years of the vehicle's life cycle). Lossless compression doesn't provide sufficient compression ratios, hence, lossy video compression has b…
▽ More
Autonomous driving systems require extensive data collection schemes to cover the diverse scenarios needed for building a robust and safe system. The data volumes are in the order of Exabytes and have to be stored for a long period of time (i.e., more than 10 years of the vehicle's life cycle). Lossless compression doesn't provide sufficient compression ratios, hence, lossy video compression has been explored. It is essential to prove that lossy video compression artifacts do not impact the performance of the perception algorithms. However, there is limited work in this area to provide a solid conclusion. In particular, there is no such work for fisheye cameras, which have high radial distortion and where compression may have higher artifacts. Fisheye cameras are commonly used in automotive systems for 3D object detection task. In this work, we provide the first analysis of the impact of standard video compression codecs on wide FOV fisheye camera images. We demonstrate that the achievable compression with negligible impact depends on the dataset and temporal prediction of the video codec. We propose a radial distortion-aware zonal metric to evaluate the performance of artifacts in fisheye images. In addition, we present a novel method for estimating affine mode parameters of the latest VVC codec, and suggest some areas for improvement in video codecs for the application to fisheye imagery.
△ Less
Submitted 24 March, 2024;
originally announced March 2024.