subscribe to arXiv mailings

Maker-Breaker resolving game played on corona products of graphs

Authors: Tijo James, Sandi Klavžar, Dorota Kuziak, Savitha K S, Ambat Vijayakumar

Abstract: The Maker-Breaker resolving game is a game played on a graph $G$ by Resolver and Spoiler. The players taking turns alternately in which each player selects a not yet played vertex of $G$. The goal of Resolver is to select all the vertices in a resolving set of $G$, while that of Spoiler is to prevent this from happening. The outcome $o(G)$ of the game played is one of $\mathcal{R}$, $\mathcal{S}$,… ▽ More The Maker-Breaker resolving game is a game played on a graph $G$ by Resolver and Spoiler. The players taking turns alternately in which each player selects a not yet played vertex of $G$. The goal of Resolver is to select all the vertices in a resolving set of $G$, while that of Spoiler is to prevent this from happening. The outcome $o(G)$ of the game played is one of $\mathcal{R}$, $\mathcal{S}$, and $\mathcal{N}$, where $o(G)=\mathcal{R}$ (resp.\ $o(G)=\mathcal{S}$), if Resolver (resp.\ Spoiler) has a winning strategy no matter who starts the game, and $o(G)=\mathcal{N}$, if the first player has a winning strategy. In this paper, the game is investigated on corona products $G\odot H$ of graphs $G$ and $H$. It is proved that if $o(H)\in\{\mathcal{N}, \mathcal{S}\}$, then $o(G\odot H) = \mathcal{S}$. No such result is possible under the assumption $o(H) = \mathcal{R}$. It is proved that $o(G\odot P_k) = \mathcal{S}$ if $k=5$, otherwise $o(G\odot P_k) = \mathcal{R}$, and that $o(G\odot C_k) = \mathcal{S}$ if $k=3$, otherwise $o(G\odot C_k) = \mathcal{R}$. Several results are also given on corona products in which the second factor is of diameter at most $2$. △ Less

Submitted 21 June, 2024; originally announced June 2024.

arXiv:2406.00010 [pdf, other]

EnterpriseEM: Fine-tuned Embeddings for Enterprise Semantic Search

Authors: Kamalkumar Rathinasamy, Jayarama Nettar, Amit Kumar, Vishal Manchanda, Arun Vijayakumar, Ayush Kataria, Venkateshprasanna Manjunath, Chidambaram GS, Jaskirat Singh Sodhi, Shoeb Shaikh, Wasim Akhtar Khan, Prashant Singh, Tanishq Dattatray Ige, Vipin Tiwari, Rajab Ali Mondal, Harshini K, S Reka, Chetana Amancharla, Faiz ur Rahman, Harikrishnan P A, Indraneel Saha, Bhavya Tiwary, Navin Shankar Patel, Pradeep T S, Balaji A J , et al. (2 additional authors not shown)

Abstract: Enterprises grapple with the significant challenge of managing proprietary unstructured data, hindering efficient information retrieval. This has led to the emergence of AI-driven information retrieval solutions, designed to adeptly extract relevant insights to address employee inquiries. These solutions often leverage pre-trained embedding models and generative models as foundational components.… ▽ More Enterprises grapple with the significant challenge of managing proprietary unstructured data, hindering efficient information retrieval. This has led to the emergence of AI-driven information retrieval solutions, designed to adeptly extract relevant insights to address employee inquiries. These solutions often leverage pre-trained embedding models and generative models as foundational components. While pre-trained embeddings may exhibit proximity or disparity based on their original training objectives, they might not fully align with the unique characteristics of enterprise-specific data, leading to suboptimal alignment with the retrieval goals of enterprise environments. In this paper, we propose a methodology to fine-tune pre-trained embedding models specifically for enterprise environments. By adapting the embeddings to better suit the retrieval tasks prevalent in enterprises, we aim to enhance the performance of information retrieval solutions. We discuss the process of fine-tuning, its effect on retrieval accuracy, and the potential benefits for enterprise information management. Our findings demonstrate the efficacy of fine-tuned embedding models in improving the precision and relevance of search results in enterprise settings. △ Less

Submitted 18 May, 2024; originally announced June 2024.

ACM Class: I.2.7

arXiv:2403.05530 [pdf, other]

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February version on the great majority of capabilities and benchmarks; (2) Gemini 1.5 Flash, a more lightweight variant designed for efficiency with minimal regression in quality. Gemini 1.5 models achieve near-perfect recall on long-context retrieval tasks across modalities, improve the state-of-the-art in long-document QA, long-video QA and long-context ASR, and match or surpass Gemini 1.0 Ultra's state-of-the-art performance across a broad set of benchmarks. Studying the limits of Gemini 1.5's long-context ability, we find continued improvement in next-token prediction and near-perfect retrieval (>99%) up to at least 10M tokens, a generational leap over existing models such as Claude 3.0 (200k) and GPT-4 Turbo (128k). Finally, we highlight real-world use cases, such as Gemini 1.5 collaborating with professionals on completing their tasks achieving 26 to 75% time savings across 10 different job categories, as well as surprising new capabilities of large language models at the frontier; when given a grammar manual for Kalamang, a language with fewer than 200 speakers worldwide, the model learns to translate English to Kalamang at a similar level to a person who learned from the same content. △ Less

Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

arXiv:2401.10971 [pdf, ps, other]

Searching for regular, triangle-distinct graphs

Authors: Dragan Stevanović, Mohammad Ghebleh, Gilles Caporossi, Ambat Vijayakumar, Sanja Stevanović

Abstract: The triangle-degree of a vertex v of a simple graph G is the number of triangles in G that contain v. A simple graph is triangle-distinct if all its vertices have distinct triangle-degrees. Berikkyzy et al. [Discrete Math. 347 (2024) 113695] recently asked whether there exists a regular graph that is triangle-distinct. Here we showcase the examples of regular, triangle-distinct graphs with orders… ▽ More The triangle-degree of a vertex v of a simple graph G is the number of triangles in G that contain v. A simple graph is triangle-distinct if all its vertices have distinct triangle-degrees. Berikkyzy et al. [Discrete Math. 347 (2024) 113695] recently asked whether there exists a regular graph that is triangle-distinct. Here we showcase the examples of regular, triangle-distinct graphs with orders between 21 and 27, and report on the methodology used to find them. △ Less

Submitted 19 January, 2024; originally announced January 2024.

Comments: 7 pages + 11 pages appendix with examples of regular, triangle-distinct graphs

MSC Class: 05C07

arXiv:2401.00309 [pdf, other]

High-statistics measurement of Collins and Sivers asymmetries for transversely polarised deuterons

Authors: G. D. Alexeev, M. G. Alexeev, C. Alice, A. Amoroso, V. Andrieux, V. Anosov, S. Asatryan, K. Augsten, W. Augustyniak, C. D. R. Azevedo, B. Badelek, J. Barth, R. Beck, J. Beckers, Y. Bedfer, J. Bernhard, M. Bodlak, F. Bradamante, A. Bressan, W. -C. Chang, C. Chatterjee, M. Chiosso, A. G. Chumakov, S. -U. Chung, A. Cicuttin , et al. (162 additional authors not shown)

Abstract: New results are presented on a high-statistics measurement of Collins and Sivers asymmetries of charged hadrons produced in deep inelastic scattering of muons on a transversely polarised $^6$LiD target. The data were taken in 2022 with the COMPASS spectrometer using the 160 \gevv\ muon beam at CERN, balancing the existing data on transversely polarised proton targets. The first results from about… ▽ More New results are presented on a high-statistics measurement of Collins and Sivers asymmetries of charged hadrons produced in deep inelastic scattering of muons on a transversely polarised $^6$LiD target. The data were taken in 2022 with the COMPASS spectrometer using the 160 \gevv\ muon beam at CERN, balancing the existing data on transversely polarised proton targets. The first results from about two-thirds of the new data have total uncertainties smaller by up to a factor of three compared to the previous deuteron measurements. Using all the COMPASS proton and deuteron results, both the transversity and the Sivers distribution functions of the $u$ and $d$ quark, as well as the tensor charge in the measured $x$-range are extracted. In particular, the accuracy of the $d$ quark results is significantly improved. △ Less

Submitted 30 December, 2023; originally announced January 2024.

Report number: CERN-EP-2023-308

arXiv:2312.17379 [pdf, other]

Final COMPASS results on the transverse-spin-dependent azimuthal asymmetries in the pion-induced Drell-Yan process

Authors: G. D. Alexeev, M. G. Alexeev, C. Alice, A. Amoroso, V. Andrieux, V. Anosov, K. Augsten, W. Augustyniak, C. D. R. Azevedo, B. Badelek, J. Barth, R. Beck, J. Beckers, Y. Bedfer, J. Bernhard, M. Bodlak, F. Bradamante, A. Bressan, W. -C. Chang, C. Chatterjee, M. Chiosso, A. G. Chumakov, S. -U. Chung, A. Cicuttin, P. M. M. Correia , et al. (159 additional authors not shown)

Abstract: The COMPASS Collaboration performed measurements of the Drell-Yan process in 2015 and 2018 using a 190 GeV/c $π^{-}$ beam impinging on a transversely polarised ammonia target. Combining the data of both years, we present final results on the amplitudes of the five azimuthal modulations in the dimuon production cross section. Three of these transverse-spin-dependent azimuthal asymmetries (TSAs) pro… ▽ More The COMPASS Collaboration performed measurements of the Drell-Yan process in 2015 and 2018 using a 190 GeV/c $π^{-}$ beam impinging on a transversely polarised ammonia target. Combining the data of both years, we present final results on the amplitudes of the five azimuthal modulations in the dimuon production cross section. Three of these transverse-spin-dependent azimuthal asymmetries (TSAs) probe the nucleon leading-twist Sivers, transversity, and pretzelosity transverse-momentum dependent (TMD) parton distribution functions (PDFs). The other two are induced by subleading effects. These TSAs provide unique new inputs for the study of the nucleon TMD PDFs and their universality properties. In particular, the Sivers TSA observed in this measurement is consistent with the fundamental QCD prediction of a sign change of naive time-reversal-odd TMD PDFs when comparing the Drell-Yan process with semi-inclusive measurements of deep inelastic scattering. Also, within the context of model predictions, the observed transversity TSA is consistent with the expectation of a sign change for the Boer-Mulders function. △ Less

Submitted 28 December, 2023; originally announced December 2023.

Report number: CERN-EP-2023-307

arXiv:2312.11805 [pdf, other]

Gemini: A Family of Highly Capable Multimodal Models

Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultra model advances the state of the art in 30 of 32 of these benchmarks - notably being the first model to achieve human-expert performance on the well-studied exam benchmark MMLU, and improving the state of the art in every one of the 20 multimodal benchmarks we examined. We believe that the new capabilities of the Gemini family in cross-modal reasoning and language understanding will enable a wide variety of use cases. We discuss our approach toward post-training and deploying Gemini models responsibly to users through services including Gemini, Gemini Advanced, Google AI Studio, and Cloud Vertex AI. △ Less

Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

arXiv:2310.15113 [pdf]

Counting the Bugs in ChatGPT's Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model

Authors: Leonie Weissweiler, Valentin Hofmann, Anjali Kantharuban, Anna Cai, Ritam Dutt, Amey Hengle, Anubha Kabra, Atharva Kulkarni, Abhishek Vijayakumar, Haofei Yu, Hinrich Schütze, Kemal Oflazer, David R. Mortensen

Abstract: Large language models (LLMs) have recently reached an impressive level of linguistic capability, prompting comparisons with human language skills. However, there have been relatively few systematic inquiries into the linguistic capabilities of the latest generation of LLMs, and those studies that do exist (i) ignore the remarkable ability of humans to generalize, (ii) focus only on English, and (i… ▽ More Large language models (LLMs) have recently reached an impressive level of linguistic capability, prompting comparisons with human language skills. However, there have been relatively few systematic inquiries into the linguistic capabilities of the latest generation of LLMs, and those studies that do exist (i) ignore the remarkable ability of humans to generalize, (ii) focus only on English, and (iii) investigate syntax or semantics and overlook other capabilities that lie at the heart of human language, like morphology. Here, we close these gaps by conducting the first rigorous analysis of the morphological capabilities of ChatGPT in four typologically varied languages (specifically, English, German, Tamil, and Turkish). We apply a version of Berko's (1958) wug test to ChatGPT, using novel, uncontaminated datasets for the four examined languages. We find that ChatGPT massively underperforms purpose-built systems, particularly in English. Overall, our results -- through the lens of morphology -- cast a new light on the linguistic capabilities of ChatGPT, suggesting that claims of human-like language skills are premature and misleading. △ Less

Submitted 26 October, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

Comments: EMNLP 2023

arXiv:2303.02579 [pdf, other]

doi 10.1016/j.nuclphysa.2024.122874

The Present and Future of QCD

Authors: P. Achenbach, D. Adhikari, A. Afanasev, F. Afzal, C. A. Aidala, A. Al-bataineh, D. K. Almaalol, M. Amaryan, D. Androić, W. R. Armstrong, M. Arratia, J. Arrington, A. Asaturyan, E. C. Aschenauer, H. Atac, H. Avakian, T. Averett, C. Ayerbe Gayoso, X. Bai, K. N. Barish, N. Barnea, G. Basar, M. Battaglieri, A. A. Baty, I. Bautista , et al. (378 additional authors not shown)

Abstract: This White Paper presents the community inputs and scientific conclusions from the Hot and Cold QCD Town Meeting that took place September 23-25, 2022 at MIT, as part of the Nuclear Science Advisory Committee (NSAC) 2023 Long Range Planning process. A total of 424 physicists registered for the meeting. The meeting highlighted progress in Quantum Chromodynamics (QCD) nuclear physics since the 2015… ▽ More This White Paper presents the community inputs and scientific conclusions from the Hot and Cold QCD Town Meeting that took place September 23-25, 2022 at MIT, as part of the Nuclear Science Advisory Committee (NSAC) 2023 Long Range Planning process. A total of 424 physicists registered for the meeting. The meeting highlighted progress in Quantum Chromodynamics (QCD) nuclear physics since the 2015 LRP (LRP15) and identified key questions and plausible paths to obtaining answers to those questions, defining priorities for our research over the coming decade. In defining the priority of outstanding physics opportunities for the future, both prospects for the short (~ 5 years) and longer term (5-10 years and beyond) are identified together with the facilities, personnel and other resources needed to maximize the discovery potential and maintain United States leadership in QCD physics worldwide. This White Paper is organized as follows: In the Executive Summary, we detail the Recommendations and Initiatives that were presented and discussed at the Town Meeting, and their supporting rationales. Section 2 highlights major progress and accomplishments of the past seven years. It is followed, in Section 3, by an overview of the physics opportunities for the immediate future, and in relation with the next QCD frontier: the EIC. Section 4 provides an overview of the physics motivations and goals associated with the EIC. Section 5 is devoted to the workforce development and support of diversity, equity and inclusion. This is followed by a dedicated section on computing in Section 6. Section 7 describes the national need for nuclear data science and the relevance to QCD research. △ Less

Submitted 4 March, 2023; originally announced March 2023.

Comments: QCD Town Meeting White Paper, as submitted to 2023 NSAC LRP committee on Feb. 28, 2023

Journal ref: Nucl.Phys.A 1047 (2024) 122874

arXiv:2210.15019 [pdf, other]

Potential for definitive discovery of a 70 GeV dark matter WIMP with only second-order gauge couplings

Authors: Bailey Tallman, Alexandra Boone, Adhithya Vijayakumar, Fiona Lopez, Samuel Apata, Jehu Martinez, Roland Allen

Abstract: As astronomical observations and their interpretation improve, the case for cold dark matter (CDM) becomes increasingly persuasive. A particularly appealing version of CDM is a weakly interacting massive particle (WIMP) with a mass near the electroweak scale, which can naturally have the observed relic abundance after annihilation in the early universe. But in order for a WIMP to be consistent wit… ▽ More As astronomical observations and their interpretation improve, the case for cold dark matter (CDM) becomes increasingly persuasive. A particularly appealing version of CDM is a weakly interacting massive particle (WIMP) with a mass near the electroweak scale, which can naturally have the observed relic abundance after annihilation in the early universe. But in order for a WIMP to be consistent with the currently stringent experimental constraints it must have relatively small cross-sections for indirect, direct, and collider detection. Using our calculations and estimates of these cross-sections, we discuss the potential for discovery of a recently proposed dark matter WIMP which has a mass of about 70 GeV/c$^2$ and only second-order couplings to W and Z bosons. There is evidence that indirect detection may already have been achieved, since analyses of the gamma rays detected by Fermi-LAT and the antiprotons observed by AMS-02 are consistent with 70 GeV dark matter having our calculated $\langle σ_{ann} v \rangle \approx 1.2 \times 10^{-26} $ cm$^3$/s. The estimated sensitivities for LZ and XENONnT indicate that these experiments may achieve direct detection within the next few years, since we estimate the relevant cross-section to be slightly above $10^{-48}$ cm$^2$. Other experiments such as PandaX, SuperCDMS, and especially DARWIN should be able to confirm on a longer time scale. The high-luminosity LHC might achieve collider detection within about 15 years, since we estimate a collider cross-section slightly below 1 femtobarn. Definitive confirmation should come from still more powerful planned collider experiments (such as a future circular collider) within 15-35 years. △ Less

Submitted 24 October, 2022; originally announced October 2022.

Comments: 6 pages

arXiv:2010.01713 [pdf, other]

Reading Comprehension as Natural Language Inference: A Semantic Analysis

Authors: Anshuman Mishra, Dhruvesh Patel, Aparna Vijayakumar, Xiang Li, Pavan Kapanipathi, Kartik Talamadupula

Abstract: In the recent past, Natural language Inference (NLI) has gained significant attention, particularly given its promise for downstream NLP tasks. However, its true impact is limited and has not been well studied. Therefore, in this paper, we explore the utility of NLI for one of the most prominent downstream tasks, viz. Question Answering (QA). We transform the one of the largest available MRC datas… ▽ More In the recent past, Natural language Inference (NLI) has gained significant attention, particularly given its promise for downstream NLP tasks. However, its true impact is limited and has not been well studied. Therefore, in this paper, we explore the utility of NLI for one of the most prominent downstream tasks, viz. Question Answering (QA). We transform the one of the largest available MRC dataset (RACE) to an NLI form, and compare the performances of a state-of-the-art model (RoBERTa) on both these forms. We propose new characterizations of questions, and evaluate the performance of QA and NLI models on these categories. We highlight clear categories for which the model is able to perform better when the data is presented in a coherent entailment form, and a structured question-answer concatenation form, respectively. △ Less

Submitted 4 October, 2020; originally announced October 2020.

arXiv:2009.09099 [pdf, other]

Looking Beyond Sentence-Level Natural Language Inference for Downstream Tasks

Authors: Anshuman Mishra, Dhruvesh Patel, Aparna Vijayakumar, Xiang Li, Pavan Kapanipathi, Kartik Talamadupula

Abstract: In recent years, the Natural Language Inference (NLI) task has garnered significant attention, with new datasets and models achieving near human-level performance on it. However, the full promise of NLI -- particularly that it learns knowledge that should be generalizable to other downstream NLP tasks -- has not been realized. In this paper, we study this unfulfilled promise from the lens of two d… ▽ More In recent years, the Natural Language Inference (NLI) task has garnered significant attention, with new datasets and models achieving near human-level performance on it. However, the full promise of NLI -- particularly that it learns knowledge that should be generalizable to other downstream NLP tasks -- has not been realized. In this paper, we study this unfulfilled promise from the lens of two downstream tasks: question answering (QA), and text summarization. We conjecture that a key difference between the NLI datasets and these downstream tasks concerns the length of the premise; and that creating new long premise NLI datasets out of existing QA datasets is a promising avenue for training a truly generalizable NLI model. We validate our conjecture by showing competitive results on the task of QA and obtaining the best reported results on the task of Checking Factual Correctness of Summaries. △ Less

Submitted 18 September, 2020; originally announced September 2020.

arXiv:2002.06337 [pdf, other]

Manifold-based Test Generation for Image Classifiers

Authors: Taejoon Byun, Abhishek Vijayakumar, Sanjai Rayadurgam, Darren Cofer

Abstract: Neural networks used for image classification tasks in critical applications must be tested with sufficient realistic data to assure their correctness. To effectively test an image classification neural network, one must obtain realistic test data adequate enough to inspire confidence that differences between the implicit requirements and the learned model would be exposed. This raises two challen… ▽ More Neural networks used for image classification tasks in critical applications must be tested with sufficient realistic data to assure their correctness. To effectively test an image classification neural network, one must obtain realistic test data adequate enough to inspire confidence that differences between the implicit requirements and the learned model would be exposed. This raises two challenges: first, an adequate subset of the data points must be carefully chosen to inspire confidence, and second, the implicit requirements must be meaningfully extrapolated to data points beyond those in the explicit training set. This paper proposes a novel framework to address these challenges. Our approach is based on the premise that patterns in a large input data space can be effectively captured in a smaller manifold space, from which similar yet novel test cases---both the input and the label---can be sampled and generated. A variant of Conditional Variational Autoencoder (CVAE) is used for capturing this manifold with a generative function, and a search technique is applied on this manifold space to efficiently find fault-revealing inputs. Experiments show that this approach enables generation of thousands of realistic yet fault-revealing test cases efficiently even for well-trained models. △ Less

Submitted 15 February, 2020; originally announced February 2020.

arXiv:1912.01812 [pdf]

A compact single channel interferometer to study vortex beam propagation

Authors: Sruthy J. Lathika, A. Vijayakumar, Shanti Bhattacharya

Abstract: We propose and demonstrate a single channel interferometer that can be used to study how vortex beams propagate. The interferometer consists of a multifunctional diffractive optical element (MDOE) synthesized by the spatial random multiplexing of a Fresnel zone plate and a spiral Fresnel zone plate with different focal lengths. The MDOE generates two co-propagating beams, such that only the beam c… ▽ More We propose and demonstrate a single channel interferometer that can be used to study how vortex beams propagate. The interferometer consists of a multifunctional diffractive optical element (MDOE) synthesized by the spatial random multiplexing of a Fresnel zone plate and a spiral Fresnel zone plate with different focal lengths. The MDOE generates two co-propagating beams, such that only the beam carrying orbital angular momentum is modulated by an annular stack of thin scatterers located at the focal plane of the Fresnel zone plate, while the other beam passes through the centre of the annulus without any modulation. The interference pattern is recorded at the focal plane of the spiral Fresnel zone plate. The scattering of vortex beams through stacks consisting of different number of thin scatterers was studied using the proposed optical setup. Conflicting results have been reported earlier on whether higher or lower charge beams suffer more deterioration. The proposed interferometer provides a relatively simple and compact means of experimentally studying propagation of vortex beams through scattering medium. △ Less

Submitted 4 December, 2019; originally announced December 2019.

arXiv:1910.00981 [pdf, other]

doi 10.1109/TIFS.2016.2601067

Physical Design Obfuscation of Hardware: A Comprehensive Investigation of Device- and Logic-Level Techniques

Authors: Arunkumar Vijayakumar, Vinay C. Patil, Daniel E. Holcomb, Christof Paar, Sandip Kundu

Abstract: The threat of hardware reverse engineering is a growing concern for a large number of applications. A main defense strategy against reverse engineering is hardware obfuscation. In this paper, we investigate physical obfuscation techniques, which perform alterations of circuit elements that are difficult or impossible for an adversary to observe. The examples of such stealthy manipulations are chan… ▽ More The threat of hardware reverse engineering is a growing concern for a large number of applications. A main defense strategy against reverse engineering is hardware obfuscation. In this paper, we investigate physical obfuscation techniques, which perform alterations of circuit elements that are difficult or impossible for an adversary to observe. The examples of such stealthy manipulations are changes in the doping concentrations or dielectric manipulations. An attacker will, thus, extract a netlist, which does not correspond to the logic function of the device-under-attack. This approach of camouflaging has garnered recent attention in the literature. In this paper, we expound on this promising direction to conduct a systematic end-to-end study of the VLSI design process to find multiple ways to obfuscate a circuit for hardware security. This paper makes three major contributions. First, we provide a categorization of the available physical obfuscation techniques as it pertains to various design stages. There is a large and multidimensional design space for introducing obfuscated elements and mechanisms, and the proposed taxonomy is helpful for a systematic treatment. Second, we provide a review of the methods that have been proposed or in use. Third, we present recent and new device and logic-level techniques for design obfuscation. For each technique considered, we discuss feasibility of the approach and assess likelihood of its detection. Then we turn our focus to open research questions, and conclude with suggestions for future research directions. △ Less

Submitted 2 October, 2019; originally announced October 2019.

Journal ref: IEEE Transactions on Information Forensics and Security (Volume: 12, Issue: 1, Jan. 2017)

arXiv:1905.12547 [pdf]

SLM aided noninvasive imaging through thin scattering layers

Authors: Saswata Mukherjee, A. Vijayakumar, Joseph Rosen

Abstract: We propose and demonstrate a new imaging technique to noninvasively see through scattering layers with the aid of a spatial light modulator (SLM). A relay system projects the incoherent light pattern emitting from the scattering layer onto the SLM. Two coded phase masks are displayed, one after another, on the SLM to modulate the projected scattered field. Two corresponding intensity patterns are… ▽ More We propose and demonstrate a new imaging technique to noninvasively see through scattering layers with the aid of a spatial light modulator (SLM). A relay system projects the incoherent light pattern emitting from the scattering layer onto the SLM. Two coded phase masks are displayed, one after another, on the SLM to modulate the projected scattered field. Two corresponding intensity patterns are recorded by a digital camera, and subtracted one from the other in the computer to obtain a bipolar matrix. A modified phase retrieval algorithm is used to retrieve the object information from this bipolar matrix. △ Less

Submitted 29 May, 2019; originally announced May 2019.

arXiv:1904.12673 [pdf]

doi 10.1364/AO.58.005982

Implementation of a speckle correlation based optical lever (SC-OptLev) with extended dynamic range

Authors: A. Vijayakumar, D. Jayavel, M. Muthaiah, Shanti Bhattacharya, Joseph Rosen

Abstract: A speckle correlation based optical lever (SC-OptLev) is constructed for the measurement of small changes in the angle of orientation of a surface. The dynamic range of SC-OptLev is found to be twice that of a conventional OptLev for the same experimental configurations. Different filtering mechanisms are implemented and the correlation results are compared. Two types of computer automated SC-OptL… ▽ More A speckle correlation based optical lever (SC-OptLev) is constructed for the measurement of small changes in the angle of orientation of a surface. The dynamic range of SC-OptLev is found to be twice that of a conventional OptLev for the same experimental configurations. Different filtering mechanisms are implemented and the correlation results are compared. Two types of computer automated SC-OptLevs, open source based computing system with a low-cost image sensor and a commercial computing system, are presented with assistive computational modules. △ Less

Submitted 26 April, 2019; originally announced April 2019.

Comments: 9 pages, 3 figures

arXiv:1902.04300 [pdf]

doi 10.1117/1.OE.59.4.041204

Binary square axicon with chiral focusing properties for optical trapping

Authors: Balasubramani Vinoth, Anand Vijayakumar, Mani Ratnam Rai, Joseph Rosen, Chau-Jern Cheng, Oleg V Minin, Igor V Minin

Abstract: We introduce a novel phase-only diffractive optical element called chiral binary square axicon (CBSA). The CBSA is designed by linearly rotating the square half-period zones of the binary square axicon with respect to one another. A quadratic phase mask (QPM) is combined with the CBSA using modulo-2π phase addition technique to bring the far-field intensity pattern of CBSA at the focal plane of th… ▽ More We introduce a novel phase-only diffractive optical element called chiral binary square axicon (CBSA). The CBSA is designed by linearly rotating the square half-period zones of the binary square axicon with respect to one another. A quadratic phase mask (QPM) is combined with the CBSA using modulo-2π phase addition technique to bring the far-field intensity pattern of CBSA at the focal plane of the QPM and to introduce quasi-achromatic effects. The periodically rotated zones of CBSA produces a whirlpool phase profile and twisted intensity patterns at the focal plane of QPM. The degree of twisting seen in the intensity patterns is dependent upon the angular step size of rotation of the zones. The intensity pattern was found to rotate around the optical axis along the direction of propagation. The phase patterns of CBSA with different angles of zone rotation are displayed on a phase-only spatial light modulator and the experimental results were found to match with the simulation results. To evaluate the optical trapping capabilities of CBSA, an optical trapping experiment was carried out and the optical fields generated by CBSA were used for trapping and rotating yeast cells. △ Less

Submitted 12 February, 2019; originally announced February 2019.

Comments: 12 pages, 9 figures

arXiv:1901.03768 [pdf, other]

Input Prioritization for Testing Neural Networks

Authors: Taejoon Byun, Vaibhav Sharma, Abhishek Vijayakumar, Sanjai Rayadurgam, Darren Cofer

Abstract: Deep neural networks (DNNs) are increasingly being adopted for sensing and control functions in a variety of safety and mission-critical systems such as self-driving cars, autonomous air vehicles, medical diagnostics, and industrial robotics. Failures of such systems can lead to loss of life or property, which necessitates stringent verification and validation for providing high assurance. Though… ▽ More Deep neural networks (DNNs) are increasingly being adopted for sensing and control functions in a variety of safety and mission-critical systems such as self-driving cars, autonomous air vehicles, medical diagnostics, and industrial robotics. Failures of such systems can lead to loss of life or property, which necessitates stringent verification and validation for providing high assurance. Though formal verification approaches are being investigated, testing remains the primary technique for assessing the dependability of such systems. Due to the nature of the tasks handled by DNNs, the cost of obtaining test oracle data---the expected output, a.k.a. label, for a given input---is high, which significantly impacts the amount and quality of testing that can be performed. Thus, prioritizing input data for testing DNNs in meaningful ways to reduce the cost of labeling can go a long way in increasing testing efficacy. This paper proposes using gauges of the DNN's sentiment derived from the computation performed by the model, as a means to identify inputs that are likely to reveal weaknesses. We empirically assessed the efficacy of three such sentiment measures for prioritization---confidence, uncertainty, and surprise---and compare their effectiveness in terms of their fault-revealing capability and retraining effectiveness. The results indicate that sentiment measures can effectively flag inputs that expose unacceptable DNN behavior. For MNIST models, the average percentage of inputs correctly flagged ranged from 88% to 94.8%. △ Less

Submitted 11 January, 2019; originally announced January 2019.

arXiv:1812.05218 [pdf]

Structured light by discrete-phase orbital angular momentum holograms

Authors: A. Vijayakumar, Carmelo Rosales-Guzman, Mani Ratnam Rai Joseph Rosen, Oleg V. Minin, Igor V. Minin, Andrew Forbes

Abstract: Structured light has been created by a myriad of near- and far-field techniques and has found both classical and quantum applications. In the case of orbital angular momentum (OAM), continuous spiral phase patterns in dynamic or geometric phase are often employed with the phase patterns existing across the entire transverse plane. Here we exploit the uncertainty relation between OAM and angle to c… ▽ More Structured light has been created by a myriad of near- and far-field techniques and has found both classical and quantum applications. In the case of orbital angular momentum (OAM), continuous spiral phase patterns in dynamic or geometric phase are often employed with the phase patterns existing across the entire transverse plane. Here we exploit the uncertainty relation between OAM and angle to create structured OAM fields using multilevel OAM holograms. We show theoretically and experimentally that only a multilevel angular phase contour in the near-field is needed to create structured OAM light in the far-field, exploiting the reciprocal nature of angular momentum and angle. We use this approach to demonstrate exotic 3D structured light control to show the evolution of the Poynting vector in such fields and to highlight the physics underlying this phenomenon. △ Less

Submitted 12 December, 2018; originally announced December 2018.

Comments: 11 pages, 8 figures, 2 animations

arXiv:1809.10767 [pdf, ps, other]

Wiener index and Steiner 3-Wiener index of a graph

Authors: Matjaž Kovše, Rasila V A, Ambat Vijayakumar

Abstract: Let $S$ be a set of vertices of a connected graph $G$. The Steiner distance of $S$ is the minimum size of a connected subgraph of $G$ containing all the vertices of $S$. The sum of all Steiner distances on sets of size $k$ is called the Steiner $k$-Wiener index, hence for $k=2$ we get the Wiener index. The modular graphs are graphs in which every three vertices $x, y$ and $z$ have at least one med… ▽ More Let $S$ be a set of vertices of a connected graph $G$. The Steiner distance of $S$ is the minimum size of a connected subgraph of $G$ containing all the vertices of $S$. The sum of all Steiner distances on sets of size $k$ is called the Steiner $k$-Wiener index, hence for $k=2$ we get the Wiener index. The modular graphs are graphs in which every three vertices $x, y$ and $z$ have at least one median vertex $m(x,y,z)$ that belongs to shortest paths between each pair of $x, y$ and $z$. The Steiner 3-Wiener index of a modular graph is expressed in terms of its Wiener index. As a corollary formulae for the Steiner 3-Wiener index of Fibonacci and Lucas cubes are obtained. △ Less

Submitted 27 September, 2018; originally announced September 2018.

arXiv:1805.08143 [pdf, other]

Steiner Wiener index of block graphs

Authors: Matjaž Kovše, Rasila V A, Ambat Vijayakumar

Abstract: Let $S$ be a set of vertices of a connected graph $G$. The Steiner distance of $S$ is the minimum size of a connected subgraph of $G$ containing all the vertices of $S$. The Steiner $k$-Wiener index is the sum of all Steiner distances on sets of $k$ vertices of $G$. Different simple methods for calculating the Steiner $k$-Wiener index of block graphs are presented. Let $S$ be a set of vertices of a connected graph $G$. The Steiner distance of $S$ is the minimum size of a connected subgraph of $G$ containing all the vertices of $S$. The Steiner $k$-Wiener index is the sum of all Steiner distances on sets of $k$ vertices of $G$. Different simple methods for calculating the Steiner $k$-Wiener index of block graphs are presented. △ Less

Submitted 13 September, 2018; v1 submitted 21 May, 2018; originally announced May 2018.

arXiv:1708.00246 [pdf]

Incoherent digital holograms acquired by interferenceless coded aperture correlation holography system without refractive lenses

Authors: Manoj Kumar, A. Vijayakumar, Joseph Rosen

Abstract: We present a lensless, interferenceless incoherent digital holography technique based on the principle of coded aperture correlation holography. The acquired digital hologram by this technique contains a three-dimensional image of some observed scene. Light diffracted by a point object is modulated using a random-like coded phase mask (CPM) and the intensity pattern is recorded and composed as a p… ▽ More We present a lensless, interferenceless incoherent digital holography technique based on the principle of coded aperture correlation holography. The acquired digital hologram by this technique contains a three-dimensional image of some observed scene. Light diffracted by a point object is modulated using a random-like coded phase mask (CPM) and the intensity pattern is recorded and composed as a point spread hologram (PSH). A library of PSH is created using the same CPM by moving the pinhole to all possible axial locations. Intensity diffracted through the same CPM from an object placed within the axial limits of the PSH library is recorded by a digital camera. The recorded intensity this time is composed as the object hologram. The image of the object at any axial plane is reconstructed by cross-correlating the object hologram with the corresponding component of the PSH library. The reconstruction noise attached to the image is suppressed by various methods. The reconstruction results of multi-plane and thick objects by this technique are compared with regular lens-based imaging. △ Less

Submitted 1 August, 2017; originally announced August 2017.

Comments: 21 pages, 11 figures

arXiv:1703.01720 [pdf, other]

Sound-Word2Vec: Learning Word Representations Grounded in Sounds

Authors: Ashwin K Vijayakumar, Ramakrishna Vedantam, Devi Parikh

Abstract: To be able to interact better with humans, it is crucial for machines to understand sound - a primary modality of human perception. Previous works have used sound to learn embeddings for improved generic textual similarity assessment. In this work, we treat sound as a first-class citizen, studying downstream textual tasks which require aural grounding. To this end, we propose sound-word2vec - a ne… ▽ More To be able to interact better with humans, it is crucial for machines to understand sound - a primary modality of human perception. Previous works have used sound to learn embeddings for improved generic textual similarity assessment. In this work, we treat sound as a first-class citizen, studying downstream textual tasks which require aural grounding. To this end, we propose sound-word2vec - a new embedding scheme that learns specialized word embeddings grounded in sounds. For example, we learn that two seemingly (semantically) unrelated concepts, like leaves and paper are similar due to the similar rustling sounds they make. Our embeddings prove useful in textual tasks requiring aural reasoning like text-based sound retrieval and discovering foley sound effects (used in movies). Moreover, our embedding space captures interesting dependencies between words and onomatopoeia and outperforms prior work on aurally-relevant word relatedness datasets such as AMEN and ASLex. △ Less

Submitted 29 August, 2017; v1 submitted 5 March, 2017; originally announced March 2017.

Comments: Accepted at EMNLP 2017. Contains 6 pages; 3 tables; 1 figure

arXiv:1610.02424 [pdf, other]

Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models

Authors: Ashwin K Vijayakumar, Michael Cogswell, Ramprasath R. Selvaraju, Qing Sun, Stefan Lee, David Crandall, Dhruv Batra

Abstract: Neural sequence models are widely used to model time-series data. Equally ubiquitous is the usage of beam search (BS) as an approximate inference algorithm to decode output sequences from these models. BS explores the search space in a greedy left-right fashion retaining only the top-B candidates - resulting in sequences that differ only slightly from each other. Producing lists of nearly identica… ▽ More Neural sequence models are widely used to model time-series data. Equally ubiquitous is the usage of beam search (BS) as an approximate inference algorithm to decode output sequences from these models. BS explores the search space in a greedy left-right fashion retaining only the top-B candidates - resulting in sequences that differ only slightly from each other. Producing lists of nearly identical sequences is not only computationally wasteful but also typically fails to capture the inherent ambiguity of complex AI tasks. To overcome this problem, we propose Diverse Beam Search (DBS), an alternative to BS that decodes a list of diverse outputs by optimizing for a diversity-augmented objective. We observe that our method finds better top-1 solutions by controlling for the exploration and exploitation of the search space - implying that DBS is a better search algorithm. Moreover, these gains are achieved with minimal computational or memory over- head as compared to beam search. To demonstrate the broad applicability of our method, we present results on image captioning, machine translation and visual question generation using both standard quantitative metrics and qualitative human studies. Further, we study the role of diversity for image-grounded language generation tasks as the complexity of the image changes. We observe that our method consistently outperforms BS and previously proposed techniques for diverse decoding from neural sequence models. △ Less

Submitted 22 October, 2018; v1 submitted 7 October, 2016; originally announced October 2016.

Comments: 16 pages; accepted at AAAI 2018

arXiv:1603.07243 [pdf, ps, other]

Heredity for generalized power domination

Authors: Paul Dorbec, Seethu Varghese, Ambat Vijayakumar

Abstract: In this paper, we study the behaviour of the generalized power domination number of a graph by small changes on the graph, namely edge and vertex deletion and edge contraction. We prove optimal bounds for $γ\_{p,k}(G-e)$, $γ\_{p,k}(G/e)$ and for $γ\_{p,k}(G-v)$ in terms of $γ\_{p,k}(G)$, and give examples for which these bounds are tight. We characterize all graphs for which… ▽ More In this paper, we study the behaviour of the generalized power domination number of a graph by small changes on the graph, namely edge and vertex deletion and edge contraction. We prove optimal bounds for $γ\_{p,k}(G-e)$, $γ\_{p,k}(G/e)$ and for $γ\_{p,k}(G-v)$ in terms of $γ\_{p,k}(G)$, and give examples for which these bounds are tight. We characterize all graphs for which $γ\_{p,k}(G-e) = γ\_{p,k}(G)+1$ for any edge $e$. We also consider the behaviour of the propagation radius of graphs by similar modifications. △ Less

Submitted 23 March, 2016; originally announced March 2016.

Comments: Discrete Mathematics and Theoretical Computer Science, 2016

arXiv:1512.04407 [pdf, other]

We Are Humor Beings: Understanding and Predicting Visual Humor

Authors: Arjun Chandrasekaran, Ashwin K. Vijayakumar, Stanislaw Antol, Mohit Bansal, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh

Abstract: Humor is an integral part of human lives. Despite being tremendously impactful, it is perhaps surprising that we do not have a detailed understanding of humor yet. As interactions between humans and AI systems increase, it is imperative that these systems are taught to understand subtleties of human expressions such as humor. In this work, we are interested in the question - what content in a scen… ▽ More Humor is an integral part of human lives. Despite being tremendously impactful, it is perhaps surprising that we do not have a detailed understanding of humor yet. As interactions between humans and AI systems increase, it is imperative that these systems are taught to understand subtleties of human expressions such as humor. In this work, we are interested in the question - what content in a scene causes it to be funny? As a first step towards understanding visual humor, we analyze the humor manifested in abstract scenes and design computational models for them. We collect two datasets of abstract scenes that facilitate the study of humor at both the scene-level and the object-level. We analyze the funny scenes and explore the different types of humor depicted in them via human studies. We model two tasks that we believe demonstrate an understanding of some aspects of visual humor. The tasks involve predicting the funniness of a scene and altering the funniness of a scene. We show that our models perform well quantitatively, and qualitatively through human studies. Our datasets are publicly available. △ Less

Submitted 5 May, 2016; v1 submitted 14 December, 2015; originally announced December 2015.

Comments: 17 pages, 16 figures, 3 tables

arXiv:1508.00357 [pdf, ps, other]

Generalized power domination in WK-Pyramid Networks

Authors: Seethu Varghese, A. Vijayakumar

Abstract: The notion of power domination arises in the context of monitoring an electric power system with as few phase measurement units as possible. The $k-$power domination number of a graph $G$ is the minimum cardinality of a $k-$power dominating set ($k-$PDS) of $G$. In this paper, we determine the $k-$power domination number of WK-Pyramid networks, $WKP_{(C,L)}$, for all positive values of $k$ except… ▽ More The notion of power domination arises in the context of monitoring an electric power system with as few phase measurement units as possible. The $k-$power domination number of a graph $G$ is the minimum cardinality of a $k-$power dominating set ($k-$PDS) of $G$. In this paper, we determine the $k-$power domination number of WK-Pyramid networks, $WKP_{(C,L)}$, for all positive values of $k$ except for $k=C-1, C \geq 2$, for which we give an upper bound. The $k-$propagation radius of a graph $G$ is the minimum number of propagation steps needed to monitor the graph $G$ over all minimum $k-$PDS. We obtain the $k-$propagation radius of $WKP_{(C,L)}$ in some cases. △ Less

Submitted 3 August, 2015; originally announced August 2015.

Comments: 10 pages, 2 figures

MSC Class: 05C69; 94C15

arXiv:1405.3441 [pdf, ps, other]

On split graphs with four distinct eigenvalues

Authors: Felix Goldberg, Steve Kirkland, Anu Varghese, Ambat Vijayakumar

Abstract: It is a well-known fact that a graph of diameter $d$ has at least $d+1$ eigenvalues. Let us call a graph \emph{$d$-extremal} if it has diameter $d$ and exactly $d+1$ eigenvalues. Such graphs have been intensively studied by various authors. %Much attention has been devoted to the study of graphs that are extremal with respect to this relation: \emph{i.e} have diameter $d$ and exactly $d+1$ distinc… ▽ More It is a well-known fact that a graph of diameter $d$ has at least $d+1$ eigenvalues. Let us call a graph \emph{$d$-extremal} if it has diameter $d$ and exactly $d+1$ eigenvalues. Such graphs have been intensively studied by various authors. %Much attention has been devoted to the study of graphs that are extremal with respect to this relation: \emph{i.e} have diameter $d$ and exactly $d+1$ distinct eigenvalues. A graph is \emph{split} if its vertex set can be partitioned into a clique and a stable set. Such a graph has diameter at most $3$. We obtain a complete classification of the connected bidegreed $3$-extremal split graphs. We also show how to construct certain families of non-bidegreed $3$-extremal split graphs. △ Less

Submitted 14 May, 2014; originally announced May 2014.

MSC Class: 05C50; 05B05; 05C75

arXiv:1109.5245 [pdf]

The Gottschalk Conjecture

Authors: A. K. Vijayakumar

Abstract: The central idea of the proof is to show that a minimal flow v on a compact 3-manifold M implies the existence of a codimension one foliation F on it, which is transverse to the flow. If M is the 3-sphere, Novikov's theorem applies to show that one of the leaves of F is a compact surface X. It is now easy to derive a contradiction. The foliation is achieved by an induction procedure in which the m… ▽ More The central idea of the proof is to show that a minimal flow v on a compact 3-manifold M implies the existence of a codimension one foliation F on it, which is transverse to the flow. If M is the 3-sphere, Novikov's theorem applies to show that one of the leaves of F is a compact surface X. It is now easy to derive a contradiction. The foliation is achieved by an induction procedure in which the manifold M is partitioned into spaces called 'tubes', each of which is homeomorphic to a closed hollow cylinder. These cylinders are, in turn, 'filled in' with closed discs, each of which has its boundary on the tube. The first two sections are devoted to showing how a surface is chosen, which serves as a base for the first step of the induction procedure. The remainder of the paper describes the foliation procedure in detail. △ Less

Submitted 24 September, 2011; originally announced September 2011.

Comments: 66 pages, 50 figures

MSC Class: 37 - Dynamical Systems

Showing 1–30 of 30 results for author: Vijayakumar, A