-
Time Series Dataset for Modeling and Forecasting of $N_2O$ in Wastewater Treatment
Authors:
Laura Debel Hansen,
Anju Rani,
Mikkel Algren Stokholm-Bjerregaard,
Peter Alexander Stentoft,
Daniel Ortiz Arroyo,
Petar Durdevic
Abstract:
In this paper, we present two years of high-resolution nitrous oxide ($N_2O$) measurements for time series modeling and forecasting in wastewater treatment plants (WWTP). The dataset comprises frequent, real-time measurements from a full-scale WWTP, with a sample interval of 2 minutes, making it ideal for developing models for real-time operation and control. This comprehensive bio-chemical datase…
▽ More
In this paper, we present two years of high-resolution nitrous oxide ($N_2O$) measurements for time series modeling and forecasting in wastewater treatment plants (WWTP). The dataset comprises frequent, real-time measurements from a full-scale WWTP, with a sample interval of 2 minutes, making it ideal for developing models for real-time operation and control. This comprehensive bio-chemical dataset includes detailed influent and effluent parameters, operational conditions, and environmental factors. Unlike existing datasets, it addresses the unique challenges of modeling $N_2O$, a potent greenhouse gas, providing a valuable resource for researchers to enhance predictive accuracy and control strategies in wastewater treatment processes. Additionally, this dataset significantly contributes to the fields of machine learning and deep learning time series forecasting by serving as a benchmark that mirrors the complexities of real-world processes, thus facilitating advancements in these domains. We provide a detailed description of the dataset along with a statistical analysis to highlight its characteristics, such as nonstationarity, nonnormality, seasonality, heteroscedasticity, structural breaks, asymmetric distributions, and intermittency, which are common in many real-world time series datasets and pose challenges for forecasting models.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Wastewater Treatment Plant Data for Nutrient Removal System
Authors:
Esmaeel Mohammadi,
Anju Rani,
Mikkel Stokholm-Bjerregaard,
Daniel Ortiz-Arroyo,
Petar Durdevic
Abstract:
This paper introduces the Agtrup (BlueKolding) dataset, collected from Denmark's Agtrup wastewater treatment plant, specifically designed to enhance phosphorus removal via chemical and biological methods. This rich dataset is assembled through a high-frequency Supervisory Control and Data Acquisition (SCADA) system data collection process, which captures a wide range of variables related to the op…
▽ More
This paper introduces the Agtrup (BlueKolding) dataset, collected from Denmark's Agtrup wastewater treatment plant, specifically designed to enhance phosphorus removal via chemical and biological methods. This rich dataset is assembled through a high-frequency Supervisory Control and Data Acquisition (SCADA) system data collection process, which captures a wide range of variables related to the operational dynamics of nutrient removal. It comprises time-series data featuring measurements sampled to a frequency of two minutes across various control, process, and environmental variables. The comprehensive dataset aims to foster significant advancements in wastewater management by supporting the development of sophisticated predictive models and optimizing operational strategies. By providing detailed insights into the interactions and efficiencies of chemical and biological phosphorus removal processes, the dataset serves as a vital resource for environmental researchers and engineers focused on improving the sustainability and effectiveness of wastewater treatment operations. The ultimate goal of this dataset is to facilitate the creation of digital twins and the application of machine learning techniques, such as deep reinforcement learning, to predict and enhance system performance under varying operational conditions.
△ Less
Submitted 7 July, 2024;
originally announced July 2024.
-
Visual Hallucination: Definition, Quantification, and Prescriptive Remediations
Authors:
Anku Rani,
Vipula Rawte,
Harshad Sharma,
Neeraj Anand,
Krishnav Rajbangshi,
Amit Sheth,
Amitava Das
Abstract:
The troubling rise of hallucination presents perhaps the most significant impediment to the advancement of responsible AI. In recent times, considerable research has focused on detecting and mitigating hallucination in Large Language Models (LLMs). However, it's worth noting that hallucination is also quite prevalent in Vision-Language models (VLMs). In this paper, we offer a fine-grained discours…
▽ More
The troubling rise of hallucination presents perhaps the most significant impediment to the advancement of responsible AI. In recent times, considerable research has focused on detecting and mitigating hallucination in Large Language Models (LLMs). However, it's worth noting that hallucination is also quite prevalent in Vision-Language models (VLMs). In this paper, we offer a fine-grained discourse on profiling VLM hallucination based on two tasks: i) image captioning, and ii) Visual Question Answering (VQA). We delineate eight fine-grained orientations of visual hallucination: i) Contextual Guessing, ii) Identity Incongruity, iii) Geographical Erratum, iv) Visual Illusion, v) Gender Anomaly, vi) VLM as Classifier, vii) Wrong Reading, and viii) Numeric Discrepancy. We curate Visual HallucInation eLiciTation (VHILT), a publicly available dataset comprising 2,000 samples generated using eight VLMs across two tasks of captioning and VQA along with human annotations for the categories as mentioned earlier.
△ Less
Submitted 30 March, 2024; v1 submitted 25 March, 2024;
originally announced March 2024.
-
Advancements in Point Cloud-Based 3D Defect Detection and Classification for Industrial Systems: A Comprehensive Survey
Authors:
Anju Rani,
Daniel Ortiz-Arroyo,
Petar Durdevic
Abstract:
In recent years, 3D point clouds (PCs) have gained significant attention due to their diverse applications across various fields such as computer vision (CV), condition monitoring, virtual reality, robotics, autonomous driving etc. Deep learning (DL) has proven effective in leveraging 3D PCs to address various challenges previously encountered in 2D vision. However, the application of deep neural…
▽ More
In recent years, 3D point clouds (PCs) have gained significant attention due to their diverse applications across various fields such as computer vision (CV), condition monitoring, virtual reality, robotics, autonomous driving etc. Deep learning (DL) has proven effective in leveraging 3D PCs to address various challenges previously encountered in 2D vision. However, the application of deep neural networks (DNN) to process 3D PCs presents its own set of challenges. To address these challenges, numerous methods have been proposed. This paper provides an in-depth review of recent advancements in DL-based condition monitoring (CM) using 3D PCs, with a specific focus on defect shape classification and segmentation within industrial applications for operational and maintenance purposes. Recognizing the crucial role of these aspects in industrial maintenance, the paper provides insightful observations that offer perspectives on the strengths and limitations of the reviewed DL-based PC processing methods. This synthesis of knowledge aims to contribute to the understanding and enhancement of CM processes, particularly within the framework of remaining useful life (RUL), in industrial systems.
△ Less
Submitted 20 February, 2024;
originally announced February 2024.
-
A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models
Authors:
S. M Towhidul Islam Tonmoy,
S M Mehedi Zaman,
Vinija Jain,
Anku Rani,
Vipula Rawte,
Aman Chadha,
Amitava Das
Abstract:
As Large Language Models (LLMs) continue to advance in their ability to write human-like text, a key challenge remains around their tendency to hallucinate generating content that appears factual but is ungrounded. This issue of hallucination is arguably the biggest hindrance to safely deploying these powerful LLMs into real-world production systems that impact people's lives. The journey toward w…
▽ More
As Large Language Models (LLMs) continue to advance in their ability to write human-like text, a key challenge remains around their tendency to hallucinate generating content that appears factual but is ungrounded. This issue of hallucination is arguably the biggest hindrance to safely deploying these powerful LLMs into real-world production systems that impact people's lives. The journey toward widespread adoption of LLMs in practical settings heavily relies on addressing and mitigating hallucinations. Unlike traditional AI systems focused on limited tasks, LLMs have been exposed to vast amounts of online text data during training. While this allows them to display impressive language fluency, it also means they are capable of extrapolating information from the biases in training data, misinterpreting ambiguous prompts, or modifying the information to align superficially with the input. This becomes hugely alarming when we rely on language generation capabilities for sensitive applications, such as summarizing medical records, financial analysis reports, etc. This paper presents a comprehensive survey of over 32 techniques developed to mitigate hallucination in LLMs. Notable among these are Retrieval Augmented Generation (Lewis et al, 2021), Knowledge Retrieval (Varshney et al,2023), CoNLI (Lei et al, 2023), and CoVe (Dhuliawala et al, 2023). Furthermore, we introduce a detailed taxonomy categorizing these methods based on various parameters, such as dataset utilization, common tasks, feedback mechanisms, and retriever types. This classification helps distinguish the diverse approaches specifically designed to tackle hallucination issues in LLMs. Additionally, we analyze the challenges and limitations inherent in these techniques, providing a solid foundation for future research in addressing hallucinations and related phenomena within the realm of LLMs.
△ Less
Submitted 8 January, 2024; v1 submitted 2 January, 2024;
originally announced January 2024.
-
SEPSIS: I Can Catch Your Lies -- A New Paradigm for Deception Detection
Authors:
Anku Rani,
Dwip Dalal,
Shreya Gautam,
Pankaj Gupta,
Vinija Jain,
Aman Chadha,
Amit Sheth,
Amitava Das
Abstract:
Deception is the intentional practice of twisting information. It is a nuanced societal practice deeply intertwined with human societal evolution, characterized by a multitude of facets. This research explores the problem of deception through the lens of psychology, employing a framework that categorizes deception into three forms: lies of omission, lies of commission, and lies of influence. The p…
▽ More
Deception is the intentional practice of twisting information. It is a nuanced societal practice deeply intertwined with human societal evolution, characterized by a multitude of facets. This research explores the problem of deception through the lens of psychology, employing a framework that categorizes deception into three forms: lies of omission, lies of commission, and lies of influence. The primary focus of this study is specifically on investigating only lies of omission. We propose a novel framework for deception detection leveraging NLP techniques. We curated an annotated dataset of 876,784 samples by amalgamating a popular large-scale fake news dataset and scraped news headlines from the Twitter handle of Times of India, a well-known Indian news media house. Each sample has been labeled with four layers, namely: (i) the type of omission (speculation, bias, distortion, sounds factual, and opinion), (ii) colors of lies(black, white, etc), and (iii) the intention of such lies (to influence, etc) (iv) topic of lies (political, educational, religious, etc). We present a novel multi-task learning pipeline that leverages the dataless merging of fine-tuned language models to address the deception detection task mentioned earlier. Our proposed model achieved an F1 score of 0.87, demonstrating strong performance across all layers including the type, color, intent, and topic aspects of deceptive content. Finally, our research explores the relationship between lies of omission and propaganda techniques. To accomplish this, we conducted an in-depth analysis, uncovering compelling findings. For instance, our analysis revealed a significant correlation between loaded language and opinion, shedding light on their interconnectedness. To encourage further research in this field, we will be making the models and dataset available with the MIT License, making it favorable for open-source research.
△ Less
Submitted 30 November, 2023;
originally announced December 2023.
-
Imagery Dataset for Condition Monitoring of Synthetic Fibre Ropes
Authors:
Anju Rani,
Daniel O. Arroyo,
Petar Durdevic
Abstract:
Automatic visual inspection of synthetic fibre ropes (SFRs) is a challenging task in the field of offshore, wind turbine industries, etc. The presence of any defect in SFRs can compromise their structural integrity and pose significant safety risks. Due to the large size and weight of these ropes, it is often impractical to detach and inspect them frequently. Therefore, there is a critical need to…
▽ More
Automatic visual inspection of synthetic fibre ropes (SFRs) is a challenging task in the field of offshore, wind turbine industries, etc. The presence of any defect in SFRs can compromise their structural integrity and pose significant safety risks. Due to the large size and weight of these ropes, it is often impractical to detach and inspect them frequently. Therefore, there is a critical need to develop efficient defect detection methods to assess their remaining useful life (RUL). To address this challenge, a comprehensive dataset has been generated, comprising a total of 6,942 raw images representing both normal and defective SFRs. The dataset encompasses a wide array of defect scenarios which may occur throughout their operational lifespan, including but not limited to placking defects, cut strands, chafings, compressions, core outs and normal. This dataset serves as a resource to support computer vision applications, including object detection, classification, and segmentation, aimed at detecting and analyzing defects in SFRs. The availability of this dataset will facilitate the development and evaluation of robust defect detection algorithms. The aim of generating this dataset is to assist in the development of automated defect detection systems that outperform traditional visual inspection methods, thereby paving the way for safer and more efficient utilization of SFRs across a wide range of applications.
△ Less
Submitted 29 September, 2023;
originally announced September 2023.
-
Overview of Memotion 3: Sentiment and Emotion Analysis of Codemixed Hinglish Memes
Authors:
Shreyash Mishra,
S Suryavardan,
Megha Chakraborty,
Parth Patwa,
Anku Rani,
Aman Chadha,
Aishwarya Reganti,
Amitava Das,
Amit Sheth,
Manoj Chinnakotla,
Asif Ekbal,
Srijan Kumar
Abstract:
Analyzing memes on the internet has emerged as a crucial endeavor due to the impact this multi-modal form of content wields in shaping online discourse. Memes have become a powerful tool for expressing emotions and sentiments, possibly even spreading hate and misinformation, through humor and sarcasm. In this paper, we present the overview of the Memotion 3 shared task, as part of the DeFactify 2…
▽ More
Analyzing memes on the internet has emerged as a crucial endeavor due to the impact this multi-modal form of content wields in shaping online discourse. Memes have become a powerful tool for expressing emotions and sentiments, possibly even spreading hate and misinformation, through humor and sarcasm. In this paper, we present the overview of the Memotion 3 shared task, as part of the DeFactify 2 workshop at AAAI-23. The task released an annotated dataset of Hindi-English code-mixed memes based on their Sentiment (Task A), Emotion (Task B), and Emotion intensity (Task C). Each of these is defined as an individual task and the participants are ranked separately for each task. Over 50 teams registered for the shared task and 5 made final submissions to the test set of the Memotion 3 dataset. CLIP, BERT modifications, ViT etc. were the most popular models among the participants along with approaches such as Student-Teacher model, Fusion, and Ensembling. The best final F1 score for Task A is 34.41, Task B is 79.77 and Task C is 59.82.
△ Less
Submitted 12 September, 2023;
originally announced September 2023.
-
Defect Detection in Synthetic Fibre Ropes using Detectron2 Framework
Authors:
Anju Rani,
Daniel O. Arroyo,
Petar Durdevic
Abstract:
Fibre ropes with the latest technology have emerged as an appealing alternative to steel ropes for offshore industries due to their lightweight and high tensile strength. At the same time, frequent inspection of these ropes is essential to ensure the proper functioning and safety of the entire system. The development of deep learning (DL) models in condition monitoring (CM) applications offers a s…
▽ More
Fibre ropes with the latest technology have emerged as an appealing alternative to steel ropes for offshore industries due to their lightweight and high tensile strength. At the same time, frequent inspection of these ropes is essential to ensure the proper functioning and safety of the entire system. The development of deep learning (DL) models in condition monitoring (CM) applications offers a simpler and more effective approach for defect detection in synthetic fibre ropes (SFRs). The present paper investigates the performance of Detectron2, a state-of-the-art library for defect detection and instance segmentation. Detectron2 with Mask R-CNN architecture is used for segmenting defects in SFRs. Mask R-CNN with various backbone configurations has been trained and tested on an experimentally obtained dataset comprising 1,803 high-dimensional images containing seven damage classes (placking high, placking medium, placking low, compression, core out, chafing, and normal respectively) for SFRs. By leveraging the capabilities of Detectron2, this study aims to develop an automated and efficient method for detecting defects in SFRs, enhancing the inspection process, and ensuring the safety of the fibre ropes.
△ Less
Submitted 28 June, 2024; v1 submitted 4 September, 2023;
originally announced September 2023.
-
Findings of Factify 2: Multimodal Fake News Detection
Authors:
S Suryavardan,
Shreyash Mishra,
Megha Chakraborty,
Parth Patwa,
Anku Rani,
Aman Chadha,
Aishwarya Reganti,
Amitava Das,
Amit Sheth,
Manoj Chinnakotla,
Asif Ekbal,
Srijan Kumar
Abstract:
With social media usage growing exponentially in the past few years, fake news has also become extremely prevalent. The detrimental impact of fake news emphasizes the need for research focused on automating the detection of false information and verifying its accuracy. In this work, we present the outcome of the Factify 2 shared task, which provides a multi-modal fact verification and satire news…
▽ More
With social media usage growing exponentially in the past few years, fake news has also become extremely prevalent. The detrimental impact of fake news emphasizes the need for research focused on automating the detection of false information and verifying its accuracy. In this work, we present the outcome of the Factify 2 shared task, which provides a multi-modal fact verification and satire news dataset, as part of the DeFactify 2 workshop at AAAI'23. The data calls for a comparison based approach to the task by pairing social media claims with supporting documents, with both text and image, divided into 5 classes based on multi-modal relations. In the second iteration of this task we had over 60 participants and 9 final test-set submissions. The best performances came from the use of DeBERTa for text and Swinv2 and CLIP for image. The highest F1 score averaged for all five classes was 81.82%.
△ Less
Submitted 12 September, 2023; v1 submitted 19 July, 2023;
originally announced July 2023.
-
FACTIFY3M: A Benchmark for Multimodal Fact Verification with Explainability through 5W Question-Answering
Authors:
Megha Chakraborty,
Khushbu Pahwa,
Anku Rani,
Shreyas Chatterjee,
Dwip Dalal,
Harshit Dave,
Ritvik G,
Preethi Gurumurthy,
Adarsh Mahor,
Samahriti Mukherjee,
Aditya Pakala,
Ishan Paul,
Janvita Reddy,
Arghya Sarkar,
Kinjal Sensharma,
Aman Chadha,
Amit P. Sheth,
Amitava Das
Abstract:
Combating disinformation is one of the burning societal crises -- about 67% of the American population believes that disinformation produces a lot of uncertainty, and 10% of them knowingly propagate disinformation. Evidence shows that disinformation can manipulate democratic processes and public opinion, causing disruption in the share market, panic and anxiety in society, and even death during cr…
▽ More
Combating disinformation is one of the burning societal crises -- about 67% of the American population believes that disinformation produces a lot of uncertainty, and 10% of them knowingly propagate disinformation. Evidence shows that disinformation can manipulate democratic processes and public opinion, causing disruption in the share market, panic and anxiety in society, and even death during crises. Therefore, disinformation should be identified promptly and, if possible, mitigated. With approximately 3.2 billion images and 720,000 hours of video shared online daily on social media platforms, scalable detection of multimodal disinformation requires efficient fact verification. Despite progress in automatic text-based fact verification (e.g., FEVER, LIAR), the research community lacks substantial effort in multimodal fact verification. To address this gap, we introduce FACTIFY 3M, a dataset of 3 million samples that pushes the boundaries of the domain of fact verification via a multimodal fake news dataset, in addition to offering explainability through the concept of 5W question-answering. Salient features of the dataset include: (i) textual claims, (ii) ChatGPT-generated paraphrased claims, (iii) associated images, (iv) stable diffusion-generated additional images (i.e., visual paraphrases), (v) pixel-level image heatmap to foster image-text explainability of the claim, (vi) 5W QA pairs, and (vii) adversarial fake news stories.
△ Less
Submitted 30 October, 2023; v1 submitted 22 May, 2023;
originally announced June 2023.
-
Free Space Continuous Variable Quantum Key Distribution with Discrete Phases
Authors:
Anju Rani,
Pooja Chandravanshi,
Jayanth Ramakrishnan,
Pravin Vaity,
P. Madhusudhan,
Tanya Sharma,
Pranav Bhardwaj,
Ayan Biswas,
R. P. Singh
Abstract:
Quantum Key Distribution (QKD) offers unconditional security in principle. Many QKD protocols have been proposed and demonstrated to ensure secure communication between two authenticated users. Continuous variable (CV) QKD offers many advantages over discrete variable (DV) QKD since it is cost-effective, compatible with current classical communication technologies, efficient even in daylight, and…
▽ More
Quantum Key Distribution (QKD) offers unconditional security in principle. Many QKD protocols have been proposed and demonstrated to ensure secure communication between two authenticated users. Continuous variable (CV) QKD offers many advantages over discrete variable (DV) QKD since it is cost-effective, compatible with current classical communication technologies, efficient even in daylight, and gives a higher secure key rate. Keeping this in view, we demonstrate a discrete modulated CVQKD protocol in the free space which is robust against polarization drift. We also present the simulation results with a noise model to account for the channel noise and the effects of various parameter changes on the secure key rate. These simulation results help us to verify the experimental values obtained for the implemented CVQKD.
△ Less
Submitted 22 May, 2023;
originally announced May 2023.
-
Detecting resonance of radio-frequency cavities using fast direct integral equation solvers and augmented Bayesian optimization
Authors:
Yang Liu,
Tianhuan Luo,
Aman Rani,
Hengrui Luo,
Xiaoye Sherry Li
Abstract:
This paper presents a computationally efficient framework for identifying resonance modes of 3D radio-frequency (RF) cavities with damping waveguide ports. The proposed framework relies on surface integral equation (IE) formulations to convert the task of resonance detection to the task of finding resonance frequencies at which the lowest few eigenvalues of the system matrix is close to zero. For…
▽ More
This paper presents a computationally efficient framework for identifying resonance modes of 3D radio-frequency (RF) cavities with damping waveguide ports. The proposed framework relies on surface integral equation (IE) formulations to convert the task of resonance detection to the task of finding resonance frequencies at which the lowest few eigenvalues of the system matrix is close to zero. For the linear eigenvalue problem \rev{with a fixed frequency}, we propose leveraging fast direct solvers to efficiently invert the system matrix; for the frequency search problem, we develop a hybrid optimization algorithm that combines Bayesian optimization with down-hill simplex optimization. The proposed IE-based resonance detection framework (IERD) has been applied to detection of high-order resonance modes (HOMs) of realistic accelerator RF cavities to demonstrate its efficiency and accuracy.
△ Less
Submitted 30 August, 2023; v1 submitted 10 May, 2023;
originally announced May 2023.
-
FACTIFY-5WQA: 5W Aspect-based Fact Verification through Question Answering
Authors:
Anku Rani,
S. M Towhidul Islam Tonmoy,
Dwip Dalal,
Shreya Gautam,
Megha Chakraborty,
Aman Chadha,
Amit Sheth,
Amitava Das
Abstract:
Automatic fact verification has received significant attention recently. Contemporary automatic fact-checking systems focus on estimating truthfulness using numerical scores which are not human-interpretable. A human fact-checker generally follows several logical steps to verify a verisimilitude claim and conclude whether its truthful or a mere masquerade. Popular fact-checking websites follow a c…
▽ More
Automatic fact verification has received significant attention recently. Contemporary automatic fact-checking systems focus on estimating truthfulness using numerical scores which are not human-interpretable. A human fact-checker generally follows several logical steps to verify a verisimilitude claim and conclude whether its truthful or a mere masquerade. Popular fact-checking websites follow a common structure for fact categorization such as half true, half false, false, pants on fire, etc. Therefore, it is necessary to have an aspect-based (delineating which part(s) are true and which are false) explainable system that can assist human fact-checkers in asking relevant questions related to a fact, which can then be validated separately to reach a final verdict. In this paper, we propose a 5W framework (who, what, when, where, and why) for question-answer-based fact explainability. To that end, we present a semi-automatically generated dataset called FACTIFY-5WQA, which consists of 391, 041 facts along with relevant 5W QAs - underscoring our major contribution to this paper. A semantic role labeling system has been utilized to locate 5Ws, which generates QA pairs for claims using a masked language model. Finally, we report a baseline QA system to automatically locate those answers from evidence documents, which can serve as a baseline for future research in the field. Lastly, we propose a robust fact verification system that takes paraphrased claims and automatically validates them. The dataset and the baseline model are available at https: //github.com/ankuranii/acl-5W-QA
△ Less
Submitted 28 May, 2023; v1 submitted 7 May, 2023;
originally announced May 2023.
-
Factify 2: A Multimodal Fake News and Satire News Dataset
Authors:
S Suryavardan,
Shreyash Mishra,
Parth Patwa,
Megha Chakraborty,
Anku Rani,
Aishwarya Reganti,
Aman Chadha,
Amitava Das,
Amit Sheth,
Manoj Chinnakotla,
Asif Ekbal,
Srijan Kumar
Abstract:
The internet gives the world an open platform to express their views and share their stories. While this is very valuable, it makes fake news one of our society's most pressing problems. Manual fact checking process is time consuming, which makes it challenging to disprove misleading assertions before they cause significant harm. This is he driving interest in automatic fact or claim verification.…
▽ More
The internet gives the world an open platform to express their views and share their stories. While this is very valuable, it makes fake news one of our society's most pressing problems. Manual fact checking process is time consuming, which makes it challenging to disprove misleading assertions before they cause significant harm. This is he driving interest in automatic fact or claim verification. Some of the existing datasets aim to support development of automating fact-checking techniques, however, most of them are text based. Multi-modal fact verification has received relatively scant attention. In this paper, we provide a multi-modal fact-checking dataset called FACTIFY 2, improving Factify 1 by using new data sources and adding satire articles. Factify 2 has 50,000 new data instances. Similar to FACTIFY 1.0, we have three broad categories - support, no-evidence, and refute, with sub-categories based on the entailment of visual and textual data. We also provide a BERT and Vison Transformer based baseline, which achieves 65% F1 score in the test set. The baseline codes and the dataset will be made available at https://github.com/surya1701/Factify-2.0.
△ Less
Submitted 2 October, 2023; v1 submitted 7 April, 2023;
originally announced April 2023.
-
Memotion 3: Dataset on Sentiment and Emotion Analysis of Codemixed Hindi-English Memes
Authors:
Shreyash Mishra,
S Suryavardan,
Parth Patwa,
Megha Chakraborty,
Anku Rani,
Aishwarya Reganti,
Aman Chadha,
Amitava Das,
Amit Sheth,
Manoj Chinnakotla,
Asif Ekbal,
Srijan Kumar
Abstract:
Memes are the new-age conveyance mechanism for humor on social media sites. Memes often include an image and some text. Memes can be used to promote disinformation or hatred, thus it is crucial to investigate in details. We introduce Memotion 3, a new dataset with 10,000 annotated memes. Unlike other prevalent datasets in the domain, including prior iterations of Memotion, Memotion 3 introduces Hi…
▽ More
Memes are the new-age conveyance mechanism for humor on social media sites. Memes often include an image and some text. Memes can be used to promote disinformation or hatred, thus it is crucial to investigate in details. We introduce Memotion 3, a new dataset with 10,000 annotated memes. Unlike other prevalent datasets in the domain, including prior iterations of Memotion, Memotion 3 introduces Hindi-English Codemixed memes while prior works in the area were limited to only the English memes. We describe the Memotion task, the data collection and the dataset creation methodologies. We also provide a baseline for the task. The baseline code and dataset will be made available at https://github.com/Shreyashm16/Memotion-3.0
△ Less
Submitted 2 October, 2023; v1 submitted 17 March, 2023;
originally announced March 2023.
-
Action-based Early Autism Diagnosis Using Contrastive Feature Learning
Authors:
Asha Rani,
Pankaj Yadav,
Yashaswi Verma
Abstract:
Autism, also known as Autism Spectrum Disorder (or ASD), is a neurological disorder. Its main symptoms include difficulty in (verbal and/or non-verbal) communication, and rigid/repetitive behavior. These symptoms are often indistinguishable from a normal (control) individual, due to which this disorder remains undiagnosed in early childhood leading to delayed treatment. Since the learning curve is…
▽ More
Autism, also known as Autism Spectrum Disorder (or ASD), is a neurological disorder. Its main symptoms include difficulty in (verbal and/or non-verbal) communication, and rigid/repetitive behavior. These symptoms are often indistinguishable from a normal (control) individual, due to which this disorder remains undiagnosed in early childhood leading to delayed treatment. Since the learning curve is steep during the initial age, an early diagnosis of autism could allow to take adequate interventions at the right time, which might positively affect the growth of an autistic child. Further, the traditional methods of autism diagnosis require multiple visits to a specialized psychiatrist, however this process can be time-consuming. In this paper, we present a learning based approach to automate autism diagnosis using simple and small action video clips of subjects. This task is particularly challenging because the amount of annotated data available is small, and the variations among samples from the two categories (ASD and control) are generally indistinguishable. This is also evident from poor performance of a binary classifier learned using the cross-entropy loss on top of a baseline encoder. To address this, we adopt contrastive feature learning in both self supervised and supervised learning frameworks, and show that these can lead to a significant increase in the prediction accuracy of a binary classifier on this task. We further validate this by conducting thorough experimental analyses under different set-ups on two publicly available datasets.
△ Less
Submitted 17 July, 2023; v1 submitted 12 September, 2022;
originally announced September 2022.
-
Big Tech Companies Impact on Research at the Faculty of Information Technology and Electrical Engineering
Authors:
Ahmad Hassanpour,
An Thi Nguyen,
Anshul Rani,
Sarang Shaikh,
Ying Xu,
Haoyu Zhang
Abstract:
Artificial intelligence is gaining momentum, ongoing pandemic is fuel to that with more opportunities in every sector specially in health and education sector. But with the growth in technology, challenges associated with ethics also grow (Katharine Schwab, 2021). Whenever a new AI product is developed, companies publicize that their systems are transparent, fair, and are in accordance with the ex…
▽ More
Artificial intelligence is gaining momentum, ongoing pandemic is fuel to that with more opportunities in every sector specially in health and education sector. But with the growth in technology, challenges associated with ethics also grow (Katharine Schwab, 2021). Whenever a new AI product is developed, companies publicize that their systems are transparent, fair, and are in accordance with the existing laws and regulations as the methods and procedures followed by a big tech company for ensuring AI ethics, not only affect the trust and perception of public, but it also challenges the capabilities of the companies towards business strategies in different regions, and the kind of brains it can attract for their projects. AI Big Tech companies have influence over AI ethics as many influencing ethical-AI researchers have roots in Big Tech or its associated labs.
△ Less
Submitted 10 April, 2022;
originally announced May 2022.
-
BBM92 quantum key distribution over a free space dusty channel of 200 meters
Authors:
Sarika Mishra,
Ayan Biswas,
Satyajeet Patil,
Pooja Chandravanshi,
Vardaan Mongia,
Tanya Sharma,
Anju Rani,
Shashi Prabhakar,
S. Ramachandran,
Ravindra P. Singh
Abstract:
Free space quantum communication assumes importance as it is a precursor for satellite-based quantum communication needed for secure key distribution over longer distances. Prepare and measure protocols like BB84 consider the satellite as a trusted device, which is fraught with security threat looking at the current trend for satellite-based optical communication. Therefore, entanglement-based pro…
▽ More
Free space quantum communication assumes importance as it is a precursor for satellite-based quantum communication needed for secure key distribution over longer distances. Prepare and measure protocols like BB84 consider the satellite as a trusted device, which is fraught with security threat looking at the current trend for satellite-based optical communication. Therefore, entanglement-based protocols must be preferred, so that one can consider the satellite as an untrusted device too. The current work reports the implementation of BBM92 protocol, an entanglement-based QKD protocol over 200 m distance using an indigenous facility developed at Physical Research Laboratory (PRL), Ahmedabad, India. Our results show the effect of atmospheric aerosols on sift key rate, and eventually, secure key rate. Such experiments are important to validate the models to account for the atmospheric effects on the key rates achieved through satellite-based QKD.
△ Less
Submitted 9 January, 2022; v1 submitted 22 December, 2021;
originally announced December 2021.
-
Use of Formal Ethical Reviews in NLP Literature: Historical Trends and Current Practices
Authors:
Sebastin Santy,
Anku Rani,
Monojit Choudhury
Abstract:
Ethical aspects of research in language technologies have received much attention recently. It is a standard practice to get a study involving human subjects reviewed and approved by a professional ethics committee/board of the institution. How commonly do we see mention of ethical approvals in NLP research? What types of research or aspects of studies are usually subject to such reviews? With the…
▽ More
Ethical aspects of research in language technologies have received much attention recently. It is a standard practice to get a study involving human subjects reviewed and approved by a professional ethics committee/board of the institution. How commonly do we see mention of ethical approvals in NLP research? What types of research or aspects of studies are usually subject to such reviews? With the rising concerns and discourse around the ethics of NLP, do we also observe a rise in formal ethical reviews of NLP studies? And, if so, would this imply that there is a heightened awareness of ethical issues that was previously lacking? We aim to address these questions by conducting a detailed quantitative and qualitative analysis of the ACL Anthology, as well as comparing the trends in our field to those of other related disciplines, such as cognitive science, machine learning, data mining, and systems.
△ Less
Submitted 2 June, 2021;
originally announced June 2021.
-
Polarization-orbital angular momentum duality assisted entanglement observation for indistinguishable photons
Authors:
Nijil Lal,
Sarika Mishra,
Anju Rani,
Anindya Banerji,
Chithrabhanu Perumangattu,
R. P. Singh
Abstract:
Duality in the entanglement of identical particles manifests that entanglement in only one variable can be revealed at a time. We demonstrate this using polarization and orbital angular momentum (OAM) variables of indistinguishable photons generated from parametric down conversion. We show polarization entanglement by sorting photons in even and odd OAM basis, while sorting them in two orthogonal…
▽ More
Duality in the entanglement of identical particles manifests that entanglement in only one variable can be revealed at a time. We demonstrate this using polarization and orbital angular momentum (OAM) variables of indistinguishable photons generated from parametric down conversion. We show polarization entanglement by sorting photons in even and odd OAM basis, while sorting them in two orthogonal polarization modes reveals the OAM entanglement. The duality assisted observation of entanglement can be used as a verification for the preservation of quantum indistinguishability over communication channels. Indistinguishable photons entangled in complementary variables could also evoke interest in distributed quantum sensing protocols and remote entanglement generation.
△ Less
Submitted 23 April, 2021;
originally announced April 2021.
-
Enhanced photocatalytic activity of plasmonic Au nanoparticles incorporated MoS$_2$ nanosheets for degradation of organic dyes
Authors:
Anjali Rani,
Arun Singh Patel,
Anirban Chakraborti,
Kulvinder Singh,
Prianka Sharma
Abstract:
In the present paper, we investigate the effect of plasmonic Au nanoparticles (NPs) decoration on the photocatalytic efficiency of MoS$_2$ nanosheets. The Au NPs are grown on the surface of chemically exfoliated MoS$_2$ nanosheets by chemical reduction method. Au-MoS$_2$ nanostructures (NSs) are characterized by X-ray diffractometer, Raman spectrometer, absorption spectrophotometer, and transmissi…
▽ More
In the present paper, we investigate the effect of plasmonic Au nanoparticles (NPs) decoration on the photocatalytic efficiency of MoS$_2$ nanosheets. The Au NPs are grown on the surface of chemically exfoliated MoS$_2$ nanosheets by chemical reduction method. Au-MoS$_2$ nanostructures (NSs) are characterized by X-ray diffractometer, Raman spectrometer, absorption spectrophotometer, and transmission electron microscopy. Exfoliated MoS$_2$ and Au-MoS$_2$ NSs are used to study the photocatalytic degradation of organic dyes methyl red (MR) and methylene blue (MB). Under UV-Visible light irradiation, pristine MoS$_2$ shows photo degradation efficiencies between 30% to 46.9% for MR and 23.3% to 44% for MB, with varying exposure times from 30 to 120 min, respectively. However, Au-MoS$_2$ NSs with maximum Au NPs concentration show enhanced degradation efficiency from 70.2 to 96.7% for MR, and from 65.2 to 94.3% for MB. The manifold enhancement of degradation efficiency for both the dyes with Au-MoS$_2$ NSs may be attributed to the presence of Au NPs acting as charge trapping sites in the NSs. We believe this study would have potential application in battling the ill-effects of environmental degradation, which poses a major threat to humans as well as biodiversity.
△ Less
Submitted 29 June, 2020;
originally announced June 2020.
-
Correlations in scattered perfect optical vortices
Authors:
Patnala Vanitha,
Nijil Lal,
Anju Rani,
Salla Gangi Reddy,
R. P. Singh
Abstract:
We have studied correlations in the speckle patterns generated by the scattering of perfect optical vortex (POV) beams and used them for producing a new-class of coherence functions, namely Bessel coherence functions. Higher (zeroth) order Bessel coherence functions have been realized in cross (auto)-correlation between the speckle patterns generated by the scattering of perfect vortex beams of di…
▽ More
We have studied correlations in the speckle patterns generated by the scattering of perfect optical vortex (POV) beams and used them for producing a new-class of coherence functions, namely Bessel coherence functions. Higher (zeroth) order Bessel coherence functions have been realized in cross (auto)-correlation between the speckle patterns generated by the scattering of perfect vortex beams of different orders. We have also studied the propagation of produced Bessel coherence functions and characterized their divergence with respect to the radius of their first ring for different orders m=0--4. We observed that the divergence varies linearly with the order of the coherence function. We provide the exact analytical expression for the auto-correlation as well as cross-correlation functions for speckle patterns. Our experimental results are in good agreement with the analytical results.
△ Less
Submitted 4 February, 2020;
originally announced February 2020.
-
Tailoring optical and magnetic properties of molybdenum disulphide nanosheets by incorporating plasmonic gold nanoparticles
Authors:
Anjali Rani,
Arun Singh Patel,
Anirban Chakraborti,
Kulvinder Singh,
Prianka Sharma
Abstract:
The present paper deals with the systematic growth of plasmonic gold nanoparticles (Au NPs) on molybdenum disulphide (MoS$_2$) nanosheets as well as the effect of Au NPs on the optical and magnetic properties. The crystalline nature of the nanocomposites is confirmed by X-ray diffraction and transmission electron microscopic techniques. The optical properties are characterized using absorption and…
▽ More
The present paper deals with the systematic growth of plasmonic gold nanoparticles (Au NPs) on molybdenum disulphide (MoS$_2$) nanosheets as well as the effect of Au NPs on the optical and magnetic properties. The crystalline nature of the nanocomposites is confirmed by X-ray diffraction and transmission electron microscopic techniques. The optical properties are characterized using absorption and Raman spectroscopic techniques. The electron paramagnetic resonance technique is used to study the magnetic response of the nanocomposites. The paper attempts to gain a fundamental understanding of the two-dimensional nanomaterial-based composites for their applications in the magnetic and optical devices.
△ Less
Submitted 29 January, 2020; v1 submitted 9 December, 2019;
originally announced December 2019.
-
Effect of direct reaction channels on deep sub-barrier fusion in asymmetric systems
Authors:
Md. Moin Shaikh,
S. Nath,
J. Gehlot,
Tathagata Banerjee,
Ish Mukul,
R. Dubey,
A. Shamlath,
P. V. Laveen,
M. Shareef,
A. Jhingan,
N. Madhavan,
Tapan Rajbongshi,
P. Jisha,
G. Naga Jyothi,
A. Tejaswi,
Rudra N. Sahoo,
Anjali Rani
Abstract:
A steeper fall of fusion excitation function, compared to the predictions of coupled-channels models, at energies below the lowest barrier between the reaction partners, is termed as deep sub-barrier fusion hindrance. This phenomenon has been observed in many symmetric and nearly-symmetric systems. Different physical origins of the hindrance have been proposed. This work aims to study the probable…
▽ More
A steeper fall of fusion excitation function, compared to the predictions of coupled-channels models, at energies below the lowest barrier between the reaction partners, is termed as deep sub-barrier fusion hindrance. This phenomenon has been observed in many symmetric and nearly-symmetric systems. Different physical origins of the hindrance have been proposed. This work aims to study the probable effects of direct reactions on deep sub-barrier fusion cross sections. Fusion (evaporation residue) cross sections have been measured for the system $^{19}$F+$^{181}$Ta, from above the barrier down to the energies where fusion hindrance is expected to come into play. Coupled-channels calculation with standard Woods-Saxon potential gives a fair description of the fusion excitation function down to energies $\simeq 14\%$ below the barrier for the present system. This is in contrast with the observation of increasing fusion hindrance in asymmetric reactions induced by increasingly heavier projectiles, \textit{viz.} $^{6,7}$Li, $^{11}$B, $^{12}$C and $^{16}$O. The asymmetric reactions, which have not shown any signature of fusion hindrance within the measured energy range, are found to be induced by projectiles with lower $α$ break-up threshold, compared to the reactions which have shown signatures of fusion hindrance. In addition, most of the $Q$-values for light particles pick-up channels are negative for the reactions which have exhibited strong signatures of fusion hindrance, \textit{viz.} $^{12}$C+$^{198}$Pt and $^{16}$O+$^{204,208}$Pb. Thus, break-up of projectile and particle transfer channels with positive $Q$-values seem to compensate for the hindrance in fusion deep below the barrier. Inclusion of break-up and transfer channels within the framework of coupled-channels calculation would be of interest.
△ Less
Submitted 25 May, 2018; v1 submitted 13 March, 2018;
originally announced March 2018.
-
Perinormal rings with zero divisors
Authors:
Tiberiu Dumitrescu,
Anam Rani
Abstract:
We extend to rings with zero-divisors the concept of perinormal domain introduced by N. Epstein and J. Shapiro. A ring $A$ is called perinormal if every overring of $A$ which satisfies going down over $A$ is $A$-flat. The Prüfer rings and the Marot Krull rings are perinormal.
We extend to rings with zero-divisors the concept of perinormal domain introduced by N. Epstein and J. Shapiro. A ring $A$ is called perinormal if every overring of $A$ which satisfies going down over $A$ is $A$-flat. The Prüfer rings and the Marot Krull rings are perinormal.
△ Less
Submitted 29 September, 2016;
originally announced September 2016.
-
Abundance analysis of SDSS J134338.67+484426.6; an extremely metal-poor star from the MARVELS pre-survey
Authors:
A. Susmitha Rani,
T. Sivarani,
T. C. Beers,
S. Fleming,
S. Mahadevan,
J. Ge
Abstract:
We present an elemental-abundance analysis of an extremely metal-poor (EMP; [Fe/H] < -3.0) star, SDSS J134338.67+484426.6, identified during the course of the MARVELS spectroscopic pre-survey of some 20000 stars to identify suitable candidates for exoplanet searches. This star, with an apparent magnitude V = 12.14, is the lowest metallicity star found in the pre-survey, and is one of only ~20 know…
▽ More
We present an elemental-abundance analysis of an extremely metal-poor (EMP; [Fe/H] < -3.0) star, SDSS J134338.67+484426.6, identified during the course of the MARVELS spectroscopic pre-survey of some 20000 stars to identify suitable candidates for exoplanet searches. This star, with an apparent magnitude V = 12.14, is the lowest metallicity star found in the pre-survey, and is one of only ~20 known EMP stars that are this bright or brighter. Our high-resolution spectroscopic analysis shows that this star is a subgiant with [Fe/H] = -3.42, having "normal" carbon and no enhancement of neutron-capture abundances. Strontium is under-abundant, [Sr/Fe] =-0.47, but the derived lower limit on [Sr/Ba] indicates that Sr is likely enhanced relative to Ba. This star belongs to the sparsely populated class of alpha-poor EMP stars that exhibit low ratios of [Mg/Fe], [Si/Fe], and [Ca/Fe] compared to typical halo stars at similar metallicity. The observed variations in radial velocity from several epochs of (low- and high-resolution) spectroscopic follow-up indicate that SDSS J134338.67+484426.6 is a possible long-period binary. We also discuss the abundance trends in EMP stars for r-process elements, and compare with other magnesium-poor stars.
△ Less
Submitted 1 March, 2016;
originally announced March 2016.
-
A note on perinormal domains
Authors:
Tiberiu Dumitrescu,
Anam Rani
Abstract:
Recently, N. Epstein and J. Shapiro introduced and studied the perinormal domains: those domains A whose going down overrings are flat A-modules. We show that every Prüfer v-multiplication domain is perinormal and has no proper lying over overrings. We also show that a treed perinormal domain is a Prüfer domain. We give two pull-back constructions that produce perinormal/non-perinormal domains.
Recently, N. Epstein and J. Shapiro introduced and studied the perinormal domains: those domains A whose going down overrings are flat A-modules. We show that every Prüfer v-multiplication domain is perinormal and has no proper lying over overrings. We also show that a treed perinormal domain is a Prüfer domain. We give two pull-back constructions that produce perinormal/non-perinormal domains.
△ Less
Submitted 12 November, 2015;
originally announced November 2015.