Skip to main content

Showing 1–24 of 24 results for author: Dave, S

  1. arXiv:2407.06863  [pdf, other

    cs.CV

    Beyond Aesthetics: Cultural Competence in Text-to-Image Models

    Authors: Nithish Kannen, Arif Ahmad, Marco Andreetto, Vinodkumar Prabhakaran, Utsav Prabhu, Adji Bousso Dieng, Pushpak Bhattacharyya, Shachi Dave

    Abstract: Text-to-Image (T2I) models are being increasingly adopted in diverse global communities where they create visual representations of their unique cultures. Current T2I benchmarks primarily focus on faithfulness, aesthetics, and realism of generated images, overlooking the critical dimension of cultural competence. In this work, we introduce a framework to evaluate cultural competence of T2I models… ▽ More

    Submitted 11 July, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

    Comments: 30 pages, 10 figures, preprint

  2. arXiv:2404.05866  [pdf, other

    cs.CL

    GeniL: A Multilingual Dataset on Generalizing Language

    Authors: Aida Mostafazadeh Davani, Sagar Gubbi, Sunipa Dev, Shachi Dave, Vinodkumar Prabhakaran

    Abstract: LLMs are increasingly transforming our digital ecosystem, but they often inherit societal biases learned from their training data, for instance stereotypes associating certain attributes with specific identity groups. While whether and how these biases are mitigated may depend on the specific use cases, being able to effectively detect instances of stereotype perpetuation is a crucial first step.… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  3. arXiv:2404.03602  [pdf, other

    cs.CL

    Evaluating LLMs at Detecting Errors in LLM Responses

    Authors: Ryo Kamoi, Sarkar Snigdha Sarathi Das, Renze Lou, Jihyun Janice Ahn, Yilun Zhao, Xiaoxin Lu, Nan Zhang, Yusen Zhang, Ranran Haoran Zhang, Sujeeth Reddy Vummanthala, Salika Dave, Shaobo Qin, Arman Cohan, Wenpeng Yin, Rui Zhang

    Abstract: With Large Language Models (LLMs) being widely used across various tasks, detecting errors in their responses is increasingly crucial. However, little research has been conducted on error detection of LLM responses. Collecting error annotations on LLM responses is challenging due to the subjective nature of many NLP tasks, and thus previous research focuses on tasks of little practical value (e.g.… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: Benchmark and code: https://github.com/psunlpgroup/ReaLMistake

  4. arXiv:2403.05696  [pdf, other

    cs.CL cs.CV

    SeeGULL Multilingual: a Dataset of Geo-Culturally Situated Stereotypes

    Authors: Mukul Bhutani, Kevin Robinson, Vinodkumar Prabhakaran, Shachi Dave, Sunipa Dev

    Abstract: While generative multilingual models are rapidly being deployed, their safety and fairness evaluations are largely limited to resources collected in English. This is especially problematic for evaluations targeting inherently socio-cultural phenomena such as stereotyping, where it is important to build multi-lingual resources that reflect the stereotypes prevalent in respective language communitie… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  5. arXiv:2401.06310  [pdf, other

    cs.CV cs.CL cs.CY

    ViSAGe: A Global-Scale Analysis of Visual Stereotypes in Text-to-Image Generation

    Authors: Akshita Jha, Vinodkumar Prabhakaran, Remi Denton, Sarah Laszlo, Shachi Dave, Rida Qadri, Chandan K. Reddy, Sunipa Dev

    Abstract: Recent studies have shown that Text-to-Image (T2I) model generations can reflect social stereotypes present in the real world. However, existing approaches for evaluating stereotypes have a noticeable lack of coverage of global identity groups and their associated stereotypes. To address this gap, we introduce the ViSAGe (Visual Stereotypes Around the Globe) dataset to enable the evaluation of kno… ▽ More

    Submitted 14 July, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: Association for Computational Linguistics (ACL) 2024

  6. arXiv:2308.12600  [pdf, other

    cs.CV

    PoseSync: Robust pose based video synchronization

    Authors: Rishit Javia, Falak Shah, Shivam Dave

    Abstract: Pose based video sychronization can have applications in multiple domains such as gameplay performance evaluation, choreography or guiding athletes. The subject's actions could be compared and evaluated against those performed by professionals side by side. In this paper, we propose an end to end pipeline for synchronizing videos based on pose. The first step crops the region where the person pres… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  7. arXiv:2307.10514  [pdf, other

    cs.CL cs.AI cs.HC

    Building Socio-culturally Inclusive Stereotype Resources with Community Engagement

    Authors: Sunipa Dev, Jaya Goyal, Dinesh Tewari, Shachi Dave, Vinodkumar Prabhakaran

    Abstract: With rapid development and deployment of generative language models in global settings, there is an urgent need to also scale our measurements of harm, not just in the number and types of harms covered, but also how well they account for local cultural contexts, including marginalized identities and the social biases experienced by them. Current evaluation paradigms are limited in their abilities… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  8. arXiv:2305.11840  [pdf, other

    cs.CL cs.CY

    SeeGULL: A Stereotype Benchmark with Broad Geo-Cultural Coverage Leveraging Generative Models

    Authors: Akshita Jha, Aida Davani, Chandan K. Reddy, Shachi Dave, Vinodkumar Prabhakaran, Sunipa Dev

    Abstract: Stereotype benchmark datasets are crucial to detect and mitigate social stereotypes about groups of people in NLP models. However, existing datasets are limited in size and coverage, and are largely restricted to stereotypes prevalent in the Western society. This is especially problematic as language technologies gain hold across the globe. To address this gap, we present SeeGULL, a broad-coverage… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  9. arXiv:2305.10403  [pdf, other

    cs.CL cs.AI

    PaLM 2 Technical Report

    Authors: Rohan Anil, Andrew M. Dai, Orhan Firat, Melvin Johnson, Dmitry Lepikhin, Alexandre Passos, Siamak Shakeri, Emanuel Taropa, Paige Bailey, Zhifeng Chen, Eric Chu, Jonathan H. Clark, Laurent El Shafey, Yanping Huang, Kathy Meier-Hellstern, Gaurav Mishra, Erica Moreira, Mark Omernick, Kevin Robinson, Sebastian Ruder, Yi Tay, Kefan Xiao, Yuanzhong Xu, Yujing Zhang, Gustavo Hernandez Abrego , et al. (103 additional authors not shown)

    Abstract: We introduce PaLM 2, a new state-of-the-art language model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. PaLM 2 is a Transformer-based model trained using a mixture of objectives. Through extensive evaluations on English and multilingual language, and reasoning tasks, we demonstrate that PaLM 2 has significantly improved quality on… ▽ More

    Submitted 13 September, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  10. arXiv:2211.11206  [pdf, other

    cs.CL cs.AI cs.CY

    Cultural Re-contextualization of Fairness Research in Language Technologies in India

    Authors: Shaily Bhatt, Sunipa Dev, Partha Talukdar, Shachi Dave, Vinodkumar Prabhakaran

    Abstract: Recent research has revealed undesirable biases in NLP data and models. However, these efforts largely focus on social disparities in the West, and are not directly portable to other geo-cultural contexts. In this position paper, we outline a holistic research agenda to re-contextualize NLP fairness research for the Indian context, accounting for Indian societal context, bridging technological gap… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: Accepted to NeurIPS Workshop on "Cultures in AI/AI in Culture". This is a non-archival short version, to cite please refer to our complete paper: arXiv:2209.12226

  11. arXiv:2210.07313  [pdf, other

    cs.CL cs.LG

    Bootstrapping Multilingual Semantic Parsers using Large Language Models

    Authors: Abhijeet Awasthi, Nitish Gupta, Bidisha Samanta, Shachi Dave, Sunita Sarawagi, Partha Talukdar

    Abstract: Despite cross-lingual generalization demonstrated by pre-trained multilingual models, the translate-train paradigm of transferring English datasets across multiple languages remains to be a key mechanism for training task-specific multilingual models. However, for many low-resource languages, the availability of a reliable translation service entails significant amounts of costly human-annotated t… ▽ More

    Submitted 11 February, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: EACL-23

  12. arXiv:2209.12226  [pdf, other

    cs.CL cs.CY

    Re-contextualizing Fairness in NLP: The Case of India

    Authors: Shaily Bhatt, Sunipa Dev, Partha Talukdar, Shachi Dave, Vinodkumar Prabhakaran

    Abstract: Recent research has revealed undesirable biases in NLP data and models. However, these efforts focus on social disparities in West, and are not directly portable to other geo-cultural contexts. In this paper, we focus on NLP fair-ness in the context of India. We start with a brief account of the prominent axes of social disparities in India. We build resources for fairness evaluation in the Indian… ▽ More

    Submitted 21 November, 2022; v1 submitted 25 September, 2022; originally announced September 2022.

    Comments: Accepted to AACL-IJCNLP 2022

  13. arXiv:2209.06767  [pdf, other

    cs.CL

    Parameter-Efficient Finetuning for Robust Continual Multilingual Learning

    Authors: Kartikeya Badola, Shachi Dave, Partha Talukdar

    Abstract: We introduce and study the problem of Continual Multilingual Learning (CML) where a previously trained multilingual model is periodically updated using new data arriving in stages. If the new data is present only in a subset of languages, we find that the resulting model shows improved performance only on the languages included in the latest update (and a few closely related languages) while its p… ▽ More

    Submitted 28 August, 2023; v1 submitted 14 September, 2022; originally announced September 2022.

    Comments: Published at ACL Findings 2023

  14. arXiv:2204.09514  [pdf, other

    cs.AR cs.CR cs.CV cs.DC cs.LG

    Special Session: Towards an Agile Design Methodology for Efficient, Reliable, and Secure ML Systems

    Authors: Shail Dave, Alberto Marchisio, Muhammad Abdullah Hanif, Amira Guesmi, Aviral Shrivastava, Ihsen Alouani, Muhammad Shafique

    Abstract: The real-world use cases of Machine Learning (ML) have exploded over the past few years. However, the current computing infrastructure is insufficient to support all real-world applications and scenarios. Apart from high efficiency requirements, modern ML systems are expected to be highly reliable against hardware failures as well as secure against adversarial and IP stealing attacks. Privacy conc… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: Appears at 40th IEEE VLSI Test Symposium (VTS 2022), 14 pages

  15. arXiv:2103.10730  [pdf, other

    cs.CL

    MuRIL: Multilingual Representations for Indian Languages

    Authors: Simran Khanuja, Diksha Bansal, Sarvesh Mehtani, Savya Khosla, Atreyee Dey, Balaji Gopalan, Dilip Kumar Margam, Pooja Aggarwal, Rajiv Teja Nagipogu, Shachi Dave, Shruti Gupta, Subhash Chandra Bose Gali, Vish Subramanian, Partha Talukdar

    Abstract: India is a multilingual society with 1369 rationalized languages and dialects being spoken across the country (INDIA, 2011). Of these, the 22 scheduled languages have a staggering total of 1.17 billion speakers and 121 languages have more than 10,000 speakers (INDIA, 2011). India also has the second largest (and an ever growing) digital footprint (Statista, 2020). Despite this, today's state-of-th… ▽ More

    Submitted 2 April, 2021; v1 submitted 19 March, 2021; originally announced March 2021.

  16. arXiv:2101.06511  [pdf, other

    cs.LG cs.AI

    Towards Searching Efficient and Accurate Neural Network Architectures in Binary Classification Problems

    Authors: Yigit Alparslan, Ethan Jacob Moyer, Isamu Mclean Isozaki, Daniel Schwartz, Adam Dunlop, Shesh Dave, Edward Kim

    Abstract: In recent years, deep neural networks have had great success in machine learning and pattern recognition. Architecture size for a neural network contributes significantly to the success of any neural network. In this study, we optimize the selection process by investigating different search algorithms to find a neural network architecture size that yields the highest accuracy. We apply binary sear… ▽ More

    Submitted 16 January, 2021; originally announced January 2021.

    Comments: 8 pages, 11 figures

  17. arXiv:2010.12940  [pdf, other

    cs.CL

    Neural Compound-Word (Sandhi) Generation and Splitting in Sanskrit Language

    Authors: Sushant Dave, Arun Kumar Singh, Dr. Prathosh A. P., Prof. Brejesh Lall

    Abstract: This paper describes neural network based approaches to the process of the formation and splitting of word-compounding, respectively known as the Sandhi and Vichchhed, in Sanskrit language. Sandhi is an important idea essential to morphological analysis of Sanskrit texts. Sandhi leads to word transformations at word boundaries. The rules of Sandhi formation are well defined but complex, sometimes… ▽ More

    Submitted 24 October, 2020; originally announced October 2020.

    Comments: 6 pages, 3 figures, CODS-COMAD 2021, IIIT Bangalore, India

  18. arXiv:2010.12937  [pdf, other

    cs.CL

    A Benchmark Corpus and Neural Approach for Sanskrit Derivative Nouns Analysis

    Authors: Arun Kumar Singh, Sushant Dave, Dr. Prathosh A. P., Prof. Brejesh Lall, Shresth Mehta

    Abstract: This paper presents first benchmark corpus of Sanskrit Pratyaya (suffix) and inflectional words (padas) formed due to suffixes along with neural network based approaches to process the formation and splitting of inflectional words. Inflectional words spans the primary and secondary derivative nouns as the scope of current work. Pratyayas are an important dimension of morphological analysis of Sans… ▽ More

    Submitted 24 October, 2020; originally announced October 2020.

    Comments: 6 pages, 2 figures, EACL 2021 Submission

  19. arXiv:2008.01674  [pdf

    stat.ML cs.LG physics.soc-ph

    A Machine Learning Approach for Modelling Parking Duration in Urban Land-use

    Authors: Janak Parmar, Pritikana Das, Sanjaykumar Dave

    Abstract: Parking is an inevitable issue in the fast-growing developing countries. Increasing number of vehicles require more and more urban land to be allocated for parking. However, a little attention has been conferred to the parking issues in developing countries like India. This study proposes a model for analysing the influence of car users' socioeconomic and travel characteristics on parking duration… ▽ More

    Submitted 10 October, 2023; v1 submitted 4 August, 2020; originally announced August 2020.

    Journal ref: Physica A: Statistical Mechanics and its Applications, 2021

  20. arXiv:2007.00864  [pdf, other

    cs.AR cs.CV cs.DC cs.LG cs.NE

    Hardware Acceleration of Sparse and Irregular Tensor Computations of ML Models: A Survey and Insights

    Authors: Shail Dave, Riyadh Baghdadi, Tony Nowatzki, Sasikanth Avancha, Aviral Shrivastava, Baoxin Li

    Abstract: Machine learning (ML) models are widely used in many important domains. For efficiently processing these computational- and memory-intensive applications, tensors of these over-parameterized models are compressed by leveraging sparsity, size reduction, and quantization of tensors. Unstructured sparsity and tensors with varying dimensions yield irregular computation, communication, and memory acces… ▽ More

    Submitted 22 July, 2021; v1 submitted 2 July, 2020; originally announced July 2020.

    Comments: Accepted for publication in Proceedings of the IEEE

  21. Visual Appearance Based Person Retrieval in Unconstrained Environment Videos

    Authors: Hiren Galiyawala, Mehul S Raval, Shivansh Dave

    Abstract: Visual appearance-based person retrieval is a challenging problem in surveillance. It uses attributes like height, cloth color, cloth type and gender to describe a human. Such attributes are known as soft biometrics. This paper proposes person retrieval from surveillance video using height, torso cloth type, torso cloth color and gender. The approach introduces an adaptive torso patch extraction a… ▽ More

    Submitted 31 October, 2019; originally announced October 2019.

    Comments: 11 pages

    Journal ref: Image and Vision Computing, 2019

  22. arXiv:1804.08774  [pdf, other

    cs.LG cs.SI stat.ML

    Neural-Brane: Neural Bayesian Personalized Ranking for Attributed Network Embedding

    Authors: Vachik S. Dave, Baichuan Zhang, Pin-Yu Chen, Mohammad Al Hasan

    Abstract: Network embedding methodologies, which learn a distributed vector representation for each vertex in a network, have attracted considerable interest in recent years. Existing works have demonstrated that vertex representation learned through an embedding method provides superior performance in many real-world applications, such as node classification, link prediction, and community detection. Howev… ▽ More

    Submitted 20 August, 2018; v1 submitted 23 April, 2018; originally announced April 2018.

  23. arXiv:1602.01537  [pdf, ps, other

    cs.DB

    TopCom: Index for Shortest Distance Query in Directed Graph

    Authors: Vachik S. Dave, Mohammad Al Hasan

    Abstract: Finding shortest distance between two vertices in a graph is an important problem due to its numerous applications in diverse domains, including geo-spatial databases, social network analysis, and information retrieval. Classical algorithms (such as, Dijkstra) solve this problem in polynomial time, but these algorithms cannot provide real-time response for a large number of bursty queries on a lar… ▽ More

    Submitted 4 December, 2016; v1 submitted 3 February, 2016; originally announced February 2016.

  24. Efficacy of Attack detection capability of IDPS based on it's deployment in wired and wireless environment

    Authors: Shalvi Dave, Bhushan Trivedi, Jimit Mahadevia

    Abstract: Intrusion Detection and/or Prevention Systems (IDPS) represent an important line of defence against a variety of attacks that can compromise the security and proper functioning of an enterprise information system. Along with the widespread evolution of new emerging services, the quantity and impact of attacks have continuously increased, attackers continuously find vulnerabilities at various level… ▽ More

    Submitted 18 April, 2013; originally announced April 2013.

    Comments: 13 pages, 10 figures

    Journal ref: International Journal of Network Security & Its Applications (IJNSA), Vol.5, No.2, March 2013