Skip to main content

Showing 1–22 of 22 results for author: Natarajan, N

  1. arXiv:2406.11638  [pdf, other

    cs.AI cs.SE

    MASAI: Modular Architecture for Software-engineering AI Agents

    Authors: Daman Arora, Atharv Sonwane, Nalin Wadhwa, Abhav Mehrotra, Saiteja Utpala, Ramakrishna Bairi, Aditya Kanade, Nagarajan Natarajan

    Abstract: A common method to solve complex problems in software engineering, is to divide the problem into multiple sub-problems. Inspired by this, we propose a Modular Architecture for Software-engineering AI (MASAI) agents, where different LLM-powered sub-agents are instantiated with well-defined objectives and strategies tuned to achieve those objectives. Our modular architecture offers several advantage… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2406.10504  [pdf, other

    cs.AI cs.CL cs.LG

    Task Facet Learning: A Structured Approach to Prompt Optimization

    Authors: Gurusha Juneja, Nagarajan Natarajan, Hua Li, Jian Jiao, Amit Sharma

    Abstract: Given a task in the form of a basic description and its training examples, prompt optimization is the problem of synthesizing the given information into a text prompt for a large language model (LLM). Humans solve this problem by also considering the different facets that define a task (e.g., counter-examples, explanations, analogies) and including them in the prompt. However, it is unclear whethe… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  3. arXiv:2403.00409  [pdf, other

    cs.LG cs.CL

    Provably Robust DPO: Aligning Language Models with Noisy Feedback

    Authors: Sayak Ray Chowdhury, Anush Kini, Nagarajan Natarajan

    Abstract: Learning from preference-based feedback has recently gained traction as a promising approach to align language models with human interests. While these aligned generative models have demonstrated impressive capabilities across various tasks, their dependence on high-quality human preference data poses a bottleneck in practical applications. Specifically, noisy (incorrect and ambiguous) preference… ▽ More

    Submitted 11 April, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

  4. arXiv:2401.15963  [pdf, other

    cs.SE cs.AI cs.CL cs.LG

    NoFunEval: Funny How Code LMs Falter on Requirements Beyond Functional Correctness

    Authors: Manav Singhal, Tushar Aggarwal, Abhijeet Awasthi, Nagarajan Natarajan, Aditya Kanade

    Abstract: Existing evaluation benchmarks of language models of code (code LMs) focus almost exclusively on whether the LMs can generate functionally-correct code. In real-world software engineering, developers think beyond functional correctness. They have requirements on "how" a functionality should be implemented to meet overall system design objectives like efficiency, security, and maintainability. They… ▽ More

    Submitted 2 February, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: Preprint

  5. arXiv:2310.20158  [pdf, other

    cs.CL

    GAR-meets-RAG Paradigm for Zero-Shot Information Retrieval

    Authors: Daman Arora, Anush Kini, Sayak Ray Chowdhury, Nagarajan Natarajan, Gaurav Sinha, Amit Sharma

    Abstract: Given a query and a document corpus, the information retrieval (IR) task is to output a ranked list of relevant documents. Combining large language models (LLMs) with embedding-based retrieval models, recent work shows promising results on the zero-shot retrieval problem, i.e., no access to labeled data from the target domain. Two such popular paradigms are generation-augmented retrieval or GAR (g… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: preprint

  6. arXiv:2310.19733  [pdf, other

    cs.LG cs.CR

    Differentially Private Reward Estimation with Preference Feedback

    Authors: Sayak Ray Chowdhury, Xingyu Zhou, Nagarajan Natarajan

    Abstract: Learning from preference-based feedback has recently gained considerable traction as a promising approach to align generative models with human interests. Instead of relying on numerical rewards, the generative models are trained using reinforcement learning with human feedback (RLHF). These approaches first solicit feedback from human labelers typically in the form of pairwise comparisons between… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  7. arXiv:2309.12938  [pdf, other

    cs.AI cs.SE

    Frustrated with Code Quality Issues? LLMs can Help!

    Authors: Nalin Wadhwa, Jui Pradhan, Atharv Sonwane, Surya Prakash Sahu, Nagarajan Natarajan, Aditya Kanade, Suresh Parthasarathy, Sriram Rajamani

    Abstract: As software projects progress, quality of code assumes paramount importance as it affects reliability, maintainability and security of software. For this reason, static analysis tools are used in developer workflows to flag code quality issues. However, developers need to spend extra efforts to revise their code to improve code quality based on the tool findings. In this work, we investigate the u… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  8. arXiv:2307.12465  [pdf, other

    cs.SE

    StaticFixer: From Static Analysis to Static Repair

    Authors: Naman Jain, Shubham Gandhi, Atharv Sonwane, Aditya Kanade, Nagarajan Natarajan, Suresh Parthasarathy, Sriram Rajamani, Rahul Sharma

    Abstract: Static analysis tools are traditionally used to detect and flag programs that violate properties. We show that static analysis tools can also be used to perturb programs that satisfy a property to construct variants that violate the property. Using this insight we can construct paired data sets of unsafe-safe program pairs, and learn strategies to automatically repair property violations. We prese… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

  9. arXiv:2305.10374  [pdf, other

    cs.HC

    Personalizing Content Moderation on Social Media: User Perspectives on Moderation Choices, Interface Design, and Labor

    Authors: Shagun Jhaver, Alice Qian Zhang, Quanze Chen, Nikhila Natarajan, Ruotong Wang, Amy Zhang

    Abstract: Social media platforms moderate content for each user by incorporating the outputs of both platform-wide content moderation systems and, in some cases, user-configured personal moderation preferences. However, it is unclear (1) how end users perceive the choices and affordances of different kinds of personal content moderation tools, and (2) how the introduction of personalization impacts user per… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: Will appear in ACM CSCW

  10. arXiv:2303.13526  [pdf, ps, other

    cs.HC

    Trust Explanations to Do What They Say

    Authors: Neil Natarajan, Reuben Binns, Jun Zhao, Nigel Shadbolt

    Abstract: How much are we to trust a decision made by an AI algorithm? Trusting an algorithm without cause may lead to abuse, and mistrusting it may similarly lead to disuse. Trust in an AI is only desirable if it is warranted; thus, calibrating trust is critical to ensuring appropriate use. In the name of calibrating trust appropriately, AI developers should provide contracts specifying use cases in which… ▽ More

    Submitted 14 February, 2023; originally announced March 2023.

  11. arXiv:2202.13870  [pdf, other

    cs.NI cs.LG eess.SY

    Simulating Network Paths with Recurrent Buffering Units

    Authors: Divyam Anshumaan, Sriram Balasubramanian, Shubham Tiwari, Nagarajan Natarajan, Sundararajan Sellamanickam, Venkata N. Padmanabhan

    Abstract: Simulating physical network paths (e.g., Internet) is a cornerstone research problem in the emerging sub-field of AI-for-networking. We seek a model that generates end-to-end packet delay values in response to the time-varying load offered by a sender, which is typically a function of the previously output delays. The problem setting is unique, and renders the state-of-the-art text and time-series… ▽ More

    Submitted 6 December, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

    Comments: Accepted in AAAI 2023, 19 pages, 14 figures

  12. arXiv:2112.02969  [pdf, other

    cs.SE cs.PL

    Jigsaw: Large Language Models meet Program Synthesis

    Authors: Naman Jain, Skanda Vaidyanath, Arun Iyer, Nagarajan Natarajan, Suresh Parthasarathy, Sriram Rajamani, Rahul Sharma

    Abstract: Large pre-trained language models such as GPT-3, Codex, and Google's language model are now capable of generating code from natural language specifications of programmer intent. We view these developments with a mixture of optimism and caution. On the optimistic side, such large language models have the potential to improve productivity by providing an automated AI pair programmer for every progra… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: Accepted to ICSE'22

  13. arXiv:2102.07567  [pdf, other

    cs.LG

    Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent

    Authors: Ajaykrishna Karthikeyan, Naman Jain, Nagarajan Natarajan, Prateek Jain

    Abstract: Decision trees provide a rich family of highly non-linear but efficient models, due to which they continue to be the go-to family of predictive models by practitioners across domains. But learning trees is challenging due to their discrete decision boundaries. The state-of-the-art (SOTA) techniques resort to (a) learning \textit{soft} trees thereby losing logarithmic inference time; or (b) using m… ▽ More

    Submitted 30 September, 2022; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: Accepted to TMLR

  14. arXiv:2102.07387  [pdf, other

    cs.LG

    Optimal Regret Algorithm for Pseudo-1d Bandit Convex Optimization

    Authors: Aadirupa Saha, Nagarajan Natarajan, Praneeth Netrapalli, Prateek Jain

    Abstract: We study online learning with bandit feedback (i.e. learner has access to only zeroth-order oracle) where cost/reward functions $\f_t$ admit a "pseudo-1d" structure, i.e. $\f_t(\w) = \loss_t(\pred_t(\w))$ where the output of $\pred_t$ is one-dimensional. At each round, the learner observes context $\x_t$, plays prediction $\pred_t(\w_t; \x_t)$ (e.g. $\pred_t(\cdot)=\langle \x_t, \cdot\rangle$) for… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

  15. arXiv:2007.06835  [pdf, other

    cs.LG cs.AI cs.PL cs.SE stat.ML

    Programming by Rewards

    Authors: Nagarajan Natarajan, Ajaykrishna Karthikeyan, Prateek Jain, Ivan Radicek, Sriram Rajamani, Sumit Gulwani, Johannes Gehrke

    Abstract: We formalize and study ``programming by rewards'' (PBR), a new approach for specifying and synthesizing subroutines for optimizing some quantitative metric such as performance, resource utilization, or correctness over a benchmark. A PBR specification consists of (1) input features $x$, and (2) a reward function $r$, modeled as a black-box component (which we can only run), that assigns a reward f… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

  16. arXiv:1911.11728  [pdf, other

    cs.LG cs.PL cs.SE stat.ML

    On Scaling Data-Driven Loop Invariant Inference

    Authors: Sahil Bhatia, Saswat Padhi, Nagarajan Natarajan, Rahul Sharma, Prateek Jain

    Abstract: Automated synthesis of inductive invariants is an important problem in software verification. Once all the invariants have been specified, software verification reduces to checking of verification conditions. Although static analyses to infer invariants have been studied for over forty years, recent years have seen a flurry of data-driven invariant inference techniques which guess invariants from… ▽ More

    Submitted 16 July, 2020; v1 submitted 26 November, 2019; originally announced November 2019.

  17. arXiv:1709.05976  [pdf, ps, other

    cs.LG cs.AI

    Leveraging Distributional Semantics for Multi-Label Learning

    Authors: Rahul Wadbude, Vivek Gupta, Piyush Rai, Nagarajan Natarajan, Harish Karnick, Prateek Jain

    Abstract: We present a novel and scalable label embedding framework for large-scale multi-label learning a.k.a ExMLDS (Extreme Multi-Label Learning using Distributional Semantics). Our approach draws inspiration from ideas rooted in distributional semantics, specifically the Skip Gram Negative Sampling (SGNS) approach, widely used to learn word embeddings for natural language processing tasks. Learning such… ▽ More

    Submitted 10 November, 2017; v1 submitted 18 September, 2017; originally announced September 2017.

    Comments: 10 Pages, 0 Figures, Missing Result Joint Learning Included

  18. arXiv:1606.02077  [pdf, ps, other

    cs.LG stat.ML

    Regret Bounds for Non-decomposable Metrics with Missing Labels

    Authors: Prateek Jain, Nagarajan Natarajan

    Abstract: We consider the problem of recommending relevant labels (items) for a given data point (user). In particular, we are interested in the practically important setting where the evaluation is with respect to non-decomposable (over labels) performance metrics like the $F_1$ measure, and the training data has missing labels. To this end, we propose a generic framework that given a performance metric… ▽ More

    Submitted 7 June, 2016; originally announced June 2016.

  19. arXiv:1605.00751  [pdf, other

    cs.LG

    Learning from Binary Labels with Instance-Dependent Corruption

    Authors: Aditya Krishna Menon, Brendan van Rooyen, Nagarajan Natarajan

    Abstract: Suppose we have a sample of instances paired with binary labels corrupted by arbitrary instance- and label-dependent noise. With sufficiently many such samples, can we optimally classify and rank instances with respect to the noise-free distribution? We provide a theoretical analysis of this question, with three main contributions. First, we prove that for instance-dependent noise, any algorithm t… ▽ More

    Submitted 4 May, 2016; v1 submitted 3 May, 2016; originally announced May 2016.

  20. arXiv:1505.01802  [pdf, ps, other

    cs.LG stat.ML

    Optimal Decision-Theoretic Classification Using Non-Decomposable Performance Metrics

    Authors: Nagarajan Natarajan, Oluwasanmi Koyejo, Pradeep Ravikumar, Inderjit S. Dhillon

    Abstract: We provide a general theoretical analysis of expected out-of-sample utility, also referred to as decision-theoretic classification, for non-decomposable binary classification metrics such as F-measure and Jaccard coefficient. Our key result is that the expected out-of-sample utility for many performance metrics is provably optimized by a classifier which is equivalent to a signed thresholding of t… ▽ More

    Submitted 7 May, 2015; originally announced May 2015.

  21. arXiv:1411.6081  [pdf, other

    cs.LG math.NA stat.ML

    PU Learning for Matrix Completion

    Authors: Cho-Jui Hsieh, Nagarajan Natarajan, Inderjit S. Dhillon

    Abstract: In this paper, we consider the matrix completion problem when the observations are one-bit measurements of some underlying matrix M, and in particular the observed samples consist only of ones and no zeros. This problem is motivated by modern applications such as recommender systems and social networks where only "likes" or "friendships" are observed. The problem of learning from only positive and… ▽ More

    Submitted 21 November, 2014; originally announced November 2014.

  22. arXiv:1302.5145  [pdf, ps, other

    cs.SI cs.LG

    Prediction and Clustering in Signed Networks: A Local to Global Perspective

    Authors: Kai-Yang Chiang, Cho-Jui Hsieh, Nagarajan Natarajan, Ambuj Tewari, Inderjit S. Dhillon

    Abstract: The study of social networks is a burgeoning research area. However, most existing work deals with networks that simply encode whether relationships exist or not. In contrast, relationships in signed networks can be positive ("like", "trust") or negative ("dislike", "distrust"). The theory of social balance shows that signed networks tend to conform to some local patterns that, in turn, induce cer… ▽ More

    Submitted 4 March, 2013; v1 submitted 20 February, 2013; originally announced February 2013.