Skip to main content

Showing 1–13 of 13 results for author: Tanwani, A

  1. arXiv:2212.13138  [pdf, other

    cs.CL

    Large Language Models Encode Clinical Knowledge

    Authors: Karan Singhal, Shekoofeh Azizi, Tao Tu, S. Sara Mahdavi, Jason Wei, Hyung Won Chung, Nathan Scales, Ajay Tanwani, Heather Cole-Lewis, Stephen Pfohl, Perry Payne, Martin Seneviratne, Paul Gamble, Chris Kelly, Nathaneal Scharli, Aakanksha Chowdhery, Philip Mansfield, Blaise Aguera y Arcas, Dale Webster, Greg S. Corrado, Yossi Matias, Katherine Chou, Juraj Gottweis, Nenad Tomasev, Yun Liu , et al. (5 additional authors not shown)

    Abstract: Large language models (LLMs) have demonstrated impressive capabilities in natural language understanding and generation, but the quality bar for medical and clinical applications is high. Today, attempts to assess models' clinical knowledge typically rely on automated evaluations on limited benchmarks. There is no standard to evaluate model predictions and reasoning across a breadth of tasks. To a… ▽ More

    Submitted 26 December, 2022; originally announced December 2022.

  2. RepsNet: Combining Vision with Language for Automated Medical Reports

    Authors: Ajay Kumar Tanwani, Joelle Barral, Daniel Freedman

    Abstract: Writing reports by analyzing medical images is error-prone for inexperienced practitioners and time consuming for experienced ones. In this work, we present RepsNet that adapts pre-trained vision and language models to interpret medical images and generate automated reports in natural language. RepsNet consists of an encoder-decoder model: the encoder aligns the images with natural language descri… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Journal ref: MICCAI 2022, pp. 714--724

  3. arXiv:2102.09754  [pdf, other

    cs.RO cs.AI cs.CV

    VisuoSpatial Foresight for Physical Sequential Fabric Manipulation

    Authors: Ryan Hoque, Daniel Seita, Ashwin Balakrishna, Aditya Ganapathi, Ajay Kumar Tanwani, Nawid Jamali, Katsu Yamane, Soshi Iba, Ken Goldberg

    Abstract: Robotic fabric manipulation has applications in home robotics, textiles, senior care and surgery. Existing fabric manipulation techniques, however, are designed for specific tasks, making it difficult to generalize across different but related tasks. We build upon the Visual Foresight framework to learn fabric dynamics that can be efficiently reused to accomplish different sequential fabric manipu… ▽ More

    Submitted 20 July, 2021; v1 submitted 19 February, 2021; originally announced February 2021.

    Comments: Journal extension of prior work on VSF to appear in Autonomous Robots S.I. 207. arXiv admin note: text overlap with arXiv:2003.09044

  4. arXiv:2011.07589  [pdf, other

    cs.CV cs.RO

    DIRL: Domain-Invariant Representation Learning for Sim-to-Real Transfer

    Authors: Ajay Kumar Tanwani

    Abstract: Generating large-scale synthetic data in simulation is a feasible alternative to collecting/labelling real data for training vision-based deep learning models, albeit the modelling inaccuracies do not generalize to the physical world. In this paper, we present a domain-invariant representation learning (DIRL) algorithm to adapt deep models to the physical environment with a small amount of real da… ▽ More

    Submitted 7 January, 2021; v1 submitted 15 November, 2020; originally announced November 2020.

    Comments: 4th Conference on Robot Learning (CoRL), 2020 [plenary talk, Best System Paper Award Finalist]

  5. arXiv:2007.10420  [pdf, other

    cs.RO cs.AI

    Non-Markov Policies to Reduce Sequential Failures in Robot Bin Picking

    Authors: Kate Sanders, Michael Danielczuk, Jeffrey Mahler, Ajay Tanwani, Ken Goldberg

    Abstract: A new generation of automated bin picking systems using deep learning is evolving to support increasing demand for e-commerce. To accommodate a wide variety of products, many automated systems include multiple gripper types and/or tool changers. However, for some objects, sequential grasp failures are common: when a computed grasp fails to lift and remove the object, the bin is often left unchange… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

    Comments: 2020 IEEE International Conference on Automation Science and Engineering (CASE)

    ACM Class: I.2.9

  6. arXiv:2006.00545  [pdf, other

    cs.RO cs.CV

    Motion2Vec: Semi-Supervised Representation Learning from Surgical Videos

    Authors: Ajay Kumar Tanwani, Pierre Sermanet, Andy Yan, Raghav Anand, Mariano Phielipp, Ken Goldberg

    Abstract: Learning meaningful visual representations in an embedding space can facilitate generalization in downstream tasks such as action segmentation and imitation. In this paper, we learn a motion-centric representation of surgical video demonstrations by grouping them into action segments/sub-goals/options in a semi-supervised manner. We present Motion2Vec, an algorithm that learns a deep embedding fea… ▽ More

    Submitted 31 May, 2020; originally announced June 2020.

    Comments: IEEE International Conference on Robotics and Automation (ICRA), 2020

  7. arXiv:2003.09044  [pdf, other

    cs.RO cs.AI cs.CV

    VisuoSpatial Foresight for Multi-Step, Multi-Task Fabric Manipulation

    Authors: Ryan Hoque, Daniel Seita, Ashwin Balakrishna, Aditya Ganapathi, Ajay Kumar Tanwani, Nawid Jamali, Katsu Yamane, Soshi Iba, Ken Goldberg

    Abstract: Robotic fabric manipulation has applications in home robotics, textiles, senior care and surgery. Existing fabric manipulation techniques, however, are designed for specific tasks, making it difficult to generalize across different but related tasks. We extend the Visual Foresight framework to learn fabric dynamics that can be efficiently reused to accomplish different fabric manipulation tasks wi… ▽ More

    Submitted 18 February, 2021; v1 submitted 19 March, 2020; originally announced March 2020.

    Comments: Robotics: Science and Systems (RSS) 2020

  8. arXiv:1910.04854  [pdf, other

    cs.RO cs.AI cs.CV

    Deep Imitation Learning of Sequential Fabric Smoothing From an Algorithmic Supervisor

    Authors: Daniel Seita, Aditya Ganapathi, Ryan Hoque, Minho Hwang, Edward Cen, Ajay Kumar Tanwani, Ashwin Balakrishna, Brijen Thananjeyan, Jeffrey Ichnowski, Nawid Jamali, Katsu Yamane, Soshi Iba, John Canny, Ken Goldberg

    Abstract: Sequential pulling policies to flatten and smooth fabrics have applications from surgery to manufacturing to home tasks such as bed making and folding clothes. Due to the complexity of fabric states and dynamics, we apply deep imitation learning to learn policies that, given color (RGB), depth (D), or combined color-depth (RGBD) images of a rectangular fabric sample, estimate pick points and pull… ▽ More

    Submitted 2 March, 2020; v1 submitted 23 September, 2019; originally announced October 2019.

    Comments: Supplementary material is available at https://sites.google.com/view/fabric-smoothing ; Version 2 has significant improvements with new results and figures

  9. arXiv:1903.09589  [pdf, other

    cs.RO

    A Fog Robotics Approach to Deep Robot Learning: Application to Object Recognition and Grasp Planning in Surface Decluttering

    Authors: Ajay Kumar Tanwani, Nitesh Mor, John Kubiatowicz, Joseph E. Gonzalez, Ken Goldberg

    Abstract: The growing demand of industrial, automotive and service robots presents a challenge to the centralized Cloud Robotics model in terms of privacy, security, latency, bandwidth, and reliability. In this paper, we present a `Fog Robotics' approach to deep robot learning that distributes compute, storage and networking resources between the Cloud and the Edge in a federated manner. Deep models are tra… ▽ More

    Submitted 22 March, 2019; originally announced March 2019.

    Comments: IEEE International Conference on Robotics and Automation, ICRA, 2019

  10. arXiv:1811.07489  [pdf, ps, other

    cs.RO

    Generalizing Robot Imitation Learning with Invariant Hidden Semi-Markov Models

    Authors: Ajay Kumar Tanwani, Jonathan Lee, Brijen Thananjeyan, Michael Laskey, Sanjay Krishnan, Roy Fox, Ken Goldberg, Sylvain Calinon

    Abstract: Generalizing manipulation skills to new situations requires extracting invariant patterns from demonstrations. For example, the robot needs to understand the demonstrations at a higher level while being invariant to the appearance of the objects, geometric aspects of objects such as its position, size, orientation and viewpoint of the observer in the demonstrations. In this paper, we propose an al… ▽ More

    Submitted 18 November, 2018; originally announced November 2018.

    Comments: accepted in WAFR 2018

  11. arXiv:1811.02184  [pdf, other

    cs.RO cs.LG

    Dynamic Regret Convergence Analysis and an Adaptive Regularization Algorithm for On-Policy Robot Imitation Learning

    Authors: Jonathan N. Lee, Michael Laskey, Ajay Kumar Tanwani, Anil Aswani, Ken Goldberg

    Abstract: On-policy imitation learning algorithms such as DAgger evolve a robot control policy by executing it, measuring performance (loss), obtaining corrective feedback from a supervisor, and generating the next policy. As the loss between iterations can vary unpredictably, a fundamental question is under what conditions this process will eventually achieve a converged policy. If one assumes the underlyi… ▽ More

    Submitted 8 July, 2019; v1 submitted 6 November, 2018; originally announced November 2018.

  12. arXiv:1809.09810  [pdf, other

    cs.RO cs.AI

    Deep Transfer Learning of Pick Points on Fabric for Robot Bed-Making

    Authors: Daniel Seita, Nawid Jamali, Michael Laskey, Ajay Kumar Tanwani, Ron Berenstein, Prakash Baskaran, Soshi Iba, John Canny, Ken Goldberg

    Abstract: A fundamental challenge in manipulating fabric for clothes folding and textiles manufacturing is computing "pick points" to effectively modify the state of an uncertain manifold. We present a supervised deep transfer learning approach to locate pick points using depth images for invariance to color and texture. We consider the task of bed-making, where a robot sequentially grasps and pulls at pick… ▽ More

    Submitted 16 September, 2019; v1 submitted 26 September, 2018; originally announced September 2018.

    Comments: International Symposium on Robotics Research (ISRR) 2019. Expanded and revised version of arXiv:1711.02525 as well as earlier versions here under the title "Robot Bed-Making: Deep Transfer Learning Using Depth Sensing of Deformable Fabric". Project website at https://sites.google.com/view/bed-make

  13. arXiv:1610.02468  [pdf, ps, other

    cs.RO

    Small Variance Asymptotics for Non-Parametric Online Robot Learning

    Authors: Ajay Kumar Tanwani, Sylvain Calinon

    Abstract: Small variance asymptotics is emerging as a useful technique for inference in large scale Bayesian non-parametric mixture models. This paper analyses the online learning of robot manipulation tasks with Bayesian non-parametric mixture models under small variance asymptotics. The analysis yields a scalable online sequence clustering (SOSC) algorithm that is non-parametric in the number of clusters… ▽ More

    Submitted 15 October, 2018; v1 submitted 7 October, 2016; originally announced October 2016.

    Comments: accepted in International Journal of Robotics and Research (IJRR)