Skip to main content

Showing 1–34 of 34 results for author: Gandhi, D

  1. arXiv:2407.08855  [pdf, other

    eess.IV cs.CV

    BraTS-PEDs: Results of the Multi-Consortium International Pediatric Brain Tumor Segmentation Challenge 2023

    Authors: Anahita Fathi Kazerooni, Nastaran Khalili, Xinyang Liu, Debanjan Haldar, Zhifan Jiang, Anna Zapaishchykova, Julija Pavaine, Lubdha M. Shah, Blaise V. Jones, Nakul Sheth, Sanjay P. Prabhu, Aaron S. McAllister, Wenxin Tu, Khanak K. Nandolia, Andres F. Rodriguez, Ibraheem Salman Shaikh, Mariana Sanchez Montano, Hollie Anne Lai, Maruf Adewole, Jake Albrecht, Udunna Anazodo, Hannah Anderson, Syed Muhammed Anwar, Alejandro Aristizabal, Sina Bagheri , et al. (54 additional authors not shown)

    Abstract: Pediatric central nervous system tumors are the leading cause of cancer-related deaths in children. The five-year survival rate for high-grade glioma in children is less than 20%. The development of new treatments is dependent upon multi-institutional collaborative clinical trials requiring reproducible and accurate centralized response assessment. We present the results of the BraTS-PEDs 2023 cha… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2405.07518  [pdf, other

    cs.AR cs.AI

    SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts

    Authors: Raghu Prabhakar, Ram Sivaramakrishnan, Darshan Gandhi, Yun Du, Mingran Wang, Xiangyu Song, Kejie Zhang, Tianren Gao, Angela Wang, Karen Li, Yongning Sheng, Joshua Brot, Denis Sokolov, Apurv Vivek, Calvin Leung, Arjun Sabnis, Jiayu Bai, Tuowen Zhao, Mark Gottscho, David Jackson, Mark Luttrell, Manish K. Shah, Edison Chen, Kaizhao Liang, Swayambhoo Jain , et al. (5 additional authors not shown)

    Abstract: Monolithic large language models (LLMs) like GPT-4 have paved the way for modern generative AI applications. Training, serving, and maintaining monolithic LLMs at scale, however, remains prohibitively expensive and challenging. The disproportionate increase in compute-to-memory ratio of modern AI accelerators have created a memory wall, necessitating new methods to deploy AI. Composition of Expert… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  3. arXiv:2404.15009  [pdf, other

    cs.CV eess.IV

    The Brain Tumor Segmentation in Pediatrics (BraTS-PEDs) Challenge: Focus on Pediatrics (CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs)

    Authors: Anahita Fathi Kazerooni, Nastaran Khalili, Xinyang Liu, Deep Gandhi, Zhifan Jiang, Syed Muhammed Anwar, Jake Albrecht, Maruf Adewole, Udunna Anazodo, Hannah Anderson, Ujjwal Baid, Timothy Bergquist, Austin J. Borja, Evan Calabrese, Verena Chung, Gian-Marco Conte, Farouk Dako, James Eddy, Ivan Ezhov, Ariana Familiar, Keyvan Farahani, Andrea Franson, Anurag Gottipati, Shuvanjan Haldar, Juan Eugenio Iglesias , et al. (46 additional authors not shown)

    Abstract: Pediatric tumors of the central nervous system are the most common cause of cancer-related death in children. The five-year survival rate for high-grade gliomas in children is less than 20%. Due to their rarity, the diagnosis of these entities is often delayed, their treatment is mainly based on historic treatment concepts, and clinical trials require multi-institutional collaborations. Here we pr… ▽ More

    Submitted 11 July, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2305.17033

  4. arXiv:2401.08404  [pdf

    eess.IV cs.CV cs.LG physics.med-ph

    Training and Comparison of nnU-Net and DeepMedic Methods for Autosegmentation of Pediatric Brain Tumors

    Authors: Arastoo Vossough, Nastaran Khalili, Ariana M. Familiar, Deep Gandhi, Karthik Viswanathan, Wenxin Tu, Debanjan Haldar, Sina Bagheri, Hannah Anderson, Shuvanjan Haldar, Phillip B. Storm, Adam Resnick, Jeffrey B. Ware, Ali Nabavizadeh, Anahita Fathi Kazerooni

    Abstract: Brain tumors are the most common solid tumors and the leading cause of cancer-related death among children. Tumor segmentation is essential in surgical and treatment planning, and response assessment and monitoring. However, manual segmentation is time-consuming and has high inter-operator variability, underscoring the need for more efficient methods. We compared two deep learning-based 3D segment… ▽ More

    Submitted 30 January, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

  5. arXiv:2309.08779  [pdf

    q-bio.TO q-bio.QM

    Computational framework for the generation of one-dimensional vascular models accounting for uncertainty in networks extracted from medical images

    Authors: Michelle A Bartolo, Alyssa M Taylor-LaPole, Darsh Gandhi, Alexandria Johnson, Yaqi Li, Emma Slack, Isaiah Stevens, Zachary Turner, Justin D Weigand, Charles Puelz, Dirk Husmeier, Mette S Olufsen

    Abstract: Patient-specific computational modeling is a popular, non-invasive method to answer medical questions. Medical images are used to extract geometric domains necessary to create these models, providing a predictive tool for clinicians. However, in vivo imaging is subject to uncertainty, impacting vessel dimensions essential to the mathematical modeling process. While there are numerous programs avai… ▽ More

    Submitted 8 May, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: 42 pages, 10 figures

  6. arXiv:2308.08680  [pdf, other

    q-bio.BM

    Permutationally Invariant Networks for Enhanced Sampling (PINES): Discovery of Multi-Molecular and Solvent-Inclusive Collective Variables

    Authors: Nicholas S. M. Herringer, Siva Dasetty, Diya Gandhi, Junhee Lee, Andrew L. Ferguson

    Abstract: The typically rugged nature of molecular free energy landscapes can frustrate efficient sampling of the thermodynamically relevant phase space due to the presence of high free energy barriers. Enhanced sampling techniques can improve phase space exploration by accelerating sampling along particular collective variables (CVs). A number of techniques exist for data-driven discovery of CVs parameteri… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  7. arXiv:2305.12010  [pdf, other

    cs.CE cond-mat.mtrl-sci cs.LG

    Chemellia: An Ecosystem for Atomistic Scientific Machine Learning

    Authors: Anant Thazhemadam, Dhairya Gandhi, Venkatasubramanian Viswanathan, Rachel C. Kurchin

    Abstract: Chemellia is an open-source framework for atomistic machine learning in the Julia programming language. The framework takes advantage of Julia's high speed as well as the ability to share and reuse code and interfaces through the paradigm of multiple dispatch. Chemellia is designed to make use of existing interfaces and avoid ``reinventing the wheel'' wherever possible. A key aspect of the Chemell… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  8. arXiv:2304.05511  [pdf, other

    cs.LG

    Training Large Language Models Efficiently with Sparsity and Dataflow

    Authors: Venkat Srinivasan, Darshan Gandhi, Urmish Thakker, Raghu Prabhakar

    Abstract: Large foundation language models have shown their versatility in being able to be adapted to perform a wide variety of downstream tasks, such as text generation, sentiment analysis, semantic search etc. However, training such large foundational models is a non-trivial exercise that requires a significant amount of compute power and expertise from machine learning and systems experts. As models get… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  9. arXiv:2302.09243  [pdf, other

    cs.LG cs.AI cs.CL

    A Federated Approach for Hate Speech Detection

    Authors: Jay Gala, Deep Gandhi, Jash Mehta, Zeerak Talat

    Abstract: Hate speech detection has been the subject of high research attention, due to the scale of content created on social media. In spite of the attention and the sensitive nature of the task, privacy preservation in hate speech detection has remained under-studied. The majority of research has focused on centralised machine learning infrastructures which risk leaking data. In this paper, we show that… ▽ More

    Submitted 18 February, 2023; originally announced February 2023.

    Comments: EACL 2023 Main Conference (Short Paper)

  10. arXiv:2212.06952  [pdf, other

    cond-mat.mtrl-sci

    Nonequilibrium Electrochemical Phase Maps: Beyond Butler-Volmer Kinetics

    Authors: Rachel C. Kurchin, Dhairya Gandhi, Venkatasubramanian Viswanathan

    Abstract: Electrochemical kinetics at electrode-electrolyte interfaces are crucial to understand high-rate behavior of energy storage devices. Phase transformation of electrodes is typically treated under equilibrium thermodynamic conditions, while realistic operation is at finite rates. Analyzing phase transformations under nonequilibrium conditions requires integrating nonlinear electrochemical kinetic mo… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: main text + supplementary info (15+6 pages, 3+2 figures), as submitted to ACS Energy Letters

  11. arXiv:2211.06401  [pdf, other

    cs.LG cs.CL

    A Federated Approach to Predicting Emojis in Hindi Tweets

    Authors: Deep Gandhi, Jash Mehta, Nirali Parekh, Karan Waghela, Lynette D'Mello, Zeerak Talat

    Abstract: The use of emojis affords a visual modality to, often private, textual communication. The task of predicting emojis however provides a challenge for machine learning as emoji use tends to cluster into the frequently used and the rarely used emojis. Much of the machine learning research on emoji use has focused on high resource languages and has conceptualised the task of predicting emojis around t… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: EMNLP2022 Main Track Short Paper

  12. arXiv:2203.04698  [pdf, other

    cs.LG q-bio.QM

    Score-Based Generative Models for Molecule Generation

    Authors: Dwaraknath Gnaneshwar, Bharath Ramsundar, Dhairya Gandhi, Rachel Kurchin, Venkatasubramanian Viswanathan

    Abstract: Recent advances in generative models have made exploring design spaces easier for de novo molecule generation. However, popular generative models like GANs and normalizing flows face challenges such as training instabilities due to adversarial training and architectural constraints, respectively. Score-based generative models sidestep these challenges by modelling the gradient of the log probabili… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

  13. arXiv:2101.10384  [pdf, other

    cs.RO cs.AI

    droidlet: modular, heterogenous, multi-modal agents

    Authors: Anurag Pratik, Soumith Chintala, Kavya Srinet, Dhiraj Gandhi, Rebecca Qian, Yuxuan Sun, Ryan Drew, Sara Elkafrawy, Anoushka Tiwari, Tucker Hart, Mary Williamson, Abhinav Gupta, Arthur Szlam

    Abstract: In recent years, there have been significant advances in building end-to-end Machine Learning (ML) systems that learn at scale. But most of these systems are: (a) isolated (perception, speech, or language only); (b) trained on static datasets. On the other hand, in the field of robotics, large-scale learning has always been difficult. Supervision is hard to gather and real world physical interacti… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

  14. arXiv:2101.07931  [pdf, other

    cs.CY cs.CR

    MIT SafePaths Card (MiSaCa): Augmenting Paper Based Vaccination Cards with Printed Codes

    Authors: Joseph Bae, Rohan Sukumaran, Sheshank Shankar, Saurish Srivastava, Rohan Iyer, Aryan Mahindra, Qamil Mirza, Maurizio Arseni, Anshuman Sharma, Saras Agrawal, Orna Mukhopadhyay, Colin Kang, Priyanshi Katiyar, Apurv Shekhar, Sifat Hasan, Krishnendu Dasgupta, Darshan Gandhi, Sethuramen TV, Parth Patwa, Ishaan Singh, Abhishek Singh, Ramesh Raskar

    Abstract: In this early draft, we describe a user-centric, card-based system for vaccine distribution. Our system makes use of digitally signed QR codes and their use for phased vaccine distribution, vaccine administration/record-keeping, immunization verification, and follow-up symptom reporting. Furthermore, we propose and describe a complementary scanner app system to be used by vaccination clinics, publ… ▽ More

    Submitted 21 January, 2021; v1 submitted 19 January, 2021; originally announced January 2021.

    Comments: 8 pages, 4 Figures, 1 Table

  15. arXiv:2101.02556  [pdf, ps, other

    cs.CR

    Spatial K-anonymity: A Privacy-preserving Method for COVID-19 Related Geospatial Technologies

    Authors: Rohan Iyer, Regina Rex, Kevin P. McPherson, Darshan Gandhi, Aryan Mahindra, Abhishek Singh, Ramesh Raskar

    Abstract: There is a growing need for spatial privacy considerations in the many geo-spatial technologies that have been created as solutions for COVID-19-related issues. Although effective geo-spatial technologies have already been rolled out, most have significantly sacrificed privacy for utility. In this paper, we explore spatial k-anonymity, a privacy-preserving method that can address this unnecessary… ▽ More

    Submitted 4 January, 2021; originally announced January 2021.

  16. arXiv:2101.01693  [pdf, other

    cs.CY

    COVID-19 Tests Gone Rogue: Privacy, Efficacy, Mismanagement and Misunderstandings

    Authors: Manuel Morales, Rachel Barbar, Darshan Gandhi, Sanskruti Landage, Joseph Bae, Arpita Vats, Jil Kothari, Sheshank Shankar, Rohan Sukumaran, Himi Mathur, Krutika Misra, Aishwarya Saxena, Parth Patwa, Sethuraman T. V., Maurizio Arseni, Shailesh Advani, Kasia Jakimowicz, Sunaina Anand, Priyanshi Katiyar, Ashley Mehra, Rohan Iyer, Srinidhi Murali, Aryan Mahindra, Mikhail Dmitrienko, Saurish Srivastava , et al. (5 additional authors not shown)

    Abstract: COVID-19 testing, the cornerstone for effective screening and identification of COVID-19 cases, remains paramount as an intervention tool to curb the spread of COVID-19 both at local and national levels. However, the speed at which the pandemic struck and the response was rolled out, the widespread impact on healthcare infrastructure, the lack of sufficient preparation within the public health sys… ▽ More

    Submitted 7 May, 2021; v1 submitted 5 January, 2021; originally announced January 2021.

    Comments: 22 pages, 2 figures

  17. arXiv:2012.12263  [pdf, other

    econ.GN physics.soc-ph

    Challenges of Equitable Vaccine Distribution in the COVID-19 Pandemic

    Authors: Joseph Bae, Darshan Gandhi, Jil Kothari, Sheshank Shankar, Jonah Bae, Parth Patwa, Rohan Sukumaran, Aviral Chharia, Sanjay Adhikesaven, Shloak Rathod, Irene Nandutu, Sethuraman TV, Vanessa Yu, Krutika Misra, Srinidhi Murali, Aishwarya Saxena, Kasia Jakimowicz, Vivek Sharma, Rohan Iyer, Ashley Mehra, Alex Radunsky, Priyanshi Katiyar, Ananthu James, Jyoti Dalal, Sunaina Anand , et al. (3 additional authors not shown)

    Abstract: The COVID-19 pandemic has led to a need for widespread and rapid vaccine development. As several vaccines have recently been approved for human use or are in different stages of development, governments across the world are preparing comprehensive guidelines for vaccine distribution and monitoring. In this early article, we identify challenges in logistics, health outcomes, user-centric matters, a… ▽ More

    Submitted 27 April, 2022; v1 submitted 24 November, 2020; originally announced December 2020.

    Comments: 18 pages, 3 figures

  18. arXiv:2012.01772  [pdf, other

    cs.CY

    Digital Landscape of COVID-19 Testing: Challenges and Opportunities

    Authors: Darshan Gandhi, Rohan Sukumaran, Priyanshi Katiyar, Alex Radunsky, Sunaina Anand, Shailesh Advani, Jil Kothari, Kasia Jakimowicz, Sheshank Shankar, Sethuraman T. V., Krutika Misra, Aishwarya Saxena, Sanskruti Landage, Richa Sonker, Parth Patwa, Aryan Mahindra, Mikhail Dmitrienko, Kanishka Vaish, Ashley Mehra, Srinidhi Murali, Rohan Iyer, Joseph Bae, Vivek Sharma, Abhishek Singh, Rachel Barbar , et al. (1 additional authors not shown)

    Abstract: The COVID-19 Pandemic has left a devastating trail all over the world, in terms of loss of lives, economic decline, travel restrictions, trade deficit, and collapsing economy including real-estate, job loss, loss of health benefits, the decline in quality of access to care and services and overall quality of life. Immunization from the anticipated vaccines will not be the stand-alone guideline tha… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

    Comments: 28 pages, 4 figures

  19. arXiv:2011.04426  [pdf, other

    cond-mat.mtrl-sci cs.LG

    AutoMat: Accelerated Computational Electrochemical systems Discovery

    Authors: Emil Annevelink, Rachel Kurchin, Eric Muckley, Lance Kavalsky, Vinay I. Hegde, Valentin Sulzer, Shang Zhu, Jiankun Pu, David Farina, Matthew Johnson, Dhairya Gandhi, Adarsh Dave, Hongyi Lin, Alan Edelman, Bharath Ramsundar, James Saal, Christopher Rackauckas, Viral Shah, Bryce Meredig, Venkatasubramanian Viswanathan

    Abstract: Large-scale electrification is vital to addressing the climate crisis, but several scientific and technological challenges remain to fully electrify both the chemical industry and transportation. In both of these areas, new electrochemical materials will be critical, but their development currently relies heavily on human-time-intensive experimental trial and error and computationally expensive fi… ▽ More

    Submitted 13 May, 2022; v1 submitted 3 November, 2020; originally announced November 2020.

    Comments: v1-3:4 pages, 1 figure, accepted to NeurIPS Climate Change and AI Workshop 2020, updating acknowledgements and citations v4: substantially updated content and author list, accepted to MRS Bulletin

  20. arXiv:2011.04202  [pdf, other

    q-bio.OT

    Clinical Landscape of COVID-19 Testing: Difficult Choices

    Authors: Darshan Gandhi, Sanskruti Landage, Joseph Bae, Sheshank Shankar, Rohan Sukumaran, Parth Patwa, Sethuraman T V, Priyanshi Katiyar, Shailesh Advani, Rohan Iyer, Sunaina Anand, Aryan Mahindra, Rachel Barbar, Abhishek Singh, Ramesh Raskar

    Abstract: The coronavirus disease 2019 (COVID-19) pandemic has spread rapidly across the world, leading to enormous amounts of human death and economic loss. Until definitive preventive or curative measures are developed, policies regarding testing, contact tracing, and quarantine remain the best public health tools for curbing viral spread. Testing is a crucial component of these efforts, enabling the iden… ▽ More

    Submitted 15 November, 2020; v1 submitted 9 November, 2020; originally announced November 2020.

    Comments: 9 pages, 12 figures

  21. arXiv:2008.04899  [pdf, other

    cs.RO cs.CV cs.LG

    Visual Imitation Made Easy

    Authors: Sarah Young, Dhiraj Gandhi, Shubham Tulsiani, Abhinav Gupta, Pieter Abbeel, Lerrel Pinto

    Abstract: Visual imitation learning provides a framework for learning complex manipulation behaviors by leveraging human demonstrations. However, current interfaces for imitation such as kinesthetic teaching or teleoperation prohibitively restrict our ability to efficiently collect large-scale data in the wild. Obtaining such diverse demonstration data is paramount for the generalization of learned skills t… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

  22. arXiv:2007.01851  [pdf, other

    cs.RO cs.CV cs.LG

    Swoosh! Rattle! Thump! -- Actions that Sound

    Authors: Dhiraj Gandhi, Abhinav Gupta, Lerrel Pinto

    Abstract: Truly intelligent agents need to capture the interplay of all their senses to build a rich physical understanding of their world. In robotics, we have seen tremendous progress in using visual and tactile perception; however, we have often ignored a key sense: sound. This is primarily due to the lack of data that captures the interplay of action and sound. In this work, we perform the first large-s… ▽ More

    Submitted 3 July, 2020; originally announced July 2020.

    Comments: To be presented at Robotics: Science and Systems 2020

  23. arXiv:2007.00643  [pdf, other

    cs.CV cs.LG cs.RO

    Object Goal Navigation using Goal-Oriented Semantic Exploration

    Authors: Devendra Singh Chaplot, Dhiraj Gandhi, Abhinav Gupta, Ruslan Salakhutdinov

    Abstract: This work studies the problem of object goal navigation which involves navigating to an instance of the given object category in unseen environments. End-to-end learning-based navigation methods struggle at this task as they are ineffective at exploration and long-term planning. We propose a modular system called, `Goal-Oriented Semantic Exploration' which builds an episodic semantic map and uses… ▽ More

    Submitted 1 July, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

    Comments: Winner of the CVPR 2020 AI-Habitat Object Goal Navigation Challenge. See the project webpage at https://devendrachaplot.github.io/projects/semantic-exploration.html

  24. arXiv:2004.05155  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Learning to Explore using Active Neural SLAM

    Authors: Devendra Singh Chaplot, Dhiraj Gandhi, Saurabh Gupta, Abhinav Gupta, Ruslan Salakhutdinov

    Abstract: This work presents a modular and hierarchical approach to learn policies for exploring 3D environments, called `Active Neural SLAM'. Our approach leverages the strengths of both classical and learning-based methods, by using analytical path planners with learned SLAM module, and global and local policies. The use of learning provides flexibility with respect to input modalities (in the SLAM module… ▽ More

    Submitted 10 April, 2020; originally announced April 2020.

    Comments: Published in ICLR-2020. See the project webpage at https://devendrachaplot.github.io/projects/Neural-SLAM for supplementary videos. The code is available at https://github.com/devendrachaplot/Neural-SLAM

  25. arXiv:1910.03568  [pdf, other

    cs.CV cs.RO

    Object-centric Forward Modeling for Model Predictive Control

    Authors: Yufei Ye, Dhiraj Gandhi, Abhinav Gupta, Shubham Tulsiani

    Abstract: We present an approach to learn an object-centric forward model, and show that this allows us to plan for sequences of actions to achieve distant desired goals. We propose to model a scene as a collection of objects, each with an explicit spatial location and implicit visual feature, and learn to model the effects of actions using random interaction data. Our model allows capturing the robot-objec… ▽ More

    Submitted 8 October, 2019; originally announced October 2019.

  26. arXiv:1907.10955  [pdf, other

    cs.RO astro-ph.IM

    Overview of Guidance, Navigation and Control System of the TeamIndus lunar lander

    Authors: Vishesh Vatsal, C. Barath, J. Yogeshwaran, Deepana Gandhi, Chhavilata Sahu, Karthic Balasubramanian, Shyam Mohan, Midhun S. Menon, P. Natarajan, Vivek Raghavan

    Abstract: TeamIndus' lunar logistics vision includes multiple lunar missions to meet requirements of science, commercial and efforts towards global exploration. The first mission is slated for launch in 2020. The prime objective is to demonstrate autonomous precision lunar landing, and Surface Exploration Rover to collect data on the vicinity of the landing site. TeamIndus has developed various technologies… ▽ More

    Submitted 25 July, 2019; originally announced July 2019.

  27. arXiv:1906.08236  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    PyRobot: An Open-source Robotics Framework for Research and Benchmarking

    Authors: Adithyavairavan Murali, Tao Chen, Kalyan Vasudev Alwala, Dhiraj Gandhi, Lerrel Pinto, Saurabh Gupta, Abhinav Gupta

    Abstract: This paper introduces PyRobot, an open-source robotics framework for research and benchmarking. PyRobot is a light-weight, high-level interface on top of ROS that provides a consistent set of hardware independent mid-level APIs to control different robots. PyRobot abstracts away details about low-level controllers and inter-process communication, and allows non-robotics researchers (ML, CV researc… ▽ More

    Submitted 19 June, 2019; originally announced June 2019.

  28. arXiv:1906.04161  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Self-Supervised Exploration via Disagreement

    Authors: Deepak Pathak, Dhiraj Gandhi, Abhinav Gupta

    Abstract: Efficient exploration is a long-standing problem in sensorimotor learning. Major advances have been demonstrated in noise-free, non-stochastic domains such as video games and simulation. However, most of these formulations either get stuck in environments with stochastic dynamics or are too inefficient to be scalable to real robotics setups. In this paper, we propose a formulation for exploration… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: Accepted at ICML 2019. Website at https://pathak22.github.io/exploration-by-disagreement/

  29. arXiv:1811.01457  [pdf, other

    cs.PL cs.LG

    Fashionable Modelling with Flux

    Authors: Michael Innes, Elliot Saba, Keno Fischer, Dhairya Gandhi, Marco Concetto Rudilosso, Neethu Mariya Joy, Tejan Karmali, Avik Pal, Viral Shah

    Abstract: Machine learning as a discipline has seen an incredible surge of interest in recent years due in large part to a perfect storm of new theory, superior tooling, renewed interest in its capabilities. We present in this paper a framework named Flux that shows how further refinement of the core ideas of machine learning, built upon the foundation of the Julia programming language, can yield an environ… ▽ More

    Submitted 10 November, 2018; v1 submitted 31 October, 2018; originally announced November 2018.

  30. arXiv:1807.07049  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Robot Learning in Homes: Improving Generalization and Reducing Dataset Bias

    Authors: Abhinav Gupta, Adithyavairavan Murali, Dhiraj Gandhi, Lerrel Pinto

    Abstract: Data-driven approaches to solving robotic tasks have gained a lot of traction in recent years. However, most existing policies are trained on large-scale datasets collected in curated lab settings. If we aim to deploy these models in unstructured visual environments like people's homes, they will be unable to cope with the mismatch in data distribution. In such light, we present the first systemat… ▽ More

    Submitted 18 July, 2018; originally announced July 2018.

  31. arXiv:1805.04201  [pdf, other

    cs.RO cs.AI eess.SY

    Learning to Grasp Without Seeing

    Authors: Adithyavairavan Murali, Yin Li, Dhiraj Gandhi, Abhinav Gupta

    Abstract: Can a robot grasp an unknown object without seeing it? In this paper, we present a tactile-sensing based approach to this challenging problem of grasping novel objects without prior knowledge of their location or physical properties. Our key idea is to combine touch based object localization with tactile based re-grasping. To train our learning models, we created a large-scale grasping dataset, in… ▽ More

    Submitted 10 May, 2018; originally announced May 2018.

  32. arXiv:1708.01354  [pdf, other

    cs.RO cs.CV cs.LG

    CASSL: Curriculum Accelerated Self-Supervised Learning

    Authors: Adithyavairavan Murali, Lerrel Pinto, Dhiraj Gandhi, Abhinav Gupta

    Abstract: Recent self-supervised learning approaches focus on using a few thousand data points to learn policies for high-level, low-dimensional action spaces. However, scaling this framework for high-dimensional control require either scaling up the data collection efforts or using a clever sampling strategy for training. We present a novel approach - Curriculum Accelerated Self-Supervised Learning (CASSL)… ▽ More

    Submitted 12 February, 2018; v1 submitted 3 August, 2017; originally announced August 2017.

  33. arXiv:1704.05588  [pdf, other

    cs.RO cs.CV cs.LG

    Learning to Fly by Crashing

    Authors: Dhiraj Gandhi, Lerrel Pinto, Abhinav Gupta

    Abstract: How do you learn to navigate an Unmanned Aerial Vehicle (UAV) and avoid obstacles? One approach is to use a small dataset collected by human experts: however, high capacity learning algorithms tend to overfit when trained with little data. An alternative is to use simulation. But the gap between simulation and real world remains large especially for perception problems. The reason most research av… ▽ More

    Submitted 26 April, 2017; v1 submitted 18 April, 2017; originally announced April 2017.

  34. arXiv:1604.01360  [pdf, other

    cs.CV cs.AI cs.RO

    The Curious Robot: Learning Visual Representations via Physical Interactions

    Authors: Lerrel Pinto, Dhiraj Gandhi, Yuanfeng Han, Yong-Lae Park, Abhinav Gupta

    Abstract: What is the right supervisory signal to train visual representations? Current approaches in computer vision use category labels from datasets such as ImageNet to train ConvNets. However, in case of biological agents, visual representation learning does not require millions of semantic labels. We argue that biological agents use physical interactions with the world to learn visual representations u… ▽ More

    Submitted 25 July, 2016; v1 submitted 5 April, 2016; originally announced April 2016.