Skip to main content

Showing 1–2 of 2 results for author: Balsells, M

  1. arXiv:2310.20608  [pdf, other

    cs.LG cs.AI cs.RO

    Autonomous Robotic Reinforcement Learning with Asynchronous Human Feedback

    Authors: Max Balsells, Marcel Torne, Zihan Wang, Samedh Desai, Pulkit Agrawal, Abhishek Gupta

    Abstract: Ideally, we would place a robot in a real-world environment and leave it there improving on its own by gathering more experience autonomously. However, algorithms for autonomous robotic learning have been challenging to realize in the real world. While this has often been attributed to the challenge of sample complexity, even sample-efficient techniques are hampered by two major challenges - the d… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: Project website https://guided-exploration-autonomous-rl.github.io/GEAR/

  2. arXiv:2307.11049  [pdf, other

    cs.LG cs.AI cs.RO

    Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop Feedback

    Authors: Marcel Torne, Max Balsells, Zihan Wang, Samedh Desai, Tao Chen, Pulkit Agrawal, Abhishek Gupta

    Abstract: Exploration and reward specification are fundamental and intertwined challenges for reinforcement learning. Solving sequential decision-making tasks requiring expansive exploration requires either careful design of reward functions or the use of novelty-seeking exploration bonuses. Human supervisors can provide effective guidance in the loop to direct the exploration process, but prior methods to… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.