Skip to main content

Showing 1–42 of 42 results for author: Boularias, A

  1. arXiv:2407.05511  [pdf, other

    cs.LG cs.RO

    Provably Efficient Long-Horizon Exploration in Monte Carlo Tree Search through State Occupancy Regularization

    Authors: Liam Schramm, Abdeslam Boularias

    Abstract: Monte Carlo tree search (MCTS) has been successful in a variety of domains, but faces challenges with long-horizon exploration when compared to sampling-based motion planning algorithms like Rapidly-Exploring Random Trees. To address these limitations of MCTS, we derive a tree search algorithm based on policy optimization with state occupancy measure regularization, which we call {\it Volume-MCTS}… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: To be published in ICML 2024 Conference Proceedings

  2. arXiv:2406.07837  [pdf, other

    cs.RO cs.AI

    Scaling Manipulation Learning with Visual Kinematic Chain Prediction

    Authors: Xinyu Zhang, Yuhan Liu, Haonan Chang, Abdeslam Boularias

    Abstract: Learning general-purpose models from diverse datasets has achieved great success in machine learning. In robotics, however, existing methods in multi-task learning are typically constrained to a single robot and workspace, while recent work such as RT-X requires a non-trivial action normalization procedure to manually bridge the gap between different action spaces in diverse environments. In this… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Submitted to CoRL 2024

  3. arXiv:2406.07549  [pdf, other

    cs.RO

    A3VLM: Actionable Articulation-Aware Vision Language Model

    Authors: Siyuan Huang, Haonan Chang, Yuhan Liu, Yimeng Zhu, Hao Dong, Peng Gao, Abdeslam Boularias, Hongsheng Li

    Abstract: Vision Language Models (VLMs) have received significant attention in recent years in the robotics community. VLMs are shown to be able to perform complex visual reasoning and scene understanding tasks, which makes them regarded as a potential universal solution for general robotics problems such as manipulation and navigation. However, previous VLMs for robotics such as RT-1, RT-2, and ManipLLM ha… ▽ More

    Submitted 13 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

  4. arXiv:2405.13178  [pdf, other

    cs.RO cs.AI cs.LG

    One-Shot Imitation Learning with Invariance Matching for Robotic Manipulation

    Authors: Xinyu Zhang, Abdeslam Boularias

    Abstract: Learning a single universal policy that can perform a diverse set of manipulation tasks is a promising new direction in robotics. However, existing techniques are limited to learning policies that can only perform tasks that are encountered during training, and require a large number of demonstrations to learn new tasks. Humans, on the other hand, often can learn a new task from a single unannotat… ▽ More

    Submitted 4 June, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: RSS 2024

  5. arXiv:2311.02873  [pdf, other

    cs.CV cs.RO

    OVIR-3D: Open-Vocabulary 3D Instance Retrieval Without Training on 3D Data

    Authors: Shiyang Lu, Haonan Chang, Eric Pu Jing, Abdeslam Boularias, Kostas Bekris

    Abstract: This work presents OVIR-3D, a straightforward yet effective method for open-vocabulary 3D object instance retrieval without using any 3D data for training. Given a language query, the proposed method is able to return a ranked set of 3D object instance segments based on the feature similarity of the instance and the text query. This is achieved by a multi-view fusion of text-aligned 2D region prop… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  6. arXiv:2309.15940  [pdf, other

    cs.RO cs.CV

    Context-Aware Entity Grounding with Open-Vocabulary 3D Scene Graphs

    Authors: Haonan Chang, Kowndinya Boyalakuntla, Shiyang Lu, Siwei Cai, Eric Jing, Shreesh Keskar, Shijie Geng, Adeeb Abbas, Lifeng Zhou, Kostas Bekris, Abdeslam Boularias

    Abstract: We present an Open-Vocabulary 3D Scene Graph (OVSG), a formal framework for grounding a variety of entities, such as object instances, agents, and regions, with free-form text-based queries. Unlike conventional semantic-based object localization approaches, our system facilitates context-aware entity localization, allowing for queries such as ``pick up a cup on a kitchen table" or ``navigate to a… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: The code and dataset used for evaluation can be found at https://github.com/changhaonan/OVSG}{https://github.com/changhaonan/OVSG. This paper has been accepted by CoRL2023

  7. arXiv:2309.15821  [pdf, other

    cs.RO

    LGMCTS: Language-Guided Monte-Carlo Tree Search for Executable Semantic Object Rearrangement

    Authors: Haonan Chang, Kai Gao, Kowndinya Boyalakuntla, Alex Lee, Baichuan Huang, Harish Udhaya Kumar, Jinjin Yu, Abdeslam Boularias

    Abstract: We introduce a novel approach to the executable semantic object rearrangement problem. In this challenge, a robot seeks to create an actionable plan that rearranges objects within a scene according to a pattern dictated by a natural language description. Unlike existing methods such as StructFormer and StructDiffusion, which tackle the issue in two steps by first generating poses and then leveragi… ▽ More

    Submitted 20 March, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

    Comments: Our code and supplementary materials are accessible at https://github.com/changhaonan/LG-MCTS

  8. arXiv:2309.12969  [pdf, other

    cs.CV

    Detect Everything with Few Examples

    Authors: Xinyu Zhang, Yuting Wang, Abdeslam Boularias

    Abstract: Few-shot object detection aims at detecting novel categories given a few example images. Recent methods focus on finetuning strategies, with complicated procedures that prohibit a wider application. In this paper, we introduce DE-ViT, a few-shot object detector without the need for finetuning. DE-ViT's novel architecture is based on a new region-propagation mechanism for localization. The propagat… ▽ More

    Submitted 7 March, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

  9. arXiv:2307.13640  [pdf, other

    cs.CV

    Optical Flow boosts Unsupervised Localization and Segmentation

    Authors: Xinyu Zhang, Abdeslam Boularias

    Abstract: Unsupervised localization and segmentation are long-standing robot vision challenges that describe the critical ability for an autonomous robot to learn to decompose images into individual objects without labeled data. These tasks are important because of the limited availability of dense image manual annotation and the promising vision of adapting to an evolving set of object categories in lifelo… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: Accepted at IROS2023

  10. arXiv:2304.04325  [pdf, other

    cs.CV cs.RO

    Self-Supervised Learning of Object Segmentation from Unlabeled RGB-D Videos

    Authors: Shiyang Lu, Yunfu Deng, Abdeslam Boularias, Kostas Bekris

    Abstract: This work proposes a self-supervised learning system for segmenting rigid objects in RGB images. The proposed pipeline is trained on unlabeled RGB-D videos of static objects, which can be captured with a camera carried by a mobile robot. A key feature of the self-supervised training process is a graph-matching algorithm that operates on the over-segmentation output of the point cloud that is recon… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

  11. arXiv:2301.13244  [pdf, other

    cs.RO cs.CV

    Mono-STAR: Mono-camera Scene-level Tracking and Reconstruction

    Authors: Haonan Chang, Dhruv Metha Ramesh, Shijie Geng, Yuqiu Gan, Abdeslam Boularias

    Abstract: We present Mono-STAR, the first real-time 3D reconstruction system that simultaneously supports semantic fusion, fast motion tracking, non-rigid object deformation, and topological change under a unified framework. The proposed system solves a new optimization problem incorporating optical-flow-based 2D constraints to deal with fast motion and a novel semantic-aware deformation graph (SAD-graph) f… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    Comments: This paper has been accepted by ICRA2023

  12. arXiv:2210.03815  [pdf, other

    cs.RO cs.CV cs.GR

    Scene-level Tracking and Reconstruction without Object Priors

    Authors: Haonan Chang, Abdeslam Boularias

    Abstract: We present the first real-time system capable of tracking and reconstructing, individually, every visible object in a given scene, without any form of prior on the rigidness of the objects, texture existence, or object category. In contrast with previous methods such as Co-Fusion and MaskFusion that first segment the scene into individual objects and then process each object independently, the pro… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: Accepted by IROS2022

  13. arXiv:2209.06331  [pdf, other

    cs.RO

    Learning Category-Level Manipulation Tasks from Point Clouds with Dynamic Graph CNNs

    Authors: Junchi Liang, Abdeslam Boularias

    Abstract: This paper presents a new technique for learning category-level manipulation from raw RGB-D videos of task demonstrations, with no manual labels or annotations. Category-level learning aims to acquire skills that can be generalized to new objects, with geometries and textures that are different from the ones of the objects used in the demonstrations. We address this problem by first viewing both g… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

  14. arXiv:2207.06649  [pdf, other

    cs.RO

    Parallel Monte Carlo Tree Search with Batched Rigid-body Simulations for Speeding up Long-Horizon Episodic Robot Planning

    Authors: Baichuan Huang, Abdeslam Boularias, Jingjin Yu

    Abstract: We propose a novel Parallel Monte Carlo tree search with Batched Simulations (PMBS) algorithm for accelerating long-horizon, episodic robotic planning tasks. Monte Carlo tree search (MCTS) is an effective heuristic search algorithm for solving episodic decision-making problems whose underlying search spaces are expansive. Leveraging a GPU-based large-scale simulator, PMBS introduces massive parall… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Comments: Accepted for IROS 2022

  15. arXiv:2207.01115  [pdf, other

    cs.LG cs.AI cs.RO

    USHER: Unbiased Sampling for Hindsight Experience Replay

    Authors: Liam Schramm, Yunfu Deng, Edgar Granados, Abdeslam Boularias

    Abstract: Dealing with sparse rewards is a long-standing challenge in reinforcement learning (RL). Hindsight Experience Replay (HER) addresses this problem by reusing failed trajectories for one goal as successful trajectories for another. This allows for both a minimum density of reward and for generalization across multiple goals. However, this strategy is known to result in a biased value function, as th… ▽ More

    Submitted 3 July, 2022; originally announced July 2022.

  16. arXiv:2203.03797  [pdf, other

    cs.RO cs.LG

    Learning Sensorimotor Primitives of Sequential Manipulation Tasks from Visual Demonstrations

    Authors: Junchi Liang, Bowen Wen, Kostas Bekris, Abdeslam Boularias

    Abstract: This work aims to learn how to perform complex robot manipulation tasks that are composed of several, consecutively executed low-level sub-tasks, given as input a few visual demonstrations of the tasks performed by a person. The sub-tasks consist of moving the robot's end-effector until it reaches a sub-goal region in the task space, performing an action, and triggering the next sub-task when a pr… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

  17. arXiv:2202.01426  [pdf, other

    cs.RO

    Interleaving Monte Carlo Tree Search and Self-Supervised Learning for Object Retrieval in Clutter

    Authors: Baichuan Huang, Teng Guo, Abdeslam Boularias, Jingjin Yu

    Abstract: In this study, working with the task of object retrieval in clutter, we have developed a robot learning framework in which Monte Carlo Tree Search (MCTS) is first applied to enable a Deep Neural Network (DNN) to learn the intricate interactions between a robot arm and a complex scene containing many objects, allowing the DNN to partially clone the behavior of MCTS. In turn, the trained DNN is inte… ▽ More

    Submitted 23 March, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

    Comments: Accepted by ICRA 2022

  18. arXiv:2106.14070  [pdf, other

    cs.RO cs.AI cs.CV eess.SY

    Vision-driven Compliant Manipulation for Reliable, High-Precision Assembly Tasks

    Authors: Andrew S. Morgan, Bowen Wen, Junchi Liang, Abdeslam Boularias, Aaron M. Dollar, Kostas Bekris

    Abstract: Highly constrained manipulation tasks continue to be challenging for autonomous robots as they require high levels of precision, typically less than 1mm, which is often incompatible with what can be achieved by traditional perception systems. This paper demonstrates that the combination of state-of-the-art object tracking with passively adaptive mechanical hardware can be leveraged to complete pre… ▽ More

    Submitted 26 June, 2021; originally announced June 2021.

  19. Visual Foresight Trees for Object Retrieval from Clutter with Nonprehensile Rearrangement

    Authors: Baichuan Huang, Shuai D. Han, Jingjin Yu, Abdeslam Boularias

    Abstract: This paper considers the problem of retrieving an object from many tightly packed objects using a combination of robotic pushing and grasping actions. Object retrieval in dense clutter is an important skill for robots to operate in households and everyday environments effectively. The proposed solution, Visual Foresight Trees (VFT), intelligently rearranges the clutter surrounding a target object… ▽ More

    Submitted 9 March, 2022; v1 submitted 6 May, 2021; originally announced May 2021.

    Comments: Accepted by RA-L

  20. arXiv:2011.05459  [pdf, other

    cs.CV cs.RO

    A Self-supervised Learning System for Object Detection in Videos Using Random Walks on Graphs

    Authors: Juntao Tan, Changkyu Song, Abdeslam Boularias

    Abstract: This paper presents a new self-supervised system for learning to detect novel and previously unseen categories of objects in images. The proposed system receives as input several unlabeled videos of scenes containing various objects. The frames of the videos are segmented into objects using depth information, and the segments are tracked along each video. The system then constructs a weighted grap… ▽ More

    Submitted 24 August, 2021; v1 submitted 10 November, 2020; originally announced November 2020.

    Comments: 2021 IEEE International Conference on Robotics and Automation (ICRA 2021)

  21. arXiv:2011.04692  [pdf, other

    cs.RO cs.AI

    DIPN: Deep Interaction Prediction Network with Application to Clutter Removal

    Authors: Baichuan Huang, Shuai D. Han, Abdeslam Boularias, Jingjin Yu

    Abstract: We propose a Deep Interaction Prediction Network (DIPN) for learning to predict complex interactions that ensue as a robot end-effector pushes multiple objects, whose physical properties, including size, shape, mass, and friction coefficients may be unknown a priori. DIPN "imagines" the effect of a push action and generates an accurate synthetic image of the predicted outcome. DIPN is shown to be… ▽ More

    Submitted 4 April, 2021; v1 submitted 9 November, 2020; originally announced November 2020.

    Comments: ICRA 2021

  22. arXiv:2009.11465  [pdf, other

    cs.RO

    Model Identification and Control of a Low-Cost Wheeled Mobile Robot Using Differentiable Physics

    Authors: Yanshi Luo, Abdeslam Boularias, Mridul Aanjaneya

    Abstract: We present the design of a low-cost wheeled mobile robot, and an analytical model for predicting its motion under the influence of motor torques and friction forces. Using our proposed model, we show how to analytically compute the gradient of an appropriate loss function, that measures the deviation between predicted motion trajectories and real-world trajectories, which are estimated using April… ▽ More

    Submitted 23 September, 2020; originally announced September 2020.

  23. arXiv:2008.01921  [pdf, other

    cs.RO eess.SY

    A Probabilistic Model for Planar Sliding of Objects with Unknown Material Properties: Identification and Robust Planning

    Authors: Changkyu Song, Abdeslam Boularias

    Abstract: This paper introduces a new technique for learning probabilistic models of mass and friction distributions of unknown objects, and performing robust sliding actions by using the learned models. The proposed method is executed in two consecutive phases. In the exploration phase, a table-top object is poked by a robot from different angles. The observed motions of the object are compared against sim… ▽ More

    Submitted 4 August, 2020; originally announced August 2020.

    Journal ref: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2020)

  24. arXiv:2008.01593  [pdf, other

    cs.LG cs.RO stat.ML

    Learning Transition Models with Time-delayed Causal Relations

    Authors: Junchi Liang, Abdeslam Boularias

    Abstract: This paper introduces an algorithm for discovering implicit and delayed causal relations between events observed by a robot at arbitrary times, with the objective of improving data-efficiency and interpretability of model-based reinforcement learning (RL) techniques. The proposed algorithm initially predicts observations with the Markov assumption, and incrementally introduces new hidden variables… ▽ More

    Submitted 4 August, 2020; originally announced August 2020.

  25. arXiv:2006.15503  [pdf, other

    cs.RO

    Task-driven Perception and Manipulation for Constrained Placement of Unknown Objects

    Authors: Chaitanya Mitash, Rahul Shome, Bowen Wen, Abdeslam Boularias, Kostas Bekris

    Abstract: Recent progress in robotic manipulation has dealt with the case of previously unknown objects in the context of relatively simple tasks, such as bin-picking. Existing methods for more constrained problems, however, such as deliberate placement in a tight region, depend more critically on shape information to achieve safe execution. This work deals with pick-and-constrained placement of objects wit… ▽ More

    Submitted 28 June, 2020; originally announced June 2020.

    Comments: 8 pages, 8 figures, RA-L and IROS 2020

  26. arXiv:2005.10418  [pdf, other

    cs.LG eess.SY stat.ML

    Learning to Transfer Dynamic Models of Underactuated Soft Robotic Hands

    Authors: Liam Schramm, Avishai Sintov, Abdeslam Boularias

    Abstract: Transfer learning is a popular approach to bypassing data limitations in one domain by leveraging data from another domain. This is especially useful in robotics, as it allows practitioners to reduce data collection with physical robots, which can be time-consuming and cause wear and tear. The most common way of doing this with neural networks is to take an existing neural network, and simply trai… ▽ More

    Submitted 20 May, 2020; originally announced May 2020.

    Comments: ICRA 2020

  27. arXiv:2005.05456  [pdf, other

    cs.RO cs.LG eess.SY

    Learning to Slide Unknown Objects with Differentiable Physics Simulations

    Authors: Changkyu Song, Abdeslam Boularias

    Abstract: We propose a new technique for pushing an unknown object from an initial configuration to a goal configuration with stability constraints. The proposed method leverages recent progress in differentiable physics models to learn unknown mechanical properties of pushed objects, such as their distributions of mass and coefficients of friction. The proposed learning technique computes the gradient of t… ▽ More

    Submitted 3 June, 2020; v1 submitted 11 May, 2020; originally announced May 2020.

    Comments: to be published in Robotics: Science and Systems, July 12-16, 2020

  28. arXiv:2005.05410  [pdf, other

    cs.RO cs.CV eess.SY

    Identifying Mechanical Models through Differentiable Simulations

    Authors: Changkyu Song, Abdeslam Boularias

    Abstract: This paper proposes a new method for manipulating unknown objects through a sequence of non-prehensile actions that displace an object from its initial configuration to a given goal configuration on a flat surface. The proposed method leverages recent progress in differentiable physics models to identify unknown mechanical properties of manipulated objects, such as inertia matrix, friction coeffic… ▽ More

    Submitted 11 May, 2020; originally announced May 2020.

    Comments: to be published in Learning for DynamIcs & Control (L4DC), June 10-11th, 2020

  29. arXiv:1910.04953  [pdf, other

    cs.RO cs.CV

    Scene-level Pose Estimation for Multiple Instances of Densely Packed Objects

    Authors: Chaitanya Mitash, Bowen Wen, Kostas Bekris, Abdeslam Boularias

    Abstract: This paper introduces key machine learning operations that allow the realization of robust, joint 6D pose estimation of multiple instances of objects either densely packed or in unstructured piles from RGB-D data. The first objective is to learn semantic and instance-boundary detectors without manual labeling. An adversarial training framework in conjunction with physics-based simulation is used t… ▽ More

    Submitted 10 October, 2019; originally announced October 2019.

    Comments: To appear at the Conference on Robot Learning (CoRL) - 2019

  30. arXiv:1905.07505  [pdf, other

    cs.RO

    Object Rearrangement with Nested Nonprehensile Manipulation Actions

    Authors: Changkyu Song, Abdeslam Boularias

    Abstract: This paper considers the problem of rearrangement planning, i.e finding a sequence of manipulation actions that displace multiple objects from an initial configuration to a given goal configuration. Rearrangement is a critical skill for robots so that they can effectively operate in confined spaces that contain clutter. Examples of tasks that require rearrangement include packing objects inside a… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

  31. Inferring 3D Shapes of Unknown Rigid Objects in Clutter through Inverse Physics Reasoning

    Authors: Changkyu Song, Abdeslam Boularias

    Abstract: We present a probabilistic approach for building, on the fly, 3-D models of unknown objects while being manipulated by a robot. We specifically consider manipulation tasks in piles of clutter that contain previously unseen objects. Most manipulation algorithms for performing such tasks require known geometric models of the objects in order to grasp or rearrange them robustly. One of the novel aspe… ▽ More

    Submitted 13 March, 2019; originally announced March 2019.

    Journal ref: The IEEE Robotics and Automation Letters (RA-L) with the IEEE International Conference on Robotics and Automation (ICRA 2019)

  32. arXiv:1903.00984  [pdf, other

    cs.RO

    Tight Robot Packing in the Real World: A Complete Manipulation Pipeline with Robust Primitives

    Authors: Rahul Shome, Wei N. Tang, Changkyu Song, Chaitanya Mitash, Hristiyan Kourtev, Jingjin Yu, Abdeslam Boularias, Kostas E. Bekris

    Abstract: Many order fulfillment applications in logistics, such as packing, involve picking objects from unstructured piles before tightly arranging them in bins or shipping containers. Desirable robotic solutions in this space need to be low-cost, robust, easily deployable and simple to control. The current work proposes a complete pipeline for solving packing tasks for cuboid objects, given access only t… ▽ More

    Submitted 30 September, 2021; v1 submitted 3 March, 2019; originally announced March 2019.

  33. arXiv:1806.10457  [pdf, other

    cs.RO cs.CV

    Physics-based Scene-level Reasoning for Object Pose Estimation in Clutter

    Authors: Chaitanya Mitash, Abdeslam Boularias, Kostas Bekris

    Abstract: This paper focuses on vision-based pose estimation for multiple rigid objects placed in clutter, especially in cases involving occlusions and objects resting on each other. Progress has been achieved recently in object recognition given advancements in deep learning. Nevertheless, such tools typically require a large amount of training data and significant manual effort to label objects. This limi… ▽ More

    Submitted 1 April, 2019; v1 submitted 25 June, 2018; originally announced June 2018.

    Comments: 18 pages, 13 figures, International Journal of Robotics Research (IJRR) 2019. arXiv admin note: text overlap with arXiv:1710.08577

  34. arXiv:1806.06888  [pdf, other

    cs.CV

    Learning Object Localization and 6D Pose Estimation from Simulation and Weakly Labeled Real Images

    Authors: Jean-Philippe Mercier, Chaitanya Mitash, Philippe Giguère, Abdeslam Boularias

    Abstract: This work proposes a process for efficiently training a point-wise object detector that enables localizing objects and computing their 6D poses in cluttered and occluded scenes. Accurate pose estimation is typically a requirement for robust robotic grasping and manipulation of objects placed in cluttered, tight environments, such as a shelf with multiple objects. To minimize the human labor requir… ▽ More

    Submitted 20 February, 2019; v1 submitted 18 June, 2018; originally announced June 2018.

  35. arXiv:1806.06392  [pdf, other

    cs.LG stat.ML

    Task-Relevant Object Discovery and Categorization for Playing First-person Shooter Games

    Authors: Junchi Liang, Abdeslam Boularias

    Abstract: We consider the problem of learning to play first-person shooter (FPS) video games using raw screen images as observations and keyboard inputs as actions. The high-dimensionality of the observations in this type of applications leads to prohibitive needs of training data for model-free methods, such as the deep Q-network (DQN), and its recurrent variant DRQN. Thus, recent works focused on learning… ▽ More

    Submitted 17 June, 2018; originally announced June 2018.

  36. arXiv:1805.06324  [pdf, other

    cs.CV

    Robust 6D Object Pose Estimation with Stochastic Congruent Sets

    Authors: Chaitanya Mitash, Abdeslam Boularias, Kostas Bekris

    Abstract: Object pose estimation is frequently achieved by first segmenting an RGB image and then, given depth data, registering the corresponding point cloud segment against the object's 3D model. Despite the progress due to CNNs, semantic segmentation output can be noisy, especially when the CNN is only trained on synthetic data. This causes registration methods to fail in estimating a good object pose. T… ▽ More

    Submitted 16 May, 2018; originally announced May 2018.

  37. arXiv:1804.04696  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Efficient Model Identification for Tensegrity Locomotion

    Authors: Shaojun Zhu, David Surovik, Kostas E. Bekris, Abdeslam Boularias

    Abstract: This paper aims to identify in a practical manner unknown physical parameters, such as mechanical models of actuated robot links, which are critical in dynamical robotic tasks. Key features include the use of an off-the-shelf physics engine and the Bayesian optimization framework. The task being considered is locomotion with a high-dimensional, compliant Tensegrity robot. A key insight, in this ca… ▽ More

    Submitted 12 April, 2018; originally announced April 2018.

  38. arXiv:1710.08893  [pdf, other

    cs.RO cs.AI cs.LG

    Fast Model Identification via Physics Engines for Data-Efficient Policy Search

    Authors: Shaojun Zhu, Andrew Kimmel, Kostas E. Bekris, Abdeslam Boularias

    Abstract: This paper presents a method for identifying mechanical parameters of robots or objects, such as their mass and friction coefficients. Key features are the use of off-the-shelf physics engines and the adaptation of a Bayesian optimization technique towards minimizing the number of real-world experiments needed for model-based reinforcement learning. The proposed framework reproduces in a physics e… ▽ More

    Submitted 13 June, 2018; v1 submitted 24 October, 2017; originally announced October 2017.

    Comments: IJCAI 18

  39. arXiv:1710.08577  [pdf, other

    cs.RO cs.CV

    Improving 6D Pose Estimation of Objects in Clutter via Physics-aware Monte Carlo Tree Search

    Authors: Chaitanya Mitash, Abdeslam Boularias, Kostas E. Bekris

    Abstract: This work proposes a process for efficiently searching over combinations of individual object 6D pose hypotheses in cluttered scenes, especially in cases involving occlusions and objects resting on each other. The initial set of candidate object poses is generated from state-of-the-art object detection and global point cloud registration techniques. The best-scored pose per object by using these t… ▽ More

    Submitted 23 October, 2017; originally announced October 2017.

    Comments: 8 pages, 4 figures

  40. arXiv:1703.07822  [pdf, other

    cs.RO cs.AI cs.LG

    Information-theoretic Model Identification and Policy Search using Physics Engines with Application to Robotic Manipulation

    Authors: Shaojun Zhu, Andrew Kimmel, Abdeslam Boularias

    Abstract: We consider the problem of a robot learning the mechanical properties of objects through physical interaction with the object, and introduce a practical, data-efficient approach for identifying the motion models of these objects. The proposed method utilizes a physics engine, where the robot seeks to identify the inertial and friction parameters of the object by simulating its motion under differe… ▽ More

    Submitted 22 March, 2017; originally announced March 2017.

  41. arXiv:1703.03347  [pdf, other

    cs.RO cs.CV

    A Self-supervised Learning System for Object Detection using Physics Simulation and Multi-view Pose Estimation

    Authors: Chaitanya Mitash, Kostas E. Bekris, Abdeslam Boularias

    Abstract: Progress has been achieved recently in object detection given advancements in deep learning. Nevertheless, such tools typically require a large amount of training data and significant manual effort to label objects. This limits their applicability in robotics, where solutions must scale to a large number of objects and variety of conditions. This work proposes an autonomous process for training a… ▽ More

    Submitted 3 August, 2017; v1 submitted 9 March, 2017; originally announced March 2017.

    Comments: 7 pages, 6 figures, accepted at the IEEE International Conference on Intelligent Robots and Systems (IROS), Vancouver, Canada, 2017

  42. arXiv:1210.4887  [pdf

    cs.LG cs.AI stat.ML

    Hilbert Space Embeddings of POMDPs

    Authors: Yu Nishiyama, Abdeslam Boularias, Arthur Gretton, Kenji Fukumizu

    Abstract: A nonparametric approach for policy learning for POMDPs is proposed. The approach represents distributions over the states, observations, and actions as embeddings in feature spaces, which are reproducing kernel Hilbert spaces. Distributions over states given the observations are obtained by applying the kernel Bayes' rule to these distribution embeddings. Policies and value functions are defined… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-644-653