Robot grasping in clutter: Using a hierarchy of supervisors for learning from demonstrations

For applications such as Amazon warehouse order fulfillment, robots must grasp a desired object amid clutter: other objects that block direct access. This can be difficult to program explicitly due to uncertainty in friction and push mechanics and the variety of objects that can be encountered. Deep...

Full description

Saved in:
Bibliographic Details
Published inIEEE International Conference on Automation Science and Engineering (CASE) pp. 827 - 834
Main Authors Laskey, Michael, Lee, Jonathan, Chuck, Caleb, Gealy, David, Hsieh, Wesley, Pokorny, Florian T., Dragan, Anca D., Goldberg, Ken
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.08.2016
Subjects
Online AccessGet full text
ISSN2161-8089
DOI10.1109/COASE.2016.7743488

Cover

Loading…
More Information
Summary:For applications such as Amazon warehouse order fulfillment, robots must grasp a desired object amid clutter: other objects that block direct access. This can be difficult to program explicitly due to uncertainty in friction and push mechanics and the variety of objects that can be encountered. Deep Learning networks combined with Online Learning from Demonstration (LfD) algorithms such as DAgger and SHIV have potential to learn robot control policies for such tasks where the input is a camera image and system dynamics and the cost function are unknown. To explore this idea, we introduce a version of the grasping in clutter problem where a yellow cylinder must be grasped by a planar robot arm amid extruded objects in a variety of shapes and positions. To reduce the burden on human experts to provide demonstrations, we propose using a hierarchy of three levels of supervisors: a fast motion planner that ignores obstacles, crowd-sourced human workers who provide appropriate robot control values remotely via online videos, and a local human expert. Physical experiments suggest that with 160 expert demonstrations, using the hierarchy of supervisors can increase the probability of a successful grasp (reliability) from 55% to 90%.
ISSN:2161-8089
DOI:10.1109/COASE.2016.7743488