How to Select and Use Tools? : Active Perception of Target Objects Using Multimodal Deep Learning

Selection of appropriate tools and use of them when performing daily tasks is a critical function for introducing robots for domestic applications. In previous studies, however, adaptability to target objects was limited, making it difficult to accordingly change tools and adjust actions. To manipul...

Full description

Saved in:

Bibliographic Details
Published in	IEEE robotics and automation letters Vol. 6; no. 2; pp. 2517 - 2524
Main Authors	Saito, Namiko, Ogata, Tetsuya, Funabashi, Satoshi, Mori, Hiroki, Sugano, Shigeki
Format	Journal Article
Language	English
Published	Piscataway IEEE 01.04.2021 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Active perception AI-based methods Artificial neural networks Character recognition Deep learning deep learning in grasping and manipulation Ingredients Object recognition Perception perception for grasping and manipulation Robot sensing systems Robots Target recognition Task analysis
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Selection of appropriate tools and use of them when performing daily tasks is a critical function for introducing robots for domestic applications. In previous studies, however, adaptability to target objects was limited, making it difficult to accordingly change tools and adjust actions. To manipulate various objects with tools, robots must both understand tool functions and recognize object characteristics to discern a tool-object-action relation. We focus on active perception using multimodal sensorimotor data while a robot interacts with objects, and allow the robot to recognize their extrinsic and intrinsic characteristics. We construct a deep neural networks (DNN) model that learns to recognize object characteristics, acquires tool-object-action relations, and generates motions for tool selection and handling. As an example tool-use situation, the robot performs an ingredients transfer task, using a turner or ladle to transfer an ingredient from a pot to a bowl. The results confirm that the robot recognizes object characteristics and servings even when the target ingredients are unknown. We also examine the contributions of images, force, and tactile data and show that learning a variety of multimodal information results in rich perception for tool use.
ISSN:	2377-3766 2377-3766
DOI:	10.1109/LRA.2021.3062004