Real-time robot path planning from simple to complex obstacle patterns via transfer learning of options

We consider the problem of path planning in an initially unknown environment where a robot does not have an a priori map of its environment but has access to prior information accumulated by itself from navigation in similar but not identical environments. To address the navigation problem, we propo...

Full description

Saved in:

Bibliographic Details
Published in	Autonomous robots Vol. 43; no. 8; pp. 2071 - 2093
Main Authors	Saha, Olimpiya, Dasgupta, Prithviraj, Woosley, Bradley
Format	Journal Article
Language	English
Published	New York Springer US 01.12.2019 Springer Nature B.V
Subjects	Algorithms Artificial Intelligence Barriers Collision avoidance Complexity Computer Imaging Computer simulation Control Cost analysis Engineering Machine learning Maneuvers Markov analysis Markov processes Mechatronics Navigation Path planning Pattern Recognition and Graphics Planning Robotics Robotics and Automation Robots Unknown environments Vision Markov decision processes with unawareness Robot path planning Options Transfer learning Reinforcement learning
Online Access	Get full text

Cover

Loading…

More Information
Summary:	We consider the problem of path planning in an initially unknown environment where a robot does not have an a priori map of its environment but has access to prior information accumulated by itself from navigation in similar but not identical environments. To address the navigation problem, we propose a novel, machine learning-based algorithm called Semi-Markov Decision Process with Unawareness and Transfer (SMDPU-T) where a robot records a sequence of its actions around obstacles as action sequences called options which are then reused by it within a framework called Markov Decision Process with unawareness (MDPU) to learn suitable, collision-free maneuvers around more complex obstacles in future. We have analytically derived the cost bounds of the selected option by SMDPU-T and the worst case time complexity of our algorithm. Our experimental results on simulated robots within Webots simulator illustrate that SMDPU-T takes 24 % planning time and 39 % total time to solve same navigation tasks while, our hardware results on a Turtlebot robot indicate that SMDPU-T on average takes 53 % planning time and 60 % total time as compared to a recent, sampling-based path planner.
ISSN:	0929-5593 1573-7527
DOI:	10.1007/s10514-019-09852-5