SafeCrowdNav: safety evaluation of robot crowd navigation in complex scenes

Navigating safely and efficiently in dense crowds remains a challenging problem for mobile robots. The interaction mechanisms involved in collision avoidance require robots to exhibit active and foresighted behaviors while understanding the crowd dynamics. Deep reinforcement learning methods have sh...

Full description

Saved in:

Bibliographic Details
Published in	Frontiers in neurorobotics Vol. 17; p. 1276519
Main Authors	Xu, Jing, Zhang, Wanruo, Cai, Jialun, Liu, Hong
Format	Journal Article
Language	English
Published	Lausanne Frontiers Research Foundation 12.10.2023 Frontiers Media S.A
Subjects	Algorithms collision avoidance Decision making Deep learning Exploration human-aware navigation Markov analysis mobile robot Neuroscience Pedestrians reinforcement learning Robotics Robots Safety security assessment Simulation Velocity
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Navigating safely and efficiently in dense crowds remains a challenging problem for mobile robots. The interaction mechanisms involved in collision avoidance require robots to exhibit active and foresighted behaviors while understanding the crowd dynamics. Deep reinforcement learning methods have shown superior performance compared to model-based approaches. However, existing methods lack an intuitive and quantitative safety evaluation for agents, and they may potentially trap agents in local optima during training, hindering their ability to learn optimal strategies. In addition, sparse reward problems further compound these limitations. To address these challenges, we propose SafeCrowdNav, a comprehensive crowd navigation algorithm that emphasizes obstacle avoidance in complex environments. Our approach incorporates a safety evaluation function to quantitatively assess the current safety score and an intrinsic exploration reward to balance exploration and exploitation based on scene constraints. By combining prioritized experience replay and hindsight experience replay techniques, our model effectively learns the optimal navigation policy in crowded environments. Experimental outcomes reveal that our approach enables robots to improve crowd comprehension during navigation, resulting in reduced collision probabilities and shorter navigation times compared to state-of-the-art algorithms. Our code is available at https://github.com/Janet-xujing-1216/SafeCrowdNav .
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 Reviewed by: Yuxuan Cao, Naval University of Engineering, China; Yinyan Zhang, Jinan University, China Edited by: Long Jin, Lanzhou University, China
ISSN:	1662-5218 1662-5218
DOI:	10.3389/fnbot.2023.1276519