Safe Navigation for Robotic Digestive Endoscopy via Human Intervention-based Reinforcement Learning

With the increasing application of automated robotic digestive endoscopy (RDE), ensuring safe and efficient navigation in the unstructured and narrow digestive tract has become a critical challenge. Existing automated reinforcement learning navigation algorithms, often result in potentially risky co...

Full description

Saved in:

Bibliographic Details
Published in	arXiv.org
Main Authors	Tan, Min, Yushun Tao, Zheng, Boyun, Xie, GaoSheng, Feng, Lijuan, Xia, Zeyang, Xiong, Jing
Format	Paper
Language	English
Published	Ithaca Cornell University Library, arXiv.org 24.09.2024
Subjects	Algorithms Automation Cloning Endoscopy Intervention Machine learning Navigation Reagents
Online Access	Get full text

Cover

Loading…

More Information
Summary:	With the increasing application of automated robotic digestive endoscopy (RDE), ensuring safe and efficient navigation in the unstructured and narrow digestive tract has become a critical challenge. Existing automated reinforcement learning navigation algorithms, often result in potentially risky collisions due to the absence of essential human intervention, which significantly limits the safety and effectiveness of RDE in actual clinical practice. To address this limitation, we proposed a Human Intervention (HI)-based Proximal Policy Optimization (PPO) framework, dubbed HI-PPO, which incorporates expert knowledge to enhance RDE's safety. Specifically, we introduce an Enhanced Exploration Mechanism (EEM) to address the low exploration efficiency of the standard PPO. Additionally, a reward-penalty adjustment (RPA) is implemented to penalize unsafe actions during initial interventions. Furthermore, Behavior Cloning Similarity (BCS) is included as an auxiliary objective to ensure the agent emulates expert actions. Comparative experiments conducted in a simulated platform across various anatomical colon segments demonstrate that our model effectively and safely guides RDE.
ISSN:	2331-8422