A self‐adaptive SAC‐PID control approach based on reinforcement learning for mobile robots

Proportional–integral–derivative (PID) control is the most widely used in industrial control, robot control, and other fields. However, traditional PID control is not competent when the system cannot be accurately modeled and the operating environment is variable in real time. To tackle these proble...

Full description

Saved in:

Bibliographic Details
Published in	International journal of robust and nonlinear control Vol. 32; no. 18; pp. 9625 - 9643
Main Authors	Yu, Xinyi, Fan, Yuehai, Xu, Siyu, Ou, Linlin
Format	Journal Article
Language	English
Published	Bognor Regis Wiley Subscription Services, Inc 01.12.2022
Subjects	Adaptive control Algorithms Automatic control Control algorithms Control methods Controllers Error compensation Fuzzy control hierarchical structure Learning mobile robots Polynomials Proportional integral derivative Real time reinforcement learning Robot control Robots Robustness (mathematics) SAC‐PID control
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Proportional–integral–derivative (PID) control is the most widely used in industrial control, robot control, and other fields. However, traditional PID control is not competent when the system cannot be accurately modeled and the operating environment is variable in real time. To tackle these problems, we propose a self‐adaptive model‐free SAC‐PID control approach based on reinforcement learning for automatic control of mobile robots. A new hierarchical structure is developed, which includes the upper controller based on soft actor‐critic (SAC), one of the most competitive continuous control algorithms, and the lower controller based on incremental PID controller. SAC receives the dynamic information of the mobile robot as input and simultaneously outputs the optimal parameters of incremental PID controllers to compensate for the error between the path and the mobile robot in real time. In addition, the combination of 24‐neighborhood method and polynomial fitting is developed to improve the adaptability of SAC‐PID control method to complex environment. The effectiveness of the SAC‐PID control method is verified with several different difficulty paths both on Gazebo and real mecanum mobile robot. Furthermore, compared with fuzzy PID control, the SAC‐PID method has merits of strong robustness, generalization, and real‐time performance.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	1049-8923 1099-1239
DOI:	10.1002/rnc.5662