Reinforcement learning-based finite time control for the asymmetric underactuated tethered spacecraft with disturbances

This article addresses an attitude stabilization control problem for the asymmetric underactuated tethered spacecraft subject to external disturbances, and a reinforcement learning(RL)-based finite time control scheme is proposed to enhance the control performance and energy efficiency of the closed...

Full description

Saved in:

Bibliographic Details
Published in	Acta astronautica Vol. 220; pp. 218 - 229
Main Authors	Lu, Yingbo, Wang, Xingyu, Liu, Ya, Huang, Panfeng
Format	Journal Article
Language	English
Published	Elsevier Ltd 01.07.2024
Subjects	Actor–critic Asymmetric underactuated tethered spacecraft Finite time control Reinforcement learning Actor–critic Finite time control Asymmetric underactuated tethered spacecraft Reinforcement learning
Online Access	Get full text
ISSN	0094-5765 1879-2030
DOI	10.1016/j.actaastro.2024.04.014

Cover

Loading…

More Information
Summary:	This article addresses an attitude stabilization control problem for the asymmetric underactuated tethered spacecraft subject to external disturbances, and a reinforcement learning(RL)-based finite time control scheme is proposed to enhance the control performance and energy efficiency of the closed-loop system. Firstly, the error dynamics of the underactuated tethered system in the presence of external disturbances is built based on the Lagrange’s modeling technique. Then, a RL-based control algorithm is implemented by a radial basis function (RBF) neural network (NN), in which the actor–critic networks are developed to obtain the optimal performance index function and the optimal controller. According to the Lyapunov theorem, semi-global finite-time stability of all the closed-loop signals is achieved through rigorous mathematical analysis, and tracking errors can be ensured to an arbitrarily small neighborhood of the origin in a finite time. Finally, comparative simulation results with hierarchical sliding mode controller are presented to demonstrate the viability of the proposed strategy. [Display omitted] •The proposed algorithm does not rely on the harsh symmetry condition of mass matrix.•Actor-critic NNs are used to obtain the performance index function and controller.•A novel RL-based scheme is designed to ensure semi-global finite-time convergence.
ISSN:	0094-5765 1879-2030
DOI:	10.1016/j.actaastro.2024.04.014