Reinforcement learning for robust adaptive control of partially unknown nonlinear systems subject to unmatched uncertainties

This paper proposes a novel robust adaptive control strategy for partially unknown continuous-time nonlinear systems subject to unmatched uncertainties. Initially, the robust nonlinear control problem is converted into a nonlinear optimal control problem by constructing an appropriate value function...

Full description

Saved in:

Bibliographic Details
Published in	Information sciences Vol. 463-464; pp. 307 - 322
Main Authors	Yang, Xiong, He, Haibo, Wei, Qinglai, Luo, Biao
Format	Journal Article
Language	English
Published	Elsevier Inc 01.10.2018
Subjects	Adaptive dynamic programming Neural networks Optimal control Reinforcement learning Robust control Unmatched uncertainty Robust control Unmatched uncertainty Adaptive dynamic programming Neural networks Optimal control Reinforcement learning
Online Access	Get full text

Cover

Loading…

More Information
Summary:	This paper proposes a novel robust adaptive control strategy for partially unknown continuous-time nonlinear systems subject to unmatched uncertainties. Initially, the robust nonlinear control problem is converted into a nonlinear optimal control problem by constructing an appropriate value function for the auxiliary system. After that, within the framework of reinforcement learning, an identifier-critic architecture is developed. The presented architecture uses two neural networks: the identifier neural network (INN) which aims at estimating the unknown internal dynamics and the critic neural network (CNN) which tends to derive the approximate solution of the Hamilton-Jacobi-Bellman equation arising in the obtained optimal control problem. The INN is updated by using both the back-propagation algorithm and the e-modification technique. Meanwhile, the CNN is updated via the modified gradient descent method, which uses historical and current state data simultaneously. Based on the classic Lyapunov technique, all the signals in the closed-loop auxiliary system are proved to be uniformly ultimately bounded. Moreover, the original system is kept asymptotically stable under the obtained approximate optimal control. Finally, two illustrative examples, including the F-16 aircraft plant, are provided to demonstrate the effectiveness of the developed method.
ISSN:	0020-0255 1872-6291
DOI:	10.1016/j.ins.2018.06.022