Provably safe and robust learning-based model predictive control

Controller design faces a trade-off between robustness and performance, and the reliability of linear controllers has caused many practitioners to focus on the former. However, there is renewed interest in improving system performance to deal with growing energy constraints. This paper describes a l...

Full description

Saved in:
Bibliographic Details
Published inAutomatica (Oxford) Vol. 49; no. 5; pp. 1216 - 1226
Main Authors Aswani, Anil, Gonzalez, Humberto, Sastry, S. Shankar, Tomlin, Claire
Format Journal Article
LanguageEnglish
Published Kidlington Elsevier Ltd 01.05.2013
Elsevier
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Controller design faces a trade-off between robustness and performance, and the reliability of linear controllers has caused many practitioners to focus on the former. However, there is renewed interest in improving system performance to deal with growing energy constraints. This paper describes a learning-based model predictive control (LBMPC) scheme that provides deterministic guarantees on robustness, while statistical identification tools are used to identify richer models of the system in order to improve performance; the benefits of this framework are that it handles state and input constraints, optimizes system performance with respect to a cost function, and can be designed to use a wide variety of parametric or nonparametric statistical tools. The main insight of LBMPC is that safety and performance can be decoupled under reasonable conditions in an optimization framework by maintaining two models of the system. The first is an approximate model with bounds on its uncertainty, and the second model is updated by statistical methods. LBMPC improves performance by choosing inputs that minimize a cost subject to the learned dynamics, and it ensures safety and robustness by checking whether these same inputs keep the approximate model stable when it is subject to uncertainty. Furthermore, we show that if the system is sufficiently excited, then the LBMPC control action probabilistically converges to that of an MPC computed using the true dynamics.
ISSN:0005-1098
1873-2836
DOI:10.1016/j.automatica.2013.02.003