Provably safe and robust learning-based model predictive control

Controller design faces a trade-off between robustness and performance, and the reliability of linear controllers has caused many practitioners to focus on the former. However, there is renewed interest in improving system performance to deal with growing energy constraints. This paper describes a l...

Full description

Saved in:

Bibliographic Details
Published in	Automatica (Oxford) Vol. 49; no. 5; pp. 1216 - 1226
Main Authors	Aswani, Anil, Gonzalez, Humberto, Sastry, S. Shankar, Tomlin, Claire
Format	Journal Article
Language	English
Published	Kidlington Elsevier Ltd 01.05.2013 Elsevier
Subjects	Adaptative systems Applied sciences Computer science; control theory; systems Control system synthesis Control theory. Systems Exact sciences and technology Learning control Predictive control Robustness Safety analysis Statistics Robustness Learning control Statistics Safety analysis Predictive control Energy consumption Statistical analysis Control synthesis Internal model control Adaptive control Modeling Optimization Statistical method Model predictive control Reference model Uncertain system Model matching Cost function Reliability Deterministic approach Intelligent control State constraint
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Controller design faces a trade-off between robustness and performance, and the reliability of linear controllers has caused many practitioners to focus on the former. However, there is renewed interest in improving system performance to deal with growing energy constraints. This paper describes a learning-based model predictive control (LBMPC) scheme that provides deterministic guarantees on robustness, while statistical identification tools are used to identify richer models of the system in order to improve performance; the benefits of this framework are that it handles state and input constraints, optimizes system performance with respect to a cost function, and can be designed to use a wide variety of parametric or nonparametric statistical tools. The main insight of LBMPC is that safety and performance can be decoupled under reasonable conditions in an optimization framework by maintaining two models of the system. The first is an approximate model with bounds on its uncertainty, and the second model is updated by statistical methods. LBMPC improves performance by choosing inputs that minimize a cost subject to the learned dynamics, and it ensures safety and robustness by checking whether these same inputs keep the approximate model stable when it is subject to uncertainty. Furthermore, we show that if the system is sufficiently excited, then the LBMPC control action probabilistically converges to that of an MPC computed using the true dynamics.
ISSN:	0005-1098 1873-2836
DOI:	10.1016/j.automatica.2013.02.003