Differentiable predictive control: Deep learning alternative to explicit model predictive control for unknown nonlinear systems

We present differentiable predictive control (DPC) as a deep learning-based alternative to the explicit model predictive control (MPC) for unknown nonlinear systems. In the DPC framework, a neural state-space model is learned from time-series measurements of the system dynamics. The neural control p...

Full description

Saved in:
Bibliographic Details
Published inJournal of process control Vol. 116; no. C; pp. 80 - 92
Main Authors Drgoňa, Ján, Kiš, Karol, Tuor, Aaron, Vrabie, Draguna, Klaučo, Martin
Format Journal Article
LanguageEnglish
Published United Kingdom Elsevier Ltd 01.08.2022
Elsevier
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:We present differentiable predictive control (DPC) as a deep learning-based alternative to the explicit model predictive control (MPC) for unknown nonlinear systems. In the DPC framework, a neural state-space model is learned from time-series measurements of the system dynamics. The neural control policy is then optimized via stochastic gradient descent approach by differentiating the MPC loss function through the closed-loop system dynamics model. The proposed DPC method learns model-based control policies with state and input constraints, while supporting time-varying references and constraints. In embedded implementation using a Raspberry-Pi platform, we experimentally demonstrate that it is possible to train constrained control policies purely based on the measurements of the unknown nonlinear system. We compare the control performance of the DPC method against explicit MPC and report efficiency gains in online computational demands, memory requirements, policy complexity, and construction time. In particular, we show that our method scales linearly compared to exponential scalability of the explicit MPC solved via multiparametric programming. •Data-driven differentiable parametric optimization approach to optimal control problem.•Learning of neural state-space dynamics models and predictive constrained control policies.•Linear scalability in terms of the problem complexity.•Significantly improved computational and memory footprints compared to explicit MPC.•Experimental demonstration using embedded hardware.
Bibliography:USDOE
AC05-76RL01830
PNNL-SA-162137
ISSN:0959-1524
1873-2771
DOI:10.1016/j.jprocont.2022.06.001