Differentiable predictive control: Deep learning alternative to explicit model predictive control for unknown nonlinear systems
We present differentiable predictive control (DPC) as a deep learning-based alternative to the explicit model predictive control (MPC) for unknown nonlinear systems. In the DPC framework, a neural state-space model is learned from time-series measurements of the system dynamics. The neural control p...
Saved in:
Published in | Journal of process control Vol. 116; no. C; pp. 80 - 92 |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | English |
Published |
United Kingdom
Elsevier Ltd
01.08.2022
Elsevier |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | We present differentiable predictive control (DPC) as a deep learning-based alternative to the explicit model predictive control (MPC) for unknown nonlinear systems. In the DPC framework, a neural state-space model is learned from time-series measurements of the system dynamics. The neural control policy is then optimized via stochastic gradient descent approach by differentiating the MPC loss function through the closed-loop system dynamics model. The proposed DPC method learns model-based control policies with state and input constraints, while supporting time-varying references and constraints. In embedded implementation using a Raspberry-Pi platform, we experimentally demonstrate that it is possible to train constrained control policies purely based on the measurements of the unknown nonlinear system. We compare the control performance of the DPC method against explicit MPC and report efficiency gains in online computational demands, memory requirements, policy complexity, and construction time. In particular, we show that our method scales linearly compared to exponential scalability of the explicit MPC solved via multiparametric programming.
•Data-driven differentiable parametric optimization approach to optimal control problem.•Learning of neural state-space dynamics models and predictive constrained control policies.•Linear scalability in terms of the problem complexity.•Significantly improved computational and memory footprints compared to explicit MPC.•Experimental demonstration using embedded hardware. |
---|---|
Bibliography: | USDOE AC05-76RL01830 PNNL-SA-162137 |
ISSN: | 0959-1524 1873-2771 |
DOI: | 10.1016/j.jprocont.2022.06.001 |