State constrained stochastic optimal control for continuous and hybrid dynamical systems using DFBSDE

We develop a computationally efficient learning-based forward–backward stochastic differential equations (FBSDE) controller for both continuous and hybrid dynamical (HD) systems subject to stochastic noise and state constraints. Solutions to stochastic optimal control (SOC) problems satisfy the Hami...

Full description

Saved in:
Bibliographic Details
Published inAutomatica (Oxford) Vol. 155; p. 111146
Main Authors Dai, Bolun, Krishnamurthy, Prashanth, Papanicolaou, Andrew, Khorrami, Farshad
Format Journal Article
LanguageEnglish
Published Elsevier Ltd 01.09.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:We develop a computationally efficient learning-based forward–backward stochastic differential equations (FBSDE) controller for both continuous and hybrid dynamical (HD) systems subject to stochastic noise and state constraints. Solutions to stochastic optimal control (SOC) problems satisfy the Hamilton–Jacobi–Bellman (HJB) equation. Using current FBSDE-based solutions, the optimal control can be obtained from the HJB equations using deep neural networks (e.g., long short-term memory (LSTM) networks). To ensure the learned controller respects the constraint boundaries, we enforce the state constraints using a soft penalty function. In addition to previous works, we adapt the deep FBSDE (DFBSDE) control framework to handle HD systems consisting of continuous dynamics and a deterministic discrete state change. We demonstrate our proposed algorithm in simulation on a continuous nonlinear system (cart–pole) and a hybrid nonlinear system (five-link biped).
ISSN:0005-1098
1873-2836
DOI:10.1016/j.automatica.2023.111146