Design of experiments for the calibration of history-dependent models via deep reinforcement learning and an enhanced Kalman filter

Experimental data are often costly to obtain, which makes it difficult to calibrate complex models. For many models an experimental design that produces the best calibration given a limited experimental budget is not obvious. This paper introduces a deep reinforcement learning (RL) algorithm for des...

Full description

Saved in:

Bibliographic Details
Published in	Computational mechanics Vol. 72; no. 1; pp. 95 - 124
Main Authors	Villarreal, Ruben, Vlassis, Nikolaos N., Phan, Nhon N., Catanach, Tommie A., Jones, Reese E., Trask, Nathaniel A., Kramer, Sharlotte L. B., Sun, WaiChing
Format	Journal Article
Language	English
Published	Berlin/Heidelberg Springer Berlin Heidelberg 01.07.2023 Springer Springer Nature B.V
Subjects	Algorithms Analysis Calibration Classical and Continuum Physics Computational Science and Engineering Data acquisition Decision trees Deep learning Design of experiments Engineering Kalman filters Machine learning Markov processes Mechanical tests Original Paper Theoretical and Applied Mechanics Deep reinforcement learning Enhanced Kalman filter Elastoplasticity Experimental design
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Experimental data are often costly to obtain, which makes it difficult to calibrate complex models. For many models an experimental design that produces the best calibration given a limited experimental budget is not obvious. This paper introduces a deep reinforcement learning (RL) algorithm for design of experiments that maximizes the information gain measured by Kullback–Leibler divergence obtained via the Kalman filter (KF). This combination enables experimental design for rapid online experiments where manual trial-and-error is not feasible in the high-dimensional parametric design space. We formulate possible configurations of experiments as a decision tree and a Markov decision process, where a finite choice of actions is available at each incremental step. Once an action is taken, a variety of measurements are used to update the state of the experiment. This new data leads to a Bayesian update of the parameters by the KF, which is used to enhance the state representation. In contrast to the Nash–Sutcliffe efficiency index, which requires additional sampling to test hypotheses for forward predictions, the KF can lower the cost of experiments by directly estimating the values of new data acquired through additional actions. In this work our applications focus on mechanical testing of materials. Numerical experiments with complex, history-dependent models are used to verify the implementation and benchmark the performance of the RL-designed experiments.
ISSN:	0178-7675 1432-0924
DOI:	10.1007/s00466-023-02335-6