Reinforcement learning-based approach for optimizing solvent-switch processes

In chemical and pharmaceutical industries, process control optimization is a crucial step to improve economical efficiency and the environmental impact. The current state-of-practice heavily relies on expert knowledge and extensive lab experiments. This not only increases the development time but al...

Full description

Saved in:
Bibliographic Details
Published inComputers & chemical engineering Vol. 176; p. 108310
Main Authors Elmaz, Furkan, Di Caprio, Ulderico, Wu, Min, Wouters, Yentl, Van Der Vorst, Geert, Vandervoort, Niels, Anwar, Ali, Leblebici, M. Enis, Hellinckx, Peter, Mercelis, Siegfried
Format Journal Article
LanguageEnglish
Published Elsevier Ltd 01.08.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In chemical and pharmaceutical industries, process control optimization is a crucial step to improve economical efficiency and the environmental impact. The current state-of-practice heavily relies on expert knowledge and extensive lab experiments. This not only increases the development time but also limits the discovery of new strategies. In this study, we propose Reinforcement Learning-based optimization approach for solvent-switch processes. We utilize a digital twin as the environment for a process designed to switch the THF to 1-propanol. A reward function is created for minimizing the process time and constraints are implemented using logarithmic barrier functions. A PPO agent is trained on the environment. The agent proposed a novel strategy that combines two conventionally separate phases, evaporation and constant volume distillation. This strategy resulted in an overall cost decrease of 24.9% compared to the baseline strategy. Moreover, results were verified experimentally on a pilot plant of Johnson & Johnson (J&J). •Reinforcement Learning-based approach is proposed to optimize solvent-switch processes.•Logarithmic barrier functions are utilized for handling constraints which led to stable training.•Proposed control strategy led to 24.9% decrease in cost compared to the state-of-the-art.•Results are validated experimentally on a pilot plant of Johnson & Johnson.
ISSN:0098-1354
1873-4375
DOI:10.1016/j.compchemeng.2023.108310