Machine learning-based solubility prediction and methodology evaluation of active pharmaceutical ingredients in industrial crystallization

Solubility has been widely regarded as a fundamental property of small molecule drugs and drug candidates, as it has a profound impact on the crystallization process. Solubility prediction, as an alternative to experiments which can reduce waste and improve crystallization process efficiency, has at...

Full description

Saved in:
Bibliographic Details
Published inFrontiers of chemical science and engineering Vol. 16; no. 4; pp. 523 - 535
Main Authors Ma, Yiming, Gao, Zhenguo, Shi, Peng, Chen, Mingyang, Wu, Songgu, Yang, Chao, Wang, Jingkang, Cheng, Jingcai, Gong, Junbo
Format Journal Article
LanguageEnglish
Published Beijing Higher Education Press 01.04.2022
Springer Nature B.V
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Solubility has been widely regarded as a fundamental property of small molecule drugs and drug candidates, as it has a profound impact on the crystallization process. Solubility prediction, as an alternative to experiments which can reduce waste and improve crystallization process efficiency, has attracted increasing attention. However, there are still many urgent challenges thus far. Herein we used seven descriptors based on understanding dissolution behavior to establish two solubility prediction models by machine learning algorithms. The solubility data of 120 active pharmaceutical ingredients (APIs) in ethanol were considered in the prediction models, which were constructed by random decision forests and artificial neural network with optimized data structure and model accuracy. Furthermore, a comparison with traditional prediction methods including the modified solubility equation and the quantitative structure-property relationships model was carried out. The highest accuracy shown by the testing set proves that the ML models have the best solubility prediction ability. Multiple linear regression and stepwise regression were used to further investigate the critical factor in determining solubility value. The results revealed that the API properties and the solute-solvent interaction both provide a nonnegligible contribution to the solubility value.
ISSN:2095-0179
2095-0187
DOI:10.1007/s11705-021-2083-5