Transfer learning for materials informatics using crystal graph convolutional neural network

[Display omitted] •Transfer learning (TL) is re-using knowledge accumulated in big pretrained model.•We apply TL to crystal graph convolutional neural network (CGCNN).•TL improves predictions of various target properties suffering from their small data.•TL becomes powerful when bigger pretrained mod...

Full description

Saved in:
Bibliographic Details
Published inComputational materials science Vol. 190; p. 110314
Main Authors Lee, Joohwi, Asahi, Ryoji
Format Journal Article
LanguageEnglish
Published Elsevier B.V 01.04.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:[Display omitted] •Transfer learning (TL) is re-using knowledge accumulated in big pretrained model.•We apply TL to crystal graph convolutional neural network (CGCNN).•TL improves predictions of various target properties suffering from their small data.•TL becomes powerful when bigger pretrained model is employed.•TL becomes powerful when pretrained and target models are strongly correlated. For successful applications of machine learning in materials informatics, it is necessary to overcome the inaccuracy of predictions ascribed to insufficient amount of data. In this study, we propose a transfer learning using a crystal graph convolutional neural network (TL-CGCNN). Herein, TL-CGCNN is pretrained with big data such as formation energies for crystal structures, and then used for predicting target properties with relatively small data. We confirm that TL-CGCNN can improve predictions of various properties such as bulk moduli, dielectric constants, and quasiparticle band gaps, which are computationally demanding, to construct big data for materials. Moreover, we quantitatively observe that the prediction of properties in target models via TL-CGCNN becomes more accurate with an increase in size of training dataset in pretrained models. Finally, we confirm that TL-CGCNN is superior to other regression methods in the predictions of target properties, which suffer from small amount of data. Therefore, we conclude that TL-CGCNN is promising along with compiling big data for materials that are easy to accumulate and relevant to the target properties.
ISSN:0927-0256
1879-0801
DOI:10.1016/j.commatsci.2021.110314