Gaussian process regression for prediction and confidence analysis of fruit traits by near-infrared spectroscopy

Abstract Detection of fruit traits by using near-infrared (NIR) spectroscopy may encounter out-of-distribution samples that exceed the generalization ability of a constructed calibration model. Therefore, confidence analysis for a given prediction is required, but this cannot be done using common ca...

Full description

Saved in:
Bibliographic Details
Published inFood quality and safety Vol. 7
Main Authors Chen, Xiaojing, Xue, Jianxia, Chen, Xiao, Zhao, Xinyu, Ali, Shujat, Huang, Guangzao
Format Journal Article
LanguageEnglish
Published UK Oxford University Press 01.01.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Abstract Detection of fruit traits by using near-infrared (NIR) spectroscopy may encounter out-of-distribution samples that exceed the generalization ability of a constructed calibration model. Therefore, confidence analysis for a given prediction is required, but this cannot be done using common calibration models of NIR spectroscopy. To address this issue, this paper studied the Gaussian process regression (GPR) for fruit traits detection using NIR spectroscopy. The mean and variance of the GPR were used as the predicted value and confidence, respectively. To show this, a real NIR data set related to dry matter content measurements in mango was used. Compared to partial least squares regression (PLSR), GPR showed approximately 14% lower root mean squared error (RMSE) for the in-distribution test set. Compared with no confidence analysis, using the variance of GPR to remove abnormal samples made GPR and PLSR showed approximately 58% and 10% lower RMSE on the mixed distribution test set, respectively (when the type 1 error rate was set to 0.1). Compared with traditional one-class classification methods, the variance of the GPR can be used to effectively eliminate poorly predicted samples.
ISSN:2399-1399
2399-1402
DOI:10.1093/fqsafe/fyac068