Simulation of Daily Diffuse Solar Radiation Based on Three Machine Learning Models

Solar radiation is an important parameter in the fields of computer modeling, engineering technology and energy development. This paper evaluated the ability of three machine learning models, i.e., Extreme Gradient Boosting (XGBoost), Support Vector Machine (SVM) and Multivariate Adaptive Regression...

Full description

Saved in:
Bibliographic Details
Published inComputer modeling in engineering & sciences Vol. 123; no. 1; pp. 49 - 73
Main Authors Dong, Jianhua, Wu, Lifeng, Liu, Xiaogang, Fan, Cheng, Leng, Menghui, Yang, Qiliang
Format Journal Article
LanguageEnglish
Published Henderson Tech Science Press 01.01.2020
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Solar radiation is an important parameter in the fields of computer modeling, engineering technology and energy development. This paper evaluated the ability of three machine learning models, i.e., Extreme Gradient Boosting (XGBoost), Support Vector Machine (SVM) and Multivariate Adaptive Regression Splines (MARS), to estimate the daily diffuse solar radiation (Rd). The regular meteorological data of 1966-2015 at five stations in China were taken as the input parameters (including mean average temperature (Ta), theoretical sunshine duration (N), actual sunshine duration (n), daily average air relative humidity (RH), and extra-terrestrial solar radiation (Ra)). And their estimation accuracies were subjected to comparative analysis. The three models were first trained using meteorological data from 1966 to 2000. Then, the 2001-2015 data was used to test the trained machine learning model. The results show that the XGBoost had better accuracy than the other two models in coefficient of determination (R2), root mean square error (RMSE), mean bias error (MBE) and normalized root mean square error (NRMSE). The MARS performed better in the training phase than the testing phase, but became less accurate in the testing phase, with the R2 value falling by 2.7-16.9% on average. By contrast, the R2 values of SVM and XGBoost increased by 2.9-12.2% and 1.9-14.3%, respectively. Despite trailing slightly behind the SVM at the Beijing station, the XGBoost showed good performance at the rest of the stations in the two phases. In the training phase, the accuracy growth is small but observable. In addition, the XGBoost had a slightly lower RMSE than the SVM, a signal of its edge in stability. Therefore, the three machine learning models can estimate the daily Rd based on local inputs and the XGBoost stands out for its excellent performance and stability.
Bibliography:1526-1492(20200401)123:1L.49;1-
ISSN:1526-1492
1526-1506
1526-1506
DOI:10.32604/cmes.2020.09014