Power Transformer Defect Prediction Method Based on SMOTE and Random Forest Algorithm

Dissolved gas analysis (DGA) in oil is an essential approach for transformer defect prediction. Most of the transformer defect prediction studies use artificial intelligence methods to build individual classifiers. Artificial intelligence techniques are highly sensitive to data. Transformer data is...

Full description

Saved in:

Bibliographic Details
Published in	2022 IEEE International Conference on High Voltage Engineering and Applications (ICHVE) pp. 1 - 4
Main Authors	Wang, Xuliang, Zhai, Yuhui, Gu, Yuanli, Li, Shuqi, Zhang, Hongru, Li, Qingquan, Liu, Hongshun
Format	Conference Proceeding
Language	English
Published	IEEE 25.09.2022
Subjects	Classification algorithms Data models Dissolved gas analysis (DGA) Fuel processing industries Oil insulation Power transformers Prediction algorithms Random Forest (RF) Supervised learning Synthetic minority oversampling technique (SMOTE) Transformer defect prediction
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Dissolved gas analysis (DGA) in oil is an essential approach for transformer defect prediction. Most of the transformer defect prediction studies use artificial intelligence methods to build individual classifiers. Artificial intelligence techniques are highly sensitive to data. Transformer data is often an unbalanced data set, which leads to supervised learning models that focus more on a larger variety of samples, resulting in poorer model performance. To address this situation, this paper uses the synthetic minority oversampling technique (SMOTE) algorithm to oversample a few classes and mitigate the class imbalance problem of the sample set. Compared with a single classifier, a cluster of classifiers can better mine the information of the data set. In this paper, a feedforward neural network (FNN) is used as the base learner to construct a random forest (RF) model, and the variability among the base learners is increased by mapping the samples to a high-dimensional space through kernel principal component analysis (KPCA). Experimental results on faulty samples as well as noisy samples show that SMOTE can significantly enhance the precision of the classifier, but the problem of classification ambiguity occurs between some species. KPCA makes the characteristics of the dataset more obvious. The diagnostic effectiveness and interference resistance of the random forest model constructed in this paper are superior compared to other individual classifier algorithms mentioned in this paper.
ISSN:	2474-3852
DOI:	10.1109/ICHVE53725.2022.9961386