Power Transformer Defect Prediction Method Based on SMOTE and Random Forest Algorithm

Dissolved gas analysis (DGA) in oil is an essential approach for transformer defect prediction. Most of the transformer defect prediction studies use artificial intelligence methods to build individual classifiers. Artificial intelligence techniques are highly sensitive to data. Transformer data is...

Full description

Saved in:
Bibliographic Details
Published in2022 IEEE International Conference on High Voltage Engineering and Applications (ICHVE) pp. 1 - 4
Main Authors Wang, Xuliang, Zhai, Yuhui, Gu, Yuanli, Li, Shuqi, Zhang, Hongru, Li, Qingquan, Liu, Hongshun
Format Conference Proceeding
LanguageEnglish
Published IEEE 25.09.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Dissolved gas analysis (DGA) in oil is an essential approach for transformer defect prediction. Most of the transformer defect prediction studies use artificial intelligence methods to build individual classifiers. Artificial intelligence techniques are highly sensitive to data. Transformer data is often an unbalanced data set, which leads to supervised learning models that focus more on a larger variety of samples, resulting in poorer model performance. To address this situation, this paper uses the synthetic minority oversampling technique (SMOTE) algorithm to oversample a few classes and mitigate the class imbalance problem of the sample set. Compared with a single classifier, a cluster of classifiers can better mine the information of the data set. In this paper, a feedforward neural network (FNN) is used as the base learner to construct a random forest (RF) model, and the variability among the base learners is increased by mapping the samples to a high-dimensional space through kernel principal component analysis (KPCA). Experimental results on faulty samples as well as noisy samples show that SMOTE can significantly enhance the precision of the classifier, but the problem of classification ambiguity occurs between some species. KPCA makes the characteristics of the dataset more obvious. The diagnostic effectiveness and interference resistance of the random forest model constructed in this paper are superior compared to other individual classifier algorithms mentioned in this paper.
ISSN:2474-3852
DOI:10.1109/ICHVE53725.2022.9961386