Contrastive feature-based learning-guided elevated deep reinforcement learning: Developing an imbalanced fault quantitative diagnosis under variable working conditions

Fault diagnosis is subject to the challenge of implementing model learning in the presence of small samples and imbalanced data (i.e., variable operating conditions), which is a fundamental and crucial problem that hinders their applications in real industrial scenarios. Herein, a novel deep reinfor...

Full description

Saved in:
Bibliographic Details
Published inMechanical systems and signal processing Vol. 211; p. 111192
Main Authors He, Shuilong, Cui, Qianwen, Chen, Jinglong, Pan, Tongyang, Hu, Chaofan
Format Journal Article
LanguageEnglish
Published Elsevier Ltd 01.04.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Fault diagnosis is subject to the challenge of implementing model learning in the presence of small samples and imbalanced data (i.e., variable operating conditions), which is a fundamental and crucial problem that hinders their applications in real industrial scenarios. Herein, a novel deep reinforcement learning strategy (SIMC-PERDRL) that combines SimCLR and elevated prioritized experience replay (PER) is proposed for machinery fault quantitative diagnosis in non-ideal data scenarios. First, unsupervised contrastive learning pre-trains the feature extraction layer to mine optimal discriminative features with more optimal intra-class compactness and inter-class separability to reduce inter-class overlap. Second, the experience priority is quantified by reward and TD error to enhance the learning frequency of rare high-value samples; the reward function is skillfully constructed using adaptive unbalanced distribution, which immensely increases the agent's sensitivity to minorities, and enhances the model's domain adaptability by dynamically fine-tuning the agent's decision through real-time feedback. Moreover, ResNet utilizes the Convolutional Block Attention Module (CBAM) to construct a deep Q-network; thus, the agent's learning ability of critical fault features is enhanced. Finally, SIMC-PERDRL was validated online using three rotating machinery datasets. The results indicate that the method can automatically realize accurate qualitative identification under different rotational speeds, different loads, and class unbalanced conditions, with excellent effectiveness, stability, and versatility.
ISSN:0888-3270
1096-1216
DOI:10.1016/j.ymssp.2024.111192