Class imbalanced problem: Taxonomy, open challenges, applications and state-of-the-art solutions

The study of machine learning has revealed that it can unleash new applications in a variety of disciplines. Many limitations limit their expressiveness, and researchers are working to overcome them to fully exploit the power of data-driven machine learning (ML) and deep learning (DL) techniques. Th...

Full description

Saved in:
Bibliographic Details
Published inChina communications pp. 1 - 29
Main Authors Bhat, Khursheed Ahmad, Sofi, Shabir Ahmad
Format Journal Article
LanguageEnglish
Published China Institute of Communications 2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The study of machine learning has revealed that it can unleash new applications in a variety of disciplines. Many limitations limit their expressiveness, and researchers are working to overcome them to fully exploit the power of data-driven machine learning (ML) and deep learning (DL) techniques. The data imbalance presents major hurdles for classification and prediction problems in machine learning, restricting data analytics and acquiring relevant insights in practically all real-world research domains. In visual learning, network information security, failure prediction, digital marketing, healthcare, and a variety of other domains, raw data suffers from a biased data distribution of one class over the other. This article aims to present a taxonomy of the approaches for handling imbalanced data problems and their comparative study on the classification metrics and their application areas. We have explored very recent trends of techniques employed for solutions to class imbalance problems in datasets and have also discussed their limitations. This article has also identified open challenges for further research in the direction of class data imbalance.
ISSN:1673-5447
DOI:10.23919/JCC.ea.2022-0448.202401