Classification of Imbalanced Data:Review of Methods and Applications

Abstract Imbalance in dataset enforces numerous challenges to implement data analytic in all existing real world applications using machine learning. Data imbalance occurs when sample size from a class is very small or large then another class. Performance of predicted models is greatly affected whe...

Full description

Saved in:
Bibliographic Details
Published inIOP conference series. Materials Science and Engineering Vol. 1099; no. 1; p. 12077
Main Authors Kumar, Pradeep, Bhatnagar, Roheet, Gaur, Kuntal, Bhatnagar, Anurag
Format Journal Article
LanguageEnglish
Published Bristol IOP Publishing 01.03.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Abstract Imbalance in dataset enforces numerous challenges to implement data analytic in all existing real world applications using machine learning. Data imbalance occurs when sample size from a class is very small or large then another class. Performance of predicted models is greatly affected when dataset is highly imbalanced and sample size increases. Overall, Imbalanced training data have a major negative impact on performance. Leading machine learning technique combat with imbalanced dataset by focusing on avoiding the minority class and reducing the inaccuracy for the majority class. This article presents a review of different approaches to classify imbalanced dataset and their application areas.
ISSN:1757-8981
1757-899X
DOI:10.1088/1757-899X/1099/1/012077