Deep transfer with minority data augmentation for imbalanced breast cancer dataset
Clinical diagnosis of breast cancer is a challenging problem in the biomedical domain. The BreakHis breast cancer histopathological image dataset consists of two classes: Benign (Minority class) and Malignant (Majority class). The imbalanced class distribution results in the degradation of performan...
Saved in:
Published in | Applied soft computing Vol. 97; p. 106759 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
Elsevier B.V
01.12.2020
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Clinical diagnosis of breast cancer is a challenging problem in the biomedical domain. The BreakHis breast cancer histopathological image dataset consists of two classes: Benign (Minority class) and Malignant (Majority class). The imbalanced class distribution results in the degradation of performance of the classifier model due to biased classification towards the majority class. To tackle this problem, a novel learning strategy that involves a deep transfer network has been proposed in this paper, in collaboration with Deep Convolution Generative Adversarial network (DCGAN). DCGAN is used in the initial phase for data augmentation of the minority class only. The dataset, with the class distribution now balanced, is applied as input to the deep transfer network. The proposed deep transfer architecture has at its core, the initial pre-trained layers (until block 4 pool layer) of the VGG16 deep network architecture pre-trained on the ImageNet object classification dataset. The higher end of our transfer network comprises of Batch Normalization, 2D Convolutional (CONV2D) layer, Global Average Pooling 2D, Dropout and Dense layers that are added to enhance the network’s performance. Experiments on the benchmark BreakHis dataset for different magnification factors: 40X, 100X, 200X and 400X validate the efficiency of the proposed deep transfer learning approach due to the high scores achieved as compared to the state-of-the-art deep networks.
•A novel deep transfer approach is proposed for the imbalanced Breast cancer dataset.•The assembled deep transfer network comprises of pre-trained VGG16 layers at the lower level.•Higher level contains 2D convolution, pooling, dropout, dense layers with Batch Normalization.•Deep Convolutional Generative Adversarial Network is used for minority data augmentation.•Proposed approach outperforms state of the art networks for different Magnification factors. |
---|---|
ISSN: | 1568-4946 1872-9681 |
DOI: | 10.1016/j.asoc.2020.106759 |