Multiple instance learning for histopathological breast cancer image classification

•Multiple Instance Learning (MIL) is a framework to deal with weakly supervised data.•We investigate the relevance of MIL for histopathological cancer image classification.•Benchmark of MIL methods on a public dataset shows the interest of non parametric MIL.•Compared to single instance learning, MI...

Full description

Saved in:

Bibliographic Details
Published in	Expert systems with applications Vol. 117; pp. 103 - 111
Main Authors	Sudharshan, P.J., Petitjean, Caroline, Spanhol, Fabio, Oliveira, Luiz Eduardo, Heutte, Laurent, Honeine, Paul
Format	Journal Article
Language	English
Published	New York Elsevier Ltd 01.03.2019 Elsevier BV
Subjects	Artificial intelligence Biomedical image processing Breast cancer CAI Cancer Computer assisted instruction Diagnosis Disease Expert systems Fuzzy logic Health care Histopathology Image acquisition Image classification Machine learning Medical imaging Multiple Instance Learning Patients State of the art Breast cancer Histopathology Multiple Instance Learning Biomedical image processing Image classification
Online Access	Get full text

Cover

Loading…

More Information
Summary:	•Multiple Instance Learning (MIL) is a framework to deal with weakly supervised data.•We investigate the relevance of MIL for histopathological cancer image classification.•Benchmark of MIL methods on a public dataset shows the interest of non parametric MIL.•Compared to single instance learning, MIL can bring some insight on such application. Histopathological images are the gold standard for breast cancer diagnosis. During examination several dozens of them are acquired for a single patient. Conventional, image-based classification systems make the assumption that all the patient’s images have the same label as the patient, which is rarely verified in practice since labeling the data is expensive. We propose a weakly supervised learning framework and investigate the relevance of Multiple Instance Learning (MIL) for computer-aided diagnosis of breast cancer patients, based on the analysis of histopathological images. Multiple instance learning consists in organizing instances (images) into bags (patients), without the need to label all the instances. We compare several state-of-the-art MIL methods including the pioneering ones (APR, Diverse Density, MI-SVM, citation-kNN), and more recent ones such as a non parametric method and a deep learning based approach (MIL-CNN). The experiments are conducted on the public BreaKHis dataset which contains about 8000 microscopic biopsy images of benign and malignant breast tumors, originating from 82 patients. Among the MIL methods the non-parametric approach has the best overall results, and in some cases allows to obtain classification rates never reached by conventional (single instance) classification frameworks. The comparison between MIL and single instance classification reveals the relevance of the MIL paradigm for the task at hand. In particular, the MIL allows to obtain comparable or better results than conventional (single instance) classification without the need to label all the images.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	0957-4174 1873-6793
DOI:	10.1016/j.eswa.2018.09.049