Stacking-based ensemble learning of decision trees for interpretable prostate cancer detection

Prostate cancer is a highly incident malignant cancer among men. Early detection of prostate cancer is necessary for deciding whether a patient should receive costly and invasive biopsy with possible serious complications. However, existing cancer diagnosis methods based on data mining only focus on...

Full description

Saved in:
Bibliographic Details
Published inApplied soft computing Vol. 77; pp. 188 - 204
Main Authors Wang, Yuyan, Wang, Dujuan, Geng, Na, Wang, Yanzhang, Yin, Yunqiang, Jin, Yaochu
Format Journal Article
LanguageEnglish
Published Elsevier B.V 01.04.2019
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Prostate cancer is a highly incident malignant cancer among men. Early detection of prostate cancer is necessary for deciding whether a patient should receive costly and invasive biopsy with possible serious complications. However, existing cancer diagnosis methods based on data mining only focus on diagnostic accuracy, while neglecting the interpretability of the diagnosis model that is necessary for helping doctors make clinical decisions. To take both accuracy and interpretability into consideration, we propose a stacking-based ensemble learning method that simultaneously constructs the diagnostic model and extracts interpretable diagnostic rules. For this purpose, a multi-objective optimization algorithm is devised to maximize the classification accuracy and minimize the ensemble complexity for model selection. As for model combination, a random forest classifier-based stacking technique is explored for the integration of base learners, i.e., decision trees. Empirical results on real-world data from the General Hospital of PLA demonstrate that the classification performance of the proposed method outperforms that of several state-of-the-art methods in terms of the classification accuracy, sensitivity and specificity. Moreover, the results reveal that several diagnostic rules extracted from the constructed ensemble learning model are accurate and interpretable. •We propose a stacking-based interpretable selective ensemble learning method.•We select ensemble models with accuracy and complexity under consideration.•We combine selected effective models by random forest-based stacking.•The proposed method is more accurate and interpretable in prostate cancer detection.•We extract a few of effective diagnostic rules for clinical decision support.
ISSN:1568-4946
1872-9681
DOI:10.1016/j.asoc.2019.01.015