Computational Model for Prediction of Malignant Mesothelioma Diagnosis

Mesothelioma is an aggressive lung cancer, harms the linings of the lungs. It is one of the deadliest cancers diagnosed in those exposed to fibrous silicate minerals (asbestos). Millions of people face severe consequences as they are diagnosed at late stages. This study presents a comparison of seve...

Full description

Saved in:
Bibliographic Details
Published inComputer journal Vol. 66; no. 1; pp. 86 - 100
Main Authors Gupta, Surbhi, Gupta, Manoj Kumar
Format Journal Article
LanguageEnglish
Published 17.01.2023
Online AccessGet full text

Cover

Loading…
More Information
Summary:Mesothelioma is an aggressive lung cancer, harms the linings of the lungs. It is one of the deadliest cancers diagnosed in those exposed to fibrous silicate minerals (asbestos). Millions of people face severe consequences as they are diagnosed at late stages. This study presents a comparison of several machine learning approaches with distinct feature sets and addresses the issue of class imbalance. The dataset used in this study is available publicly on the University of California Irvine (UCI) machine learning repository. This study used the resampling technique, synthetic minority oversampling technique (SMOTE), and adaptive synthetic sampling (ADASYN) to handle the class imbalance. Most of the machine learning strategies performed well with the resampling technique. The best accuracy using the resampling strategy was achieved by artificial neural networks (ANN). The highest accuracy was recorded on the feature set selected by principal component analysis (PCA) is 96%. Overall, ensemble techniques performed well. The proposed stacking-based classifier achieved the highest accuracy (89%) on data balanced using SMOTE and ADASYN.
ISSN:0010-4620
1460-2067
DOI:10.1093/comjnl/bxab146