Low-rank and sparse representation based learning for cancer survivability prediction

•Cancer survivability prediction is a significant problem to health professionals.•A novel classification algorithm is proposed using low-rank and sparse representation.•Low-rank alternative of raw inputs is trained using a sparsity-enhanced classifier.•Experiments show superior performance compared...

Full description

Saved in:
Bibliographic Details
Published inInformation sciences Vol. 582; pp. 573 - 592
Main Authors Yang, Jie, Ma, Jun, Win, Khin Than, Gao, Junbin, Yang, Zhenyu
Format Journal Article
LanguageEnglish
Published Elsevier Inc 01.01.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:•Cancer survivability prediction is a significant problem to health professionals.•A novel classification algorithm is proposed using low-rank and sparse representation.•Low-rank alternative of raw inputs is trained using a sparsity-enhanced classifier.•Experiments show superior performance compared to state-of-the-art approaches. Cancer survivability prediction has been of great interest to health professionals and researchers. The task refers to the procedure of estimating the potential survivability according to an individual’s medical history. The difficulty is that raw data is usually subject to some noise, such as missing values. To address this issue, we propose a novel low-rank and sparse representation-based learning algorithm, which consists of two main stages of data self expressiveness and classification. Firstly, in the data self expressiveness stage, raw inputs have been decomposed into one dictionary (which is enforced with a low-rank constraint) and one coefficient matrix (which is sparsely coded), respectively. Secondly, this sparse coefficient matrix is paired with sample labels for training during the classification stage. We further integrate these two stages and formulate them into an optimization problem, which is then solved using an iterative computational strategy. Theoretically, we analyze the convergence of the proposed algorithm. The relationship between the proposed algorithm and existing approaches are also discussed. The efficiency of the proposed algorithm is experimentally verified using several benchmarking classification problems and a public longitudinal dataset. Experimental results demonstrate that the proposed algorithm achieves superior performance in terms of affordable computational complexity and high prediction accuracy, compared to state-of-the-art approaches.
ISSN:0020-0255
1872-6291
DOI:10.1016/j.ins.2021.10.013