Low-rank and sparse representation based learning for cancer survivability prediction

•Cancer survivability prediction is a significant problem to health professionals.•A novel classification algorithm is proposed using low-rank and sparse representation.•Low-rank alternative of raw inputs is trained using a sparsity-enhanced classifier.•Experiments show superior performance compared...

Full description

Saved in:

Bibliographic Details
Published in	Information sciences Vol. 582; pp. 573 - 592
Main Authors	Yang, Jie, Ma, Jun, Win, Khin Than, Gao, Junbin, Yang, Zhenyu
Format	Journal Article
Language	English
Published	Elsevier Inc 01.01.2022
Subjects	Cancer survivability prediction Healthcare modeling Low-rank representation SEER dataset Sparse representation SEER dataset Sparse representation Low-rank representation Cancer survivability prediction Healthcare modeling
Online Access	Get full text

Cover

Loading…

More Information
Summary:	•Cancer survivability prediction is a significant problem to health professionals.•A novel classification algorithm is proposed using low-rank and sparse representation.•Low-rank alternative of raw inputs is trained using a sparsity-enhanced classifier.•Experiments show superior performance compared to state-of-the-art approaches. Cancer survivability prediction has been of great interest to health professionals and researchers. The task refers to the procedure of estimating the potential survivability according to an individual’s medical history. The difficulty is that raw data is usually subject to some noise, such as missing values. To address this issue, we propose a novel low-rank and sparse representation-based learning algorithm, which consists of two main stages of data self expressiveness and classification. Firstly, in the data self expressiveness stage, raw inputs have been decomposed into one dictionary (which is enforced with a low-rank constraint) and one coefficient matrix (which is sparsely coded), respectively. Secondly, this sparse coefficient matrix is paired with sample labels for training during the classification stage. We further integrate these two stages and formulate them into an optimization problem, which is then solved using an iterative computational strategy. Theoretically, we analyze the convergence of the proposed algorithm. The relationship between the proposed algorithm and existing approaches are also discussed. The efficiency of the proposed algorithm is experimentally verified using several benchmarking classification problems and a public longitudinal dataset. Experimental results demonstrate that the proposed algorithm achieves superior performance in terms of affordable computational complexity and high prediction accuracy, compared to state-of-the-art approaches.
ISSN:	0020-0255 1872-6291
DOI:	10.1016/j.ins.2021.10.013