A nonparametric feature screening method for ultrahigh-dimensional missing response

This paper addresses the feature screening issue for ultrahigh-dimensional data with responses missing at random. A novel nonparametric feature screening procedure is developed to identify the important features via the conditionally imputing marginal Spearman rank correlation. The proposed nonparam...

Full description

Saved in:
Bibliographic Details
Published inComputational statistics & data analysis Vol. 142; p. 106828
Main Authors Li, Xiaoxia, Tang, Niansheng, Xie, Jinhan, Yan, Xiaodong
Format Journal Article
LanguageEnglish
Published Elsevier B.V 01.02.2020
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:This paper addresses the feature screening issue for ultrahigh-dimensional data with responses missing at random. A novel nonparametric feature screening procedure is developed to identify the important features via the conditionally imputing marginal Spearman rank correlation. The proposed nonparametric screening approach has several desirable merits. First, it is nonparametric without assuming any regression form of predictors on response variable. Second, it is robust to outliers and heavy-tailed data. Third, under some regularity conditions, it is shown that the proposed feature screening procedure has the sure screening and ranking consistency properties. Simulation studies evidence that the proposed screening procedure outperforms several existing model-free screening procedures. An example taken from the microarray diffuse large-B-cell lymphoma study is used to illustrate the proposed methodologies.
ISSN:0167-9473
1872-7352
DOI:10.1016/j.csda.2019.106828