A new imputation method for incomplete binary data
In data analysis problems where the data are represented by vectors of real numbers, it is often the case that some of the data-points will have “missing values”, meaning that one or more of the entries of the vector that describes the data-point is not observed. In this paper, we propose a new appr...
Saved in:
Published in | Discrete Applied Mathematics Vol. 159; no. 10; pp. 1040 - 1047 |
---|---|
Main Authors | , , , |
Format | Journal Article |
Language | English |
Published |
Elsevier B.V
28.06.2011
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | In data analysis problems where the data are represented by vectors of real numbers, it is often the case that some of the data-points will have “missing values”, meaning that one or more of the entries of the vector that describes the data-point is not observed. In this paper, we propose a new approach to the imputation of missing binary values. The technique we introduce employs a “similarity measure” introduced by Anthony and Hammer (2006)
[1]. We compare experimentally the performance of our technique with ones based on the usual Hamming distance measure and multiple imputation. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 0166-218X 1872-6771 |
DOI: | 10.1016/j.dam.2011.01.024 |