Missing data in cross-sectional networks – An extensive comparison of missing data treatment methods
•Compares state of the art missing network data treatment methods.•Evaluation on descriptive statistics, link reconstruction, and model parameters.•Tested on simulated networks varying in structure and size.•Multiple imputation outperforms simpler methods.•Multiple imputation with Bayesian ERGMs ove...
Saved in:
Published in | Social networks Vol. 62; pp. 99 - 112 |
---|---|
Main Authors | , , , |
Format | Journal Article |
Language | English |
Published |
Amsterdam
Elsevier B.V
01.07.2020
Elsevier Science Ltd |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | •Compares state of the art missing network data treatment methods.•Evaluation on descriptive statistics, link reconstruction, and model parameters.•Tested on simulated networks varying in structure and size.•Multiple imputation outperforms simpler methods.•Multiple imputation with Bayesian ERGMs overall performed best.
This paper compares several missing data treatment methods for missing network data on a diverse set of simulated networks under several missing data mechanisms. We focus the comparison on three different outcomes: descriptive statistics, link reconstruction, and model parameters. The results indicate that the often used methods (analysis of available cases and null-tie imputation) lead to considerable bias on descriptive statistics with moderate or large proportions of missing data. Multiple imputation using sophisticated imputation models based on exponential random graph models (ERGMs) lead to acceptable biases in descriptive statistics and model parameters even under large amounts of missing data. For link reconstruction multiple imputation by simple ERGM performed well on small data sets, while missing data was more accurately imputed in larger data sets with multiple imputation by complex Bayesian ERGMs (BERGMs). |
---|---|
ISSN: | 0378-8733 1879-2111 1879-2111 |
DOI: | 10.1016/j.socnet.2020.02.004 |