When survey science met web tracking: Presenting an error framework for metered data
Metered data, also called web‐tracking data, are generally collected from a sample of participants who willingly install or configure, onto their devices, technologies that track digital traces left when people go online (e.g., URLs visited). Since metered data allow for the observation of online be...
Saved in:
Published in | Journal of the Royal Statistical Society. Series A, Statistics in society Vol. 185; no. Suppl 2; pp. S408 - S436 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
England
Oxford University Press
01.12.2022
John Wiley and Sons Inc |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Metered data, also called web‐tracking data, are generally collected from a sample of participants who willingly install or configure, onto their devices, technologies that track digital traces left when people go online (e.g., URLs visited). Since metered data allow for the observation of online behaviours unobtrusively, it has been proposed as a useful tool to understand what people do online and what impacts this might have on online and offline phenomena. It is crucial, nevertheless, to understand its limitations. Although some research have explored the potential errors of metered data, a systematic categorisation and conceptualisation of these errors are missing. Inspired by the Total Survey Error, we present a Total Error framework for digital traces collected with Meters (TEM). The TEM framework (1) describes the data generation and the analysis process for metered data and (2) documents the sources of bias and variance that may arise in each step of this process. Using a case study we also show how the TEM can be applied in real life to identify, quantify and reduce metered data errors. Results suggest that metered data might indeed be affected by the error sources identified in our framework and, to some extent, biased. This framework can help improve the quality of both stand‐alone metered data research projects, as well as foster the understanding of how and when survey and metered data can be combined. |
---|---|
Bibliography: | Funding information Fundación BBVA, H2020 European Research Council, , Grant/Award Number: 849165; Ministerio de Ciencia e Innovación, , Grant/Award Number: PID2019‐106867RB‐ I00 /AEI/10.13039/501100011033 ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 Funding information Fundación BBVA, H2020 European Research Council, , Grant/Award Number: 849165; Ministerio de Ciencia e Innovación, , Grant/Award Number: PID2019‐106867RB‐ I00 /AEI/10.13039/501100011033 |
ISSN: | 0964-1998 1467-985X |
DOI: | 10.1111/rssa.12956 |