Models of linkage error for capture-recapture estimation without clerical reviews

The capture-recapture method can be applied to measure the coverage of administrative and big data sources, in official statistics. In its basic form, it involves the linkage of two sources while assuming a perfect linkage and other standard assumptions. In practice, linkage errors arise and are a p...

Full description

Saved in:
Bibliographic Details
Published inarXiv.org
Main Authors Dasylva, Abel, Goussanou, Arthur, Christian-Olivier Nambeu
Format Paper
LanguageEnglish
Published Ithaca Cornell University Library, arXiv.org 18.03.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The capture-recapture method can be applied to measure the coverage of administrative and big data sources, in official statistics. In its basic form, it involves the linkage of two sources while assuming a perfect linkage and other standard assumptions. In practice, linkage errors arise and are a potential source of bias, where the linkage is based on quasi-identifiers. These errors include false positives and false negatives, where the former arise when linking a pair of records from different units, and the latter arise when not linking a pair of records from the same unit. So far, the existing solutions have resorted to costly clerical reviews, or they have made the restrictive conditional independence assumption. In this work, these requirements are relaxed by modeling the number of links from a record instead. The same approach may be taken to estimate the linkage accuracy without clerical reviews, when linking two sources that each have some undercoverage.
ISSN:2331-8422