Developing a representative community health survey sampling frame using open-source remote satellite imagery in Mozambique

Lack of accurate data on the distribution of sub-national populations in low- and middle-income countries impairs planning, monitoring, and evaluation of interventions. Novel, low-cost methods to develop unbiased survey sampling frames at sub-national, sub-provincial, and even sub-district levels ar...

Full description

Saved in:
Bibliographic Details
Published inInternational journal of health geographics Vol. 17; no. 1; p. 37
Main Authors Wagenaar, Bradley H, Augusto, Orvalho, Ásbjörnsdóttir, Kristjana, Akullian, Adam, Manaca, Nelia, Chale, Falume, Muanido, Alberto, Covele, Alfredo, Michel, Cathy, Gimbel, Sarah, Radford, Tyler, Girardot, Blake, Sherr, Kenneth
Format Journal Article
LanguageEnglish
Published England BioMed Central 29.10.2018
BMC
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Lack of accurate data on the distribution of sub-national populations in low- and middle-income countries impairs planning, monitoring, and evaluation of interventions. Novel, low-cost methods to develop unbiased survey sampling frames at sub-national, sub-provincial, and even sub-district levels are urgently needed. This article details our experience using remote satellite imagery to develop a provincial-level representative community survey sampling frame to evaluate the effects of a 7-year health system intervention in Sofala Province, Mozambique. Mozambique's most recent census was conducted in 2007, and no data are readily available to generate enumeration areas for representative health survey sampling frames. To remedy this, we partnered with the Humanitarian OpenStreetMap Team to digitize every building in Sofala and Manica provinces (685,189 Sofala; 925,713 Manica) using up-to-date remote satellite imagery, with final results deposited in the open-source OpenStreetMap database. We then created a probability proportional to size sampling frame by overlaying a grid of 2.106 km resolution (0.02 decimal degrees) across each province, and calculating the number of buildings within each grid square. Squares containing buildings were used as our primary sampling unit with replacement. Study teams navigated to the geographic center of each selected square using geographic positioning system coordinates, and then conducted a standard "random walk" procedure to select 20 households for each time a given square was selected. Based on sample size calculations, we targeted a minimum of 1500 households in each province. We selected 88 grids within each province to reach 1760 households, anticipating ongoing conflict and transport issues could preclude the inclusion of some clusters. Civil conflict issues forced the exclusion of 8 of 31 subdistricts in Sofala and 15 of 39 subdistricts in Manica. Using Android tablets, Open Data Kit software, and a remote RedCap data capture system, our final sample included 1549 households in Sofala (4669 adults; 4766 children; 33 missing age) and 1538 households in Manica (4422 adults; 4898 children; 33 missing age). Other implementation or evaluation teams may consider employing similar methods to track population distributions for health systems planning or the development of representative sampling frames using remote satellite imagery.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1476-072X
1476-072X
DOI:10.1186/s12942-018-0158-4