Quantitative analysis of population-scale family trees with millions of relatives
Family trees have vast applications in fields as diverse as genetics, anthropology, and economics. However, the collection of extended family trees is tedious and usually relies on resources with limited geographical scope and complex data usage restrictions. We collected 86 million profiles from pu...
Saved in:
Published in | Science (American Association for the Advancement of Science) Vol. 360; no. 6385; pp. 171 - 175 |
---|---|
Main Authors | , , , , , , , , , , , , , |
Format | Journal Article |
Language | English |
Published |
United States
The American Association for the Advancement of Science
13.04.2018
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Family trees have vast applications in fields as diverse as genetics, anthropology, and economics. However, the collection of extended family trees is tedious and usually relies on resources with limited geographical scope and complex data usage restrictions. We collected 86 million profiles from publicly available online data shared by genealogy enthusiasts. After extensive cleaning and validation, we obtained population-scale family trees, including a single pedigree of 13 million individuals. We leveraged the data to partition the genetic architecture of human longevity and to provide insights into the geographical dispersion of families. We also report a simple digital procedure to overlay other data sets with our resource. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 These authors equally contributed to this manuscript. |
ISSN: | 0036-8075 1095-9203 1095-9203 |
DOI: | 10.1126/science.aam9309 |