Genetics in geographically structured populations: defining, estimating and interpreting FST
Key Points Wright's F -statistics, and especially F ST , provide important insights into the evolutionary processes that influence the structure of genetic variation within and among populations, and they are among the most widely used descriptive statistics in population and evolutionary genet...
Saved in:
Published in | Nature reviews. Genetics Vol. 10; no. 9; pp. 639 - 650 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
London
Nature Publishing Group UK
01.09.2009
Nature Publishing Group |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Key Points
Wright's
F
-statistics, and especially
F
ST
, provide important insights into the evolutionary processes that influence the structure of genetic variation within and among populations, and they are among the most widely used descriptive statistics in population and evolutionary genetics.
F
ST
is a property of the distribution of allele frequencies among populations. It reflects the joint effects of drift, migration, mutation and selection on the distribution of genetic variation among populations.
F
ST
has a central role in population and evolutionary genetics and has wide applications in fields from disease association mapping to forensic science.
F
ST
can be used to describe the distribution of genetic variation among any set of samples, but it is most usefully applied when the samples represent discrete units rather than arbitrary divisions along a continuous distribution.
Statistics related to
F
ST
can be useful for haplotype or microsatellite data if an appropriate measure of evolutionary distance among alleles is available.
Comparison of an estimate of
F
ST
from marker data with an estimate of
Q
ST
from continuously varying trait data can be used to detect selection, but the estimate of
F
ST
may depend on the choice of marker and the estimate of
Q
ST
may differ from neutral expectations if there is a non-additive component of genetic variance.
Although the simple relationship between
F
ST
and migration rates in Wright's island model makes it tempting to infer migration rates from
F
ST
, caution is needed if such an approach is to be used.
If estimates of
F
ST
from many loci are available, it may be possible to identify certain loci as 'outliers' that may have been subject to different patterns of selection or to different demographic processes.
Case–control studies for association-mapping studies must account for the possibility that population substructure accounts for an observed association between a marker and a disease. The genomic control method uses background estimates of
F
ST
to control for such substructure.
In forensic applications, the probabilities of obtaining a match are sometimes calculated for subpopulations that lack specific allele frequency data. A
θ
correction, in which
θ
is
F
ST
, is used to calculate the probability of a match using allele frequency information from a broader population that the subpopulation is part of.
The massive amount of data that is being generated by population genomics projects can be understood fundamentally as allelic variation at individual loci. We therefore expect
F
-statistics to be at least as useful in understanding these data sets as they have been in population and evolutionary genetics for most of the last century.
F
ST
describes the processes that lead to genetic differentiation among and within populations and is widely used in population and evolutionary genetics. This article describes the meaning of
F
ST
and how it should be estimated and interpreted.
Wright's
F
-statistics, and especially
F
ST
, provide important insights into the evolutionary processes that influence the structure of genetic variation within and among populations, and they are among the most widely used descriptive statistics in population and evolutionary genetics. Estimates of
F
ST
can identify regions of the genome that have been the target of selection, and comparisons of
F
ST
from different parts of the genome can provide insights into the demographic history of populations. For these reasons and others,
F
ST
has a central role in population and evolutionary genetics and has wide applications in fields that range from disease association mapping to forensic science. This Review clarifies how
F
ST
is defined, how it should be estimated, how it is related to similar statistics and how estimates of
F
ST
should be interpreted. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
ISSN: | 1471-0056 1471-0064 |
DOI: | 10.1038/nrg2611 |