Haplotype frequency estimation in patient populations: The effect of departures from Hardy-Weinberg proportions and collapsing over a locus in the HLA region
Haplotype analyses are an important area in the study of the genetic components of human disease. Associations between markers and disease loci that are not evident with a single marker locus may be identified in multi‐locus marker analyses using estimated haplotype frequencies (HFs). Procedures tha...
Saved in:
Published in | Genetic epidemiology Vol. 22; no. 2; pp. 186 - 195 |
---|---|
Main Authors | , , , , , , |
Format | Journal Article |
Language | English |
Published |
New York
John Wiley & Sons, Inc
01.02.2002
Wiley-Liss |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Haplotype analyses are an important area in the study of the genetic components of human disease. Associations between markers and disease loci that are not evident with a single marker locus may be identified in multi‐locus marker analyses using estimated haplotype frequencies (HFs). Procedures that make use of the expectation‐maximization (EM) algorithm to estimate HFs from unphased genotype data are in common use in genetic studies. The EM algorithm uses these unphased genotype frequencies along with the assumption of Hardy‐Weinberg proportions (HWP) to converge on HF estimates. In this paper, we assess the accuracy of EM estimates of HFs in patients with type I diabetes for whom the true haplotypes are known, but the data are analyzed ignoring family information to allow comparison between estimated and true frequencies. The data consist of six HLA loci with high levels of polymorphism and a range of departures from HWP and linkage equilibrium. While the overall accuracy of the EM estimates is good, there can be large over‐ and underestimates of particular HFs, even for common haplotypes, especially when the loci involved deviate significantly from HWP. Estimating HFs for three or more loci and then collapsing over loci so as to generate two locus haplotypes can improve the accuracy of the estimation. The collapsing procedure is most beneficial when one of the loci in the two‐locus haplotype of interest deviates significantly from HWP and the locus collapsed over is in linkage disequilibrium with the other loci. Genet. Epidemiol. 22:186–195, 2002. © 2002 Wiley‐Liss, Inc. |
---|---|
Bibliography: | American Diabetes Association, Career Development Award (J.A.N.) ark:/67375/WNG-9WBPZ6CL-R istex:FB210E15585417210BA7BD212FCA8903A2C31257 ArticleID:GEPI0163 National Institutes of Health - No. GM35326, CA84497 (R.M.S., DM., J.A.H., G.T.); No. DK46626 (J.AN., H.A.E.) ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23 ObjectType-Article-1 ObjectType-Feature-2 |
ISSN: | 0741-0395 1098-2272 |
DOI: | 10.1002/gepi.0163 |