Haplotype frequency estimation in patient populations: The effect of departures from Hardy-Weinberg proportions and collapsing over a locus in the HLA region

Haplotype analyses are an important area in the study of the genetic components of human disease. Associations between markers and disease loci that are not evident with a single marker locus may be identified in multi‐locus marker analyses using estimated haplotype frequencies (HFs). Procedures tha...

Full description

Saved in:
Bibliographic Details
Published inGenetic epidemiology Vol. 22; no. 2; pp. 186 - 195
Main Authors Single, Richard M., Meyer, Diogo, Hollenbach, Jill A., Nelson, Mark P., Noble, Janelle A., Erlich, Henry A., Thomson, Glenys
Format Journal Article
LanguageEnglish
Published New York John Wiley & Sons, Inc 01.02.2002
Wiley-Liss
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Haplotype analyses are an important area in the study of the genetic components of human disease. Associations between markers and disease loci that are not evident with a single marker locus may be identified in multi‐locus marker analyses using estimated haplotype frequencies (HFs). Procedures that make use of the expectation‐maximization (EM) algorithm to estimate HFs from unphased genotype data are in common use in genetic studies. The EM algorithm uses these unphased genotype frequencies along with the assumption of Hardy‐Weinberg proportions (HWP) to converge on HF estimates. In this paper, we assess the accuracy of EM estimates of HFs in patients with type I diabetes for whom the true haplotypes are known, but the data are analyzed ignoring family information to allow comparison between estimated and true frequencies. The data consist of six HLA loci with high levels of polymorphism and a range of departures from HWP and linkage equilibrium. While the overall accuracy of the EM estimates is good, there can be large over‐ and underestimates of particular HFs, even for common haplotypes, especially when the loci involved deviate significantly from HWP. Estimating HFs for three or more loci and then collapsing over loci so as to generate two locus haplotypes can improve the accuracy of the estimation. The collapsing procedure is most beneficial when one of the loci in the two‐locus haplotype of interest deviates significantly from HWP and the locus collapsed over is in linkage disequilibrium with the other loci. Genet. Epidemiol. 22:186–195, 2002. © 2002 Wiley‐Liss, Inc.
Bibliography:American Diabetes Association, Career Development Award (J.A.N.)
ark:/67375/WNG-9WBPZ6CL-R
istex:FB210E15585417210BA7BD212FCA8903A2C31257
ArticleID:GEPI0163
National Institutes of Health - No. GM35326, CA84497 (R.M.S., DM., J.A.H., G.T.); No. DK46626 (J.AN., H.A.E.)
ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ObjectType-Article-1
ObjectType-Feature-2
ISSN:0741-0395
1098-2272
DOI:10.1002/gepi.0163