Haplotype inference from diploid sequence data: evaluating performance using non-neutral MHC sequences

The direct sequencing of PCR products from diploid organisms is problematic because of ambiguities associated with phase inference in multi‐site heterozygotes. Several molecular methods such as cloning, SSCP, and DGGE have been developed to empirically reduce diploid sequences to their constitutive...

Full description

Saved in:
Bibliographic Details
Published inHereditas Vol. 144; no. 6; pp. 228 - 234
Main Authors Bos, David H., Turner, Sara M., Andrew DeWoody, J.
Format Journal Article
LanguageEnglish
Published Copenhagen Blackwell Publishing Ltd 01.12.2007
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The direct sequencing of PCR products from diploid organisms is problematic because of ambiguities associated with phase inference in multi‐site heterozygotes. Several molecular methods such as cloning, SSCP, and DGGE have been developed to empirically reduce diploid sequences to their constitutive haploid components, but in theory these empirical approaches can be supplanted by analytical treatment of diploid sequences. Analytical approaches are more desirable than molecular methods because of the added time and expense required to generate molecular data. A variety of analytical methods have been developed to address this issue, but few have been rigorously evaluated with empirical data. Furthermore, they all assume that the sequences under consideration are evolving in a neutral fashion and assume a moderate number of heterozygous sites. Here, we use non‐neutral major histocompatibility complex (MHC) sequences comprised of large numbers of heterozygous sites that are under strong balancing selection to evaluate the performance of the popular Bayesian algorithm implemented by the program PHASE. Our results suggest that PHASE performs admirably with non‐neutral sequences of moderate length with numerous heterozygous sites typical of MHC class II sequences. We conclude that analytical approaches to haplotype inference have great potential in large‐scale population genetic assays, but recommend groundtruthing analytical results using empirical (molecular) approaches at the outset of population‐level analyses.
Bibliography:istex:F1A7518C057F74ED140F02BC92307BE81CA785B1
ark:/67375/WNG-CJZ01DKM-G
ArticleID:HRD21994
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0018-0661
1601-5223
DOI:10.1111/j.2007.0018-0661.01994.x