SNP Subset Selection for Genetic Association Studies

Summary Association studies for disease susceptibility genes rely on the high density of SNPs within candidate genes. However, the linkage disequilibrium between SNPs imply that not all SNPs identified in the candidate region need be genotyped. Here we develop several approaches to SNP subset select...

Full description

Saved in:
Bibliographic Details
Published inAnnals of human genetics Vol. 67; no. 6; pp. 543 - 556
Main Authors Byng, M. C., Whittaker, J. C., Cuthbert, A. P., Mathew, C. G., Lewis, C. M.
Format Journal Article
LanguageEnglish
Published 9600 Garsington Road , Oxford OX4 2DQ , UK Blackwell Science Ltd 01.11.2003
Cambridge University Press
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Summary Association studies for disease susceptibility genes rely on the high density of SNPs within candidate genes. However, the linkage disequilibrium between SNPs imply that not all SNPs identified in the candidate region need be genotyped. Here we develop several approaches to SNP subset selection, which can substantially reduce the number of SNPs to be genotyped in an association study. We apply clustering algorithms to pairwise linkage disequilibrium measures, with SNP subsets determined for different cut‐off values of Δ using nearest and furthest neighbour clusters. Alternatively, SNP subsets may be determined by the proportion of haplotypes they identify. We also show how power calculations, based on the average power to identify a SNP as the disease susceptibility mutation using haplotype‐based or logistic regression based statistical analyses, can be used to choose SNP subsets. All these methods provide a ranking method for subsets of a specific size, but do not provide criteria for overall choice of SNP subset size. We develop such criteria by incorporating power calculations into a decision analysis, where the choice of SNP subset size depends on the genotyping costs and the perceived benefits of identifying association. These methods are illustrated using eleven SNPs in the MMP2 gene.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0003-4800
1469-1809
DOI:10.1046/j.1529-8817.2003.00055.x