Optimal Selection of SNP Markers for Disease Association Studies

Genetic association studies with population samples hold the promise of uncovering the susceptibility genes underlying the heritability of complex or common disease. Most association studies rely on the use of surrogate markers, single-nucleotide polymorphism (SNP) being the most suitable due to the...

Full description

Saved in:

Bibliographic Details
Published in	Human heredity Vol. 58; no. 3/4; pp. 190 - 202
Main Authors	Halldórsson, Bjarni V., Istrail, Sorin, De La Vega, Francisco M.
Format	Journal Article
Language	English
Published	Basel, Switzerland S. Karger AG 01.01.2004
Subjects	Algorithms Alleles Disease Genetic Diseases, Inborn - genetics Genetic Markers Genetic Predisposition to Disease Genetic Techniques Genetics Genotype Genotype & phenotype Haplotypes Heredity Humans Linkage Disequilibrium Models, Genetic Models, Statistical Multivariate Analysis Polymorphism Polymorphism, Single Nucleotide Principal Component Analysis Software Time Factors Association mapping Single nucleotide polymorphism Haplotype tagging Minimum informative subset Linkage disequilibrium htSNP
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Genetic association studies with population samples hold the promise of uncovering the susceptibility genes underlying the heritability of complex or common disease. Most association studies rely on the use of surrogate markers, single-nucleotide polymorphism (SNP) being the most suitable due to their abundance and ease of scoring. SNP marker selection is aimed to increase the chances that at least one typed SNP would be in linkage disequilibrium (LD) with the disease causative variant, while at the same time controlling the cost of the study in terms of the number of markers genotyped and samples. Empirical studies reporting block-like segments in the genome with high LD and low haplotype diversity have motivated a marker selection strategy whereby subsets of SNPs that ‘tag’ the common haplotypes of a region are picked for genotyping, avoiding typing redundant SNPs. Based on these initial observations, a plethora of ‘tagging’ algorithms for selecting minimum informative sub-sets of SNPs has recently appeared in the literature. These differ mostly in two major aspects: the quality or correlation measure used to define tagging and the algorithm used for the minimization of the final number of tagging SNPs. In this review we describe the available tagging algorithms utilizing a 3-step unifying framework, point out their methodological and conceptual differences, and make an assessment of their assumptions, performance, and scalability.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISBN:	3805579241 9783805579247
ISSN:	0001-5652 1423-0062
DOI:	10.1159/000083546