Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

SNP Subset Selection for Genetic Association Studies

SNP Subset Selection for Genetic Association Studies Association studies for disease susceptibility genes rely on the high density of SNPs within candidate genes. However, the linkage disequilibrium between SNPs imply that not all SNPs identified in the candidate region need be genotyped. Here we develop several approaches to SNP subset selection, which can substantially reduce the number of SNPs to be genotyped in an association study. We apply clustering algorithms to pairwise linkage disequilibrium measures, with SNP subsets determined for different cut‐off values of Δ using nearest and furthest neighbour clusters. Alternatively, SNP subsets may be determined by the proportion of haplotypes they identify. We also show how power calculations, based on the average power to identify a SNP as the disease susceptibility mutation using haplotype‐based or logistic regression based statistical analyses, can be used to choose SNP subsets. All these methods provide a ranking method for subsets of a specific size, but do not provide criteria for overall choice of SNP subset size. We develop such criteria by incorporating power calculations into a decision analysis, where the choice of SNP subset size depends on the genotyping costs and the perceived benefits of identifying association. These methods are illustrated using eleven SNPs in the MMP2 gene. http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png Annals of Human Genetics Wiley

SNP Subset Selection for Genetic Association Studies

Loading next page...
 
/lp/wiley/snp-subset-selection-for-genetic-association-studies-8A7zkb0bPw

References (17)

Publisher
Wiley
Copyright
Copyright © 2003 Wiley Subscription Services
ISSN
0003-4800
eISSN
1469-1809
DOI
10.1046/j.1529-8817.2003.00055.x
Publisher site
See Article on Publisher Site

Abstract

Association studies for disease susceptibility genes rely on the high density of SNPs within candidate genes. However, the linkage disequilibrium between SNPs imply that not all SNPs identified in the candidate region need be genotyped. Here we develop several approaches to SNP subset selection, which can substantially reduce the number of SNPs to be genotyped in an association study. We apply clustering algorithms to pairwise linkage disequilibrium measures, with SNP subsets determined for different cut‐off values of Δ using nearest and furthest neighbour clusters. Alternatively, SNP subsets may be determined by the proportion of haplotypes they identify. We also show how power calculations, based on the average power to identify a SNP as the disease susceptibility mutation using haplotype‐based or logistic regression based statistical analyses, can be used to choose SNP subsets. All these methods provide a ranking method for subsets of a specific size, but do not provide criteria for overall choice of SNP subset size. We develop such criteria by incorporating power calculations into a decision analysis, where the choice of SNP subset size depends on the genotyping costs and the perceived benefits of identifying association. These methods are illustrated using eleven SNPs in the MMP2 gene.

Journal

Annals of Human GeneticsWiley

Published: Jan 1, 2003

There are no references for this article.