Open Access
2010 Prediction of Phenotype Information from Genotype Data
Nir Yosef, Jens Gramm, Qian-fei Wang, William S. Noble, Richard M. Karp, Roded Sharan
Commun. Inf. Syst. 10(2): 99-114 (2010).


The dissection of complex diseases is one of the greatest challenges of human genetics with important clinical and scientific applications. Traditionally, associations were sought between single genetic markers and disease. The availability of large scale SNP data makes it possible, for the first time, to study the predictive power of genotypes and haplotypes with respect to phenotype data. Here we present a novel method for predicting phenotype information from genotype data. The method is based on a support vector machine that employs new kernel functions for the similarity between genotypes or their underlying haplotypes. We demonstrate our approach on SNP data for the apolipoprotein gene cluster in baboons, predicting plasma lipid levels with significant success rates, and identifying associations that were not detected using extant approaches.


Download Citation

Nir Yosef. Jens Gramm. Qian-fei Wang. William S. Noble. Richard M. Karp. Roded Sharan. "Prediction of Phenotype Information from Genotype Data." Commun. Inf. Syst. 10 (2) 99 - 114, 2010.


Published: 2010
First available in Project Euclid: 9 March 2010

zbMATH: 1185.92083

Keywords: Biology and genetics (Computer Applications–Life and Medical Sciences) , Classifier design and evaluation (Computing Methodologies–Pattern Recognition–Design Methodology) , Machine learning (Computing Methodologies–Artificial Intelligence–Learning) , Parameter learning (Computing Methodologies–Artificial Intelligence–Learning)

Rights: Copyright © 2010 International Press of Boston

Vol.10 • No. 2 • 2010
Back to Top