Evolution and functional impact of rare coding variation from deep sequencing of human exomes

JA Tennessen, AW Bigham, TD O'connor, W Fu… - science, 2012 - science.org
science, 2012science.org
As a first step toward understanding how rare variants contribute to risk for complex
diseases, we sequenced 15,585 human protein-coding genes to an average median depth
of 111× in 2440 individuals of European (n= 1351) and African (n= 1088) ancestry. We
identified over 500,000 single-nucleotide variants (SNVs), the majority of which were rare
(86% with a minor allele frequency less than 0.5%), previously unknown (82%), and
population-specific (82%). On average, 2.3% of the 13,595 SNVs each person carried were …
As a first step toward understanding how rare variants contribute to risk for complex diseases, we sequenced 15,585 human protein-coding genes to an average median depth of 111× in 2440 individuals of European (n = 1351) and African (n = 1088) ancestry. We identified over 500,000 single-nucleotide variants (SNVs), the majority of which were rare (86% with a minor allele frequency less than 0.5%), previously unknown (82%), and population-specific (82%). On average, 2.3% of the 13,595 SNVs each person carried were predicted to affect protein function of ~313 genes per genome, and ~95.7% of SNVs predicted to be functionally important were rare. This excess of rare functional variants is due to the combined effects of explosive, recent accelerated population growth and weak purifying selection. Furthermore, we show that large sample sizes will be required to associate rare variants with complex traits.
AAAS