Abstract
Polygenic scores have recently been used to summarise genetic effects among an ensemble of markers that do not individually achieve significance in a large-scale association study. Markers are selected using an initial training sample and used to construct a score in an independent replication sample by forming the weighted sum of associated alleles within each subject. Association between a trait and this composite score implies that a genetic signal is present among the selected markers, and the score can then be used for prediction of individual trait values. This approach has been used to obtain evidence of a genetic effect when no single markers are significant, to establish a common genetic basis for related disorders, and to construct risk prediction models. In some cases, however, the desired association or prediction has not been achieved. Here, the power and predictive accuracy of a polygenic score are derived from a quantitative genetics model as a function of the sizes of the two samples, explained genetic variance, selection thresholds for including a marker in the score, and methods for weighting effect sizes in the score. Expressions are derived for quantitative and discrete traits, the latter allowing for case/control sampling. A novel approach to estimating the variance explained by a marker panel is also proposed. It is shown that published studies with significant association of polygenic scores have been well powered, whereas those with negative results can be explained by low sample size. It is also shown that useful levels of prediction may only be approached when predictors are estimated from very large samples, up to an order of magnitude greater than currently available. Therefore, polygenic scores currently have more utility for association testing than predicting complex traits, but prediction will become more feasible as sample sizes continue to grow.
Keywords
Affiliated Institutions
Related Publications
Polygenic prediction via Bayesian regression and continuous shrinkage priors
Polygenic risk scores (PRS) have shown promise in predicting human complex traits and diseases. Here, we present PRS-CS, a polygenic prediction method that infers posterior effe...
Meta-analysis of genome-wide association studies for height and body mass index in ∼700000 individuals of European ancestry
Recent genome-wide association studies (GWAS) of height and body mass index (BMI) in ∼250000 European participants have led to the discovery of ∼700 and ∼100 nearly independent ...
Efficiency of marker-assisted selection in the improvement of quantitative traits.
Abstract Molecular genetics can be integrated with traditional methods of artificial selection on phenotypes by applying marker-assisted selection (MAS). We derive selection ind...
Estimation of admixture and detection of linkage in admixed populations by a Bayesian approach: application to African‐American populations
We describe a novel method for analysis of marker genotype data from admixed populations, based on a hybrid of Bayesian and frequentist approaches in which the posterior distrib...
Empirical threshold values for quantitative trait mapping.
Abstract The detection of genes that control quantitative characters is a problem of great interest to the genetic mapping community. Methods for locating these quantitative tra...
Publication Info
- Year
- 2013
- Type
- article
- Volume
- 9
- Issue
- 3
- Pages
- e1003348-e1003348
- Citations
- 1596
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1371/journal.pgen.1003348