Abstract
With the increase in genomewide experiments and the sequencing of multiple genomes, the analysis of large data sets has become commonplace in biology. It is often the case that thousands of features in a genomewide data set are tested against some null hypothesis, where a number of features are expected to be significant. Here we propose an approach to measuring statistical significance in these genomewide studies based on the concept of the false discovery rate. This approach offers a sensible balance between the number of true and false positives that is automatically calibrated and easily interpreted. In doing so, a measure of statistical significance called the q value is associated with each tested feature. The q value is similar to the well known p value, except it is a measure of significance in terms of the false discovery rate rather than the false positive rate. Our approach avoids a flood of false positive results, while offering a more liberal criterion than what has been used in genome scans for linkage.
Keywords
Affiliated Institutions
Related Publications
A Direct Approach to False Discovery Rates
Summary Multiple-hypothesis testing involves guarding against much more complicated errors than single-hypothesis testing. Whereas we typically control the type I error rate for...
Operating Characteristics and Extensions of the False Discovery Rate Procedure
Summary We investigate the operating characteristics of the Benjamini–Hochberg false discovery rate procedure for multiple testing. This is a distribution-free method that contr...
The control of the false discovery rate in multiple testing under dependency
Benjamini and Hochberg suggest that the false discovery rate may\nbe the appropriate error rate to control in many applied multiple testing\nproblems. A simple procedure was giv...
Sifting the evidence---what's wrong with significance tests? Another comment on the role of statistical methods
The findings of medical research are often met with considerable scepticism, even when they have apparently come from studies with sound methodologies that have been subjected t...
On the Adaptive Control of the False Discovery Rate in Multiple Testing With Independent Statistics
A new approach to problems of multiple significance testing was presented in Benjamini and Hochberg (1995), which calls for controlling the expected ratio of the number of erron...
Publication Info
- Year
- 2003
- Type
- article
- Volume
- 100
- Issue
- 16
- Pages
- 9440-9445
- Citations
- 9812
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1073/pnas.1530509100