Abstract

An important problem exists in the interpretation of modern medical research data: Biological understanding and previous research play little formal role in the interpretation of quantitative results. This phenomenon is manifest in the discussion sections of research articles and ultimately can affect the reliability of conclusions. The standard statistical approach has created this situation by promoting the illusion that conclusions can be produced with certain "error rates," without consideration of information from outside the experiment. This statistical approach, the key components of which are P values and hypothesis tests, is widely perceived as a mathematically coherent approach to inference. There is little appreciation in the medical community that the methodology is an amalgam of incompatible elements, whose utility for scientific inference has been the subject of intense debate among statisticians for almost 70 years. This article introduces some of the key elements of that debate and traces the appeal and adverse impact of this methodology to the P value fallacy, the mistaken idea that a single number can capture both the long-run outcomes of an experiment and the evidential meaning of a single result. This argument is made as a prelude to the suggestion that another measure of evidence should be used--the Bayes factor, which properly separates issues of long-run behavior from evidential strength and allows the integration of background knowledge with statistical findings.

Keywords

FallacyArgument (complex analysis)Meaning (existential)Bayes' theoremEpistemologyStatistical inferenceInterpretation (philosophy)Statistical hypothesis testingInferenceValue (mathematics)MedicineCognitive psychologyStatisticsComputer sciencePsychologyBayesian probabilityMathematics

MeSH Terms

Bayes TheoremEvidence-Based MedicineHumansProbabilityResearch Design

Affiliated Institutions

Related Publications

A Direct Approach to False Discovery Rates

Summary Multiple-hypothesis testing involves guarding against much more complicated errors than single-hypothesis testing. Whereas we typically control the type I error rate for...

2002 Journal of the Royal Statistical Soci... 5607 citations

Bayes Factors

Abstract In a 1935 paper and in his book Theory of Probability, Jeffreys developed a methodology for quantifying the evidence in favor of a scientific theory. The centerpiece wa...

1995 Journal of the American Statistical A... 11631 citations

Publication Info

Year
1999
Type
article
Volume
130
Issue
12
Pages
995-1004
Citations
1178
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1178
OpenAlex
47
Influential
874
CrossRef

Cite This

Steven N. Goodman (1999). Toward Evidence-Based Medical Statistics. 1: The <i>P</i> Value Fallacy. Annals of Internal Medicine , 130 (12) , 995-1004. https://doi.org/10.7326/0003-4819-130-12-199906150-00008

Identifiers

DOI
10.7326/0003-4819-130-12-199906150-00008
PMID
10383371

Data Quality

Data completeness: 81%