Abstract

ABSTRACT Libraries of 16S rRNA genes provide insight into the membership of microbial communities. Statistical methods help to determine whether differences in library composition are artifacts of sampling or are due to underlying differences in the communities from which they are derived. To contribute to a growing statistical framework for comparing 16S rRNA libraries, we present a computer program, ∫-LIBSHUFF, which calculates the integral form of the Cramér-von Mises statistic. This implementation builds upon the LIBSHUFF program, which uses an approximation of the statistic and makes a number of modifications that improve precision and accuracy. Once ∫-LIBSHUFF calculates the P values, when pairwise comparisons are tested at the 0.05 level, the probability of falsely identifying a significant P value is 0.098 for a study with two libraries, 0.265 for three libraries, and 0.460 for four libraries. The potential negative effects of making the multiple pairwise comparisons necessitate correcting for the increased likelihood that differences between treatments are due to chance and do not reflect biological differences. Using ∫-LIBSHUFF, we found that previously published 16S rRNA gene libraries constructed from Scottish and Wisconsin soils contained different bacterial lineages. We also analyzed the published libraries constructed for the zebrafish gut microflora and found statistically significant changes in the community during development of the host. These analyses illustrate the power of ∫-LIBSHUFF to detect differences between communities, providing the basis for ecological inference about the association of soil productivity or host gene expression and microbial community composition.

Keywords

Pairwise comparisonBiologyStatisticEcologyStatisticsStatistical hypothesis testingTest statisticInferenceMicrobial population biologyStatistical inferenceMicrobial ecology16S ribosomal RNAComputational biologyComputer scienceGeneMathematicsGeneticsArtificial intelligence

Affiliated Institutions

Related Publications

Publication Info

Year
2004
Type
article
Volume
70
Issue
9
Pages
5485-5492
Citations
348
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

348
OpenAlex

Cite This

Patrick D. Schloss, Bret Larget, Jo Handelsman (2004). Integration of Microbial Ecology and Statistics: a Test To Compare Gene Libraries. Applied and Environmental Microbiology , 70 (9) , 5485-5492. https://doi.org/10.1128/aem.70.9.5485-5492.2004

Identifiers

DOI
10.1128/aem.70.9.5485-5492.2004