Abstract

Summary Deep sequencing of PCR amplicon libraries facilitates the detection of low‐abundance populations in environmental DNA surveys of complex microbial communities. At the same time, deep sequencing can lead to overestimates of microbial diversity through the generation of low‐frequency, error‐prone reads. Even with sequencing error rates below 0.005 per nucleotide position, the common method of generating operational taxonomic units (OTUs) by multiple sequence alignment and complete‐linkage clustering significantly increases the number of predicted OTUs and inflates richness estimates. We show that a 2% single‐linkage preclustering methodology followed by an average‐linkage clustering based on pairwise alignments more accurately predicts expected OTUs in both single and pooled template preparations of known taxonomic composition. This new clustering method can reduce the OTU richness in environmental samples by as much as 30–60% but does not reduce the fraction of OTUs in long‐tailed rank abundance curves that defines the rare biosphere.

Keywords

BiologyCluster analysisSpecies richnessAmpliconPairwise comparisonDeep sequencingEnvironmental DNADNA sequencingAbundance (ecology)Evolutionary biologyAmplicon sequencingComputational biologyGeneticsEcologyStatisticsBiodiversity16S ribosomal RNAGenePolymerase chain reaction

Affiliated Institutions

Related Publications

Publication Info

Year
2010
Type
article
Volume
12
Issue
7
Pages
1889-1898
Citations
1346
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1346
OpenAlex

Cite This

Susan M. Huse, David B. Mark Welch, Hilary G. Morrison et al. (2010). Ironing out the wrinkles in the rare biosphere through improved OTU clustering. Environmental Microbiology , 12 (7) , 1889-1898. https://doi.org/10.1111/j.1462-2920.2010.02193.x

Identifiers

DOI
10.1111/j.1462-2920.2010.02193.x