Abstract

Microbial communities are critical to ecosystem function. A key objective of metagenomic studies is to analyse organism-specific metabolic pathways and reconstruct community interaction networks. This requires accurate assignment of assembled genome fragments to genomes. Existing binning methods often fail to reconstruct a reasonable number of genomes and report many bins of low quality and completeness. Furthermore, the performance of existing algorithms varies between samples and biotopes. Here, we present a dereplication, aggregation and scoring strategy, DAS Tool, that combines the strengths of a flexible set of established binning algorithms. DAS Tool applied to a constructed community generated more accurate bins than any automated method. Indeed, when applied to environmental and host-associated samples of different complexity, DAS Tool recovered substantially more near-complete genomes, including previously unreported lineages, than any single binning method alone. The ability to reconstruct many near-complete genomes from metagenomics data will greatly advance genome-centric analyses of ecosystems. Here the authors present a tool that enables a flexible set of existing binning algorithms to be combined, resulting in improved binning accuracy and the recovery of more near-complete genomes from metagenomes compared to standalone methods.

Keywords

MetagenomicsGenomeComputational biologyOrganismSet (abstract data type)Bacterial genome sizeFunction (biology)BiologyComputer scienceData miningGeneEvolutionary biologyGenetics

MeSH Terms

AlgorithmsAnimalsComputational BiologyData CurationGastrointestinal MicrobiomeGenomeBacterialHumansMetagenomicsMicrobiotaSoil MicrobiologyUser-Computer InterfaceWater Microbiology

Affiliated Institutions

Related Publications

Publication Info

Year
2018
Type
article
Volume
3
Issue
7
Pages
836-843
Citations
1581
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1581
OpenAlex
165
Influential

Cite This

Christian M. K. Sieber, Alexander J. Probst, Allison Sharrar et al. (2018). Recovery of genomes from metagenomes via a dereplication, aggregation and scoring strategy. Nature Microbiology , 3 (7) , 836-843. https://doi.org/10.1038/s41564-018-0171-1

Identifiers

DOI
10.1038/s41564-018-0171-1
PMID
29807988
PMCID
PMC6786971

Data Quality

Data completeness: 86%