SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing

2012 Journal of Computational Biology 25,356 citations

Abstract

The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online ( http://bioinf.spbau.ru/spades ). It is distributed as open source software.

Keywords

MetagenomicsSequence assemblyGenomeSoftwareGenomicsComputational biologyBiologyComputer scienceContigGeneGeneticsTranscriptomeProgramming language

Affiliated Institutions

Related Publications

Publication Info

Year
2012
Type
article
Volume
19
Issue
5
Pages
455-477
Citations
25356
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

25356
OpenAlex

Cite This

Anton Bankevich, Sergey Nurk, Dmitry Antipov et al. (2012). SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing. Journal of Computational Biology , 19 (5) , 455-477. https://doi.org/10.1089/cmb.2012.0021

Identifiers

DOI
10.1089/cmb.2012.0021