Abstract
Abstract False-positive identifications are a significant problem in metagenomics classification. We present KrakenUniq, a novel metagenomics classifier that combines the fast k -mer-based classification of Kraken with an efficient algorithm for assessing the coverage of unique k -mers found in each species in a dataset. On various test datasets, KrakenUniq gives better recall and precision than other methods and effectively classifies and distinguishes pathogens with low abundance from false positives in infectious disease samples. By using the probabilistic cardinality estimator HyperLogLog, KrakenUniq runs as fast as Kraken and requires little additional memory. KrakenUniq is freely available at https://github.com/fbreitwieser/krakenuniq .
Keywords
Affiliated Institutions
Related Publications
Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies
Recent long-read assemblies often exceed the quality and completeness of available reference genomes, making validation challenging. Here we present Merqury, a novel tool for re...
A fast, lock-free approach for efficient parallel counting of occurrences of <i>k</i> -mers
Abstract Motivation: Counting the number of occurrences of every k-mer (substring of length k) in a long string is a central subproblem in many applications, including genome as...
GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes
An important assessment prior to genome assembly and related analyses is genome profiling, where the k-mer frequencies within raw sequencing reads are analyzed to estimate major...
Local homology recognition and distance measures in linear time using compressed amino acid alphabets
Methods for discovery of local similarities and estimation of evolutionary distance by identifying k-mers (contiguous subsequences of length k) common to two sequences are descr...
ALE: a generic assembly likelihood evaluation framework for assessing the accuracy of genome and metagenome assemblies
Abstract Motivation: Researchers need general purpose methods for objectively evaluating the accuracy of single and metagenome assemblies and for automatically detecting any err...
Publication Info
- Year
- 2018
- Type
- article
- Volume
- 19
- Issue
- 1
- Pages
- 198-198
- Citations
- 455
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1186/s13059-018-1568-0