Abstract

As whole-genome sequencing becomes commoditized and we begin to sequence and analyze personal genomes for clinical and diagnostic purposes, it is necessary to understand what constitutes a complete sequencing experiment for determining genotypes and detecting single-nucleotide variants. Here, we show that the current recommendation of ∼30× coverage is not adequate to produce genotype calls across a large fraction of the genome with acceptably low error rates. Our results are based on analyses of a clinical sample sequenced on two related Illumina platforms, GAII x and HiSeq 2000, to a very high depth (126×). We used these data to establish genotype-calling filters that dramatically increase accuracy. We also empirically determined how the callable portion of the genome varies as a function of the amount of sequence data used. These results help provide a “sequencing guide” for future whole-genome sequencing decisions and metrics by which coverage statistics should be reported.

Keywords

BiologyPersonal genomicsGenomeWhole genome sequencingDNA sequencingComputational biologyReference genomeGeneticsHybrid genome assemblyCancer genome sequencingGene

Affiliated Institutions

Related Publications

Publication Info

Year
2011
Type
article
Volume
21
Issue
9
Pages
1498-1505
Citations
217
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

217
OpenAlex

Cite This

Subramanian S. Ajay, Stephen C.J. Parker, Hatice Özel Abaan et al. (2011). Accurate and comprehensive sequencing of personal genomes. Genome Research , 21 (9) , 1498-1505. https://doi.org/10.1101/gr.123638.111

Identifiers

DOI
10.1101/gr.123638.111