Whole-genome sequencing and variant discovery in C. elegans

2008 Nature Methods 407 citations

Abstract

Massively parallel sequencing instruments enable rapid and inexpensive DNA sequence data production. Because these instruments are new, their data require characterization with respect to accuracy and utility. To address this, we sequenced a Caernohabditis elegans N2 Bristol strain isolate using the Solexa Sequence Analyzer, and compared the reads to the reference genome to characterize the data and to evaluate coverage and representation. Massively parallel sequencing facilitates strain-to-reference comparison for genome-wide sequence variant discovery. Owing to the short-read-length sequences produced, we developed a revised approach to determine the regions of the genome to which short reads could be uniquely mapped. We then aligned Solexa reads from C. elegans strain CB4858 to the reference, and screened for single-nucleotide polymorphisms (SNPs) and small indels. This study demonstrates the utility of massively parallel short read sequencing for whole genome resequencing and for accurate discovery of genome-wide polymorphisms.

Keywords

Massive parallel sequencingIndelReference genomeGenomeBiologyDNA sequencingWhole genome sequencingComputational biologyGeneticsINDEL MutationHybrid genome assemblyCancer genome sequencingSequence (biology)GenomicsDeep sequencingSingle-nucleotide polymorphismGeneGenotype

MeSH Terms

AnimalsBase SequenceCaenorhabditis elegansChromosome MappingDNA Mutational AnalysisGenetic VariationMolecular Sequence DataPolymorphismSingle NucleotideSequence AnalysisDNA

Affiliated Institutions

Related Publications

Publication Info

Year
2008
Type
article
Volume
5
Issue
2
Pages
183-188
Citations
407
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

407
OpenAlex
14
Influential
342
CrossRef

Cite This

La Deana W. Hillier, Gábor Marth, Aaron R. Quinlan et al. (2008). Whole-genome sequencing and variant discovery in C. elegans. Nature Methods , 5 (2) , 183-188. https://doi.org/10.1038/nmeth.1179

Identifiers

DOI
10.1038/nmeth.1179
PMID
18204455

Data Quality

Data completeness: 86%