Abstract

Abstract Next-generation sequencing technologies are quickly becoming the preferred approach for characterizing and quantifying entire genomes. Even though data produced from these technologies are proving to be the most informative of any thus far, very little attention has been paid to fundamental design aspects of data collection and analysis, namely sampling, randomization, replication, and blocking. We discuss these concepts in an RNA sequencing framework. Using simulations we demonstrate the benefits of collecting replicated RNA sequencing data according to well known statistical designs that partition the sources of biological and technical variation. Examples of these designs and their corresponding models are presented with the goal of testing differential expression.

Keywords

BiologyComputational biologyDNA sequencingRNAReplication (statistics)GenomeComputer scienceGeneticsDNAGene

Affiliated Institutions

Related Publications

Publication Info

Year
2010
Type
article
Volume
185
Issue
2
Pages
405-416
Citations
403
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

403
OpenAlex

Cite This

Paul L. Auer, R. W. Doerge (2010). Statistical Design and Analysis of RNA Sequencing Data. Genetics , 185 (2) , 405-416. https://doi.org/10.1534/genetics.110.114983

Identifiers

DOI
10.1534/genetics.110.114983