Abstract

We discuss the identification of genes that are associated with an outcome in RNA sequencing and other sequence-based comparative genomic experiments. RNA-sequencing data take the form of counts, so models based on the Gaussian distribution are unsuitable. Moreover, normalization is challenging because different sequencing experiments may generate quite different total numbers of reads. To overcome these difficulties, we use a log-linear model with a new approach to normalization. We derive a novel procedure to estimate the false discovery rate (FDR). Our method can be applied to data with quantitative, two-class, or multiple-class outcomes, and the computation is fast even for large data sets. We study the accuracy of our approaches for significance calculation and FDR estimation, and we demonstrate that our method has potential advantages over existing methods that are based on a Poisson or negative binomial model. In summary, this work provides a pipeline for the significance analysis of sequencing data.

Keywords

False discovery rateNormalization (sociology)Computer sciencePoisson distributionCount dataNegative binomial distributionData miningAlgorithmStatisticsMathematicsBiologyGenetics

Affiliated Institutions

Related Publications

A Direct Approach to False Discovery Rates

Summary Multiple-hypothesis testing involves guarding against much more complicated errors than single-hypothesis testing. Whereas we typically control the type I error rate for...

2002 Journal of the Royal Statistical Soci... 5607 citations

Publication Info

Year
2011
Type
article
Volume
13
Issue
3
Pages
523-538
Citations
378
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

378
OpenAlex

Cite This

Jun Li, Daniela Witten, Iain M. Johnstone et al. (2011). Normalization, testing, and false discovery rate estimation for RNA-sequencing data. Biostatistics , 13 (3) , 523-538. https://doi.org/10.1093/biostatistics/kxr031

Identifiers

DOI
10.1093/biostatistics/kxr031