Abstract

Abstract Next generation sequencing technology provides a powerful tool for measuring gene expression (mRNA) levels in the form of RNA-sequence data. Method development for identifying differentially expressed (DE) genes from RNA-seq data, which frequently includes many low-count integers and can exhibit severe overdispersion relative to Poisson or binomial distributions, is a popular area of ongoing research. Here we present quasi-likelihood methods with shrunken dispersion estimates based on an adaptation of Smyth's (2004) approach to estimating gene-specific error variances for microarray data. Our suggested methods are computationally simple, analogous to ANOVA and compare favorably versus competing methods in detecting DE genes and estimating false discovery rates across a variety of simulations based on real data.

Keywords

OverdispersionNegative binomial distributionCount dataPoisson distributionFalse discovery rateQuasi-likelihoodStatisticsMathematicsBinomial distributionComputational biologyBiologyGeneGenetics

Affiliated Institutions

Related Publications

Publication Info

Year
2012
Type
article
Volume
11
Issue
5
Citations
342
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

342
OpenAlex

Cite This

Steven P. Lund, Dan Nettleton, Davis J. McCarthy et al. (2012). Detecting Differential Expression in RNA-sequence Data Using Quasi-likelihood with Shrunken Dispersion Estimates. Statistical Applications in Genetics and Molecular Biology , 11 (5) . https://doi.org/10.1515/1544-6115.1826

Identifiers

DOI
10.1515/1544-6115.1826