Abstract
Summary We consider the problem of identifying differentially expressed genes under different conditions using gene expression microarrays. Because of the many steps involved in the experimental process, from hybridization to image analysis, cDNA microarray data often contain outliers. For example, an outlying data value could occur because of scratches or dust on the surface, imperfections in the glass, or imperfections in the array production. We develop a robust Bayesian hierarchical model for testing for differential expression. Errors are modeled explicitly using a t ‐distribution, which accounts for outliers. The model includes an exchangeable prior for the variances, which allows different variances for the genes but still shrinks extreme empirical variances. Our model can be used for testing for differentially expressed genes among multiple samples, and it can distinguish between the different possible patterns of differential expression when there are three or more samples. Parameter estimation is carried out using a novel version of Markov chain Monte Carlo that is appropriate when the model puts mass on subspaces of the full parameter space. The method is illustrated using two publicly available gene expression data sets. We compare our method to six other baseline and commonly used techniques, namely the t ‐test, the Bonferroni‐adjusted t ‐test, significance analysis of microarrays (SAM), Efron's empirical Bayes, and EBarrays in both its lognormal–normal and gamma–gamma forms. In an experiment with HIV data, our method performed better than these alternatives, on the basis of between‐replicate agreement and disagreement.
Keywords
Affiliated Institutions
Related Publications
Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments
The problem of identifying differentially expressed genes in designed microarray experiments is considered. Lonnstedt and Speed (2002) derived an expression for the posterior od...
RNA-seq: An assessment of technical reproducibility and comparison with gene expression arrays
Ultra-high-throughput sequencing is emerging as an attractive alternative to microarrays for genotyping, analysis of methylation patterns, and identification of transcription fa...
PADGE: analysis of heterogeneous patterns of differential gene expression
We have devised a novel analysis approach, percentile analysis for differential gene expression (PADGE), for identifying genes differentially expressed between two groups of het...
Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray
A high-capacity system was developed to monitor the expression of many genes in parallel. Microarrays prepared by high-speed robotic printing of complementary DNAs on glass were...
Significance analysis of microarrays applied to the ionizing radiation response
Microarrays can measure the expression of thousands of genes to identify changes in expression between different biological states. Methods are needed to determine the significa...
Publication Info
- Year
- 2005
- Type
- article
- Volume
- 62
- Issue
- 1
- Pages
- 10-18
- Citations
- 94
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1111/j.1541-0420.2005.00397.x