Abstract
Large-scale single-cell RNA sequencing (scRNA-seq) data sets that are produced in different laboratories and at different times contain batch effects that may compromise the integration and interpretation of the data. Existing scRNA-seq analysis methods incorrectly assume that the composition of cell populations is either known or identical across batches. We present a strategy for batch correction based on the detection of mutual nearest neighbors (MNNs) in the high-dimensional expression space. Our approach does not rely on predefined or equal population compositions across batches; instead, it requires only that a subset of the population be shared between batches. We demonstrate the superiority of our approach compared with existing methods by using both simulated and real scRNA-seq data sets. Using multiple droplet-based scRNA-seq data sets, we demonstrate that our MNN batch-effect-correction method can be scaled to large numbers of cells.
Keywords
MeSH Terms
Affiliated Institutions
Related Publications
A benchmark of batch-effect correction methods for single-cell RNA sequencing data
Abstract Background Large-scale single-cell transcriptomic datasets generated using different technologies contain batch-specific systematic variations that present a challenge ...
Missing data and technical variability in single-cell RNA-sequencing experiments
Until recently, high-throughput gene expression technology, such as RNA-Sequencing (RNA-seq) required hundreds of thousands of cells to produce reliable measurements. Recent tec...
ComBat-seq: batch effect adjustment for RNA-seq count data
Abstract The benefit of integrating batches of genomic data to increase statistical power is often hindered by batch effects, or unwanted variation in data caused by differences...
Single-cell RNA sequencing technologies and bioinformatics pipelines
Rapid progress in the development of next-generation sequencing (NGS) technologies in recent years has provided many valuable insights into complex biological systems, ranging f...
Mapping the Mouse Cell Atlas by Microwell-Seq
Single-cell RNA sequencing (scRNA-seq) technologies are poised to reshape the current cell-type classification system. However, a transcriptome-based single-cell atlas has not b...
Publication Info
- Year
- 2018
- Type
- article
- Volume
- 36
- Issue
- 5
- Pages
- 421-427
- Citations
- 2526
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1038/nbt.4091
- PMID
- 29608177
- PMCID
- PMC6152897