Abstract

Chromatin immunoprecipitation (ChIP) followed by tag sequencing (ChIP-seq) using high-throughput next-generation instrumentation is fast, replacing chromatin immunoprecipitation followed by genome tiling array analysis (ChIP-chip) as the preferred approach for mapping of sites of transcription-factor binding and chromatin modification. Using two deeply sequenced data sets for human RNA polymerase II and STAT1, each with matching input-DNA controls, we describe a general scoring approach to address unique challenges in ChIP-seq data analysis. Our approach is based on the observation that sites of potential binding are strongly correlated with signal peaks in the control, likely revealing features of open chromatin. We develop a two-pass strategy called PeakSeq to compensate for this. A two-pass strategy compensates for signal caused by open chromatin, as revealed by inclusion of the controls. The first pass identifies putative binding sites and compensates for genomic variation in the 'mappability' of sequences. The second pass filters out sites not significantly enriched compared to the normalized control, computing precise enrichments and significances. Our scoring procedure enables us to optimize experimental design by estimating the depth of sequencing required for a desired level of coverage and demonstrating that more than two replicates provides only a marginal gain in information.

Keywords

Chromatin immunoprecipitationTiling arrayChromatinComputational biologyChIP-sequencingComputer scienceChipBiologyGeneticsDNAAlgorithmGeneDNA microarrayChromatin remodelingTelecommunicationsGene expressionPromoter

MeSH Terms

Binding SitesBiotechnologyChromatinChromatin ImmunoprecipitationDNAFalse Positive ReactionsGenetic VariationGenomeGenomicsHumansModelsGeneticOligonucleotide Array Sequence AnalysisRNA Polymerase IISequence AnalysisDNASoftware

Affiliated Institutions

Related Publications

Publication Info

Year
2009
Type
article
Volume
27
Issue
1
Pages
66-75
Citations
548
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

548
OpenAlex
68
Influential
469
CrossRef

Cite This

Joel Rozowsky, Ghia Euskirchen, Raymond K. Auerbach et al. (2009). PeakSeq enables systematic scoring of ChIP-seq experiments relative to controls. Nature Biotechnology , 27 (1) , 66-75. https://doi.org/10.1038/nbt.1518

Identifiers

DOI
10.1038/nbt.1518
PMID
19122651
PMCID
PMC2924752

Data Quality

Data completeness: 90%