Abstract

Abstract Motivation: Chromatin states are the key to gene regulation and cell identity. Chromatin immunoprecipitation (ChIP) coupled with high-throughput sequencing (ChIP-Seq) is increasingly being used to map epigenetic states across genomes of diverse species. Chromatin modification profiles are frequently noisy and diffuse, spanning regions ranging from several nucleosomes to large domains of multiple genes. Much of the early work on the identification of ChIP-enriched regions for ChIP-Seq data has focused on identifying localized regions, such as transcription factor binding sites. Bioinformatic tools to identify diffuse domains of ChIP-enriched regions have been lacking. Results: Based on the biological observation that histone modifications tend to cluster to form domains, we present a method that identifies spatial clusters of signals unlikely to appear by chance. This method pools together enrichment information from neighboring nucleosomes to increase sensitivity and specificity. By using genomic-scale analysis, as well as the examination of loci with validated epigenetic states, we demonstrate that this method outperforms existing methods in the identification of ChIP-enriched signals for histone modification profiles. We demonstrate the application of this unbiased method in important issues in ChIP-Seq data analysis, such as data normalization for quantitative comparison of levels of epigenetic modifications across cell types and growth conditions. Availability: http://home.gwu.edu/∼wpeng/Software.htm Contact: wpeng@gwu.edu Supplementary information: Supplementary data are available at Bioinformatics online.

Keywords

Chromatin immunoprecipitationHistoneChromatinComputational biologyEpigeneticsNucleosomeBiologyEpigenomicsIdentification (biology)Cluster analysisChIP-sequencingGeneticsTiling arrayGeneComputer scienceDNA microarrayDNA methylationGene expressionPromoterArtificial intelligence

Affiliated Institutions

Related Publications

Publication Info

Year
2009
Type
article
Volume
25
Issue
15
Pages
1952-1958
Citations
1062
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1062
OpenAlex

Cite This

Chongzhi Zang, Dustin E. Schones, Chen Zeng et al. (2009). A clustering approach for identification of enriched domains from histone modification ChIP-Seq data. Bioinformatics , 25 (15) , 1952-1958. https://doi.org/10.1093/bioinformatics/btp340

Identifiers

DOI
10.1093/bioinformatics/btp340