Chromatin-state discovery and genome annotation with ChromHMM

Jason Ernst; Manolis Kellis

doi:10.1038/nprot.2017.124

Chromatin-state discovery and genome annotation with ChromHMM

Nat Protoc. 2017 Dec;12(12):2478-2492. doi: 10.1038/nprot.2017.124. Epub 2017 Nov 9.

Authors

Jason Ernst^{1

2

3

4

5}, Manolis Kellis^{6

7}

Affiliations

¹ Department of Biological Chemistry, University of California, Los Angeles, Los Angeles, California, USA.
² Department of Computer Science, University of California, Los Angeles, Los Angeles, California, USA.
³ Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research at University of California, Los Angeles, Los Angeles, California, USA.
⁴ Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, California, USA.
⁵ Molecular Biology Institute, University of California, Los Angeles, Los Angeles, California, USA.
⁶ Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA.
⁷ MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, Massachusetts, USA.

Abstract

Noncoding DNA regions have central roles in human biology, evolution, and disease. ChromHMM helps to annotate the noncoding genome using epigenomic information across one or multiple cell types. It combines multiple genome-wide epigenomic maps, and uses combinatorial and spatial mark patterns to infer a complete annotation for each cell type. ChromHMM learns chromatin-state signatures using a multivariate hidden Markov model (HMM) that explicitly models the combinatorial presence or absence of each mark. ChromHMM uses these signatures to generate a genome-wide annotation for each cell type by calculating the most probable state for each genomic segment. ChromHMM provides an automated enrichment analysis of the resulting annotations to facilitate the functional interpretations of each chromatin state. ChromHMM is distinguished by its modeling emphasis on combinations of marks, its tight integration with downstream functional enrichment analyses, its speed, and its ease of use. Chromatin states are learned, annotations are produced, and enrichments are computed within 1 d.

Publication types

Review

MeSH terms

Animals
Biomedical Research / methods*
Biomedical Research / trends
Chromatin / chemistry
Chromatin / metabolism*
Chromatin Assembly and Disassembly*
DNA, Intergenic / chemistry
DNA, Intergenic / metabolism
Epigenesis, Genetic
Epigenomics / methods
Epigenomics / trends
Genomics / methods*
Genomics / trends
Humans
Markov Chains
Models, Genetic*
Molecular Sequence Annotation
Software
Software Design

Substances

Chromatin
DNA, Intergenic

Abstract

Publication types

MeSH terms

Substances

Grants and funding