Chromatin-state discovery and genome annotation with ChromHMM

Nat Protoc. 2017 Dec;12(12):2478-2492. doi: 10.1038/nprot.2017.124. Epub 2017 Nov 9.

Abstract

Noncoding DNA regions have central roles in human biology, evolution, and disease. ChromHMM helps to annotate the noncoding genome using epigenomic information across one or multiple cell types. It combines multiple genome-wide epigenomic maps, and uses combinatorial and spatial mark patterns to infer a complete annotation for each cell type. ChromHMM learns chromatin-state signatures using a multivariate hidden Markov model (HMM) that explicitly models the combinatorial presence or absence of each mark. ChromHMM uses these signatures to generate a genome-wide annotation for each cell type by calculating the most probable state for each genomic segment. ChromHMM provides an automated enrichment analysis of the resulting annotations to facilitate the functional interpretations of each chromatin state. ChromHMM is distinguished by its modeling emphasis on combinations of marks, its tight integration with downstream functional enrichment analyses, its speed, and its ease of use. Chromatin states are learned, annotations are produced, and enrichments are computed within 1 d.

Publication types

  • Review

MeSH terms

  • Animals
  • Biomedical Research / methods*
  • Biomedical Research / trends
  • Chromatin / chemistry
  • Chromatin / metabolism*
  • Chromatin Assembly and Disassembly*
  • DNA, Intergenic / chemistry
  • DNA, Intergenic / metabolism
  • Epigenesis, Genetic
  • Epigenomics / methods
  • Epigenomics / trends
  • Genomics / methods*
  • Genomics / trends
  • Humans
  • Markov Chains
  • Models, Genetic*
  • Molecular Sequence Annotation
  • Software
  • Software Design

Substances

  • Chromatin
  • DNA, Intergenic