Description
The data was used for the study Zhou, Jian, and Olga G. Troyanskaya. "Ernst, Jason, and Manolis Kellis. "ChromHMM: automating chromatin-state discovery and characterization." Nature methods 9.3 (2012):
215-216.
The DNase-seq profile (ENCFF264NMW), histone modification profiles (ENCFF180LKW, ENCFF682WPF, ENCFF828CQV, ENCFF818GNV, ENCFF465KNK) and chromatin annotation (ENCFF001TDH) of GM12878 cell line were obtained from ENCODE project.
The human genome (hg19) was split into 200-bp bins, and the bins whose all bases were covered by the same chromatin annotation were only used for label datasets.
The profile information corresponding to bins used as labels were extracted and used to build example datasets.
Dataset
The DNase-seq and 5 different Histone modification profiles (H3K27ac, H3K4me1, H3K4me2, H3K4me3, H3K9ac) of 200bp bins (bigWig format)
Label data
Chromatin features for each bin (BED format)