ConsHMM 1.1

ConsHMM provides tools for parsing a multiple species alignment and training a Hidden Markov Model (HMM) to learn a conservation state annotation of the reference genome in the alignment, at single nucleotide resolution. The HMM is learned using an updated version of the ChromHMM software, which is included in this repository. Tools for visualizing and interpreting ConsHMM output are also provided.

The segmentation and browser files mentioned in the paper are available here. The link provides the intermediate files produced by the pipeline using the hg19 Multiz 100-way alignment.

For pre-generated ConsHMM annotations in multiple species, visit the ConsHMM Atlas.

Files from the analysis of bases prioritized by various variant scores in the paper are available here.

v1.1 updates:

Allele-specific annotations
parseMAF can now work on MAF files split by the chromosomes of a different species than the target one

Requirements

Python 3
Java 1.5 or later
Biopython
Numpy

If you are in a conda environment, the following lines will install the necessary python libraries

conda install -c conda-forge biopython
conda install -c anaconda numpy

Getting started

The Wiki contains useful tutorials, including how to reproduce the model and segmentation from the original ConsHMM paper or create your own based on a different reference species and/or multiple-sequence alignment.

Citation

For any use of the ConsHMM software or ConsHMM state annotations, please cite:

Arneson A, Ernst J. Systematic discovery of conservation states for single-nucleotide annotation of the human genome. Communications Biology, 248, 2019. doi: https://doi.org/10.1038/s42003-019-0488-1

Authors

Adriana Arneson (University of California, Los Angeles)

Jason Ernst (University of California, Los Angeles)

Collaborators

Bruins In Genomics students Brooke Felsheim (Washington University in St. Louis) and Jennifer Chien (Wellesley College) helped test the pipeline during the summer of 2018 and implemented several additional features.

Name		Name	Last commit message	Last commit date
Latest commit History 103 Commits
ChromHMM		ChromHMM
anchorFiles/hg19		anchorFiles/hg19
coords/hg19		coords/hg19
models		models
postProcessing		postProcessing
source		source
README.md		README.md
speciesUCSC100way.txt		speciesUCSC100way.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ConsHMM 1.1

Requirements

Getting started

Citation

Authors

Collaborators

About

Releases

Packages

Contributors 5

Languages

ernstlab/ConsHMM

Folders and files

Latest commit

History

Repository files navigation

ConsHMM 1.1

Requirements

Getting started

Citation

Authors

Collaborators

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages