Project repository: Phased Genome Assembly using Strand-seq (PGAS)

Citation

If you use this pipeline or extract and reuse original code/rules from this repository, please cite the following two papers:

Porubsky and Ebert et al.
"Fully Phased Human Genome Assembly without Parental Data Using Single-Cell Strand Sequencing and Long Reads."
Nature Biotechnology, December 2020
DOI: 10.1038/s41587-020-0719-5

Ebert, Audano, Zhu and Rodriguez-Martin et al.
"Haplotype-resolved diverse human genomes and integrated analysis of structural variation"
Science, February 2021
DOI: 10.1126/science.abf7117

Deprecated citations

Please do not reference the preprints (10.1101/855049 and 10.1101/2020.12.16.423102) anymore.

Scope of this repository

This repository contains the Snakemake pipeline code plus some auxiliary scripts to go from raw input data to polished haploid assemblies. Any self-contained, general purpose software tool used in the pipeline is either available via conda/bioconda, or via github. In any case, the pipeline implementation covers the entire software setup required for a complete pipeline run.

In particular, the code for the SaaRclust, StrandPhaseR and breakpointR R packages is available in David Porubsky's github.

Documentation

There are several step-by-step manuals available that describe all use cases currently supported for this pipeline. First-time users should start by reading the tutorial. If you encounter any problems or "strange behaviour" during pipeline execution, please check the FAQ for explanations and solutions. If this does not help, please open a github issue.

Name		Name	Last commit message	Last commit date
Latest commit History 1,188 Commits
annotation		annotation
docs		docs
environment		environment
notebooks		notebooks
notes		notes
scripts		scripts
smk_config		smk_config
smk_include		smk_include
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Snakefile		Snakefile
autoconf.py		autoconf.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project repository: Phased Genome Assembly using Strand-seq (PGAS)

Citation

Deprecated citations

Scope of this repository

Documentation

About

Releases 6

Packages

Contributors 2

Languages

License

ptrebert/project-diploid-assembly

Folders and files

Latest commit

History

Repository files navigation

Project repository: Phased Genome Assembly using Strand-seq (PGAS)

Citation

Deprecated citations

Scope of this repository

Documentation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 6

Packages 0

Contributors 2

Languages

Packages