Skip to content

Runs Quality Control workflow on Illumina WGS data

Notifications You must be signed in to change notification settings

alipirani88/QCD_SMAKE

Repository files navigation

QCD - Quality Control and Contamination Detection workflow.

QCD is a smakemake worflow for microbial illumina sequencing quality control and contamination detection. As a SOP part of Snitkin lab, this pipeline should be run on raw sequencing data as soon as the data is available from the sequencing core department. Apart from QCing raw sequencing data, it performs numerous downstream tasks such as AMR gene detection, SPAdes genome assembly, MLST detection and Assembly annotation.

Installation

Clone the github directory onto your system.

git clone https://github.com/alipirani88/QCD_SMAKE.git

Create QCD conda environment

conda env create -f QCD.yaml -n QCD

Change snakemake configuration settings in config/config.yaml file and create a new sample list file - sample_test.tsv

Quick start

Run QCD on a set of samples.

$ snakemake -s QCD.smk -p --use-conda -j 999 --cluster "sbatch -A {cluster.account} -p {cluster.partition} -N {cluster.nodes}  -t {cluster.walltime} -c {cluster.procs} --mem-per-cpu {cluster.pmem}" --conda-frontend conda --cluster-config config/cluster.json --configfile config/config.yaml --latency-wait 1000

Alt text

Gather Summary files and generate a report.

$ snakemake -s QCD_report.smk -p --configfile config/config.yaml

Alt text

Dependencies

Near Essential

All the necessary software stack required for the workflow will be installed using conda package manager.

Tool stack used in workflow

About

Runs Quality Control workflow on Illumina WGS data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published