Snakemake workflow: `mt_quant`

A Snakemake workflow for mt_quant:

Preprocessing:
- Trim reads with fastp
- Remove rRNAs with ribodetector
- Remove host RNA with STAR
- Screen your reads with kraken2
Quantification:
- Map reads to a MAG catalogue with bowtie2
- Get count tables with CoverM
Report
- Get a gazillion of reports with samtools, fastqc and multiqc

Usage

Make sure you have conda, mamba and snakemake installed.

conda --version
mamba --version
snakemake --version

Clone this git repository and get it

git clone https://github.com/3d-omics/mt_quant
cd mt_quant

Test your installation by running the pipeline with test data. It will download all the necessary software through conda / mamba. It should take less than five minutes.
```
./run
```

Run it with your own data:

Edit config/samples.tsv and add your sample names, a library identifier in case you have more than one file per sample, their paths and adapters used.

sample_id	library_id	forward_filename	reverse_filename	forward_adapter	reverse_adapter
sample1	1	resources/reads/GBRF1.1_1.fq.gz	resources/reads/GBRF1.1_2.fq.gz	AGATCGGAAGAGCACACGTCTGAACTCCAGTCA	AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT
sample2	1	resources/reads/GBRM1.1_1.fq.gz	resources/reads/GBRM1.1_2.fq.gz	AGATCGGAAGAGCACACGTCTGAACTCCAGTCA	AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT

Edit config/features.yml with your reference hosts, mags and external databases. You can have multiple hosts and multiple catalogues. You can even have no host files in case you are analyzing environmental samples.

hosts:  # Comment the next lines of no host
    human:
        genome: resources/reference/chrX_sub.fa.gz
        gtf: resources/reference/chrX_sub.gtf.gz

mag_catalogues:
    mag1: resources/reference/mags_mock.fa.gz
    # mag2: resources/reference/mags_mock.fa.gz

databases:
    kraken2:  # Comment the next lines if no database
        mock: resources/databases/kraken2/kraken_mock
        # mock2: resources/databases/kraken2/kraken_mock

Edit config/params.yml with the execution parameters. The defaults are reasonable.

Run the pipeline

./run -j8  # locally with 8 cpus
./run_slurm  # on a cluster with slurm

Name		Name	Last commit message	Last commit date
Latest commit History 241 Commits
.github/workflows		.github/workflows
config		config
profile/default		profile/default
resources		resources
workflow		workflow
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.snakemake-workflow-catalog.yml		.snakemake-workflow-catalog.yml
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
rulegraph_simple.dot		rulegraph_simple.dot
rulegraph_simple.svg		rulegraph_simple.svg
run		run

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Snakemake workflow: `mt_quant`

Usage

References

About

Releases 5

Packages

Contributors 3

Languages

License

3d-omics/mt_quant

Folders and files

Latest commit

History

Repository files navigation

Snakemake workflow: mt_quant

Usage

References

About

Resources

License

Stars

Watchers

Forks

Releases 5

Packages 0

Contributors 3

Languages

Snakemake workflow: `mt_quant`

Packages