Taxonomic profiling pipeline

Raw reads mOTUs and taxonomic classification pipeline.

Pipeline summary

The containerised pipeline for profiling shotgun metagenomic data is derived from the MGnify pipeline raw-reads analyses, a well-established resource used for analyzing microbiome data. Results of chosen studies are available on FTP.

Key components:

Quality control and decontamination
rRNA and ncRNA detection using Rfam database
Taxonomic classification of SSU and LSU regions based on SILVA database
Abundance analysis of mOTUs (Metagenomic Operational Taxonomic Units)

The pipeline is implemented in Nextflow and needs as second dependency either Docker or Singularity. All databases are automatically downloaded by Nextflow.

Quick Start

Install Nextflow
Install any of Docker or Singularity.
Download the pipeline and test it on a minimal dataset.

Run examples

Add your own profile to nextflow.config file including all inputs

Basic run

nextflow run EBI-Metagenomics/motus_pipeline \
-profile <choose profile> \
--mode <single/paired> \
--single_end  / --paired_end_forward --paired_end_reverse <path with fastq file/s>\
--sample_name <accession/name>

Using the fetch tool to download the reads

nextflow run EBI-Metagenomics/motus_pipeline \
-profile local \
--mode single \
--sample_name test
--reads_accession ERR4387386

Local Single End run

nextflow run EBI-Metagenomics/motus_pipeline \
-profile local \
--mode single \
--single_end my_reads/raw/test.fastq.gz \
--sample_name test

Local Paired Ends run

nextflow run EBI-Metagenomics/motus_pipeline \
-profile local \
--mode paired \
--paired_end_forward my_reads/raw/test_1.fastq.gz \
--paired_end_reverse my_reads/raw/test_2.fastq.gz \
--sample_name test

Development

Install development tools (including pre-commit hooks to run Black code formatting).

pip install -r requirements-dev.txt
pre-commit install

Code style

Use Black, this tool is configured if you install the pre-commit tools as above.

To manually run them: black .

Testing

The pipeline unit test are executed using nf-test.

To run the nextflow unit tests the databases have to downloaded manually, we are working to improve this.

nf-test test tests/*

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
bin		bin
config		config
docs/images		docs/images
modules		modules
subworkflows		subworkflows
tests		tests
workflow		workflow
.editorconfig		.editorconfig
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
main.nf		main.nf
nextflow.config		nextflow.config
nf-test.config		nf-test.config
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Taxonomic profiling pipeline

Pipeline summary

Quick Start

Run examples

Basic run

Using the fetch tool to download the reads

Local Single End run

Local Paired Ends run

Development

Code style

Testing

About

Releases 3

Packages

Contributors 2

Languages

License

EBI-Metagenomics/motus_pipeline

Folders and files

Latest commit

History

Repository files navigation

Taxonomic profiling pipeline

Pipeline summary

Quick Start

Run examples

Basic run

Using the fetch tool to download the reads

Local Single End run

Local Paired Ends run

Development

Code style

Testing

About

Resources

License

Stars

Watchers

Forks

Releases 3

Packages 0

Contributors 2

Languages

Packages