A Snakemake workflow to annotate mags
-
Make sure you have
conda
,mamba
andsnakemake
installed.conda --version snakemake --version mamba --version
-
Clone the git repository in your terminal and get in:
git clone git@github.com:3d-omics/mg_annotate.git cd mg_annotate
-
Test your installation.
snakemake -np
-
Run it with your own data:
- Edit
config/features.yml
with reference databases:
# All fastas should be called .fa.gz mag_catalogue_dir: resources/mag_catalogue/ databases: checkm2: resources/databases/checkm2/20210323/uniref100.KO.1.dmnd dram: resources/databases/dram/20230811 gtdbtk: resources/databases/gtdbtk/release214
- Edit
config/params.yml
with execution parameters. The defaults are reasonable.
- Edit
-
Run the pipeline
snakemake \ --use-conda \ --profile profile/default \ --cores 24 \ --jobs 8 \ --keep-going \ --rerun-incomplete
- Dereplication with
dRep
. Multiple secondary ANIs can be provided. - Annotation with
checkm2
,gtdbtk
anddram