Skip to content

Latest commit

 

History

History
49 lines (39 loc) · 2.4 KB

File metadata and controls

49 lines (39 loc) · 2.4 KB

shotgun-metagenomics

Summary

This pipeline analyses paired-end shotgun metagenomics data. In a first step is trims adapters and low-quality bases by using Skewer. Reads are then decontaminated against reference specified in the config file (default: human). Then, several metagenomics profilers (default: Kraken and Metaphlan2) and a pathway analysis with HUMAnN2 is performed. Results aggregated over all input samples. In addition resistance gene typing results are produced with SRST2.

This pipeline was originally developed by Chenhao Li (see Github repository). The SRST2 logic was added by KOH Jia Yu (Jayce).

Output

The following lists the main output files

  • {sample}/reads/all-trimmed-decont_[12].fastq.gz is the (if needed concatenated,) quality trimmed and decontaminated read pair
  • {sample}/reads/counts.txt: read counts after trimming and after decontamination
  • {sample}/srst2/{sample}__genes__{db}__results.txt and {sample}/srst2/{sample}__fullgenes__{db}__results.txt: SRST2 resistance gene typing results (see SRST2 documentation)
  • merged_table_{profiler}/{tax}.{profiler}.profile_merged.tsv where profiler can be kraken or metaphlan2 and tax is a one-letter abbreviation for taxonomic rank (e.g. g for genus). Please note that Metaphlan2 lists abundances as percentage, whereas Kraken produces read counts
  • merged_table_humann2/genefamily.tsv: Abundance (in RPK) of each gene family (each row) in each sample (each column)
  • merged_table_humann2/pathabundance.tsv: Abundance of each pathway in each sample. The pathways are stratified into species.
  • merged_table_humann2/pathcoverage.tsv: Presence (1)/absence (0) code for each pathway in each sample.

References