code_release_002

This repository contains all scripts and code used for the analyses presented in ARMS-MBON's Second Data Paper. For related datasets, please refer to:

data_release_002: Occurrence and event data.
analysis_release_002: Bioinformatics pipeline outputs.

1. decontam.R

Description:
This script performs blank curation as the first step of data processing. Using the prevalence method from the decontam R package, it identifies and removes potential contaminants from the dataset.

2. rename_samples.R

Description:
The second step involves renaming sample identifiers. PEMA outputs use ENA accession numbers as sample names, which are replaced with their corresponding material sample IDs for clarity and consistency.

3. merge_tables

Description:
This step merges data from the PEMA outputs, including read count tables, taxonomy assignments, and FASTA files, for each genetic marker. Separate scripts handle each marker:

3. COI_merge_tables.R

Processes data for the COI gene.

3. 18S_merge_tables.R

Processes data for the 18S gene.

3. ITS_merge_tables.R

Processes data for the ITS gene.

4. gene_analysis.R

Description:
The final step involves exploratory data analysis and visualization, including:

Curation of merged datasets.
Assessment of sequencing depth.
Visualization of recovered phyla and species.
Creation of an UpSet plot to show the overlap in species identified across marker gene datasets.
Comparisons between datasets from the first data paper (DP001) and the second data paper (DP002).

By providing this comprehensive code and documentation, we aim to ensure transparency and reproducibility for all analyses conducted in the ARMS-MBON project.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

code_release_002

1. decontam.R

2. rename_samples.R

3. merge_tables

3. COI_merge_tables.R

3. 18S_merge_tables.R

3. ITS_merge_tables.R

4. gene_analysis.R

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
final_count_taxonomy_fasta_files		final_count_taxonomy_fasta_files
1. decontam.R		1. decontam.R
2. rename_samples.R		2. rename_samples.R
3. 18S_merge_tables.R		3. 18S_merge_tables.R
3. COI_merge_tables.R		3. COI_merge_tables.R
3. ITS_merge_tables.R		3. ITS_merge_tables.R
4. gene_analysis.R		4. gene_analysis.R
README.md		README.md
sample_data.csv		sample_data.csv

arms-mbon/code_release_002

Folders and files

Latest commit

History

Repository files navigation

code_release_002

1. decontam.R

2. rename_samples.R

3. merge_tables

3. COI_merge_tables.R

3. 18S_merge_tables.R

3. ITS_merge_tables.R

4. gene_analysis.R

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages