UPDATES
These data are now available in the following biodiversity portals:
- OBIS dataset ITS, and its metadata record
- OBIS dataset COI, and its metadata record
- OBIS dataset 18S, and its metadata record
These are the first set of data files to be submitted to (Eur)OBIS: taxonomic occurrences from the COI, 18S, and ITS marker gene omics data, sampling event metadata, and image metadata for the events of ARMS-MBON's first sampling campaign (all ARMS deployed in 2018 and 2019 and retrieved between and 2018 and 2020). This includes:
- The individual files that when combined, create the DwCA files that we submitted: a file of occurrences (taxonomy and event data), the DNA extension information (omics data), the EMOF (extra information that cannot otherwise be added to the other two files).
- The observatory, sampling event, omics and image data related to the sampling events included in this EurOBIS dataset. These expand somewhat on the information included in the data that will be published in (Eur)OBIS.
The source files for the omics and taxonomic data can be found in the analysis_release_001 repository: the input and output files for the bioinformatics analysis done with PEMA can be found there. Links to the PEMA pipeline can also be found there. The code that was used to reforumlate PEMA outputs and search various databases for associated information can be found in code_release_001.
For more information on ARMS-MBON, see its data landing page and references there in.
These are the files that are the core of the DwCA
- The CSV containing the occurrences for COI
- The CSV containing the DNA data for COI
- The CSV containing the extended measurements or facts for COI (zipped)
- The CSV containing the occurrences for 18S
- The CSV containing the DNA data for 18S
- The CSV containing the extended measurements or facts for 18S
- The CSV containing the occurrences for ITS
- The CSV containing the DNA data for ITS
- The CSV containing the extended measurements or facts for ITS
These are the files that contain the sampling event data. This is a subset of the entire ARMS-MBON set of combined event data, and include:
- A CSV data file containing observatory metadata, with an accompanying metadata file
- A CSV data file containing sampling event metadata, with an accompanying metadata file
- A CSV data file containing raw sequence metadata, with an accompanying metadata file; including the ENA accession numbers and a limited amount of additional omics metadata. For more information on the processing of the samples and subsequence bioinformatics, see the analysis_release_001 repository.
- A CSV data file containing images metadata, with an accompanying metadata file (note that there are 1000s of rows in this spreadsheet), with a metadata file that explains the entries therein. Images are currently stored in PlutoF and can be downloaded by accessing the links in the "Download URL" column.
Note that these files in the combined data folder may also be useful to you:
- A list of the ENA project, sample, and run accession numbers for all the ARMS-MBON data to date
- A list of the area/field and sample/technical replicates for all the ARMS-MBON data to date
- Additional sequencing demultiplexing metadata - see the README in the folder for an explanation; the subset of those samples relevant to this data release can be found on demultiplexing_details_OmicsData_release001.csv.