diff --git a/OSC/PRR_STAC_introduction.ipynb b/OSC/PRR_STAC_introduction.ipynb
index bca390e0..451079cb 100644
--- a/OSC/PRR_STAC_introduction.ipynb
+++ b/OSC/PRR_STAC_introduction.ipynb
@@ -6,9 +6,7 @@
"id": "98522825",
"metadata": {},
"source": [
- "## Generating a STAC Collection for the PRR\n",
- "\n",
- "\n",
+ "# Generating a STAC Collection for the PRR\n",
"\n",
"## Introduction\n",
"\n"
@@ -21,23 +19,23 @@
"source": [
"This notebook has been created to show the core steps required of EarthCODE users to upload their research outcomes to the [ESA Project Results Repository (PRR)](https://eoresults.esa.int/). It focuses on generating metadata for a project with a single `netcdf` file.\n",
"\n",
- "PRR provides access to data, workflows, experiments and documentation from ESA Projects organised across Collections, accessible via the [STAC API](https://github.com/radiantearth/stac-api-spec). Each collection contains [STAC Items](https://github.com/radiantearth/stac-spec/blob/master/item-spec/item-spec.md), with their related assets stored within the PRR storage. Scientists/commercial companies can access the PRR via the [EarthCODE](https://earthcode.esa.int/) and [APEx](https://esa-apex.github.io/apex_documentation/) projects.\n",
+ "PRR provides access to data, workflows, experiments and documentation from ESA Projects organised across Collections, accessible via the [STAC API](https://github.com/radiantearth/stac-api-spec). Each Collection contains [STAC Items](https://github.com/radiantearth/stac-spec/blob/master/item-spec/item-spec.md), with their related Assets stored within the PRR storage. Scientists/commercial companies can access the PRR via the [EarthCODE](https://earthcode.esa.int/) and [APEx](https://esa-apex.github.io/apex_documentation/) projects.\n",
"\n",
"The [STAC Specification](https://stacspec.org/en), provides detailed explanation and more information on this metadata format. \n",
"\n",
- "In order to upload data to the ESA Project Results Repository (PRR) you have to generate a STAC Collection that is associated to your files. The STAC collection provides metadata about your files and makes them searchable and machine readable. The metadata generation process is organised in four steps process:\n",
+ "In order to upload data to the ESA Project Results Repository (PRR) you have to generate a STAC Collection that is associated to your files. The STAC Collection provides metadata about your files and makes them searchable and machine readable. The metadata generation process is organised in four steps process:\n",
"\n",
"1. Generate a root STAC Collection\n",
"2. Group your dataset files into STAC Items and STAC Assets\n",
- "3. Add the Items to the collection\n",
- "4. Save the normalised collection\n",
+ "3. Add the Items to the Collection\n",
+ "4. Save the normalised Collection\n",
"\n",
"The easiest way to generate all the required files is to use a STAC library, such as `pystac` or `riostac`. This library will take care of creating the links and formating the files in the correct way. In the examples below we are using `pystac`. \n",
"\n",
- "Have a look at the steps below and learn how to prepare your dataset to generate a valid STAC Collection. You will find all the steps descibed in the markdown cell, together with the example code (executable) to make this process easier. Please adjust the information in the fields required to describe your collection and items according to the comments, starting with : \"#\" \n",
+ "Have a look at the steps below and learn how to prepare your dataset to generate a valid STAC Collection. You will find all the steps descibed in the markdown cell, together with the example code (executable) to make this process easier. Please adjust the information in the fields required to describe your Collection and Items according to the comments, starting with : \"#\" \n",
"\n",
"\n",
- "*NOTE: Depending on the information that you put in the assets or items the code, you may get an error about an object not being json-serialisable. If this happens, you have to transform the problem field into an object that can be described using standard JSON. For example, transforming a numpy array into a list.*"
+ "*NOTE: Depending on the information that you put in the Assets or Items the code, you may get an error about an object not being json-serialisable. If this happens, you have to transform the problem field into an object that can be described using standard JSON. For example, transforming a numpy array into a list.*"
]
},
{
@@ -63,7 +61,14 @@
"id": "f3f344c3-c98b-45c9-9991-a0c010e2e722",
"metadata": {},
"source": [
- "## Import necessary Python libraries"
+ "## Import necessary Python libraries\n",
+ "\n",
+ "You can create an example conda/miniconda enviroment to run the below code using:\n",
+ "\n",
+ "```bash\n",
+ "conda create -n prr_stack_example pystac xarray shapely\n",
+ "conda activate prr_stack_example\n",
+ "```"
]
},
{
@@ -92,7 +97,7 @@
"\n",
"The root STAC Collection provides a general description of the enitre dataset, that you would like to store in ESA PRR. In the STAC Specification a Collection is defined as an extension of the STAC Catalog with additional information such as the extents, license, keywords, providers, etc that describe STAC Items that fall within the Collection.
\n",
"\n",
- "**In short: it behaves as the container to store various items that build up your dataset.
**\n",
+ "**In short: it behaves as the container to store the various Items that build up your dataset.
**\n",
"\n",
"\n",
"STAC Collection has some required fields that you need to provide in order to build its valid description. Most of these metadata fields should be extracted from your data.\n",
@@ -460,12 +465,12 @@
"\n",
"For example:\n",
"\n",
- "- Microsoft Planatery Computer groups its Sentinel-2 data into Items which represent individual regions, and each item has 13 assets each representing a band - https://stacindex.org/catalogs/microsoft-pc#/43bjKKcJQfxYaT1ir3Ep6uENfjEoQrjkzhd2?cp=1&t=5 .\n",
+ "- Microsoft Planatery Computer groups its Sentinel-2 data into Items which represent individual regions, and each Item has 13 Assets each representing a band - https://stacindex.org/catalogs/microsoft-pc#/43bjKKcJQfxYaT1ir3Ep6uENfjEoQrjkzhd2?cp=1&t=5 .\n",
"\n",
- "- The California Forest Observatory (on Google Earth Engine) groups its data into Items, where each Item represents a specific year, data type and resolution for the whole study area. Each Item has only one asset ( dataset ) associated with it - https://stacindex.org/catalogs/forest-observatory#/4dGsSbK8F5jjmhRZYE6kjUMmgWCUKe6J2qqw?t=2.\n",
+ "- The California Forest Observatory (on Google Earth Engine) groups its data into Items, where each Item represents a specific year, data type and resolution for the whole study area. Each Item has only one Asset ( dataset ) associated with it - https://stacindex.org/catalogs/forest-observatory#/4dGsSbK8F5jjmhRZYE6kjUMmgWCUKe6J2qqw?t=2.\n",
"\n",
"\n",
- "- A More complex example from real-data from ESA-funded project: [ESA Projects Results Repository](https://eoresults.esa.int/browser/#/external/eoresults.esa.int/stac/?.language=en), gives the researchers flexibility in terms on how their datasets will be grouped into Items and Assets. You may need to consider that the more Items you have in your collection, the slower the browsing would be if the user would like to browse through the publicly open STAC Browser. Please have a look at one example, that provides one Sentinel-3 AMPLI Ice Sheet Elevation collection with around 400 Items complemented by around 360 assets each.\n",
+ "- A More complex example from real-data from ESA-funded project: [ESA Projects Results Repository](https://eoresults.esa.int/browser/#/external/eoresults.esa.int/stac/?.language=en), gives the researchers flexibility in terms on how their datasets will be grouped into Items and Assets. You may need to consider that the more Items you have in your Collection, the slower the browsing would be if the user would like to browse through the publicly open STAC Browser. Please have a look at one example, that provides one Sentinel-3 AMPLI Ice Sheet Elevation Collection with around 400 Items complemented by around 360 Assets each.\n",
"https://eoresults.esa.int/browser/#/external/eoresults.esa.int/stac/collections/sentinel3-ampli-ice-sheet-elevation\n",
"\n",
"- More general examples about creating STAC catalogs are available here - https://github.com/stac-utils/pystac/tree/main/docs/tutorials.\n",
@@ -508,7 +513,7 @@
},
{
"cell_type": "code",
- "execution_count": 4,
+ "execution_count": null,
"id": "c2a28021",
"metadata": {
"scrolled": true
@@ -913,14 +918,14 @@
" end_date: 2022-12-16\n",
" version: 2.0\n",
" comment: See technical documentation ATBD and EDD, and peer-review a...\n",
- " contact: robin.fraudeau(at)magellium.fr