-
Notifications
You must be signed in to change notification settings - Fork 54
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cataloging pangeo-forge datasets #25
Comments
Thanks for opening this issue, Ryan. Just to make sure I'm on the right track here, a projected plan of implementing these features should look like:
In terms of input from the pipeline creator, I think some other meta-metadata that we could collect would be:
Finally, if we opt to add these tasks to an existing pipeline, that may require an expansion to the current naming convention being considered - something like |
Thanks for this Charles! I've been thinking about this issue A LOT... I think that, before we can integrate catalog updates into pangeo forge itself, we need to settle on a catalog format, structure, and creation / update procedure. This brings us back to STAC... I'm working on some ideas here and will send an update soon. |
Excited to hear what ideas you have in mind - in the meantime, I'll be consulting PySTAC's API reference to get a feel for generating/updating absolute published catalogs. |
When a dataset gets "published" by pangeo forge, we want to create a catalog entry for it. Some options for this catalog entry are:
Regardless of how we do, we will need to collect some meta-metadata from the pipeline creator about the dataset, such as:
Pangeo forge should provide all the technical entries to the catalog like:
Related issue from @charlesbluca: in order to make zarr metadata browseable, we need
The text was updated successfully, but these errors were encountered: