Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Documenting usage of datasets produced by pangeo-forge #72

Open
TomAugspurger opened this issue Feb 2, 2021 · 1 comment
Open

Documenting usage of datasets produced by pangeo-forge #72

TomAugspurger opened this issue Feb 2, 2021 · 1 comment
Labels
design question A question of the design of Pangeo Forge documentation Improvements or additions to documentation

Comments

@TomAugspurger
Copy link
Contributor

What policy should pangeo-forge have on documenting how to use a dataset? The conda-forge analogy would say that you just point to the upstream's documentation. While the upstream documentation will certainly be useful for understanding the data, it won't necessarily help with using the ARCO dataset.

At a minimum, it'd be helpful to have an example showing how to load the dataset into the preferred container. This is analogous to conda-forge's conda install -c conda-forge name-of-package

>>> import intake, intake_stac  # assuming we're using intake as our recommended API
>>> my_dataset = intake.open_stac_catalog("/path/to/pangeo-catalog.json")[collection].to_dask()
>>> my_dataset
<xarray.Dataset>
...

Perhaps we cut if off there? Or perhaps we recommend / require recipes come with an example in pangeo-gallery? It now occurs to me that pangeo-gallery is federated, so we could have a gallery of notebooks in the recipe repository, and then register them with pangeo-gallery.

@TomAugspurger TomAugspurger added design question A question of the design of Pangeo Forge documentation Improvements or additions to documentation labels Feb 2, 2021
@rabernat
Copy link
Contributor

rabernat commented Feb 2, 2021

Totally agree that we should provide usage hints.

To me, this is tightly coupled with cataloging (#25). All the dataset that get processed by forge will be ingested into a catalog. That catalog should contain sufficient information about the format that we can generate example code for opening the data. Ideally this would include examples from many languages, not just python.

Once we solve the catalog problem, then the feature you describe would probably live in https://github.com/pangeo-forge/pangeo-forge-vue-website.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
design question A question of the design of Pangeo Forge documentation Improvements or additions to documentation
Projects
None yet
Development

No branches or pull requests

2 participants