-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refactor data step, inference step, Jupyter notebooks #97
Conversation
b483fd0
to
87a0cae
Compare
Check out this pull request on See visual diffs & provide feedback on Jupyter Notebooks. Powered by ReviewNB |
d5755c3
to
60813a6
Compare
0c93ea2
to
ebfcd52
Compare
1a5daf0
to
998cf51
Compare
Can I request that the module in the |
The section marked |
Rewrite as a library (set of functions) and a CLI.
Cleaner subdomain configuration.
Also locks intake catalog to current HEAD.
No need to repeat sigma according to docs.
Also does more operations up front in the CLI for testing purposes.
Seems that Dask needs the explicit map_blocks to schedule properly. Otherwise memory usage balloons.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We discussed the changes previously, just a couple of tidy-up comments/questions below.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is ok to be merged in
Commented out since first commit. Uses the wrong variable for temperature in CM2.6 (surface_temp, not surface_temperature).
This PR covers multiple "feature" changes.
Extra:
Important to-dos:
cli/testing.py
(needs better name! check?)cli/infer.py
, but you would have to split the data again yourself for the same behaviour. we could rewrite the dataset splitting used incli/train.py
in some way that it is easy to obtain the correct data to use incli/infer.py
.cli/infer.py
too, we could removecli/testing.py
test_global_control
)[ ] Provide option to use the ML flow style, or to load from file (e.g.,xarray.open_zarr
on the forcings and output of inference step).Closes #87 , #90 , #98 , #4 .
Known bugs:
IndexError
when performing train/test dataset splitting #104when training,
IndexError: index x is out of bounds for axis 0 with size y
wherex > y