Benchmarking recipes (Lauer et al.) #3598

axel-lauer · 2024-05-16T06:56:43Z

Description

This PR implements a set of benchmarking recipes for comparison of different metrics (RMSE, bias, correlation, EMD) calculated for a given model simulation to the results from an ensemble of (model) datasets:

model_evaluation/recipe_model_benchmarking_annual_cycle.yml (annual cycle)
model_evaluation/recipe_model_benchmarking_boxplots.yml (boxplots)
model_evaluation/recipe_model_benchmarking_diurnal_cycle.yml (diurnal cycle)
model_evaluation/recipe_model_benchmarking_maps.yml (map plots of 2-dim variables)
model_evaluation/recipe_model_benchmarking_timeseries.yml (time series)
model_evaluation/recipe_model_benchmarking_zonal.yml (zonal mean plots of 3-dim variables)

For this, the existing monitoring diagnostics monitoring/monitor.py and monitor/multi_datasets.py have been extended.

The new diurnal cycle plot has also been added to the following existing recipes:

Documentation for the benchmarking recipes is available in recipes/recipe_benchmarking.rst, the documentation for monitoring and model evaluation have been updated to include the diurnal cycle plots.

Closes #3498

Note for testing

The benchmarking recipes require the new preprocessor functions local_solar_time and distance_metric and the extended version of preprocessor resample_hours.
The recipe creating the portrait diagram additionally requires PR #3551 providing the diagnostic script for this plot.

Checklist

It is the responsibility of the author to make sure the pull request is ready to review. The icons indicate whether the item will be subject to the 🛠 Technical or 🧪 Scientific review.

🛠 This pull request has a descriptive title
🛠 Code is written according to the code quality guidelines
🛠 Documentation is available
🛠 Tests run successfully
🛠 The list of authors is up to date
🛠 Any changed dependencies have been added or removed correctly
🛠 All checks below this pull request were successful

New or updated recipe/diagnostic

🧪 Recipe runs successfully
🧪 Recipe is well documented
🧪 Figure(s) and data look as expected from literature
🛠 Provenance information has been added

…on_clouds_cycles.yml

…/ESMValTool into benchmarking_maps4monitoring

schlunma

I just pushed some commits to make this PR up-to-date with other developments of the monitoring diagnostic that have been merged while this PR was open. The merge with main only adapted existing lines, but not those added via this PR. I also fixed some minor issues with the doc about allowed options for these diagnostics.

I also fixed a bug which lead to no output from the boxplot diagnostic. Now all recipes run fine and produce the expected output (just tested) 🎉

The only remaining comment (apart from two minor ones on the code) I have is about the lauer25gmd recipes. These include data from the EMAC model, which is not easily available. Usually we don't include these kind of recipes in the main repository because they are not really reproducible without the data. The alternative would be to put them on Zenodo (e.g., https://zenodo.org/records/7254313).

I know that the paper is almost published and it's probably hard to make any additional changes to it, but if that's possible, putting the recipe on Zenodo would be the preferred solution. If that's not possible, it would be great to add a note about that (ideally with a link to the data). We also need to make sure that those recipes are not part of the automated recipe test workflow.

doc/sphinx/source/recipes/recipe_benchmarking.rst

…lished

…/ESMValTool into benchmarking_maps4monitoring

…198444

axel-lauer · 2025-01-27T08:56:07Z

The only remaining comment (apart from two minor ones on the code) I have is about the lauer25gmd recipes. These include data from the EMAC model, which is not easily available. Usually we don't include these kind of recipes in the main repository because they are not really reproducible without the data. The alternative would be to put them on Zenodo (e.g., https://zenodo.org/records/7254313).

Very good point! I removed the 'lauer25gmd' recipes from the branch and put them on Zenodo. The recipes are available at 10.5281/zenodo.11198444, this will be updated in the paper once we get the proofs.

alistairsellar

Thanks for the great work everyone. I'm happy that my comments have been addressed (thanks!) so I'll approve now.

A note for future reviews - it really helps the reviewer if when you address a comment with e.g. "I have changed x" or "I have added this to y", your message also includes a link to the commit that addresses the comment. It makes it quicker for the reviewer to check what has changed in response.

…/ESMValTool into benchmarking_maps4monitoring

schlunma

Thanks Axel for addressing all my comments and sorry for pushing while you were working on it, I think my changes broke the tests, so I wanted to fix them quickly.

I have two tiny comments, then will approve! Tests are failing because this requires a new ESMValCore release. We can merge once the first release candidate is out.

Cheers! 🚀

doc/sphinx/source/recipes/recipe_benchmarking.rst

esmvaltool/recipes/model_evaluation/recipe_model_evaluation_portraits.yml

…uplication

axel-lauer · 2025-01-28T13:19:10Z

I have two tiny comments, then will approve! Tests are failing because this requires a new ESMValCore release. We can merge once the first release candidate is out.

Should be all done now!! Thanks for your review and help with this PR!

schlunma

Awesome, thanks @axel-lauer and everyone!

axel-lauer and others added 30 commits January 15, 2024 14:25

quick and dirty implementation of diurnal cycle

94c826e

added diurnal cycle plot to monitor.py

7c7a2e6

added diurnal cycle example to model_evaluation/recipe_model_evaluati…

8a73469

…on_clouds_cycles.yml

added docu examples diurnal cycle

2daf3a9

fixed style issues in monitor/multi_datasets.py

244d8d8

fixed typo in docu example

f45b25d

draft version of first benchmarking recipe (maps)

530106e

snapshot 2024-02-01

c8aa55c

Merge branch 'main' into diurnal_cycle

080b8f5

snapshot 2024-02-02

40d9167

first working version

e707342

fixed some flake8 issues

904b291

adding benchmarking boxplot

a7ab4e4

Merge branch 'benchmarking_boxplot' into benchmarking_maps4monitoring

b9b0a40

extract plotting function

b25b9c6

added draft of recipe_model_benchmarking_timeseries.yml

ec4b1c1

fix filename

2438b26

Merge branch 'benchmarking_maps4monitoring' of github.com:ESMValGroup…

83d972e

…/ESMValTool into benchmarking_maps4monitoring

boxplots for more variables

30b8453

mv recipe

b864979

added zonal mean benchmarking plot

dddc3a5

merged with lastest branch

a8c5e1e

fixed some flake8 issues

d154eed

updated zonal mean benchmarking recipe

128a77e

addressing review comments

4ccc12c

Merge branch 'main' into diurnal_cycle

a99b522

clean recipe

413cb61

add var order and different distance metrics

ed1e991

first version of plot benchmarking_timeseries

1241f20

added benchmarking annual cycle plot

dff982e

schlunma added 11 commits January 22, 2025 10:24

Remove unused options from benchmarking maps and zonal plots

4a817e8

Allow datasets w/o timerange for benchmarking diags (see #3528)

da473da

Fix contourf plots (see #3797 and #3789)

5f6c3e9

More flexible font sizes (see #3844)

e647152

Make sure that boxplots are actually created

2e9ef36

Properly format figure captions for model evaluation recipe doc

378c313

Delete superfluous ':'

6c166a9

Use YAML syntax for YAML code

5ff0d45

Minor doc changes

baa2096

Re-add default show_stats for zonal mean plot

2c44d27

Do not use ERA5 in monitor recipe so it can be run with bot

51f9d1b

schlunma reviewed Jan 22, 2025

View reviewed changes

doc/sphinx/source/recipes/recipe_benchmarking.rst Outdated Show resolved Hide resolved

doc/sphinx/source/recipes/recipe_benchmarking.rst Show resolved Hide resolved

schlunma and others added 8 commits January 22, 2025 17:39

Fix doc build

2550c05

Merge branch 'main' into benchmarking_maps4monitoring

4b8c65f

changed reference lauer25gmd to preprint version until article is pub…

6414a7f

…lished

update docs

d58b02a

Make portrait plot work

0240bbb

added info on benchmark_dataset: true to multi_datasets.py

664417e

Merge branch 'benchmarking_maps4monitoring' of github.com:ESMValGroup…

92f09a9

…/ESMValTool into benchmarking_maps4monitoring

remove recipe_lauer25gmd_fig*.yml, now available at 10.5281/zenodo.11…

d230fd0

…198444

alistairsellar approved these changes Jan 27, 2025

View reviewed changes

schlunma and others added 3 commits January 27, 2025 15:09

Fix flake8 issues

bfa5873

removed EMAC from recipes

0d862a2

Merge branch 'benchmarking_maps4monitoring' of github.com:ESMValGroup…

78b6021

…/ESMValTool into benchmarking_maps4monitoring

schlunma reviewed Jan 27, 2025

View reviewed changes

doc/sphinx/source/recipes/recipe_benchmarking.rst Show resolved Hide resolved

esmvaltool/recipes/model_evaluation/recipe_model_evaluation_portraits.yml Outdated Show resolved Hide resolved

axel-lauer added 2 commits January 28, 2025 12:56

removed commented out lines in recipe_model_evaluation_portraits.yml

8417240

removed recipe_model_evaluation_portraits.yml from this PR to avoid d…

a221dcb

…uplication

schlunma approved these changes Jan 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmarking recipes (Lauer et al.) #3598

Benchmarking recipes (Lauer et al.) #3598

axel-lauer commented May 16, 2024 •

edited by schlunma

Loading

schlunma left a comment •

edited

Loading

axel-lauer commented Jan 27, 2025

alistairsellar left a comment

schlunma left a comment

axel-lauer commented Jan 28, 2025

schlunma left a comment

Benchmarking recipes (Lauer et al.) #3598

Are you sure you want to change the base?

Benchmarking recipes (Lauer et al.) #3598

Conversation

axel-lauer commented May 16, 2024 • edited by schlunma Loading

Description

Note for testing

Checklist

New or updated recipe/diagnostic

schlunma left a comment • edited Loading

Choose a reason for hiding this comment

axel-lauer commented Jan 27, 2025

alistairsellar left a comment

Choose a reason for hiding this comment

schlunma left a comment

Choose a reason for hiding this comment

axel-lauer commented Jan 28, 2025

schlunma left a comment

Choose a reason for hiding this comment

axel-lauer commented May 16, 2024 •

edited by schlunma

Loading

schlunma left a comment •

edited

Loading