Add expected value CLI and plugin #1719

tjtg · 2022-05-16T09:15:04Z

Add a plugin and CLI to calculate the expected value from a probability distribution.

The expected value is the mean of random outcomes (eg. ensemble members) and can be used to produce a deterministic "best guess" forecast from a probabilistic forecast as processed by IMPROVER. The expected value will often be similar to the 50th percentile, but may differ, such as in the case of a positively or negatively skewed (asymmetrical) distribution.

The calculation of expected value for threshold probability data added here is a quick to implement method using existing IMPROVER functionality for conversion to percentiles - this has the correct input and output interfaces, but has high memory usage and has an impact on the accuracy of the output data. I expect to add direct calculation from threshold data (via numerical integration over the threshold values) in a future pull request.

Testing:

Ran tests and they passed OK
Added new tests for the new feature(s)

codecov · 2022-05-16T09:45:32Z

Codecov Report

Merging #1719 (4880cde) into master (b18f795) will decrease coverage by 0.11%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master    #1719      +/-   ##
==========================================
- Coverage   98.13%   98.01%   -0.12%     
==========================================
  Files         111      113       +2     
  Lines       10190    10332     +142     
==========================================
+ Hits        10000    10127     +127     
- Misses        190      205      +15

Impacted Files	Coverage Δ
improver/expected_value.py	`100.00% <100.00%> (ø)`
improver/metadata/probabilistic.py	`100.00% <100.00%> (ø)`
improver/feels_like_temperature.py	`100.00% <0.00%> (ø)`
improver/calibration/dataframe_utilities.py	`100.00% <0.00%> (ø)`
improver/calibration/rainforest_calibration.py	`39.28% <0.00%> (ø)`
improver/utilities/spatial.py	`98.86% <0.00%> (+0.04%)`	⬆️
...erate_ancillaries/generate_derived_solar_fields.py	`98.48% <0.00%> (+9.19%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update b18f795...4880cde. Read the comment docs.

anja-bom

Great work Tom, it was amazing to see how easily we can switch from thresholds to percentiles.

MoseleyS · 2022-05-18T07:31:42Z

Something for a reviewer to consider. This PR calculates the mean value of the ensemble members, and the meta-data is updated to say exactly this. An "Expected Value" is a weighted mean, but this PR provides no mechanism for weighting the members. Should the CLI / plugin be renamed?

benowen-bom

This is a good start to adding in the capability to evaluate the mean from an ensemble forecast. There are a couple of points below that need some consideration, but hopefully these should be reasonably straight forward to address.

I think renaming the CLI to ensemble_mean or something similar would be warranted, given (as@MoseleyS highlights) it is the mean being calculated here and strictly speaking not the expectation value.

One thing worth considering is the way multiple ensemble dimensions are handled. Currently the case of two percentile dimensions is identified through a ValueError, but this case is treated as not being a percentile cube. I appreciate that taking the expected value is ambiguous without knowing which percentile dimension to perform the mean over, but I think this case should raise an exception to highlight this ambiguity. The issue becomes even more complicated when one factors in the possibility of mixed ensemble dimensions (for example, both threshold, realization dims present).

improver/metadata/probabilistic.py

improver/expected_value.py

improver_tests/acceptance/test_expected_value.py

improver_tests/expected_value/test_expected_value.py

improver_tests/metadata/test_probabilistic.py

improver/expected_value.py

benowen-bom

Thanks for updates, and your consideration of other points and raising associated issues. This all looks good to me now.

fionaRust

Thanks Tom, could you let me know where I can find your new acceptance test data, so I can take a quick look and have it ready to merge in when this PR gets merged.

improver/metadata/probabilistic.py

improver_tests/expected_value/test_expected_value.py

Co-authored-by: fionaRust <fiona.rust@metoffice.gov.uk>

* Skeleton for expected value * Update style and copyright * Add basic implementation * Add tests for is_percentile * Add expected value tests * Fix imports and tests * Add handling of threshold data via conversion to percentiles * Update tests for threshold calculation * Add acceptance tests * Fix black making flake8 fail * Changes from review comments * Fix unused import * Docstring fix Co-authored-by: fionaRust <fiona.rust@metoffice.gov.uk> Co-authored-by: fionaRust <fiona.rust@metoffice.gov.uk>

tjtg added 6 commits May 13, 2022 10:12

Skeleton for expected value

0e13391

Update style and copyright

8622c81

Add basic implementation

729d6e5

Add tests for is_percentile

5a86b2f

Add expected value tests

b72d658

Fix imports and tests

51e0f5e

tjtg added 3 commits May 17, 2022 11:02

Add handling of threshold data via conversion to percentiles

78a9bf1

Update tests for threshold calculation

3fdcdd3

Add acceptance tests

3372877

tjtg force-pushed the expectedvalue branch from e8e6023 to 3372877 Compare May 18, 2022 03:57

Fix black making flake8 fail

8922b8c

tjtg changed the title ~~WIP: Add expected value CLI and plugin~~ Add expected value CLI and plugin May 18, 2022

anja-bom requested review from anja-bom and benowen-bom May 18, 2022 04:28

anja-bom previously approved these changes May 18, 2022

View reviewed changes

tjtg added the MO review required PRs opened by non-Met Office developers that require a Met Office review label May 18, 2022

benowen-bom reviewed May 19, 2022

View reviewed changes

benowen-bom reviewed May 23, 2022

View reviewed changes

improver/expected_value.py Outdated Show resolved Hide resolved

benowen-bom reviewed May 23, 2022

View reviewed changes

improver/expected_value.py Outdated Show resolved Hide resolved

fionaRust assigned tjtg May 23, 2022

Changes from review comments

817f3a5

tjtg dismissed anja-bom’s stale review via 817f3a5 May 24, 2022 05:19

Fix unused import

529c57c

tjtg mentioned this pull request May 24, 2022

Inconsistency between find_percentile_coordinate and find_threshold_coordinate functions #1723

Open

benowen-bom previously approved these changes May 25, 2022

View reviewed changes

tjtg removed their assignment May 26, 2022

fionaRust self-assigned this May 30, 2022

fionaRust reviewed May 30, 2022

View reviewed changes

improver/metadata/probabilistic.py Outdated Show resolved Hide resolved

improver_tests/expected_value/test_expected_value.py Show resolved Hide resolved

fionaRust assigned tjtg and unassigned fionaRust May 30, 2022

Docstring fix

4880cde

Co-authored-by: fionaRust <fiona.rust@metoffice.gov.uk>

tjtg dismissed benowen-bom’s stale review via 4880cde May 31, 2022 01:03

fionaRust approved these changes May 31, 2022

View reviewed changes

fionaRust merged commit 493f9e4 into metoppv:master May 31, 2022

tjtg mentioned this pull request Jun 3, 2022

Implement expected value via integration over probability thresholds #1734

Merged

2 tasks

tjtg deleted the expectedvalue branch June 24, 2022 03:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add expected value CLI and plugin #1719

Add expected value CLI and plugin #1719

tjtg commented May 16, 2022 •

edited

Loading

codecov bot commented May 16, 2022 •

edited

Loading

anja-bom left a comment

MoseleyS commented May 18, 2022

benowen-bom left a comment

benowen-bom left a comment

fionaRust left a comment •

edited

Loading

Add expected value CLI and plugin #1719

Add expected value CLI and plugin #1719

Conversation

tjtg commented May 16, 2022 • edited Loading

codecov bot commented May 16, 2022 • edited Loading

Codecov Report

anja-bom left a comment

Choose a reason for hiding this comment

MoseleyS commented May 18, 2022

benowen-bom left a comment

Choose a reason for hiding this comment

benowen-bom left a comment

Choose a reason for hiding this comment

fionaRust left a comment • edited Loading

Choose a reason for hiding this comment

tjtg commented May 16, 2022 •

edited

Loading

codecov bot commented May 16, 2022 •

edited

Loading

fionaRust left a comment •

edited

Loading