Feature/mask NaNs in training loss function #56

sahahner · 2024-10-02T14:28:36Z

Variables with missing values that are imputed by the imputer should not be considered in the loss.

The NaN masks are prepared in the imputer. The remapper contains a new function to remap the NaN masks from the imputer.

This goes together with PR #72 from anemoi-training.

codecov-commenter · 2024-10-02T14:40:39Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 99.84%. Comparing base (f96bcf9) to head (87647b7).

Additional details and impacted files

@@           Coverage Diff            @@
##           develop      #56   +/-   ##
========================================
  Coverage    99.84%   99.84%           
========================================
  Files           23       23           
  Lines         1301     1301           
========================================
  Hits          1299     1299           
  Misses           2        2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

floriankrb · 2024-10-15T07:39:38Z

This functionality seems to be related to ecmwf/anemoi-training#79
Perhaps the masks.py created by @JPXKQX should move in anemoi-models and a [refactored version of] OutputMask be used here?

JPXKQX · 2024-10-15T10:02:15Z

I see some similarities between the output masking and the post-processors, but the part that doesn't fit is that the post-processors are only applied at the end of the rollout. Instead, the masking is called not only at the end, but also in between all the rollout steps (to roll out the boundary forcing). So I don't know if it's better to include it as a special post-processor or leave it in the anemoi-training.

I would say that we can do the loss masking here similar to the imputer, but I think the masking should remain in anemoi-training.

sahahner added 7 commits September 23, 2024 14:10

make preprocessors iterable

38d78c6

feat: calculate nan mask for loss function in imputer forward pass

e0c6067

remove iterators from baseprocessors

cf22b5e

transform loss nan mask in remapper

fa16cb2

Merge branch 'develop' into feature/mask-NaNs-in-training-loss-function

27d682d

use internal model indices for bounding

3551992

Merge branch 'develop' into feature/mask-NaNs-in-training-loss-function

3c4d93c

sahahner mentioned this pull request Oct 2, 2024

Feature/mask NaNs in training loss function ecmwf/anemoi-training#72

Draft

changelog

87647b7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/mask NaNs in training loss function #56

Feature/mask NaNs in training loss function #56

sahahner commented Oct 2, 2024 •

edited

Loading

codecov-commenter commented Oct 2, 2024 •

edited

Loading

floriankrb commented Oct 15, 2024

JPXKQX commented Oct 15, 2024

Feature/mask NaNs in training loss function #56

Are you sure you want to change the base?

Feature/mask NaNs in training loss function #56

Conversation

sahahner commented Oct 2, 2024 • edited Loading

codecov-commenter commented Oct 2, 2024 • edited Loading

Codecov Report

floriankrb commented Oct 15, 2024

JPXKQX commented Oct 15, 2024

sahahner commented Oct 2, 2024 •

edited

Loading

codecov-commenter commented Oct 2, 2024 •

edited

Loading