Reduction binary ops on floating-points (and NaN) might not be strictly associative/commutative #550

davidozog · 2024-10-09T16:04:19Z

Problem Statement

Reduction binary ops on floating-points (and NaN) might not be strictly associative/commutative.

The result should be the same on all PEs.
The result should be the same from run to run.
The result may be different on different architectures, due to differing arithmetic.
The result on IEEE-754 compatible platforms should always be the same everywhere every time.
We might consider new APIs or environment variables that enforce more consistent results.
A team could take an optional config parameter indicating desire for a well-ordered reduction (i.e, C++ std::reduce vs std::accumulate)

davidozog added this to the OpenSHMEM 1.7 milestone Oct 9, 2024

davidozog mentioned this issue Oct 9, 2024

reductions: add note about FP associativity davidozog/openshmem-specification#16

Merged

4 tasks