DLA-Future 0.5.0
Changes
- Introduced an option (*) for forcing contiguous GPU communication buffers. (#1096)
- Introduced an option (*) for enabling GPU aware MPI communication. (#1102)
- Removed special handling of Intel MKL, as it could lead to broken installations. (#1149)
- Spack installations: spack will set the correct variables.
- Manual installations: the user is responsible to correctly set variables (see BUILD.md).
(*) These options are available as spack variants.
Performance improvements
- Don't communicate in algorithms when using single rank communicators. (#1097)
- Fixed slow performance of local version of
bt_band_to_tridiagonal
(#1144)