Skip to content

DLA-Future 0.5.0

Compare
Choose a tag to compare
@msimberg msimberg released this 31 May 12:40
· 40 commits to master since this release
v0.5.0
88f2543

Changes

  • Introduced an option (*) for forcing contiguous GPU communication buffers. (#1096)
  • Introduced an option (*) for enabling GPU aware MPI communication. (#1102)
  • Removed special handling of Intel MKL, as it could lead to broken installations. (#1149)
    • Spack installations: spack will set the correct variables.
    • Manual installations: the user is responsible to correctly set variables (see BUILD.md).

(*) These options are available as spack variants.

Performance improvements

  • Don't communicate in algorithms when using single rank communicators. (#1097)
  • Fixed slow performance of local version of bt_band_to_tridiagonal (#1144)

Bug fixes

  • Implemented a workaround for hipMemcpyDefault 2D memcpys, due to bugs in HIP. (#1106)
  • Miniapps initialize HIP before MPI, as on older Cray MPICH versions initializing HIP after MPI leads to HIP not seeing any devices. (#1090)