AOMP Release 0.6-4
Pre-release
Pre-release
Like 0.6-3, this release 0.6-4 of aomp is based off the stable version of clang/llvm 8.0.
These are the changes found in 0.6-4 compared to the previous 0.6-3 release.
- support for building on SLES15 SP1
- rpm package for SLES15 SP1
- do not create a host thread for GPU hostcall services if no services are used by any kernel in the application. This fixes a performance regression we saw with openmpapps in 0.6-3 because none of those apps currently use printf on the device. This still needs more study.
- Reorganized the github README and linked pages to make it less confusing and to ready support for more platforms.
- removed hip wrapper scripts such as hipcc. Users must compile hip with clang++ as demonstrated in the examples to get openmp support with hip.
- properly set amdgpu-flat-work-group-size for generic mode: add wave_size
- add -lelf to link step of libomptarget.rtl.hsa.so
- more gracefully exit when gpu arch of kernel does not match device arch
- refine LIBPOMPTARGET_KERNEL_TRACE 1=>minimal, 2=>verbos'er