Skip to content

clBLAS-2.6.0 Release for ACL 1.0 Beta 1

Compare
Choose a tag to compare
@TimmyLiu TimmyLiu released this 06 Aug 16:26
· 185 commits to master since this release

This clBLAS release is tagged as v2.6 is part of AMD Compute Libraries (ACL) 1.0 beta 1. This release is based on merge from develop branch to master branch.

The highlights of the release:

  • Introduced offline kernel compilation
  • Improved performance (with offline kernel compilation) of
    • sgemm small matrices NN, TN, NT
    • sgemm large matrices NN, TN, NT
    • zgemm large matrices NT for m,n,k multiples of 32,64,8 respectively
    • dtrsm large matrices for m,n multiples of 192
  • Incorporated some CMake configuration changes
  • Released binaries now includes offline compiled library for certain device and driver.
    • "clBLAS-2.6.0-Windows-x64-Hawaii-14502.zip" is a binary built for Hawaii device with 14.502 driver on Windows platform
    • Binary built for Hawaii device with 14.502 driver on Linux platform will be released once the driver is released
    • (update 08/06/2015) "clBLAS-2.6.0-Linux-x64-Hawaii-14502.tar.gz" is a binary build for Hawaii device with 14.502 driver on Linux platform