Skip to content

Commit 5f36f18

Browse files
authored
Update with 0.3.7 changes
1 parent d47fe78 commit 5f36f18

File tree

1 file changed

+43
-0
lines changed

1 file changed

+43
-0
lines changed

Changelog.txt

Lines changed: 43 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,47 @@
11
OpenBLAS ChangeLog
2+
====================================================================
3+
Version 0.3.7
4+
11-Aug 2019
5+
6+
common:
7+
* having the gmake special variables TARGET_ARCH or TARGET_MACH
8+
defined no longer causes build failures in ctest or utest
9+
* defining NO_AFFINITY or USE_TLS to 0 in gmake builds no longer
10+
has the same effect as setting them to 1
11+
* a new test program was added to allow checking the library for
12+
thread safety
13+
* a new option USE_LOCKING was added to ensure thread safety when
14+
OpenBLAS itself is built without multithreading but will be
15+
called from multiple threads.
16+
* a build failure on Linux with glibc versions earlier than 2.5
17+
was fixed
18+
* a runtime error with CPU enumeration (and NO_AFFINITY not set)
19+
on glibc 2.6 was fixed
20+
* NO_AFFINITY was added to the CMAKE options (and defaults to being
21+
active on Linux, as in the gmake builds)
22+
23+
x86_64:
24+
* the build-time logic for detection of AVX512 availability in
25+
the processor and compiler was fixed
26+
* gmake builds on OSX now set the internal name of the library to
27+
libopenblas.0.dylib (consistent with CMAKE)
28+
* the Haswell DGEMM kernel received a significant speedup through
29+
improved prefetch and load instructions
30+
* performance of DGEMM, DTRMM, DTRSM and ZDOT on Zen/Zen2 was markedly
31+
increased by avoiding vpermpd instructions
32+
* the SKYLAKEX (AVX512) DGEMM helper functions have now been disabled
33+
to fix remaining errors in DGEMM, DSYMM and DTRMM
34+
35+
## POWER:
36+
* added support for building on FreeBSD/powerpc64 and FreeBSD/ppc970
37+
* added optimized kernels for POWER9 single and double precision complex BLAS3
38+
* added optimized kernels for POWER9 SGEMM and STRMM
39+
40+
## ARMV7:
41+
* fixed the softfp implementations of xAMAX and IxAMAX
42+
* removed the predefined -march= flags on both ARMV5 and ARMV6 as
43+
they were appropriate for only a subset of platforms
44+
245
====================================================================
346
Version 0.3.6
447
29-Apr-2019

0 commit comments

Comments
 (0)