diff options
Diffstat (limited to 'CHANGELOG')
-rw-r--r-- | CHANGELOG | 7 |
1 files changed, 2 insertions, 5 deletions
@@ -1,5 +1,7 @@ Development version (next release) +- Made it possible to compile the performance tests (clients) separately from the correctness tests +- Made a reference BLAS and head-to-head performance comparison optional in the clients - Added support for half-precision floating-point (fp16) in the library - Added half-precision routines: * Level-1: HSWAP/HSCAL/HCOPY/HAXPY/HDOT/HNRM2/HASUM/HSUM/iHAMAX/iHMAX/iHMIN @@ -11,11 +13,6 @@ Version 0.7.1 - Fixed a bug in the xGEMM routine related to the event incorrectly set - Made MSVC link the run-time libraries statically -Version 0.7.1 -- Improved performance of large power-of-2 xGEMM kernels for AMD GPUs -- Fixed a bug in the xGEMM routine related to the event incorrectly set -- Made MSVC link the run-time libraries statically - Version 0.7.0 - Added exports to be able to create a DLL on Windows (thanks to Marco Hutter) - Made the library thread-safe |