diff options
author | Cedric Nugteren <web@cedricnugteren.nl> | 2016-05-30 11:11:28 +0200 |
---|---|---|
committer | Cedric Nugteren <web@cedricnugteren.nl> | 2016-05-30 11:11:28 +0200 |
commit | 61105e38100d323ea270f2cbee0a824d401eaa77 (patch) | |
tree | a6f8af9f6e75b57870bfce119f037093a46d2e9c /CHANGELOG | |
parent | 182d2cffa163688e2ae08d5d526f8eb63914b6ac (diff) | |
parent | 03182f9d07533f795a498936391da744d982e8e2 (diff) |
Merge branch 'half_precision' into development
Diffstat (limited to 'CHANGELOG')
-rw-r--r-- | CHANGELOG | 11 |
1 files changed, 10 insertions, 1 deletions
@@ -1,6 +1,15 @@ Development version (next release) -- +- Added support for half-precision floating-point (fp16) in the library +- Added half-precision routines: + * Level-1: HSWAP/HSCAL/HCOPY/HAXPY/HDOT/HNRM2/HASUM/HSUM/iHAMAX/iHMAX/iHMIN + * Level-2: HGEMV/HGBMV/HHEMV/HHBMV/HHPMV/HSYMV/HSBMV/HSPMV/HTRMV/HTBMV/HTPMV/HGER/HSYR/HSPR/HSYR2/HSPR2 + * Level-3: HGEMM/HSYMM/HSYRK/HSYR2K/HTRMM + +Version 0.7.1 +- Improved performance of large power-of-2 xGEMM kernels for AMD GPUs +- Fixed a bug in the xGEMM routine related to the event incorrectly set +- Made MSVC link the run-time libraries statically Version 0.7.1 - Improved performance of large power-of-2 xGEMM kernels for AMD GPUs |