summaryrefslogtreecommitdiff
path: root/CHANGELOG
diff options
context:
space:
mode:
authorCedric Nugteren <web@cedricnugteren.nl>2016-05-30 11:11:28 +0200
committerCedric Nugteren <web@cedricnugteren.nl>2016-05-30 11:11:28 +0200
commit61105e38100d323ea270f2cbee0a824d401eaa77 (patch)
treea6f8af9f6e75b57870bfce119f037093a46d2e9c /CHANGELOG
parent182d2cffa163688e2ae08d5d526f8eb63914b6ac (diff)
parent03182f9d07533f795a498936391da744d982e8e2 (diff)
Merge branch 'half_precision' into development
Diffstat (limited to 'CHANGELOG')
-rw-r--r--CHANGELOG11
1 files changed, 10 insertions, 1 deletions
diff --git a/CHANGELOG b/CHANGELOG
index 0d2ad140..d01f674f 100644
--- a/CHANGELOG
+++ b/CHANGELOG
@@ -1,6 +1,15 @@
Development version (next release)
--
+- Added support for half-precision floating-point (fp16) in the library
+- Added half-precision routines:
+ * Level-1: HSWAP/HSCAL/HCOPY/HAXPY/HDOT/HNRM2/HASUM/HSUM/iHAMAX/iHMAX/iHMIN
+ * Level-2: HGEMV/HGBMV/HHEMV/HHBMV/HHPMV/HSYMV/HSBMV/HSPMV/HTRMV/HTBMV/HTPMV/HGER/HSYR/HSPR/HSYR2/HSPR2
+ * Level-3: HGEMM/HSYMM/HSYRK/HSYR2K/HTRMM
+
+Version 0.7.1
+- Improved performance of large power-of-2 xGEMM kernels for AMD GPUs
+- Fixed a bug in the xGEMM routine related to the event incorrectly set
+- Made MSVC link the run-time libraries statically
Version 0.7.1
- Improved performance of large power-of-2 xGEMM kernels for AMD GPUs