summaryrefslogtreecommitdiff
path: root/CHANGELOG
diff options
context:
space:
mode:
authorcnugteren <web@cedricnugteren.nl>2016-05-15 14:04:34 +0200
committercnugteren <web@cedricnugteren.nl>2016-05-15 14:04:34 +0200
commit9065b3468478818e9c5918380af665f2d499a322 (patch)
treeeb505cb765d7375125b8423ce2f8079602efb408 /CHANGELOG
parent1c72d225c53c123ed810cf3f56f5c92603f7f791 (diff)
Added support for staggered/shuffled offsets for GEMM to improve performance for large power-of-2 kernels on AMD GPUs
Diffstat (limited to 'CHANGELOG')
-rw-r--r--CHANGELOG2
1 files changed, 1 insertions, 1 deletions
diff --git a/CHANGELOG b/CHANGELOG
index 92c0c5ad..187fca73 100644
--- a/CHANGELOG
+++ b/CHANGELOG
@@ -1,6 +1,6 @@
Development version (next release)
--
+- Improved performance of large power-of-2 xGEMM kernels for AMD GPUs
Version 0.7.0
- Added exports to be able to create a DLL on Windows (thanks to Marco Hutter)