From a76dc2f09c283d6ee8d92e1c4ababc6a7cb22c91 Mon Sep 17 00:00:00 2001 From: CNugteren Date: Fri, 24 Jul 2015 08:16:41 +0200 Subject: Updated the docs to reflect the performance improvements --- CHANGELOG | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) (limited to 'CHANGELOG') diff --git a/CHANGELOG b/CHANGELOG index c6cf612a..0fee63af 100644 --- a/CHANGELOG +++ b/CHANGELOG @@ -1,7 +1,8 @@ Development version (next release) - Re-organized test/client infrastructure to avoid code duplication -- Bypasses pre/post-processing kernels if possible (in level-3 routines) +- Added an optional bypass for pre/post-processing kernels in level-3 routines +- Significantly improved performance of level-3 routines on AMD GPUs - Added level-3 routines: * CHEMM/ZHEMM * SSYRK/DSYRK/CSYRK/ZSYRK -- cgit v1.2.3