summaryrefslogtreecommitdiff
path: root/CHANGELOG
diff options
context:
space:
mode:
authorCedric Nugteren <web@cedricnugteren.nl>2017-12-31 16:13:13 +0100
committerCedric Nugteren <web@cedricnugteren.nl>2017-12-31 16:13:13 +0100
commitad483123e6f7aab223417d8387baf74ae098a2a2 (patch)
tree9904f1d5bf352004d15c280ccbf3bc95f09871f5 /CHANGELOG
parent1511909b6ffeb1cc1f3ee6b414c079e35a72a60d (diff)
Fixed the issue with AMD's APP compiler not being able to compile the invert kernel
Diffstat (limited to 'CHANGELOG')
-rw-r--r--CHANGELOG1
1 files changed, 1 insertions, 0 deletions
diff --git a/CHANGELOG b/CHANGELOG
index e2f0d872..d49cb3f5 100644
--- a/CHANGELOG
+++ b/CHANGELOG
@@ -5,6 +5,7 @@ Development (next version)
- Added OpenCL pre-processor to unroll loops and perform array-to-register promotions for compilers
which don't do this themselves (ARM Mali) - greatly improves performance on these platforms
- Added first tuners for the TRSV (block size) and TRSM (invert kernel) routines
+- Fixed an issue with a crashing/hanging AMD APP compiler with the TRSM routine (invert kernel)
- Improved compilation time by splitting the tuning database into multiple compilation units
- Various minor fixes and enhancements
- Added tuned parameters for various devices (see README)