diff options
author | Cedric Nugteren <web@cedricnugteren.nl> | 2017-12-31 16:13:13 +0100 |
---|---|---|
committer | Cedric Nugteren <web@cedricnugteren.nl> | 2017-12-31 16:13:13 +0100 |
commit | ad483123e6f7aab223417d8387baf74ae098a2a2 (patch) | |
tree | 9904f1d5bf352004d15c280ccbf3bc95f09871f5 /CHANGELOG | |
parent | 1511909b6ffeb1cc1f3ee6b414c079e35a72a60d (diff) |
Fixed the issue with AMD's APP compiler not being able to compile the invert kernel
Diffstat (limited to 'CHANGELOG')
-rw-r--r-- | CHANGELOG | 1 |
1 files changed, 1 insertions, 0 deletions
@@ -5,6 +5,7 @@ Development (next version) - Added OpenCL pre-processor to unroll loops and perform array-to-register promotions for compilers which don't do this themselves (ARM Mali) - greatly improves performance on these platforms - Added first tuners for the TRSV (block size) and TRSM (invert kernel) routines +- Fixed an issue with a crashing/hanging AMD APP compiler with the TRSM routine (invert kernel) - Improved compilation time by splitting the tuning database into multiple compilation units - Various minor fixes and enhancements - Added tuned parameters for various devices (see README) |