summaryrefslogtreecommitdiff
path: root/CHANGELOG
diff options
context:
space:
mode:
authorCedric Nugteren <web@cedricnugteren.nl>2023-01-21 20:28:32 +0100
committerGitHub <noreply@github.com>2023-01-21 20:28:32 +0100
commite72f87ae5eca5e2ea8aea4f2ce49408c1faa0521 (patch)
tree588a426b6350a5c982d89d98749ae78667fd23b4 /CHANGELOG
parent03cffa83c5f7742f8ec0c5e762bb7048e38952f3 (diff)
parent73f49e9b3d4abc4214122e4b8c07a736e01626ee (diff)
Merge pull request #451 from CodeLinaro/master
CLBlast modifications to address Qualcomm Adreno performance
Diffstat (limited to 'CHANGELOG')
-rw-r--r--CHANGELOG5
1 files changed, 5 insertions, 0 deletions
diff --git a/CHANGELOG b/CHANGELOG
index 7088fb49..824f4520 100644
--- a/CHANGELOG
+++ b/CHANGELOG
@@ -1,5 +1,10 @@
Development version (next version)
- Fixes two small issues in the plotting script
+- Modifications to improve performance on Qualcomm Adreno GPUs:
+ * Unique database entries for specific Adreno devices
+ * Toggle OpenCL kernel compilation options for Adreno
+ * New preprocessor directive RELAX_WORKGROUP_SIZE
+- Fixed a bug in handling of #undef in CLBlast loop unrolling and array-to-register mapping functions
Version 1.5.3
- Fix a correctness issue with DGEMM on SM 7.5 Turing GPUs