summaryrefslogtreecommitdiff
path: root/CHANGELOG
diff options
context:
space:
mode:
Diffstat (limited to 'CHANGELOG')
-rw-r--r--CHANGELOG20
1 files changed, 20 insertions, 0 deletions
diff --git a/CHANGELOG b/CHANGELOG
index f815d316..3b8b1123 100644
--- a/CHANGELOG
+++ b/CHANGELOG
@@ -1,3 +1,23 @@
+Development version (next version)
+-
+
+Version 1.6.0
+- Modifications to improve performance on Qualcomm Adreno GPUs:
+ * Unique database entries for specific Adreno devices
+ * Toggle OpenCL kernel compilation options for Adreno
+ * New preprocessor directive RELAX_WORKGROUP_SIZE
+- Fixed a bug in handling of #undef in CLBlast loop unrolling and array-to-register mapping functions
+- Fixed a bug in XAMAX/XAMIN routines related to inadvertently including the increment and offset in the result
+- Fixed a bug in XAMAX/XAMIN routines that would cause only the real part of a complex number to be taken into account
+- Fixed a bug that caused tests to not properly do integer-output testing (for XAMAX/XAMIN)
+- Fixes a minor issue with the expected input buffer size in the TRMV/TBMV/TPMV/TRSV routines
+- Fixes an issue with crashes on Android related to calling clReleaseProgram
+- Fixes two small issues in the plotting script
+- Fixed a documentation bug in the 'ld' requirements
+- Enabled Github Actions CI builds for testing and releasing
+- Various minor fixes and enhancements
+- Added tuned parameters for various devices (see doc/tuning.md)
+
Version 1.5.3
- Fix a correctness issue with DGEMM on SM 7.5 Turing GPUs
- Various minor fixes and enhancements