summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorCedric Nugteren <web@cedricnugteren.nl>2021-08-19 20:37:46 +0200
committerCedric Nugteren <web@cedricnugteren.nl>2021-08-19 20:37:46 +0200
commitc2951b8a2a3205fca0276cc93c1c8b7c09d119ab (patch)
treeda5e747b95442a0108d4e0a17d798c08401e992d
parent5a9bd270f80bd206ae4e5f047154d60dbfc044b2 (diff)
Updated README and tuning list
-rw-r--r--CHANGELOG5
-rw-r--r--doc/tuning.md6
2 files changed, 11 insertions, 0 deletions
diff --git a/CHANGELOG b/CHANGELOG
index e3614d80..424cab04 100644
--- a/CHANGELOG
+++ b/CHANGELOG
@@ -1,3 +1,8 @@
+Development version (next version)
+- Fix a correctness issue with DGEMM on SM 7.5 Turing GPUs
+- Various minor fixes and enhancements
+- Added tuned parameters for various devices (see doc/tuning.md)
+
Version 1.5.2
- Changed XAMAX/XAMIN to more likely return first rather than last min/max index, updated API docs
- Added batched routines to pyclblast
diff --git a/doc/tuning.md b/doc/tuning.md
index 793bce88..200cb7cc 100644
--- a/doc/tuning.md
+++ b/doc/tuning.md
@@ -34,6 +34,11 @@ The CLBlast library is already tuned for the most commonly used OpenCL devices a
- Tesla K20m
- Tesla K40m
- Tesla P100 16GB
+ - Tesla T4
+ - Tesla V100
+ - Quadro T2000
+ - Quadro GV100
+ - A100
* AMD GPUs:
- Radeon HD 6750M
- Radeon HD 6770M
@@ -66,6 +71,7 @@ The CLBlast library is already tuned for the most commonly used OpenCL devices a
- Core i7-4790K
- Core i7-5930K
- Core i7-6770HQ
+ - Core i9-9980HK
- Xeon E5-2630 v3
- Xeon E5-2630 v4
* Other devices: