diff options
-rw-r--r-- | CHANGELOG | 1 | ||||
-rw-r--r-- | doc/tuning.md | 2 | ||||
-rw-r--r-- | src/api_common.cpp | 2 |
3 files changed, 3 insertions, 2 deletions
@@ -3,6 +3,7 @@ Development (next version) - Added support for shuffle instructions for NVIDIA GPUs (thanks to 'tyler-utah') - Added an option to compile the Netlib API with static OpenCL device and context (-DNETLIB_PERSISTENT_OPENCL=ON) - The tuners now check beforehand on invalid local thread sizes and skip those completely +- Made the tuning API (OverrideParameters) more flexible, disregarding superfluous parameters - Fixed an issue with conjugate transpose not being executed in certain cases for a.o. XOMATCOPY - Fixed an issue with AMD GPUs and the new GEMMK == 1 kernel - Fixed an issue with the preprocessor and the new GEMMK == 1 kernel diff --git a/doc/tuning.md b/doc/tuning.md index 3117ffad..6243d135 100644 --- a/doc/tuning.md +++ b/doc/tuning.md @@ -201,7 +201,7 @@ These two functions require/retrieve the parameters as given in [src/database/ke | --------------------|-----------------------| | Xaxpy | VW, WGS, WPT | | Xdot | WGS1, WGS2 | -| Xgemv | WGS1, WPT1, UNROLL1 | +| Xgemv | WGS1, WPT1 | | XgemvFast | VW2, WGS2, WPT2 | | XgemvFastRot | VW3, WGS3, WPT3 | | Xger | WGS1, WGS2, WPT | diff --git a/src/api_common.cpp b/src/api_common.cpp index 4e08f1ef..4a4d5f8c 100644 --- a/src/api_common.cpp +++ b/src/api_common.cpp @@ -161,7 +161,7 @@ StatusCode OverrideParameters(const RawDeviceID device, const std::string &kerne // Verifies the parameters size const auto current_parameter_names = current_database.GetParameterNames(); - if (current_parameter_names.size() != parameters.size()) { + if (current_parameter_names.size() > parameters.size()) { return StatusCode::kMissingOverrideParameter; } |