summaryrefslogtreecommitdiff
path: root/CHANGELOG
AgeCommit message (Collapse)Author
2023-05-21Update to version 1.6.0 (#475)Cedric Nugteren
2023-05-10Fixes an issue under Android when the driver was already unloaded (#462)Cedric Nugteren
2023-05-10TBMV/TPMV/TRSV: Use the minimum x buffer size for copying to a temp buffer ↵Cedric Nugteren
(#461)
2023-05-07TRMV: Use the minimum x buffer size for copying to a temp buffer (#458)Cedric Nugteren
2023-05-07AMAX/AMIN integer testing and bug fixes (#457)Cedric Nugteren
* Fixed a bug in XAMAX/XMIN routines that caused the increment and offset to be included in the result * Perform proper integer-output testing in XAMAX tests * A few changes towards getting it ready for a PR * Also fix compilation for clBLAS and cuBLAS references * Fix a bug that would only use the real part of complex numbers in the amax/amin routines * A few small fixes related to the AMAX tests
2023-03-25Fix documentation bug w.r.t. ld values and matrix layoutCedric Nugteren
2023-01-21Add tuning results for Adreno 740Cedric Nugteren
2023-01-17Updated according to feedback from CNugterenAngus, Alexander
2022-10-13Update changelogCedric Nugteren
2022-09-22Replace the broken khronos registry link for cl.hpp with a new github link ↵Cedric Nugteren
for opencl.hpp
2022-09-22Update to version 1.5.3Cedric Nugteren
2022-05-13dev versionJustin Graham
2022-05-13changelog messageJustin Graham
2021-08-19Updated README and tuning listCedric Nugteren
2021-01-19Updated to version 1.5.2Cedric Nugteren
2020-10-10Add tuning results for Radeon RX VegaCedric Nugteren
2020-06-07Add a cautionary note in Program::GetIR and mention the fix in CHANGELOGPradeep Garigipati
2020-05-12Added CLBLAST_VERSION_MAJOR/MINOR/PATCH defines in headers to store version ↵Cedric Nugteren
numbering
2020-05-10Made sure that the global workgroup size is a multiple of the local size in ↵Cedric Nugteren
the tuners
2020-05-10Updated PyCLBlast version numberCedric Nugteren
2020-03-08Update API documentationCedric Nugteren
2020-03-08Silenced a new OpenCL warning messageCedric Nugteren
2020-02-18Updated to version 1.5.1Cedric Nugteren
2020-02-17Catches all exceptions of the tunersCedric Nugteren
2019-09-06Added notion of fixes in XhadFasterCedric Nugteren
2019-05-19Fixed a bug in the absolute-min index kernelCedric Nugteren
2019-02-09Added tuning parameters for Xeon E5-2630 v3 and v4Cedric Nugteren
2018-12-21Update changelogKoichi Akabe
2018-12-04Updated to version 1.5.0Cedric Nugteren
2018-12-01Added a FAQ documentCedric Nugteren
2018-11-30Fixed an issue for unequal MWG and NWG and the new GEMMK == 1 kernelCedric Nugteren
2018-11-01Added new col2im routine to the documentationCedric Nugteren
2018-10-15Fixed a bug in the XaxpyFaster kernel for specific parametersCedric Nugteren
2018-10-14Merge pull request #319 from CNugteren/convgemm_multi_kernelCedric Nugteren
First im2col+GEMM implementation of convolution
2018-10-13Updated changelog regarding tuning API changeCedric Nugteren
2018-09-16Merge branch 'master' into convgemm_multi_kernelCedric Nugteren
2018-09-15Disabled Intel subgroup shuffling for double-precisionCedric Nugteren
2018-09-07Added xCONVGEMM as im2col plus a batched GEMM kernelCedric Nugteren
2018-08-07Name change of setting to NETLIB_PERSISTENT_OPENCLCedric Nugteren
2018-08-05Added an option to compile the Netlib API with static OpenCL device and contextCedric Nugteren
2018-07-31Fixed issue with not performing complex conjugation under certain cases when ↵Cedric Nugteren
transposing
2018-07-28The tuners now also check for valid local thread configurations and skip ↵Cedric Nugteren
invalid ones completely, saving compilation time
2018-07-27Fixed an issue with AMD GPUs and the new GEMMK == 1 kernelCedric Nugteren
2018-07-25Added code to report the average tuning resultsCedric Nugteren
2018-07-14Updated to CLBlast version 1.4.1Cedric Nugteren
2018-07-13Added tuning results for HD Graphics 6000 Broadwell GT3Cedric Nugteren
2018-07-06Updated changelogCedric Nugteren
2018-06-28Disabled calls to clReleaseProgram under Windows to avoid segfaults when the ↵Cedric Nugteren
OpenCL driver unloads first
2018-06-03Updated to CLBlast version 1.4.0Cedric Nugteren
2018-06-02Added MKL as an alternative for CBLAS for correctness and performance ↵Cedric Nugteren
comparisons