summaryrefslogtreecommitdiff
path: root/CHANGELOG
AgeCommit message (Expand)Author
2020-02-18Updated to version 1.5.1Cedric Nugteren
2020-02-17Catches all exceptions of the tunersCedric Nugteren
2019-09-06Added notion of fixes in XhadFasterCedric Nugteren
2019-05-19Fixed a bug in the absolute-min index kernelCedric Nugteren
2019-02-09Added tuning parameters for Xeon E5-2630 v3 and v4Cedric Nugteren
2018-12-21Update changelogKoichi Akabe
2018-12-04Updated to version 1.5.0Cedric Nugteren
2018-12-01Added a FAQ documentCedric Nugteren
2018-11-30Fixed an issue for unequal MWG and NWG and the new GEMMK == 1 kernelCedric Nugteren
2018-11-01Added new col2im routine to the documentationCedric Nugteren
2018-10-15Fixed a bug in the XaxpyFaster kernel for specific parametersCedric Nugteren
2018-10-14Merge pull request #319 from CNugteren/convgemm_multi_kernelCedric Nugteren
2018-10-13Updated changelog regarding tuning API changeCedric Nugteren
2018-09-16Merge branch 'master' into convgemm_multi_kernelCedric Nugteren
2018-09-15Disabled Intel subgroup shuffling for double-precisionCedric Nugteren
2018-09-07Added xCONVGEMM as im2col plus a batched GEMM kernelCedric Nugteren
2018-08-07Name change of setting to NETLIB_PERSISTENT_OPENCLCedric Nugteren
2018-08-05Added an option to compile the Netlib API with static OpenCL device and contextCedric Nugteren
2018-07-31Fixed issue with not performing complex conjugation under certain cases when ...Cedric Nugteren
2018-07-28The tuners now also check for valid local thread configurations and skip inva...Cedric Nugteren
2018-07-27Fixed an issue with AMD GPUs and the new GEMMK == 1 kernelCedric Nugteren
2018-07-25Added code to report the average tuning resultsCedric Nugteren
2018-07-14Updated to CLBlast version 1.4.1Cedric Nugteren
2018-07-13Added tuning results for HD Graphics 6000 Broadwell GT3Cedric Nugteren
2018-07-06Updated changelogCedric Nugteren
2018-06-28Disabled calls to clReleaseProgram under Windows to avoid segfaults when the ...Cedric Nugteren
2018-06-03Updated to CLBlast version 1.4.0Cedric Nugteren
2018-06-02Added MKL as an alternative for CBLAS for correctness and performance compari...Cedric Nugteren
2018-05-19The GEMM routine tuner now loads kernel JSON tuning results from disk if avai...Cedric Nugteren
2018-05-18Merge branch 'master' into canary_buffer_overflow_protectionCedric Nugteren
2018-05-17Added a canary region for overflow detection to the correctness testsCedric Nugteren
2018-05-01Now stores a shared_ptr to the Program class in the cacheCedric Nugteren
2018-04-29Merge pull request #277 from CNugteren/CLBlast-257-intel-subgroupsCedric Nugteren
2018-04-29Updated the changelogCedric Nugteren
2018-04-26Fixed an access violation when compiled with Visual Studio upon releasing the...Cedric Nugteren
2018-04-15Updated tuning results for the Skylake ULT GT2 GPU with the new kernelCedric Nugteren
2018-04-07Added tuning results for NVIDIA GeForce 920MXCedric Nugteren
2018-03-22Added the OpenCL local memory size constraint to the tunersCedric Nugteren
2018-03-10Updated the documentation for the tuner APICedric Nugteren
2018-02-26Updated the changelogCedric Nugteren
2018-02-20Fixed several issues in the new invert tunerCedric Nugteren
2018-02-18Updated changelog and roadmap: Python package createdCedric Nugteren
2018-02-02Implemented the XHAD Hadamard product routineCedric Nugteren
2018-01-29Updated to CLBlast version 1.3.0Cedric Nugteren
2018-01-11Added a RetrieveParameters function to inspect tuning parametersCedric Nugteren
2018-01-08Implemented the in-direct version of the strided-batched GEMM kernelCedric Nugteren
2018-01-06Updated changelog and roadmapCedric Nugteren
2017-12-31Fixed the issue with AMD's APP compiler not being able to compile the invert ...Cedric Nugteren
2017-12-27Split the database into multiple small compilation unitsCedric Nugteren
2017-12-23Updated the database to use the new TRSV and Invert tunersCedric Nugteren