summaryrefslogtreecommitdiff
path: root/CHANGELOG
AgeCommit message (Expand)Author
2018-07-13Added tuning results for HD Graphics 6000 Broadwell GT3Cedric Nugteren
2018-07-06Updated changelogCedric Nugteren
2018-06-28Disabled calls to clReleaseProgram under Windows to avoid segfaults when the ...Cedric Nugteren
2018-06-03Updated to CLBlast version 1.4.0Cedric Nugteren
2018-06-02Added MKL as an alternative for CBLAS for correctness and performance compari...Cedric Nugteren
2018-05-19The GEMM routine tuner now loads kernel JSON tuning results from disk if avai...Cedric Nugteren
2018-05-18Merge branch 'master' into canary_buffer_overflow_protectionCedric Nugteren
2018-05-17Added a canary region for overflow detection to the correctness testsCedric Nugteren
2018-05-01Now stores a shared_ptr to the Program class in the cacheCedric Nugteren
2018-04-29Merge pull request #277 from CNugteren/CLBlast-257-intel-subgroupsCedric Nugteren
2018-04-29Updated the changelogCedric Nugteren
2018-04-26Fixed an access violation when compiled with Visual Studio upon releasing the...Cedric Nugteren
2018-04-15Updated tuning results for the Skylake ULT GT2 GPU with the new kernelCedric Nugteren
2018-04-07Added tuning results for NVIDIA GeForce 920MXCedric Nugteren
2018-03-22Added the OpenCL local memory size constraint to the tunersCedric Nugteren
2018-03-10Updated the documentation for the tuner APICedric Nugteren
2018-02-26Updated the changelogCedric Nugteren
2018-02-20Fixed several issues in the new invert tunerCedric Nugteren
2018-02-18Updated changelog and roadmap: Python package createdCedric Nugteren
2018-02-02Implemented the XHAD Hadamard product routineCedric Nugteren
2018-01-29Updated to CLBlast version 1.3.0Cedric Nugteren
2018-01-11Added a RetrieveParameters function to inspect tuning parametersCedric Nugteren
2018-01-08Implemented the in-direct version of the strided-batched GEMM kernelCedric Nugteren
2018-01-06Updated changelog and roadmapCedric Nugteren
2017-12-31Fixed the issue with AMD's APP compiler not being able to compile the invert ...Cedric Nugteren
2017-12-27Split the database into multiple small compilation unitsCedric Nugteren
2017-12-23Updated the database to use the new TRSV and Invert tunersCedric Nugteren
2017-12-20Added try-except to database script parser to skip invalid filesCedric Nugteren
2017-12-17Removed all ARM Mali tuning results; re-added Mali-T760 and Mali-T628 results...Cedric Nugteren
2017-12-10Updated roadmap: completed pre-processor implementationCedric Nugteren
2017-12-09Made the pre-processor run by default for ARM and Qualcomm GPUsCedric Nugteren
2017-11-24Added precision check to parameter override for the clientsCedric Nugteren
2017-11-19Revived the GEMM routine tuner; minor formatting changesCedric Nugteren
2017-11-09Added tuning results for the GeForce GTX750TiCedric Nugteren
2017-11-08Updated to CLBlast version 1.2.0Cedric Nugteren
2017-11-07Merge pull request #212 from CNugteren/kernel_selection_tunerCedric Nugteren
2017-11-02Integrated the GEMM routine tuner for kernel selection; added first tuning re...Cedric Nugteren
2017-10-29Made it possible to compile the CLBlast performance clients for Android with ...Cedric Nugteren
2017-10-27Fixed a bug when using the matrix A-offset argument for the TRSM routineCedric Nugteren
2017-10-27Added GEMV synchronisation for the TRSV routine: similar bug as in TRSMCedric Nugteren
2017-10-25Fixed a bug in TRSM routine due to missing event synchronisations after GEMM ...Cedric Nugteren
2017-10-20Added tuning parameters for GeForce GTX 580, GeForce GTX 1080Ti, and Core i5-...Cedric Nugteren
2017-10-16Added CUDA API documentationCedric Nugteren
2017-10-03Gemm in-direct implementation now uses only 1 larger instead of max 3 optiona...Cedric Nugteren
2017-10-01Allow OverrideParameters function to work before a kernel was first usedCedric Nugteren
2017-09-30Kernels are now cached based on their routine name and their tuning parametersCedric Nugteren
2017-09-30Updated to version 1.1.0Cedric Nugteren
2017-09-23Added extra benchmarks to verify new database caching keys performanceCedric Nugteren
2017-09-22Added OpenCL properties printing to the diagnostics helperCedric Nugteren
2017-09-16Added tuning results for Intel Core i7 6770HQCedric Nugteren