summaryrefslogtreecommitdiff
path: root/CHANGELOG
AgeCommit message (Expand)Author
2017-11-19Revived the GEMM routine tuner; minor formatting changesCedric Nugteren
2017-11-09Added tuning results for the GeForce GTX750TiCedric Nugteren
2017-11-08Updated to CLBlast version 1.2.0Cedric Nugteren
2017-11-07Merge pull request #212 from CNugteren/kernel_selection_tunerCedric Nugteren
2017-11-02Integrated the GEMM routine tuner for kernel selection; added first tuning re...Cedric Nugteren
2017-10-29Made it possible to compile the CLBlast performance clients for Android with ...Cedric Nugteren
2017-10-27Fixed a bug when using the matrix A-offset argument for the TRSM routineCedric Nugteren
2017-10-27Added GEMV synchronisation for the TRSV routine: similar bug as in TRSMCedric Nugteren
2017-10-25Fixed a bug in TRSM routine due to missing event synchronisations after GEMM ...Cedric Nugteren
2017-10-20Added tuning parameters for GeForce GTX 580, GeForce GTX 1080Ti, and Core i5-...Cedric Nugteren
2017-10-16Added CUDA API documentationCedric Nugteren
2017-10-03Gemm in-direct implementation now uses only 1 larger instead of max 3 optiona...Cedric Nugteren
2017-10-01Allow OverrideParameters function to work before a kernel was first usedCedric Nugteren
2017-09-30Kernels are now cached based on their routine name and their tuning parametersCedric Nugteren
2017-09-30Updated to version 1.1.0Cedric Nugteren
2017-09-23Added extra benchmarks to verify new database caching keys performanceCedric Nugteren
2017-09-22Added OpenCL properties printing to the diagnostics helperCedric Nugteren
2017-09-16Added tuning results for Intel Core i7 6770HQCedric Nugteren
2017-09-16Improved compilation time of the tuner databaseCedric Nugteren
2017-09-14Added architecture layer in the tuning database for better performance on uns...Cedric Nugteren
2017-09-04Removed an assumption that the 'default' tuning parameters have to be stored ...Cedric Nugteren
2017-08-24Merge branch 'master' into im_to_colCedric Nugteren
2017-08-24Completed im2col implementationCedric Nugteren
2017-08-21Minor updates after merging in the PSO addition to the tunersCedric Nugteren
2017-08-08Updated to version 1.0.1 (bugfix release)Cedric Nugteren
2017-07-30Updated to version 1.0.0Cedric Nugteren
2017-07-24Added status badges for correctness tests; updated list of contributors; fixe...Cedric Nugteren
2017-06-30Fixed an if-statement in the direct GEMM kernel causing a bug with specific s...Cedric Nugteren
2017-06-26Fixed and suppresses several warnings for MSVCCedric Nugteren
2017-06-21Fixes some compilation issues related to the database structure changeCedric Nugteren
2017-06-01Added tuning results for GeForce GT 650M (thanks to bzcheeseman)Cedric Nugteren
2017-05-12Added the IxAMIN routines: absolute minimum version of IxAMAXCedric Nugteren
2017-05-12Fixed a bug in the TRSM routine; tests now passCedric Nugteren
2017-05-12Removed the included performance reports; README now redirects to the new ext...Cedric Nugteren
2017-05-11Added tuning results for the AMD Radeon Fiji GPUCedric Nugteren
2017-05-11Minor naming fixes to the benchmark scriptCedric Nugteren
2017-05-02Updated to version 0.11.0Cedric Nugteren
2017-04-16Finalized support for performance testing against cuBLASCedric Nugteren
2017-04-10Updated the changelog with the Apple CPU overrideCedric Nugteren
2017-03-26Replaced the R graph scripts with Python/Matplotlib benchmark scriptsCedric Nugteren
2017-03-11Added initial naive version of the batched GEMM routine based on the direct G...Cedric Nugteren
2017-03-10Added proper testing of the alpha parameter; finalized the batched AXPY imple...Cedric Nugteren
2017-02-27Added L2 error computation and checking for half-precision testsCedric Nugteren
2017-02-27Fixed half-precision bugs in HTBMV/HTPMV/HTRMV/HSYR2K/HTRMM related to incorr...Cedric Nugteren
2017-02-26Merge branch 'development' into triangular_solversCedric Nugteren
2017-02-25Added documentation for the TRSV and TRSM routinesCedric Nugteren
2017-02-18Added documentation for the OverrideParameters functionCedric Nugteren
2017-01-24Updated the changelog for PR131 and PR132Cedric Nugteren
2017-01-07Updated the link to cl.hpp in the Khronos registry for the samplesCedric Nugteren
2017-01-07Always enables cl_khr_fp64 when running double-precision, not just for OpenCL...Cedric Nugteren