summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2018-02-02Fixed the XHAD documentationCedric Nugteren
2018-01-31Created the API and stubs for the HAD (hadamard-product) routinesCedric Nugteren
2018-01-29Updated to CLBlast version 1.3.0Cedric Nugteren
2018-01-29Merge branch 'master' of github.com:CNugteren/CLBlastCedric Nugteren
2018-01-29Fixed a compilation error of the kernel-preprocessor test under MSVCCedric Nugteren
2018-01-28Updated the known issuesCedric Nugteren
2018-01-27Some fixes to the benchmark scriptsCedric Nugteren
2018-01-26Minor displaying improvements to the graph plotting scriptsCedric Nugteren
2018-01-26Fixed an event synchronisation issue in the batched gemm routinesCedric Nugteren
2018-01-25Improved the benchmark scripts; added gemmstridedbatched benchmarkCedric Nugteren
2018-01-25Moved some constants from global scope to a function; removed unnecessary inc...Cedric Nugteren
2018-01-25Changed the default number of runs for the GEMV tuner to fix issues for FP16Cedric Nugteren
2018-01-20Merge pull request #244 from CNugteren/kernel_selection_batched_gemmCedric Nugteren
2018-01-18Made GEMM routine tuning a bit more generic in preparation of possible separa...Cedric Nugteren
2018-01-18Made the batched routines also chose direct/indirect kernel like the main GEM...Cedric Nugteren
2018-01-15Factored out the generic parts of the GEMM routine tunerCedric Nugteren
2018-01-14Small improvements to benchmarking for cuBLASCedric Nugteren
2018-01-11Merge pull request #240 from CNugteren/retrieve_tuning_parametersCedric Nugteren
2018-01-11Added test for the RetrieveParameters functionCedric Nugteren
2018-01-11Added a RetrieveParameters function to inspect tuning parametersCedric Nugteren
2018-01-11Fixed bug in override parameters testCedric Nugteren
2018-01-11Merge pull request #239 from CNugteren/gemm_strided_batchedCedric Nugteren
2018-01-08Implemented the in-direct version of the strided-batched GEMM kernelCedric Nugteren
2018-01-07Implemented direct version of strided-batched GEMM kernelCedric Nugteren
2018-01-07Added API and tests for new GemmStridedBatched routineCedric Nugteren
2018-01-06Fixed a minor nullptr related issue in the code generatorCedric Nugteren
2018-01-06Prevented half-precision batched routines from failing in the testsCedric Nugteren
2018-01-06Reduced duplicate code in the batched GEMM implementationCedric Nugteren
2018-01-06Updated changelog and roadmapCedric Nugteren
2018-01-06Fixed a vendor naming bug in the tuners and in the databaseCedric Nugteren
2018-01-06Merge pull request #238 from CNugteren/gemm_api_with_temp_bufferCedric Nugteren
2018-01-06Fixed the CUDA interface: replaced nullptr with 0Cedric Nugteren
2018-01-06Fixed a performance overhead in database creation: it is again a static varia...Cedric Nugteren
2018-01-06Added CUDA interface to get temporary-buffer size for GEMM routineCedric Nugteren
2018-01-04Added a CUDA version of the GEMM temp-buffer optional argumentCedric Nugteren
2018-01-04Updated the generator script to automatically generate the temp-buffer codeCedric Nugteren
2018-01-03Updated the ROADMAPCedric Nugteren
2018-01-03Added the temp-buffer to the GEMM testers and clientsCedric Nugteren
2018-01-03Added a queue argument to the get-size function when running the tests/clientsCedric Nugteren
2018-01-01Merge pull request #236 from CNugteren/trsm_compilationCedric Nugteren
2017-12-31Fixed the issue with AMD's APP compiler not being able to compile the invert ...Cedric Nugteren
2017-12-31Revert "Added a simple test to check compilation of the invert kernels (issue...Cedric Nugteren
2017-12-31Revert "Added options to disable parts of the invert kernel to find out where...Cedric Nugteren
2017-12-31Made plotting script more flexible: extra argument to set the comparison libraryCedric Nugteren
2017-12-31Changed the invert kernel slightly; added part1a/part1b disable-definesCedric Nugteren
2017-12-30Fixed ifdef's into ifndef'sCedric Nugteren
2017-12-30Added options to disable parts of the invert kernel to find out where the AMD...Cedric Nugteren
2017-12-30Added optional temp-buffer argument to C++ interface of GEMMCedric Nugteren
2017-12-28Added interface to compute the required temporary buffer size for GEMMCedric Nugteren
2017-12-28Factored out argument processing from the GEMM routineCedric Nugteren