summaryrefslogtreecommitdiff
path: root/src
AgeCommit message (Expand)Author
2018-03-10Fixed a few things for the new tuning APICedric Nugteren
2018-03-10Completed the API for all tuneable kernelsCedric Nugteren
2018-03-09Added several more tuner API functionsCedric Nugteren
2018-03-06Fixed compilation issue in Xger tunerCedric Nugteren
2018-03-06First version of the tuning API, added interface for copy-kernel, added sampleCedric Nugteren
2018-03-03Separate kernel tuners in .cpp with main and .hpp with settingsCedric Nugteren
2018-03-03Fixed some small issues regarding PR#253Cedric Nugteren
2018-03-03Added C API for getting GEMM temp buffer sizesivagnanamn
2018-02-25Generated function signatures/inspect for PyCLBlastCedric Nugteren
2018-02-25Generated PyCLBlast docstringsCedric Nugteren
2018-02-20Fixed several issues in the new invert tunerCedric Nugteren
2018-02-18Set initial pyclblast to be version 1.0.0Cedric Nugteren
2018-02-18Prepared PyCLBlast for release as a package on PyPiCedric Nugteren
2018-02-18Added PyCLBlast samplesCedric Nugteren
2018-02-18Added all other level 1/2/3 routines to pyclblastCedric Nugteren
2018-02-18Added GEMM to the Python wrapperCedric Nugteren
2018-02-14First agenerated version (clblastXswap only for now) of the pyclblast wrapperCedric Nugteren
2018-02-13Added skeleton for Python interface using CythonCedric Nugteren
2018-02-11Fixed a minor typoCedric Nugteren
2018-02-02Implemented the XHAD Hadamard product routineCedric Nugteren
2018-01-31Created the API and stubs for the HAD (hadamard-product) routinesCedric Nugteren
2018-01-26Fixed an event synchronisation issue in the batched gemm routinesCedric Nugteren
2018-01-25Moved some constants from global scope to a function; removed unnecessary inc...Cedric Nugteren
2018-01-25Changed the default number of runs for the GEMV tuner to fix issues for FP16Cedric Nugteren
2018-01-18Made GEMM routine tuning a bit more generic in preparation of possible separa...Cedric Nugteren
2018-01-18Made the batched routines also chose direct/indirect kernel like the main GEM...Cedric Nugteren
2018-01-15Factored out the generic parts of the GEMM routine tunerCedric Nugteren
2018-01-11Added a RetrieveParameters function to inspect tuning parametersCedric Nugteren
2018-01-08Implemented the in-direct version of the strided-batched GEMM kernelCedric Nugteren
2018-01-07Implemented direct version of strided-batched GEMM kernelCedric Nugteren
2018-01-07Added API and tests for new GemmStridedBatched routineCedric Nugteren
2018-01-06Reduced duplicate code in the batched GEMM implementationCedric Nugteren
2018-01-06Fixed a vendor naming bug in the tuners and in the databaseCedric Nugteren
2018-01-06Merge pull request #238 from CNugteren/gemm_api_with_temp_bufferCedric Nugteren
2018-01-06Fixed the CUDA interface: replaced nullptr with 0Cedric Nugteren
2018-01-06Fixed a performance overhead in database creation: it is again a static varia...Cedric Nugteren
2018-01-06Added CUDA interface to get temporary-buffer size for GEMM routineCedric Nugteren
2018-01-04Added a CUDA version of the GEMM temp-buffer optional argumentCedric Nugteren
2018-01-04Updated the generator script to automatically generate the temp-buffer codeCedric Nugteren
2017-12-31Revert "Added options to disable parts of the invert kernel to find out where...Cedric Nugteren
2017-12-31Changed the invert kernel slightly; added part1a/part1b disable-definesCedric Nugteren
2017-12-30Fixed ifdef's into ifndef'sCedric Nugteren
2017-12-30Added options to disable parts of the invert kernel to find out where the AMD...Cedric Nugteren
2017-12-30Added optional temp-buffer argument to C++ interface of GEMMCedric Nugteren
2017-12-28Added interface to compute the required temporary buffer size for GEMMCedric Nugteren
2017-12-28Factored out argument processing from the GEMM routineCedric Nugteren
2017-12-28Refactored GEMM code in preparation of separate temp-buffer computationCedric Nugteren
2017-12-27Simplified invert kernel a littleCedric Nugteren
2017-12-27Split the database into multiple small compilation unitsCedric Nugteren
2017-12-26Made the database-vector a non-static memberCedric Nugteren