summaryrefslogtreecommitdiff
path: root/src
AgeCommit message (Expand)Author
2018-04-07Extended the GEMM tuner to be able to tune the new 'kernel 1'Cedric Nugteren
2018-04-07Fixed a compilation issue for complex datatypes and vloadCedric Nugteren
2018-04-06Fixed a compilation issue for complex datatypes and vloadCedric Nugteren
2018-04-03Added first version of 2D register tiling kernel with A and C transposed as wellCedric Nugteren
2018-03-30Updated pyclblast to 1.1.0 and uploaded to PyPiCedric Nugteren
2018-03-30Merge pull request #255 from kodonnell/py_overrideCedric Nugteren
2018-03-30Added argument checking for the GEMM tuner: expects m/n to be multiples of MW...Cedric Nugteren
2018-03-30Merge branch 'CLBlast-227-vivante-compiler-errors'Cedric Nugteren
2018-03-27mergedkodonell
2018-03-27moved override_parameters example out of sgemm examplekodonell
2018-03-26tidying up pyclblast override_parameters api, and added examplekodonell
2018-03-23Removed arrays as function argument from GEMM kernels for Vivante OpenCL comp...Cedric Nugteren
2018-03-22Added the OpenCL local memory size constraint to the tunersCedric Nugteren
2018-03-21Re-added support for local memory size constraint checking in the tunerCedric Nugteren
2018-03-15Fixed a failing TRSM test using a CPU with Apple OpenCLCedric Nugteren
2018-03-15Fixed a failing TRSV test using a CPU with Apple OpenCLCedric Nugteren
2018-03-15Added queue-finish commands to PyCLBlast samples and testsCedric Nugteren
2018-03-11Merge pull request #262 from CNugteren/CLBlast-237-tuning-apiCedric Nugteren
2018-03-11Added basic tests for PyCLBlastCedric Nugteren
2018-03-10Fixed an issue for DLL linking under WindowsCedric Nugteren
2018-03-10Fixed a few things for the new tuning APICedric Nugteren
2018-03-10Completed the API for all tuneable kernelsCedric Nugteren
2018-03-10ok, device id workingkodonell
2018-03-09Added several more tuner API functionsCedric Nugteren
2018-03-09initial add of override parameters to pyclblast - cython not complaining, but...kodonell
2018-03-06Fixed compilation issue in Xger tunerCedric Nugteren
2018-03-06First version of the tuning API, added interface for copy-kernel, added sampleCedric Nugteren
2018-03-03Separate kernel tuners in .cpp with main and .hpp with settingsCedric Nugteren
2018-03-03Fixed some small issues regarding PR#253Cedric Nugteren
2018-03-03Added C API for getting GEMM temp buffer sizesivagnanamn
2018-02-25Generated function signatures/inspect for PyCLBlastCedric Nugteren
2018-02-25Generated PyCLBlast docstringsCedric Nugteren
2018-02-20Fixed several issues in the new invert tunerCedric Nugteren
2018-02-18Set initial pyclblast to be version 1.0.0Cedric Nugteren
2018-02-18Prepared PyCLBlast for release as a package on PyPiCedric Nugteren
2018-02-18Added PyCLBlast samplesCedric Nugteren
2018-02-18Added all other level 1/2/3 routines to pyclblastCedric Nugteren
2018-02-18Added GEMM to the Python wrapperCedric Nugteren
2018-02-14First agenerated version (clblastXswap only for now) of the pyclblast wrapperCedric Nugteren
2018-02-13Added skeleton for Python interface using CythonCedric Nugteren
2018-02-11Fixed a minor typoCedric Nugteren
2018-02-02Implemented the XHAD Hadamard product routineCedric Nugteren
2018-01-31Created the API and stubs for the HAD (hadamard-product) routinesCedric Nugteren
2018-01-26Fixed an event synchronisation issue in the batched gemm routinesCedric Nugteren
2018-01-25Moved some constants from global scope to a function; removed unnecessary inc...Cedric Nugteren
2018-01-25Changed the default number of runs for the GEMV tuner to fix issues for FP16Cedric Nugteren
2018-01-18Made GEMM routine tuning a bit more generic in preparation of possible separa...Cedric Nugteren
2018-01-18Made the batched routines also chose direct/indirect kernel like the main GEM...Cedric Nugteren
2018-01-15Factored out the generic parts of the GEMM routine tunerCedric Nugteren
2018-01-11Added a RetrieveParameters function to inspect tuning parametersCedric Nugteren