summaryrefslogtreecommitdiff
path: root/src
AgeCommit message (Expand)Author
2018-04-13Made GEMM rotation expectations kernel-specificCedric Nugteren
2018-04-10Updated database with defaults of GEMMK=0 and KREG=1Cedric Nugteren
2018-04-08Extended the maximum number of tuning parameters from 14 to 16Cedric Nugteren
2018-04-08Fixed issues with the pre-processorCedric Nugteren
2018-04-07Merge branch 'master' into CLBlast-228-2d-register-gemm-kernelCedric Nugteren
2018-04-07Added tuning results for NVIDIA GeForce 970Cedric Nugteren
2018-04-07Added tuning results for NVIDIA GeForce 920MXCedric Nugteren
2018-04-07Added tuning results for Intel HD Graphics 620Cedric Nugteren
2018-04-07Extended the GEMM tuner to be able to tune the new 'kernel 1'Cedric Nugteren
2018-04-07Fixed a compilation issue for complex datatypes and vloadCedric Nugteren
2018-04-06Fixed a compilation issue for complex datatypes and vloadCedric Nugteren
2018-04-03Added first version of 2D register tiling kernel with A and C transposed as wellCedric Nugteren
2018-03-30Updated pyclblast to 1.1.0 and uploaded to PyPiCedric Nugteren
2018-03-30Merge pull request #255 from kodonnell/py_overrideCedric Nugteren
2018-03-30Added argument checking for the GEMM tuner: expects m/n to be multiples of MW...Cedric Nugteren
2018-03-30Merge branch 'CLBlast-227-vivante-compiler-errors'Cedric Nugteren
2018-03-27mergedkodonell
2018-03-27moved override_parameters example out of sgemm examplekodonell
2018-03-26tidying up pyclblast override_parameters api, and added examplekodonell
2018-03-23Removed arrays as function argument from GEMM kernels for Vivante OpenCL comp...Cedric Nugteren
2018-03-22Added the OpenCL local memory size constraint to the tunersCedric Nugteren
2018-03-21Re-added support for local memory size constraint checking in the tunerCedric Nugteren
2018-03-15Fixed a failing TRSM test using a CPU with Apple OpenCLCedric Nugteren
2018-03-15Fixed a failing TRSV test using a CPU with Apple OpenCLCedric Nugteren
2018-03-15Added queue-finish commands to PyCLBlast samples and testsCedric Nugteren
2018-03-11Merge pull request #262 from CNugteren/CLBlast-237-tuning-apiCedric Nugteren
2018-03-11Added basic tests for PyCLBlastCedric Nugteren
2018-03-10Fixed an issue for DLL linking under WindowsCedric Nugteren
2018-03-10Fixed a few things for the new tuning APICedric Nugteren
2018-03-10Completed the API for all tuneable kernelsCedric Nugteren
2018-03-10ok, device id workingkodonell
2018-03-09Added several more tuner API functionsCedric Nugteren
2018-03-09initial add of override parameters to pyclblast - cython not complaining, but...kodonell
2018-03-06Fixed compilation issue in Xger tunerCedric Nugteren
2018-03-06First version of the tuning API, added interface for copy-kernel, added sampleCedric Nugteren
2018-03-03Separate kernel tuners in .cpp with main and .hpp with settingsCedric Nugteren
2018-03-03Fixed some small issues regarding PR#253Cedric Nugteren
2018-03-03Added C API for getting GEMM temp buffer sizesivagnanamn
2018-02-25Generated function signatures/inspect for PyCLBlastCedric Nugteren
2018-02-25Generated PyCLBlast docstringsCedric Nugteren
2018-02-20Fixed several issues in the new invert tunerCedric Nugteren
2018-02-18Set initial pyclblast to be version 1.0.0Cedric Nugteren
2018-02-18Prepared PyCLBlast for release as a package on PyPiCedric Nugteren
2018-02-18Added PyCLBlast samplesCedric Nugteren
2018-02-18Added all other level 1/2/3 routines to pyclblastCedric Nugteren
2018-02-18Added GEMM to the Python wrapperCedric Nugteren
2018-02-14First agenerated version (clblastXswap only for now) of the pyclblast wrapperCedric Nugteren
2018-02-13Added skeleton for Python interface using CythonCedric Nugteren
2018-02-11Fixed a minor typoCedric Nugteren
2018-02-02Implemented the XHAD Hadamard product routineCedric Nugteren