summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2018-04-07Extended the GEMM tuner to be able to tune the new 'kernel 1'Cedric Nugteren
2018-04-07Fixed a compilation issue for complex datatypes and vloadCedric Nugteren
2018-04-06Fixed a compilation issue for complex datatypes and vloadCedric Nugteren
2018-04-03Added first version of 2D register tiling kernel with A and C transposed as wellCedric Nugteren
2018-03-30Updated pyclblast to 1.1.0 and uploaded to PyPiCedric Nugteren
2018-03-30Merge pull request #255 from kodonnell/py_overrideCedric Nugteren
2018-03-30Added argument checking for the GEMM tuner: expects m/n to be multiples of MW...Cedric Nugteren
2018-03-30Updated the roadmapCedric Nugteren
2018-03-30Merge branch 'CLBlast-227-vivante-compiler-errors'Cedric Nugteren
2018-03-27mergedkodonell
2018-03-27got the generator thing workingkodonell
2018-03-27moved override_parameters example out of sgemm examplekodonell
2018-03-26tidying up pyclblast override_parameters api, and added examplekodonell
2018-03-23Removed arrays as function argument from GEMM kernels for Vivante OpenCL comp...Cedric Nugteren
2018-03-22Merge pull request #269 from CNugteren/CLBlast-266-local-mem-constraintCedric Nugteren
2018-03-22Added the OpenCL local memory size constraint to the tunersCedric Nugteren
2018-03-21Re-added support for local memory size constraint checking in the tunerCedric Nugteren
2018-03-15Fixed a failing TRSM test using a CPU with Apple OpenCLCedric Nugteren
2018-03-15Fixed a failing TRSV test using a CPU with Apple OpenCLCedric Nugteren
2018-03-15Fixed breaking preprocessor test on certain platforms due to empty kernel stringCedric Nugteren
2018-03-15Added queue-finish commands to PyCLBlast samples and testsCedric Nugteren
2018-03-11Merge pull request #262 from CNugteren/CLBlast-237-tuning-apiCedric Nugteren
2018-03-11Added basic tests for PyCLBlastCedric Nugteren
2018-03-10Made benchmarking script also work for complex numbersCedric Nugteren
2018-03-10Added initial glossaryCedric Nugteren
2018-03-10Added badge for OSX-Intel-CPU buildsCedric Nugteren
2018-03-10Fixed an issue for DLL linking under WindowsCedric Nugteren
2018-03-10Fixed an issue for DLL linking under WindowsCedric Nugteren
2018-03-10Updated the documentation for the tuner APICedric Nugteren
2018-03-10Fixed a few things for the new tuning APICedric Nugteren
2018-03-10Completed the API for all tuneable kernelsCedric Nugteren
2018-03-10ok, device id workingkodonell
2018-03-09Added several more tuner API functionsCedric Nugteren
2018-03-09initial add of override parameters to pyclblast - cython not complaining, but...kodonell
2018-03-06Fixed compilation issue in Xger tunerCedric Nugteren
2018-03-06First version of the tuning API, added interface for copy-kernel, added sampleCedric Nugteren
2018-03-03Separate kernel tuners in .cpp with main and .hpp with settingsCedric Nugteren
2018-03-03Fixed the buildbot badges in the READMECedric Nugteren
2018-03-03Updated documentation and build badgesCedric Nugteren
2018-03-03Fixed some small issues regarding PR#253Cedric Nugteren
2018-03-03Merge pull request #253 from sivagnanamn/masterCedric Nugteren
2018-03-03Added C API for getting GEMM temp buffer sizesivagnanamn
2018-02-26Updated the changelogCedric Nugteren
2018-02-26Added a note on preventing segfaults with OpenCL using the AMD APP SDKCedric Nugteren
2018-02-25Merge pull request #249 from CNugteren/documation_reorgCedric Nugteren
2018-02-25Fixed Ubuntu PPA package nameCedric Nugteren
2018-02-25Generated function signatures/inspect for PyCLBlastCedric Nugteren
2018-02-25Generated PyCLBlast docstringsCedric Nugteren
2018-02-25Some style improvements in the pyclblast code generatorCedric Nugteren
2018-02-25Added API documentation for two missing C++ functionsCedric Nugteren