summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2021-08-19Add tuning results for NVIDIA Tesla V100Cedric Nugteren
2021-08-19Add tuning results for NVIDIA Tesla T4Cedric Nugteren
2021-08-19Add tuning results for NVIDIA Quadro T2000Cedric Nugteren
2021-08-19Add tuning results for NVIDIA Quadro GV100Cedric Nugteren
2021-08-19Add tuning results for Intel Core i9-9980HKCedric Nugteren
2021-08-19Add tuning results for NVIDIA A100Cedric Nugteren
2021-05-23Merge pull request #419 from CNugteren/fix_tuner_out_of_bounds_accessCedric Nugteren
2021-05-22Fix issue with printing out-of-bounds local/global sizes for level 1 tunersCedric Nugteren
2021-04-30Merge pull request #417 from gspr/gspr/capitalization-typoCedric Nugteren
2021-04-30Correct capitalization typoGard Spreemann
2021-03-15Merge pull request #416 from JishinMaster/masterCedric Nugteren
2021-03-13set the correct flop count for xgemmJishinMaster
2021-02-06Merge pull request #414 from CNugteren/CLBlast-412-python-runtime-libs-fixCedric Nugteren
2021-02-05Fix Windows paths in pyclblastCedric Nugteren
2021-02-04Merge pull request #413 from CNugteren/CLBlast-412-python-runtime-libsCedric Nugteren
2021-02-04Added second Windows library pathCedric Nugteren
2021-01-30Add library path for Windows as wellCedric Nugteren
2021-01-29Add library dir on Linux for pyclblastCedric Nugteren
2021-01-21Update pyclblast package version numberCedric Nugteren
2021-01-21Merge pull request #410 from jamesjer/masterCedric Nugteren
2021-01-20Use reference types to prevent unnecessary copyingJerry James
2021-01-19Updated to version 1.5.2Cedric Nugteren
2020-10-10Add tuning results for TITAN RTXCedric Nugteren
2020-10-10Add tuning results for Radeon RX VegaCedric Nugteren
2020-10-05Merge pull request #400 from baryluk/patch-6Cedric Nugteren
2020-10-05Allow single graph / subplot on plotWitold Baryluk
2020-10-04Merge pull request #399 from baryluk/patch-3Cedric Nugteren
2020-10-04Fix a typo in benchmark when running fp 16 vs 32Witold Baryluk
2020-10-04Merge pull request #397 from baryluk/patch-1Cedric Nugteren
2020-10-04Merge pull request #398 from baryluk/patch-2Cedric Nugteren
2020-10-04Fix --load_from_disk argument help messageWitold Baryluk
2020-10-04Fix Python SyntaxWarningWitold Baryluk
2020-10-03Merge pull request #396 from CNugteren/CLBlast-395-fix-benchmark-scriptCedric Nugteren
2020-10-02Fix a Python 3 bug in the benchmark scriptCedric Nugteren
2020-08-16Added FUNDING.yml fileCedric Nugteren
2020-06-07Merge pull request #392 from 9prady9/fix_Program_getIRCedric Nugteren
2020-06-07Add a cautionary note in Program::GetIR and mention the fix in CHANGELOGPradeep Garigipati
2020-06-05Fix Program::GetIR to handle programs with multiple devicesPradeep Garigipati
2020-05-13Merge pull request #389 from CNugteren/CLBlast-385-version-definesCedric Nugteren
2020-05-12Added CLBLAST_VERSION_MAJOR/MINOR/PATCH defines in headers to store version n...Cedric Nugteren
2020-05-11Merge pull request #388 from CNugteren/CLBlast-381-gemm-direct-tuner-failureCedric Nugteren
2020-05-11Increase display width of the local/global sizesCedric Nugteren
2020-05-10Made sure that the global workgroup size is a multiple of the local size in t...Cedric Nugteren
2020-05-10Added logging of local/global workgroup sizes when run the tunersCedric Nugteren
2020-05-10Merge pull request #386 from CNugteren/CLBlast-384-pyclblast-missing-routinesCedric Nugteren
2020-05-10Updated PyCLBlast version numberCedric Nugteren
2020-05-10Added a sample to demonstrate a batched routineCedric Nugteren
2020-05-10Added pyclblast bindings for the 3 batched routinesCedric Nugteren
2020-05-04Merge pull request #383 from CNugteren/CLBlast-382-improve-tunerCedric Nugteren
2020-05-03Move queue creation out of the tuner loopCedric Nugteren