summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2023-03-25Fix documentation bug w.r.t. ld values and matrix layoutCedric Nugteren
2023-01-22Merge pull request #452 from CNugteren/add_tuning_results_adrenoCedric Nugteren
Add tuning results for 4 devies
2023-01-21Add tuning results for Intel FPGA emulation deviceCedric Nugteren
2023-01-21Add tuning results for Radeon Pro 450Cedric Nugteren
2023-01-21Add tuning results for Adreno 740Cedric Nugteren
2023-01-21Add tuning results for Adreno 730Cedric Nugteren
2023-01-21Merge pull request #451 from CodeLinaro/masterCedric Nugteren
CLBlast modifications to address Qualcomm Adreno performance
2023-01-17Updated according to feedback from CNugterenAngus, Alexander
2023-01-12Adreno 730 + 740 CLBlast tuning resultsAngus, Alexander
2023-01-03implemented changes to boost Adreno performance according to ↵Angus, Alexander
https://jira-dc.qualcomm.com/jira/browse/OSR-8731
2022-10-14Merge pull request #447 from CNugteren/small_plotting_fixesCedric Nugteren
Fix two small issues in the plotting script
2022-10-13Update changelogCedric Nugteren
2022-10-13Fix plotting issue with a single row or columnCedric Nugteren
2022-10-13Fix plotting issue in case of 'inf' valuesCedric Nugteren
2022-09-27Merge pull request #442 from CNugteren/update_version_to_1_5_3Cedric Nugteren
Update to version 1.5.3
2022-09-27Fix opencl.hpp download in CMakeCedric Nugteren
2022-09-27Properly set OpenCL target to version 2.1Cedric Nugteren
2022-09-22Replace the broken khronos registry link for cl.hpp with a new github link ↵Cedric Nugteren
for opencl.hpp
2022-09-22Update PyCLBlast version numberCedric Nugteren
2022-09-22Update to version 1.5.3Cedric Nugteren
2022-06-24Fix typo in commentCedric Nugteren
Resolves https://github.com/CNugteren/CLBlast/issues/440
2022-05-25Merge pull request #438 from CNugteren/cupp11_api_inconsistencyCedric Nugteren
Fix API inconsistency in cupp11.hpp
2022-05-23Fix API inconsistency in cupp11.hppCedric Nugteren
The function `CopyToAsync` has an optional event argument in the OpenCL version, which is used in CLBlast. This makes the code not compile at all if CUDA (through cupp11.hpp`) is used as backend. This issue was found by a CLBlast user and reported privately by email. This PR should fix that.
2022-05-17Merge pull request #437 from umar456/blas_fixCedric Nugteren
Add logic to find intel OpenMP on oneMKL.
2022-05-16Merge pull request #432 from justingra/sum-fixCedric Nugteren
sum fix
2022-05-15Add logic to find intel OpenMP on oneMKL.Umar Arshad
2022-05-13dev versionJustin Graham
2022-05-13changelog messageJustin Graham
2022-04-25Merge pull request #436 from CNugteren/add_tuning_resultsCedric Nugteren
Add tuning results for 2 AMD GPUs and 1 Qualcomm GPU
2022-04-25Add tuning results for Adreno 540Cedric Nugteren
2022-04-25Add tuning results for Radeon RX 6500 XTCedric Nugteren
2022-04-25Add tuning results for Radeon RX 6800 XTCedric Nugteren
2022-04-25Merge pull request #434 from CNugteren/update_test_status_machinesCedric Nugteren
Remove old test machines and add new ones
2022-04-25Remove old test machines and add new onesCedric Nugteren
2022-04-22sum fixJustin Graham
2022-04-14Merge pull request #431 from danyougle/patch-2Cedric Nugteren
android.hpp: custom header guard _clang_
2022-04-13android.hpp: custom header guard of _clang_danyougle
In order not to have ambiguous definitions, exclude the functions for other compilers
2022-04-13Merge pull request #430 from danyougle/patch-1Cedric Nugteren
add AMD OCL SDK light path in ENV section
2022-04-13add AMD OCL SDK light path in ENV sectiondanyougle
2021-08-27Merge pull request #425 from CNugteren/tesla_t4_correctnessCedric Nugteren
Tesla T4 tuning parameters
2021-08-27Add Quadro T2000 tuning parameters for the Tesla T4Cedric Nugteren
2021-08-27Remove Tesla T4 tuning resultsCedric Nugteren
2021-08-24Merge pull request #424 from gspr/gspr/prebuiltCedric Nugteren
Update documentation to reflect CLBlast in Debian & Ubuntu
2021-08-24PPA for older UbuntusGard Spreemann
2021-08-24Let the installation documentation reflect the fact that CLBlast is now in ↵Gard Spreemann
Debian and Ubuntu
2021-08-20Merge pull request #423 from CNugteren/new_tuning_resultsCedric Nugteren
New tuning results for 1 Intel CPU and 5 NVIDIA GPUs
2021-08-19Added a note on clock frequencies for tuningCedric Nugteren
2021-08-19Updated README and tuning listCedric Nugteren
2021-08-19Add tuning results for NVIDIA Tesla V100Cedric Nugteren
2021-08-19Add tuning results for NVIDIA Tesla T4Cedric Nugteren