index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2018-07-31
Added note about AMD southern islands GPU issue and the required workaround
Cedric Nugteren
2018-07-31
Added Beignet 1.2.1 requirement to the README for IvyBridge GPUs
Cedric Nugteren
2018-07-31
Updated the tuning results for Intel IvyBridge M GT2
Cedric Nugteren
2018-07-30
Merge pull request #305 from CNugteren/CLBlast-303-tuner-check-local-size
Cedric Nugteren
2018-07-28
Added print statements to indicate the 4 stages of GEMM tuning
Cedric Nugteren
2018-07-28
The tuners now also check for valid local thread configurations and skip inva...
Cedric Nugteren
2018-07-28
Merge pull request #304 from CNugteren/CLBlast-300-fix-staggered-indices-AMD-...
Cedric Nugteren
2018-07-28
Disabled the use of staggered indices on AMD GPUs for the new GEMMK == 1 kern...
Cedric Nugteren
2018-07-27
Fixed an issue with AMD GPUs and the new GEMMK == 1 kernel
Cedric Nugteren
2018-07-25
Added code to report the average tuning results
Cedric Nugteren
2018-07-23
Merge pull request #297 from tyler-utah/master
Cedric Nugteren
2018-07-16
moved a two-line macro to a single line
Tyler Sorensen
2018-07-14
forgot to add test cases back in, oops
Tyler Sorensen
2018-07-14
Applied feedback from Cedric from first pull request
Tyler Sorensen
2018-07-14
Updated to CLBlast version 1.4.1
Cedric Nugteren
2018-07-13
Added tuning results for Intel i5-4970S
Cedric Nugteren
2018-07-13
Added device-name removal code to handle POCL naming convention
Cedric Nugteren
2018-07-13
Added tuning results for GeForce GTX 1070 Ti
Cedric Nugteren
2018-07-13
Added tuning results for HD Graphics 6000 Broadwell GT3
Cedric Nugteren
2018-07-11
restored some of the changed tuning files for xgemm
Tyler Sorensen
2018-07-11
added inline ptx to support shuffle on Nvidia GPUs
Tyler Sorensen
2018-07-06
Updated changelog
Cedric Nugteren
2018-07-06
Merge pull request #296 from alycm/CLBlast-291-eliminate-temporary-program
Cedric Nugteren
2018-07-06
Eliminate a temporary Program object
Alastair Murray
2018-06-28
Merge pull request #295 from CNugteren/CLBlast-292-no-cl-program-release-windows
Cedric Nugteren
2018-06-28
Disabled calls to clReleaseProgram under Windows to avoid segfaults when the ...
Cedric Nugteren
2018-06-03
Updated to CLBlast version 1.4.0
Cedric Nugteren
2018-06-03
Added list of tuners to be run by 'alltuners' target
Cedric Nugteren
2018-06-03
Fixes for CUDA version of CLBlast
Cedric Nugteren
2018-06-02
Added MKL as an alternative for CBLAS for correctness and performance compari...
Cedric Nugteren
2018-06-01
Fixes for Apple OpenCL CPU implementation which requires a LWGS of 1 when bar...
Cedric Nugteren
2018-05-31
Added error-checking for half-empty local work group sizes; fixed a minor TRS...
Cedric Nugteren
2018-05-31
Some potential fixes for error -54 when launching TRSV and TRSM kernels
Cedric Nugteren
2018-05-30
Widened Apple OpenCL check, added way to debug too-large-workgroups issue
Cedric Nugteren
2018-05-29
Added Apple OpenCL TRSV block size override; removed failing old Intel GPU te...
Cedric Nugteren
2018-05-27
Merge pull request #287 from CNugteren/apple-opencl-limitations-fixes
Cedric Nugteren
2018-05-27
Merge pull request #286 from CNugteren/runtime_statistics_in_client
Cedric Nugteren
2018-05-27
Added a check to return 'NotImplemented' error code in case of systems with <...
Cedric Nugteren
2018-05-27
Made FillMatrix and FillVector functions take a configurable local workgroup ...
Cedric Nugteren
2018-05-27
Added maximum time reporting to the client statistics
Cedric Nugteren
2018-05-23
Added an option in the clients to output timing statistics: minimum, mean, an...
Cedric Nugteren
2018-05-19
Merge pull request #285 from CNugteren/size_specific_routine_tuner
Cedric Nugteren
2018-05-19
Added an option to run the routine tuner for a single specific GEMM size
Cedric Nugteren
2018-05-19
Merge pull request #284 from CNugteren/routine_tuners_read_kernel_json_from_disk
Cedric Nugteren
2018-05-19
Fixed compilation issues
Cedric Nugteren
2018-05-19
The GEMM routine tuner now loads kernel JSON tuning results from disk if avai...
Cedric Nugteren
2018-05-19
Fixed a bug in loading xgemm-direct JSON data from disk
Cedric Nugteren
2018-05-18
Merge pull request #283 from CNugteren/canary_buffer_overflow_protection
Cedric Nugteren
2018-05-18
Merge branch 'master' into canary_buffer_overflow_protection
Cedric Nugteren
2018-05-17
Merge pull request #282 from CNugteren/CLBlast-276-program-release-improvements
Cedric Nugteren
[next]