index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
Age
Commit message (
Expand
)
Author
2018-05-27
Made FillMatrix and FillVector functions take a configurable local workgroup ...
Cedric Nugteren
2018-05-19
Added an option to run the routine tuner for a single specific GEMM size
Cedric Nugteren
2018-05-19
Fixed compilation issues
Cedric Nugteren
2018-05-19
The GEMM routine tuner now loads kernel JSON tuning results from disk if avai...
Cedric Nugteren
2018-05-18
Merge branch 'master' into canary_buffer_overflow_protection
Cedric Nugteren
2018-05-17
Added a canary region for overflow detection to the tuners
Cedric Nugteren
2018-05-01
Now stores a shared_ptr to the Program class in the cache
Cedric Nugteren
2018-04-29
Merge pull request #277 from CNugteren/CLBlast-257-intel-subgroups
Cedric Nugteren
2018-04-26
Fixed an access violation when compiled with Visual Studio upon releasing the...
Cedric Nugteren
2018-04-24
Added Intel subgroup shuffle support to the 2D register caching GEMM kernel
Cedric Nugteren
2018-04-24
Added a define to enable subgroup shuffling if supported by the device
Cedric Nugteren
2018-04-20
Fixes for the CUDA API
Cedric Nugteren
2018-04-18
Expressed HER2K as two HERK calls
Cedric Nugteren
2018-04-18
Expressed SYR2K as two SYRK calls
Cedric Nugteren
2018-04-17
Updated HERK and SYRK to follow the GEMM style and functions to make it work ...
Cedric Nugteren
2018-04-15
Fixed some failing tests for GEMM and batched GEMM routines
Cedric Nugteren
2018-04-15
Updated tuning results for the Skylake ULT GT2 GPU with the new kernel
Cedric Nugteren
2018-04-13
Made GEMM rotation expectations kernel-specific
Cedric Nugteren
2018-04-10
Updated database with defaults of GEMMK=0 and KREG=1
Cedric Nugteren
2018-04-08
Extended the maximum number of tuning parameters from 14 to 16
Cedric Nugteren
2018-04-08
Fixed issues with the pre-processor
Cedric Nugteren
2018-04-07
Merge branch 'master' into CLBlast-228-2d-register-gemm-kernel
Cedric Nugteren
2018-04-07
Added tuning results for NVIDIA GeForce 970
Cedric Nugteren
2018-04-07
Added tuning results for NVIDIA GeForce 920MX
Cedric Nugteren
2018-04-07
Added tuning results for Intel HD Graphics 620
Cedric Nugteren
2018-04-07
Extended the GEMM tuner to be able to tune the new 'kernel 1'
Cedric Nugteren
2018-04-07
Fixed a compilation issue for complex datatypes and vload
Cedric Nugteren
2018-04-06
Fixed a compilation issue for complex datatypes and vload
Cedric Nugteren
2018-04-03
Added first version of 2D register tiling kernel with A and C transposed as well
Cedric Nugteren
2018-03-30
Updated pyclblast to 1.1.0 and uploaded to PyPi
Cedric Nugteren
2018-03-30
Merge pull request #255 from kodonnell/py_override
Cedric Nugteren
2018-03-30
Added argument checking for the GEMM tuner: expects m/n to be multiples of MW...
Cedric Nugteren
2018-03-30
Merge branch 'CLBlast-227-vivante-compiler-errors'
Cedric Nugteren
2018-03-27
merged
kodonell
2018-03-27
moved override_parameters example out of sgemm example
kodonell
2018-03-26
tidying up pyclblast override_parameters api, and added example
kodonell
2018-03-23
Removed arrays as function argument from GEMM kernels for Vivante OpenCL comp...
Cedric Nugteren
2018-03-22
Added the OpenCL local memory size constraint to the tuners
Cedric Nugteren
2018-03-21
Re-added support for local memory size constraint checking in the tuner
Cedric Nugteren
2018-03-15
Fixed a failing TRSM test using a CPU with Apple OpenCL
Cedric Nugteren
2018-03-15
Fixed a failing TRSV test using a CPU with Apple OpenCL
Cedric Nugteren
2018-03-15
Added queue-finish commands to PyCLBlast samples and tests
Cedric Nugteren
2018-03-11
Merge pull request #262 from CNugteren/CLBlast-237-tuning-api
Cedric Nugteren
2018-03-11
Added basic tests for PyCLBlast
Cedric Nugteren
2018-03-10
Fixed an issue for DLL linking under Windows
Cedric Nugteren
2018-03-10
Fixed a few things for the new tuning API
Cedric Nugteren
2018-03-10
Completed the API for all tuneable kernels
Cedric Nugteren
2018-03-10
ok, device id working
kodonell
2018-03-09
Added several more tuner API functions
Cedric Nugteren
2018-03-09
initial add of override parameters to pyclblast - cython not complaining, but...
kodonell
[next]