index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
/
kernels
/
level3
/
xgemm_part3.opencl
Age
Commit message (
Expand
)
Author
2017-12-10
Split GEMM kernel in 4 files instead of 3 due to MSVC 2013 string length limit
Cedric Nugteren
2017-12-09
Reformatted GEMM kernel to support array-to-register promotion
Cedric Nugteren
2017-12-09
Fixed defines parsing and substituting in pre-processor; fixed some variable ...
Cedric Nugteren
2017-12-07
Added register promotion to the main GEMM kernel
Cedric Nugteren
2017-12-03
Added GEMM (direct and in-direct) to the pre-processor testing; modified the ...
Cedric Nugteren
2017-10-14
Make local memory pointers a define in OpenCL; some fixes to the recently cha...
Cedric Nugteren
2017-10-03
Gemm in-direct implementation now uses only 1 larger instead of max 3 optiona...
Cedric Nugteren
2017-07-08
Made the inline keyword in kernels optional currently only enabled for NVIDIA...
Cedric Nugteren
2016-10-22
Fixed a bug in the SYRK/SYR2K/HERK/HER2K routines that would occur with speci...
Cedric Nugteren
2016-10-22
Fixed a bug in the SYRK/SYR2K/HERK/HER2K routines that would occur with speci...
Cedric Nugteren
2016-09-12
Split the XGEMM kernel further up: now in 3 parts. This is done because MSVC ...
Cedric Nugteren