index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
/
kernels
/
level3
/
xgemm_part2.opencl
Age
Commit message (
Expand
)
Author
2017-12-10
Split GEMM kernel in 4 files instead of 3 due to MSVC 2013 string length limit
Cedric Nugteren
2017-12-09
Reformatted GEMM kernel to support array-to-register promotion
Cedric Nugteren
2017-12-07
Added register promotion to the main GEMM kernel
Cedric Nugteren
2017-12-03
Added GEMM (direct and in-direct) to the pre-processor testing; modified the ...
Cedric Nugteren
2017-07-08
Made the inline keyword in kernels optional currently only enabled for NVIDIA...
Cedric Nugteren
2016-09-12
Split the XGEMM kernel further up: now in 3 parts. This is done because MSVC ...
Cedric Nugteren
2016-09-04
The GEMM kernel no longer adds beta*C in case beta is zero; this would cause ...
Cedric Nugteren
2016-08-20
Merge branch 'master' of https://github.com/dvasschemacq/CLBlast into dvassch...
Cedric Nugteren
2016-08-18
Adapt opencl files for 1.1 OpenCL
D. Van Assche
2016-07-10
Now passing alpha/beta to the kernel as arguments as before fp16 support; in ...
Cedric Nugteren
2016-06-08
Added global memory synchronisation for better cache performance on ARM Mali ...
Cedric Nugteren
2016-05-18
Merged in latest changes from 0.7.1 release
Cedric Nugteren
2016-05-16
Prepared GEMM and supporting kernels and tuners for half-precision support
Cedric Nugteren
2016-02-08
Separated the GEMM kernel in two parts to reduce string length for MSVC
Cedric Nugteren