index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
/
tuning
/
kernels
Age
Commit message (
Expand
)
Author
2016-10-01
Added default num-runs to the tuner adding averaging over 10 runs as a defaul...
Cedric Nugteren
2016-10-01
Merge branch 'development' into gemm_direct
Cedric Nugteren
2016-09-27
Fixed the local memory size computation for the GEMM tuners
Cedric Nugteren
2016-09-25
Added a first version of a tuner for the GEMM direct kernel; collapsed MWGD, ...
Cedric Nugteren
2016-09-12
Split the XGEMM kernel further up: now in 3 parts. This is done because MSVC ...
Cedric Nugteren
2016-09-06
Split GEMM tuning in two parts: a small set of tuning parameters which is exp...
Cedric Nugteren
2016-08-21
Increased the ratio of GEMM tuning results to explore; reduced the tuning sea...
Cedric Nugteren
2016-07-25
Moved the XgemvFast and XgemvFastRot tuning database into a separate file
Cedric Nugteren
2016-07-23
Fixe a bug in the new XgemvFastRot kernel related to local memory size
Cedric Nugteren
2016-07-23
Further improvements to the XgemvFastRot kernel, properly enables coalescing now
Cedric Nugteren
2016-07-23
Improved the XgemvFastRot kernel by tiled loading of the input matrix A, enab...
Cedric Nugteren
2016-07-10
Now passing alpha/beta to the kernel as arguments as before fp16 support; in ...
Cedric Nugteren
2016-06-19
Renamed all C++ source files to .cpp to match the .hpp extension better
Cedric Nugteren
2016-06-18
Moved all headers into the source tree, changed headers to .hpp extension
Cedric Nugteren