index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
/
tuning
/
xgemv.cc
Age
Commit message (
Collapse
)
Author
2016-06-18
Moved all headers into the source tree, changed headers to .hpp extension
Cedric Nugteren
2016-05-22
Prepared the GEMV kernels and tuner for half-precision support
Cedric Nugteren
2016-02-08
Split-up the XGEMV kernel in two parts
Cedric Nugteren
2016-02-06
Reduced the maximum workgroup-size for GEMV kernels further
CNugteren
2016-02-06
Reduced unrolling factor in xgemv kernel to reduce compilation times
CNugteren
2015-10-28
Now sets local memory size in xgemv tuner properly
CNugteren
2015-10-25
Fixed an arguments-related bug in the GEMV tuner
CNugteren
2015-09-18
Added first version of banded matrix-vector multiplication
CNugteren
2015-09-14
Added extra temporary buffer to tuners in preparation of Xdot routines
CNugteren
2015-08-09
Refactored the tuners, added JSON output
CNugteren
2015-07-19
The kernel source string is now a routine's member variable
CNugteren
2015-06-16
Added support for conjugate transpose in GEMV
CNugteren
2015-06-14
Split the three variations of the GEMV kernel for maximal tuning freedom
CNugteren
2015-06-13
Added a fast GEMV kernel with vector loads, no tail, and fewer if-statements
CNugteren
2015-06-13
Improved GEMV kernel with local memory and a tunable WPT
CNugteren
2015-06-13
Added initial version of GEMV including tester and performance client
CNugteren
2015-06-10
Added initial naive version of Xgemv kernel
CNugteren