index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
/
tuning
/
kernels
/
xger.cpp
Age
Commit message (
Collapse
)
Author
2020-02-17
Catches all exceptions of the tuners
Cedric Nugteren
2018-03-22
Added the OpenCL local memory size constraint to the tuners
Cedric Nugteren
2018-03-10
Fixed a few things for the new tuning API
Cedric Nugteren
2018-03-03
Separate kernel tuners in .cpp with main and .hpp with settings
Cedric Nugteren
2017-12-18
Reformatted tuning code to make compilation faster
Cedric Nugteren
2017-11-19
Modified the kernel tuners to use the newly integrated auto-tuner
Cedric Nugteren
2017-09-30
Refactored the tuning architecture: less duplicate now; more defaults
Cedric Nugteren
2017-08-21
Remove multistrategy and related functions
mcian
2017-07-23
Code refactoring
mcian
2017-04-22
Increased the default number of runs for the tuner from 2 up to 10 for fast ↵
Cedric Nugteren
kernels
2017-04-17
Fixed a namespace clash with CUDA FP16 for the half-datatype
Cedric Nugteren
2017-03-14
Added the possibility to tune batched kernels
Cedric Nugteren
2016-11-27
Made it possible to use the command-line environmental variables for each ↵
Cedric Nugteren
executable and without re-running CMake
2016-10-22
Moved files around a bit; created a utilities subfolder
Cedric Nugteren
2016-10-02
Set the default number of runs for all kernels to at least 2 runs
Cedric Nugteren
2016-10-01
Added default num-runs to the tuner adding averaging over 10 runs as a ↵
Cedric Nugteren
default for the GEMM direct kernel
2016-07-10
Now passing alpha/beta to the kernel as arguments as before fp16 support; in ↵
Cedric Nugteren
case of fp16 arguments are cast on host and in kernel
2016-06-19
Renamed all C++ source files to .cpp to match the .hpp extension better
Cedric Nugteren