index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
/
tuning
Age
Commit message (
Expand
)
Author
2018-09-15
Fixed an MSVC compilation error due to large strings
Cedric Nugteren
2018-07-28
Added print statements to indicate the 4 stages of GEMM tuning
Cedric Nugteren
2018-07-28
The tuners now also check for valid local thread configurations and skip inva...
Cedric Nugteren
2018-07-25
Added code to report the average tuning results
Cedric Nugteren
2018-05-19
Added an option to run the routine tuner for a single specific GEMM size
Cedric Nugteren
2018-05-19
Fixed compilation issues
Cedric Nugteren
2018-05-19
The GEMM routine tuner now loads kernel JSON tuning results from disk if avai...
Cedric Nugteren
2018-05-17
Added a canary region for overflow detection to the tuners
Cedric Nugteren
2018-04-07
Extended the GEMM tuner to be able to tune the new 'kernel 1'
Cedric Nugteren
2018-03-30
Added argument checking for the GEMM tuner: expects m/n to be multiples of MW...
Cedric Nugteren
2018-03-22
Added the OpenCL local memory size constraint to the tuners
Cedric Nugteren
2018-03-21
Re-added support for local memory size constraint checking in the tuner
Cedric Nugteren
2018-03-10
Fixed an issue for DLL linking under Windows
Cedric Nugteren
2018-03-10
Fixed a few things for the new tuning API
Cedric Nugteren
2018-03-10
Completed the API for all tuneable kernels
Cedric Nugteren
2018-03-09
Added several more tuner API functions
Cedric Nugteren
2018-03-06
Fixed compilation issue in Xger tuner
Cedric Nugteren
2018-03-06
First version of the tuning API, added interface for copy-kernel, added sample
Cedric Nugteren
2018-03-03
Separate kernel tuners in .cpp with main and .hpp with settings
Cedric Nugteren
2018-02-20
Fixed several issues in the new invert tuner
Cedric Nugteren
2018-01-25
Moved some constants from global scope to a function; removed unnecessary inc...
Cedric Nugteren
2018-01-25
Changed the default number of runs for the GEMV tuner to fix issues for FP16
Cedric Nugteren
2018-01-18
Made GEMM routine tuning a bit more generic in preparation of possible separa...
Cedric Nugteren
2018-01-15
Factored out the generic parts of the GEMM routine tuner
Cedric Nugteren
2018-01-06
Fixed a vendor naming bug in the tuners and in the database
Cedric Nugteren
2017-12-23
Fixed unused variable warnings showing up with Clang
Cedric Nugteren
2017-12-23
Now calling main TRSV routine again to fix compilation in MSVC
Cedric Nugteren
2017-12-23
Split the invert kernel in two parts to prevent error C1091 in MSVC 2013
Cedric Nugteren
2017-12-23
Updated the database to use the new TRSV and Invert tuners
Cedric Nugteren
2017-12-23
Added TRSV block-size tuner
Cedric Nugteren
2017-12-19
Added skeleton for a tuner for the invert kernel
Cedric Nugteren
2017-12-18
Reformatted tuning code to make compilation faster
Cedric Nugteren
2017-12-17
Fixed an issue with the tuner: it was using platform vendor rather than devic...
Cedric Nugteren
2017-12-17
Fixed an unnecessary overflow issue on 32-bit systems
Cedric Nugteren
2017-12-10
Split GEMM kernel in 4 files instead of 3 due to MSVC 2013 string length limit
Cedric Nugteren
2017-12-10
Fixed an issue in the tuners to prevent error -14 from persisting (CL_EXEC_ST...
Cedric Nugteren
2017-12-09
Made the pre-processor run by default for ARM and Qualcomm GPUs
Cedric Nugteren
2017-11-30
Integrated pre-processor in compilation flow, default is still disabled
Cedric Nugteren
2017-11-20
Fixes some displaying issues in the GEMM routine tuner
Cedric Nugteren
2017-11-19
Fixed a variety of warnings and an error for MSVC2013 compilation
Cedric Nugteren
2017-11-19
Added compilation timing and better compilation error reporting
Cedric Nugteren
2017-11-19
Some fixed for the new auto-tuner to be compatible with the Python scripts
Cedric Nugteren
2017-11-19
Revived the GEMM routine tuner; minor formatting changes
Cedric Nugteren
2017-11-19
Modified the kernel tuners to use the newly integrated auto-tuner
Cedric Nugteren
2017-11-17
Moved some tuning functions from .hpp to .cpp
Cedric Nugteren
2017-11-17
Moved compilation function to separate file; removed dependency of tuners of ...
Cedric Nugteren
2017-11-16
Added printing of the best parameters for the new tuner
Cedric Nugteren
2017-11-15
Added first version of integrated and re-written auto-tuner
Cedric Nugteren
2017-11-06
Changed GEMM routine tuner's scoring to use L2 measure instead for better ave...
Cedric Nugteren
2017-11-02
Integrated the GEMM routine tuner for kernel selection; added first tuning re...
Cedric Nugteren
[next]