index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
/
tuning
/
kernels
Age
Commit message (
Expand
)
Author
2018-07-28
Added print statements to indicate the 4 stages of GEMM tuning
Cedric Nugteren
2018-04-07
Extended the GEMM tuner to be able to tune the new 'kernel 1'
Cedric Nugteren
2018-03-30
Added argument checking for the GEMM tuner: expects m/n to be multiples of MW...
Cedric Nugteren
2018-03-22
Added the OpenCL local memory size constraint to the tuners
Cedric Nugteren
2018-03-10
Fixed a few things for the new tuning API
Cedric Nugteren
2018-03-06
Fixed compilation issue in Xger tuner
Cedric Nugteren
2018-03-03
Separate kernel tuners in .cpp with main and .hpp with settings
Cedric Nugteren
2018-02-20
Fixed several issues in the new invert tuner
Cedric Nugteren
2018-01-25
Changed the default number of runs for the GEMV tuner to fix issues for FP16
Cedric Nugteren
2017-12-23
Fixed unused variable warnings showing up with Clang
Cedric Nugteren
2017-12-23
Split the invert kernel in two parts to prevent error C1091 in MSVC 2013
Cedric Nugteren
2017-12-19
Added skeleton for a tuner for the invert kernel
Cedric Nugteren
2017-12-18
Reformatted tuning code to make compilation faster
Cedric Nugteren
2017-12-10
Split GEMM kernel in 4 files instead of 3 due to MSVC 2013 string length limit
Cedric Nugteren
2017-11-19
Modified the kernel tuners to use the newly integrated auto-tuner
Cedric Nugteren
2017-10-03
Gemm in-direct implementation now uses only 1 larger instead of max 3 optiona...
Cedric Nugteren
2017-09-30
Refactored the tuning architecture: less duplicate now; more defaults
Cedric Nugteren
2017-08-31
Fixed some things in the tuner: bugs, style, and defaults to random search
Cedric Nugteren
2017-08-21
Minor updates after merging in the PSO addition to the tuners
Cedric Nugteren
2017-08-21
Remove multistrategy and related functions
mcian
2017-08-09
Revert the xgemm strategy to default. If user wants to use multistrategy can ...
mcian
2017-08-09
Use cltune::SearchMethod enum instead of int values
mcian
2017-07-23
Code refactoring
mcian
2017-07-17
Add PSO parameters support and search strategy selection from command line
mcian
2017-05-11
Re-added random tuning for GEMM after accidental removal
Cedric Nugteren
2017-04-22
Increased the default number of runs for the tuner from 2 up to 10 for fast k...
Cedric Nugteren
2017-04-21
Increased the default number of runs for GEMV tuning; updated GEMV tuning res...
Cedric Nugteren
2017-04-17
Fixed a namespace clash with CUDA FP16 for the half-datatype
Cedric Nugteren
2017-04-14
Added a new Xaxpy kernel in between the regular and fast version in
Cedric Nugteren
2017-03-14
Added the possibility to tune batched kernels
Cedric Nugteren
2016-11-27
Made it possible to use the command-line environmental variables for each exe...
Cedric Nugteren
2016-10-22
Moved files around a bit; created a utilities subfolder
Cedric Nugteren
2016-10-03
Re-organised GEMM direct kernel and added faster fall-back version for incomp...
Cedric Nugteren
2016-10-02
Set the default number of runs for all kernels to at least 2 runs
Cedric Nugteren
2016-10-02
Specialised the GEMM direct kernel in four ways for transposing/non-transposi...
Cedric Nugteren
2016-10-02
Split the GEMM direct kernel into two files; set the default tuning target to...
Cedric Nugteren
2016-10-01
Added padding to the local memory of the GEMM direct kernel
Cedric Nugteren
2016-10-01
Added default num-runs to the tuner adding averaging over 10 runs as a defaul...
Cedric Nugteren
2016-10-01
Merge branch 'development' into gemm_direct
Cedric Nugteren
2016-09-27
Fixed the local memory size computation for the GEMM tuners
Cedric Nugteren
2016-09-25
Added a first version of a tuner for the GEMM direct kernel; collapsed MWGD, ...
Cedric Nugteren
2016-09-12
Split the XGEMM kernel further up: now in 3 parts. This is done because MSVC ...
Cedric Nugteren
2016-09-06
Split GEMM tuning in two parts: a small set of tuning parameters which is exp...
Cedric Nugteren
2016-08-21
Increased the ratio of GEMM tuning results to explore; reduced the tuning sea...
Cedric Nugteren
2016-07-25
Moved the XgemvFast and XgemvFastRot tuning database into a separate file
Cedric Nugteren
2016-07-23
Fixe a bug in the new XgemvFastRot kernel related to local memory size
Cedric Nugteren
2016-07-23
Further improvements to the XgemvFastRot kernel, properly enables coalescing now
Cedric Nugteren
2016-07-23
Improved the XgemvFastRot kernel by tiled loading of the input matrix A, enab...
Cedric Nugteren
2016-07-10
Now passing alpha/beta to the kernel as arguments as before fp16 support; in ...
Cedric Nugteren
2016-06-19
Renamed all C++ source files to .cpp to match the .hpp extension better
Cedric Nugteren
[next]