summaryrefslogtreecommitdiff
path: root/src/tuning/kernels/xgemv.cpp
AgeCommit message (Expand)Author
2020-02-17Catches all exceptions of the tunersCedric Nugteren
2018-03-22Added the OpenCL local memory size constraint to the tunersCedric Nugteren
2018-03-10Fixed a few things for the new tuning APICedric Nugteren
2018-03-03Separate kernel tuners in .cpp with main and .hpp with settingsCedric Nugteren
2018-01-25Changed the default number of runs for the GEMV tuner to fix issues for FP16Cedric Nugteren
2017-12-18Reformatted tuning code to make compilation fasterCedric Nugteren
2017-11-19Modified the kernel tuners to use the newly integrated auto-tunerCedric Nugteren
2017-09-30Refactored the tuning architecture: less duplicate now; more defaultsCedric Nugteren
2017-08-21Remove multistrategy and related functionsmcian
2017-07-23Code refactoringmcian
2017-04-21Increased the default number of runs for GEMV tuning; updated GEMV tuning res...Cedric Nugteren
2017-04-17Fixed a namespace clash with CUDA FP16 for the half-datatypeCedric Nugteren
2017-03-14Added the possibility to tune batched kernelsCedric Nugteren
2016-11-27Made it possible to use the command-line environmental variables for each exe...Cedric Nugteren
2016-10-22Moved files around a bit; created a utilities subfolderCedric Nugteren
2016-10-02Set the default number of runs for all kernels to at least 2 runsCedric Nugteren
2016-10-01Added default num-runs to the tuner adding averaging over 10 runs as a defaul...Cedric Nugteren
2016-07-25Moved the XgemvFast and XgemvFastRot tuning database into a separate fileCedric Nugteren
2016-07-23Fixe a bug in the new XgemvFastRot kernel related to local memory sizeCedric Nugteren
2016-07-23Further improvements to the XgemvFastRot kernel, properly enables coalescing nowCedric Nugteren
2016-07-23Improved the XgemvFastRot kernel by tiled loading of the input matrix A, enab...Cedric Nugteren
2016-07-10Now passing alpha/beta to the kernel as arguments as before fp16 support; in ...Cedric Nugteren
2016-06-19Renamed all C++ source files to .cpp to match the .hpp extension betterCedric Nugteren