summaryrefslogtreecommitdiff
path: root/src/tuning/kernels/copy_fast.cpp
AgeCommit message (Expand)Author
2020-02-17Catches all exceptions of the tunersCedric Nugteren
2018-03-22Added the OpenCL local memory size constraint to the tunersCedric Nugteren
2018-03-10Fixed a few things for the new tuning APICedric Nugteren
2018-03-03Separate kernel tuners in .cpp with main and .hpp with settingsCedric Nugteren
2017-12-18Reformatted tuning code to make compilation fasterCedric Nugteren
2017-11-19Modified the kernel tuners to use the newly integrated auto-tunerCedric Nugteren
2017-09-30Refactored the tuning architecture: less duplicate now; more defaultsCedric Nugteren
2017-08-21Remove multistrategy and related functionsmcian
2017-07-23Code refactoringmcian
2017-04-22Increased the default number of runs for the tuner from 2 up to 10 for fast k...Cedric Nugteren
2017-04-17Fixed a namespace clash with CUDA FP16 for the half-datatypeCedric Nugteren
2017-03-14Added the possibility to tune batched kernelsCedric Nugteren
2016-11-27Made it possible to use the command-line environmental variables for each exe...Cedric Nugteren
2016-10-22Moved files around a bit; created a utilities subfolderCedric Nugteren
2016-10-02Set the default number of runs for all kernels to at least 2 runsCedric Nugteren
2016-10-01Added default num-runs to the tuner adding averaging over 10 runs as a defaul...Cedric Nugteren
2016-07-10Now passing alpha/beta to the kernel as arguments as before fp16 support; in ...Cedric Nugteren
2016-06-19Renamed all C++ source files to .cpp to match the .hpp extension betterCedric Nugteren