summaryrefslogtreecommitdiff
path: root/src/tuning/tuning.cpp
AgeCommit message (Expand)Author
2021-05-22Fix issue with printing out-of-bounds local/global sizes for level 1 tunersCedric Nugteren
2021-01-20Use reference types to prevent unnecessary copyingJerry James
2020-05-11Increase display width of the local/global sizesCedric Nugteren
2020-05-10Made sure that the global workgroup size is a multiple of the local size in t...Cedric Nugteren
2020-05-10Added logging of local/global workgroup sizes when run the tunersCedric Nugteren
2020-05-03Move queue creation out of the tuner loopCedric Nugteren
2018-12-31Added support for the convgemm tuner in the tuner databaseCedric Nugteren
2018-12-18Added first version of a tuner for the ConvGemm direct kernelCedric Nugteren
2018-07-28The tuners now also check for valid local thread configurations and skip inva...Cedric Nugteren
2018-07-25Added code to report the average tuning resultsCedric Nugteren
2018-05-17Added a canary region for overflow detection to the tunersCedric Nugteren
2018-03-21Re-added support for local memory size constraint checking in the tunerCedric Nugteren
2018-01-25Moved some constants from global scope to a function; removed unnecessary inc...Cedric Nugteren
2018-01-06Fixed a vendor naming bug in the tuners and in the databaseCedric Nugteren
2017-12-18Reformatted tuning code to make compilation fasterCedric Nugteren
2017-12-17Fixed an issue with the tuner: it was using platform vendor rather than devic...Cedric Nugteren
2017-11-19Added compilation timing and better compilation error reportingCedric Nugteren
2017-11-19Some fixed for the new auto-tuner to be compatible with the Python scriptsCedric Nugteren
2017-11-19Revived the GEMM routine tuner; minor formatting changesCedric Nugteren
2017-11-17Moved some tuning functions from .hpp to .cppCedric Nugteren
2017-11-15Added first version of integrated and re-written auto-tunerCedric Nugteren