summaryrefslogtreecommitdiff
path: root/src/database/kernels
AgeCommit message (Collapse)Author
2016-11-26Improved the default parameters for cases with non-common parameters across ↵Cedric Nugteren
all devices
2016-11-15Updated the tuning results for the Intel Skylake ULT GT2 GPUCedric Nugteren
2016-10-24Added tuning results for GeForce GTX TITAN BlackCedric Nugteren
2016-10-22Added tuning results for the AMD Tonga GPUCedric Nugteren
2016-10-14Fixed an issue with a growing database: the database is now a global ↵Cedric Nugteren
variable in a namespace and its container uses const-pointers to the actual data
2016-10-13Added tuning results for Intel HD Graphics IvyBridge GPUCedric Nugteren
2016-10-10Updated the tuning results for the GTX 750 Ti GPUCedric Nugteren
2016-10-08Fixed a performance bug for Intel Iris Pro GPUs due to incorrect tuning resultsCedric Nugteren
2016-10-06Added first tuning results for the single-kernel direct GEMM implementationCedric Nugteren
2016-10-01Added padding to the local memory of the GEMM direct kernelCedric Nugteren
2016-09-25Added a first version of a tuner for the GEMM direct kernel; collapsed MWGD, ↵Cedric Nugteren
NWGD and KWGD into one WGD parameter
2016-09-25Separated the tuning parameters of the new direct GEMM kernel from the ↵Cedric Nugteren
indirect version
2016-09-12Added XgemvFastRot and Xgemm 16-bit tuning results: just defaults which are ↵Cedric Nugteren
now automatically taken from 32-bit if there are no entries at all
2016-09-11Complete re-write of the database script. Changed Pandas for the much faster ↵Cedric Nugteren
and convienient plain JSON/dict data-type
2016-09-10Updated database based on exhaustive tuning results for GEMM for the R9 ↵Cedric Nugteren
M370X GPU
2016-09-10Updated the database script to remove duplicate entries: keeps only the ↵Cedric Nugteren
best-performing cases for a specific parameters combination
2016-09-03Added tuning results for Intel Broadwell 5500 GT2 GPUCedric Nugteren
2016-09-03Updated tuning results for Haswell GT2 Mobile GPU; fixed database script to ↵Cedric Nugteren
handle duplicate entries of different runs
2016-08-21Also changed the default-default for unknown device types to use the same ↵Cedric Nugteren
method as for known device groups
2016-08-15Updated the database script to calculate the relative best performance of ↵Cedric Nugteren
tuning results common for a device/vendor type
2016-07-25Removed all old tuning results for the XgemvFastRot kernel; re-added for a ↵Cedric Nugteren
couple of devices
2016-07-25Moved the XgemvFast and XgemvFastRot tuning database into a separate fileCedric Nugteren
2016-07-23Further improvements to the XgemvFastRot kernel, properly enables coalescing nowCedric Nugteren
2016-07-23Improved the XgemvFastRot kernel by tiled loading of the input matrix A, ↵Cedric Nugteren
enabling better memory performance
2016-07-10Added tuning results for AMD Oland and for Intel Graphics HD 530Cedric Nugteren
2016-07-03Added tuning results for GTX670, GTX750, and GTX1070 (thanks to gcp)Cedric Nugteren
2016-06-19Added tuning results for 'Intel(R) HD Graphics Haswell Ultrabook GT2 Mobile' ↵Cedric Nugteren
(thanks to OursDesCavernes)
2016-06-18Moved all headers into the source tree, changed headers to .hpp extensionCedric Nugteren