summaryrefslogtreecommitdiff
path: root/src/kernels/level3/xgemm_part3.opencl
AgeCommit message (Expand)Author
2017-12-10Split GEMM kernel in 4 files instead of 3 due to MSVC 2013 string length limitCedric Nugteren
2017-12-09Reformatted GEMM kernel to support array-to-register promotionCedric Nugteren
2017-12-09Fixed defines parsing and substituting in pre-processor; fixed some variable ...Cedric Nugteren
2017-12-07Added register promotion to the main GEMM kernelCedric Nugteren
2017-12-03Added GEMM (direct and in-direct) to the pre-processor testing; modified the ...Cedric Nugteren
2017-10-14Make local memory pointers a define in OpenCL; some fixes to the recently cha...Cedric Nugteren
2017-10-03Gemm in-direct implementation now uses only 1 larger instead of max 3 optiona...Cedric Nugteren
2017-07-08Made the inline keyword in kernels optional currently only enabled for NVIDIA...Cedric Nugteren
2016-10-22Fixed a bug in the SYRK/SYR2K/HERK/HER2K routines that would occur with speci...Cedric Nugteren
2016-10-22Fixed a bug in the SYRK/SYR2K/HERK/HER2K routines that would occur with speci...Cedric Nugteren
2016-09-12Split the XGEMM kernel further up: now in 3 parts. This is done because MSVC ...Cedric Nugteren