summaryrefslogtreecommitdiff
path: root/src/kernels/level3
AgeCommit message (Expand)Author
2016-08-18Adapt opencl files for 1.1 OpenCLD. Van Assche
2016-07-17Improved the GEMM direct kernel by adding register blocking. Still not fast t...Cedric Nugteren
2016-07-16Created infrastructure to support a direct GEMM kernel; added correct but slo...Cedric Nugteren
2016-07-10Now passing alpha/beta to the kernel as arguments as before fp16 support; in ...Cedric Nugteren
2016-06-16Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, and...Cedric Nugteren
2016-06-14Re-organised the level-3 supporting kernels (copy, pad, transpose, convert) a...Cedric Nugteren
2016-06-08Added global memory synchronisation for better cache performance on ARM Mali ...Cedric Nugteren
2016-05-18Merged in latest changes from 0.7.1 releaseCedric Nugteren
2016-05-16Prepared GEMM and supporting kernels and tuners for half-precision supportCedric Nugteren
2016-02-08Separated the GEMM kernel in two parts to reduce string length for MSVCCedric Nugteren
2015-10-13Added guards for routine-specific level-3 pad kernelsCNugteren
2015-10-12Moved level3 kernel files to a subfolderCNugteren