summaryrefslogtreecommitdiff
path: root/src/routines/levelx
AgeCommit message (Expand)Author
2019-12-09Reduce TestMatrix calls for xgemmstridedbatched.Tarmo Räntilä
2019-12-09Reduce TestMatrix calls for xgemmbatched.Tarmo Räntilä
2019-01-19Merge pull request #345 from CNugteren/convolution-fixes-and-tunerCedric Nugteren
2019-01-05Added a check to prevent the stride of matrix C being set to 0 for the stride...Cedric Nugteren
2018-12-31Added convgemm to the CLBlast database, added initial parameters for Skylake GPUCedric Nugteren
2018-12-18Fix xconvgemm kernel and enable ConvGemmMethod::kSingleKernelKoichi Akabe
2018-11-12Add kernel_mode option to im2col, col2im, and convgemm functionsKoichi Akabe
2018-10-30Fix col2im implementationKoichi Akabe
2018-10-23Added groundwork for col2im algorithm plus first non-working version of kerne...Cedric Nugteren
2018-10-22Some name changes in im2col codeCedric Nugteren
2018-09-16Merge branch 'master' into convgemm_multi_kernelCedric Nugteren
2018-09-15Fixed an MSVC compilation error due to large stringsCedric Nugteren
2018-09-07Added xCONVGEMM as im2col plus a batched GEMM kernelCedric Nugteren
2018-06-03Merge branch 'master' into CLBlast-267-convgemmCedric Nugteren
2018-06-01Fixes for Apple OpenCL CPU implementation which requires a LWGS of 1 when bar...Cedric Nugteren
2018-05-31Added error-checking for half-empty local work group sizes; fixed a minor TRS...Cedric Nugteren
2018-05-31Some potential fixes for error -54 when launching TRSV and TRSM kernelsCedric Nugteren
2018-05-27Added a check to return 'NotImplemented' error code in case of systems with <...Cedric Nugteren
2018-05-27Made FillMatrix and FillVector functions take a configurable local workgroup ...Cedric Nugteren
2018-05-21Added method selection option to switch between im2col and single-kernel appr...Cedric Nugteren
2018-05-19Moved new convgemm kernel to levelx kernel folderCedric Nugteren
2018-05-19Second version of direct reading from image tensor for convgemm: also with lo...Cedric Nugteren
2018-05-17First version of direct reading from image tensor for convgemm: only for edge...Cedric Nugteren
2018-05-13Created a dedicated convgemm GEMM kernel as a copy of the batched direct gemm...Cedric Nugteren
2018-05-13Plugged in the code of strided-batched-gemm into convgemm in preparation of a...Cedric Nugteren
2018-05-09Changed temporary convgemm implementation to use batched-strided GEMMCedric Nugteren
2018-05-09Implemented convolution as im2col + GEMMCedric Nugteren
2018-05-06Added convgemm skeleton, test infrastructure, and first reference implementationCedric Nugteren
2018-04-15Fixed some failing tests for GEMM and batched GEMM routinesCedric Nugteren
2018-04-13Made GEMM rotation expectations kernel-specificCedric Nugteren
2018-02-02Implemented the XHAD Hadamard product routineCedric Nugteren
2018-01-31Created the API and stubs for the HAD (hadamard-product) routinesCedric Nugteren
2018-01-26Fixed an event synchronisation issue in the batched gemm routinesCedric Nugteren
2018-01-18Made the batched routines also chose direct/indirect kernel like the main GEM...Cedric Nugteren
2018-01-08Implemented the in-direct version of the strided-batched GEMM kernelCedric Nugteren
2018-01-07Implemented direct version of strided-batched GEMM kernelCedric Nugteren
2018-01-07Added API and tests for new GemmStridedBatched routineCedric Nugteren
2018-01-06Reduced duplicate code in the batched GEMM implementationCedric Nugteren
2017-12-23Split the invert kernel in two parts to prevent error C1091 in MSVC 2013Cedric Nugteren
2017-12-23Added TRSV block-size tunerCedric Nugteren
2017-12-10Split GEMM kernel in 4 files instead of 3 due to MSVC 2013 string length limitCedric Nugteren
2017-11-02Integrated the GEMM routine tuner for kernel selection; added first tuning re...Cedric Nugteren
2017-10-17Made buffers of batched routines read/write (was: read-only)Cedric Nugteren
2017-09-19Fixed type conversion warnings under MSVC 2013Cedric Nugteren
2017-08-31Fixed a bug in im2col: process only valid channel IDsCedric Nugteren
2017-08-31Fixed a bug in im2col confusing first and second workgroup size; made im2col ...Cedric Nugteren
2017-08-24Completed im2col implementationCedric Nugteren
2017-08-19First version of im2col kernel, unoptimized but workingCedric Nugteren
2017-08-12Merge branch 'master' into im_to_colCedric Nugteren
2017-07-12Relaxed requirement on a_ld and b_ld for batched GEMMCedric Nugteren