summaryrefslogtreecommitdiff
path: root/test
AgeCommit message (Expand)Author
2018-12-17Fix half-float+kernel_mode test cases of im2col, col2im, and convgemmKoichi Akabe
2018-11-12Add kernel_mode option to im2col, col2im, and convgemm functionsKoichi Akabe
2018-11-07Changed col2im to append to the existing im-bufferCedric Nugteren
2018-11-01Fixed half-precision tests for im2col and col2imCedric Nugteren
2018-10-30Fix col2im implementationKoichi Akabe
2018-10-23Added groundwork for col2im algorithm plus first non-working version of kerne...Cedric Nugteren
2018-10-22Some name changes in im2col codeCedric Nugteren
2018-10-17Fixed MSVC's compilation error C1061 due to too many for-loopsCedric Nugteren
2018-09-16Merge branch 'master' into convgemm_multi_kernelCedric Nugteren
2018-09-15Added pre-processor test for GEMMK=1 kernelCedric Nugteren
2018-09-07Reduced size of the xCONVGEMM correctness testsCedric Nugteren
2018-09-07Added reference implementation for xCONVGEMM for half-precisionCedric Nugteren
2018-07-31Fixed the tests of OMATCOPY to include proper complex conjugationCedric Nugteren
2018-07-31Fixed an error reporting issue related to the canary regionCedric Nugteren
2018-07-29Removed complex numbers support for CONVGEMMCedric Nugteren
2018-06-03Merge branch 'master' into CLBlast-267-convgemmCedric Nugteren
2018-06-02Added MKL as an alternative for CBLAS for correctness and performance compari...Cedric Nugteren
2018-05-27Added maximum time reporting to the client statisticsCedric Nugteren
2018-05-23Added an option in the clients to output timing statistics: minimum, mean, an...Cedric Nugteren
2018-05-21Further implemented single-kernel approach of convgemm; extended test to capt...Cedric Nugteren
2018-05-19Merge branch 'master' into CLBlast-267-convgemmCedric Nugteren
2018-05-19Fixed a bug in loading xgemm-direct JSON data from diskCedric Nugteren
2018-05-17Fixed a few issues with canary region testingCedric Nugteren
2018-05-17Added a canary region for overflow detection to the correctness testsCedric Nugteren
2018-05-09Fixed the performance client for convgemm and added GFLOPS measurementsCedric Nugteren
2018-05-09Split channels/strides testing values off from kernel sizes for more flexibilityCedric Nugteren
2018-05-06Added convgemm skeleton, test infrastructure, and first reference implementationCedric Nugteren
2018-04-15Fixed some failing tests for GEMM and batched GEMM routinesCedric Nugteren
2018-03-15Fixed breaking preprocessor test on certain platforms due to empty kernel stringCedric Nugteren
2018-02-02Implemented the XHAD Hadamard product routineCedric Nugteren
2018-01-31Created the API and stubs for the HAD (hadamard-product) routinesCedric Nugteren
2018-01-14Small improvements to benchmarking for cuBLASCedric Nugteren
2018-01-11Added test for the RetrieveParameters functionCedric Nugteren
2018-01-11Fixed bug in override parameters testCedric Nugteren
2018-01-07Added API and tests for new GemmStridedBatched routineCedric Nugteren
2018-01-06Prevented half-precision batched routines from failing in the testsCedric Nugteren
2018-01-06Added CUDA interface to get temporary-buffer size for GEMM routineCedric Nugteren
2018-01-03Added the temp-buffer to the GEMM testers and clientsCedric Nugteren
2018-01-03Added a queue argument to the get-size function when running the tests/clientsCedric Nugteren
2017-12-24Fixes for the CUDA backend of CLBlastCedric Nugteren
2017-12-23Fixed unused variable warnings showing up with ClangCedric Nugteren
2017-12-10Split GEMM kernel in 4 files instead of 3 due to MSVC 2013 string length limitCedric Nugteren
2017-12-09Completed kernel modifications for pre-processor of all other kernelsCedric Nugteren
2017-12-09Made the pre-processor run by default for ARM and Qualcomm GPUsCedric Nugteren
2017-12-09Fixed defines parsing and substituting in pre-processor; fixed some variable ...Cedric Nugteren
2017-12-05Improved array-to-register promotion, now handling function calls as wellCedric Nugteren
2017-12-03Added GEMM (direct and in-direct) to the pre-processor testing; modified the ...Cedric Nugteren
2017-12-03Reformated transpose kernels for the pre-processor; extended the amount of testsCedric Nugteren
2017-11-30Improved the pre-processor's handling of defines; added a special nested defi...Cedric Nugteren
2017-11-30Integrated pre-processor in compilation flow, default is still disabledCedric Nugteren