summaryrefslogtreecommitdiff
path: root/test/routines
AgeCommit message (Expand)Author
2021-03-13set the correct flop count for xgemmJishinMaster
2018-12-17Fix half-float+kernel_mode test cases of im2col, col2im, and convgemmKoichi Akabe
2018-11-12Add kernel_mode option to im2col, col2im, and convgemm functionsKoichi Akabe
2018-11-07Changed col2im to append to the existing im-bufferCedric Nugteren
2018-11-01Fixed half-precision tests for im2col and col2imCedric Nugteren
2018-10-30Fix col2im implementationKoichi Akabe
2018-10-23Added groundwork for col2im algorithm plus first non-working version of kerne...Cedric Nugteren
2018-10-22Some name changes in im2col codeCedric Nugteren
2018-09-16Merge branch 'master' into convgemm_multi_kernelCedric Nugteren
2018-09-07Added reference implementation for xCONVGEMM for half-precisionCedric Nugteren
2018-07-31Fixed the tests of OMATCOPY to include proper complex conjugationCedric Nugteren
2018-05-19Merge branch 'master' into CLBlast-267-convgemmCedric Nugteren
2018-05-17Fixed a few issues with canary region testingCedric Nugteren
2018-05-09Fixed the performance client for convgemm and added GFLOPS measurementsCedric Nugteren
2018-05-06Added convgemm skeleton, test infrastructure, and first reference implementationCedric Nugteren
2018-02-02Implemented the XHAD Hadamard product routineCedric Nugteren
2018-01-31Created the API and stubs for the HAD (hadamard-product) routinesCedric Nugteren
2018-01-07Added API and tests for new GemmStridedBatched routineCedric Nugteren
2018-01-06Prevented half-precision batched routines from failing in the testsCedric Nugteren
2018-01-06Added CUDA interface to get temporary-buffer size for GEMM routineCedric Nugteren
2018-01-03Added the temp-buffer to the GEMM testers and clientsCedric Nugteren
2018-01-03Added a queue argument to the get-size function when running the tests/clientsCedric Nugteren
2017-12-23Fixed unused variable warnings showing up with ClangCedric Nugteren
2017-11-19Fixed a variety of warnings and an error for MSVC2013 compilationCedric Nugteren
2017-11-08Fixed an FP16 issue in the homatcopy test; added a comment about improper tes...Cedric Nugteren
2017-11-02Integrated the GEMM routine tuner for kernel selection; added first tuning re...Cedric Nugteren
2017-10-25Fixed small bug in (unused) invert testerCedric Nugteren
2017-10-15Fixed a small copy-paste typoCedric Nugteren
2017-10-15Modified test interfaces such that they support either OpenCL or CUDACedric Nugteren
2017-10-15Fixes for the CUDA API: first tests pass and the client runsCedric Nugteren
2017-10-15Prepared test and client infrastructure for use with the CUDA APICedric Nugteren
2017-10-01GEMM tests now test both the in-direct and the direct kernels seperatelyCedric Nugteren
2017-08-31Fixed a bug in im2col confusing first and second workgroup size; made im2col ...Cedric Nugteren
2017-08-23Made the im2col client properly handle the argumentsCedric Nugteren
2017-08-19Implemented proper im2col reference function and completd testsCedric Nugteren
2017-08-12Merge branch 'master' into im_to_colCedric Nugteren
2017-08-12Moved some utility functions to a test-specific utility compilation-unitCedric Nugteren
2017-07-16First step towards supporting im2col in the test infrastructureCedric Nugteren
2017-07-12Relaxed requirement on a_ld and b_ld for batched GEMMCedric Nugteren
2017-06-26Fixed and suppresses several warnings for MSVCCedric Nugteren
2017-05-11Bug-fix in the half-precision test of the amax routineCedric Nugteren
2017-04-23Fixed a compiler warning messageCedric Nugteren
2017-04-13Fixed CUDA malloc and cuBLAS handles: cuBLAS as a performance-reference now w...Cedric Nugteren
2017-04-11Made compilation of the cuBLAS wrapper work properlyCedric Nugteren
2017-04-10Added reference implementations for performance-testing against cuBLASCedric Nugteren
2017-04-03Fixes the CUDA wrapper (now actually tested on a system with CUDA)Cedric Nugteren
2017-04-02Factored out inclusion of clBLAS and CBLAS from the test-routine filesCedric Nugteren
2017-04-02Factored out inclusion of clBLAS and CBLAS from the test-routine filesCedric Nugteren
2017-04-01Separated host-device and device-host memory copies from execution of the CBL...Cedric Nugteren
2017-03-10Added API and test infrastructure for the batched GEMM routineCedric Nugteren