summaryrefslogtreecommitdiff
path: root/test
AgeCommit message (Expand)Author
2018-04-15Fixed some failing tests for GEMM and batched GEMM routinesCedric Nugteren
2018-03-15Fixed breaking preprocessor test on certain platforms due to empty kernel stringCedric Nugteren
2018-02-02Implemented the XHAD Hadamard product routineCedric Nugteren
2018-01-31Created the API and stubs for the HAD (hadamard-product) routinesCedric Nugteren
2018-01-14Small improvements to benchmarking for cuBLASCedric Nugteren
2018-01-11Added test for the RetrieveParameters functionCedric Nugteren
2018-01-11Fixed bug in override parameters testCedric Nugteren
2018-01-07Added API and tests for new GemmStridedBatched routineCedric Nugteren
2018-01-06Prevented half-precision batched routines from failing in the testsCedric Nugteren
2018-01-06Added CUDA interface to get temporary-buffer size for GEMM routineCedric Nugteren
2018-01-03Added the temp-buffer to the GEMM testers and clientsCedric Nugteren
2018-01-03Added a queue argument to the get-size function when running the tests/clientsCedric Nugteren
2017-12-24Fixes for the CUDA backend of CLBlastCedric Nugteren
2017-12-23Fixed unused variable warnings showing up with ClangCedric Nugteren
2017-12-10Split GEMM kernel in 4 files instead of 3 due to MSVC 2013 string length limitCedric Nugteren
2017-12-09Completed kernel modifications for pre-processor of all other kernelsCedric Nugteren
2017-12-09Made the pre-processor run by default for ARM and Qualcomm GPUsCedric Nugteren
2017-12-09Fixed defines parsing and substituting in pre-processor; fixed some variable ...Cedric Nugteren
2017-12-05Improved array-to-register promotion, now handling function calls as wellCedric Nugteren
2017-12-03Added GEMM (direct and in-direct) to the pre-processor testing; modified the ...Cedric Nugteren
2017-12-03Reformated transpose kernels for the pre-processor; extended the amount of testsCedric Nugteren
2017-11-30Improved the pre-processor's handling of defines; added a special nested defi...Cedric Nugteren
2017-11-30Integrated pre-processor in compilation flow, default is still disabledCedric Nugteren
2017-11-29Extended the preprocessor tests to include CopyFast and CopyPadCedric Nugteren
2017-11-28Improved the pre-processor tester, added GEMV and GER kernelsCedric Nugteren
2017-11-25Moved string splitting functions; added string character removal functionCedric Nugteren
2017-11-25Added stub for a preprocessor and a corresponding compilation testCedric Nugteren
2017-11-24Fixed a Clang compilation errorCedric Nugteren
2017-11-24Added missing include fileCedric Nugteren
2017-11-24Added precision check to parameter override for the clientsCedric Nugteren
2017-11-22Made parameter override in the clients a command-line argument and added supp...Cedric Nugteren
2017-11-21Implemented first version of reading JSON files from disk in the client to ov...Cedric Nugteren
2017-11-19Fixed a variety of warnings and an error for MSVC2013 compilationCedric Nugteren
2017-11-13Moved square-difference utility function for use in the tunersCedric Nugteren
2017-11-08Fixed an FP16 issue in the homatcopy test; added a comment about improper tes...Cedric Nugteren
2017-11-07Merge pull request #212 from CNugteren/kernel_selection_tunerCedric Nugteren
2017-11-02Integrated the GEMM routine tuner for kernel selection; added first tuning re...Cedric Nugteren
2017-10-28Merge branch 'master' into android_supportCedric Nugteren
2017-10-28Moved timing function to a separate fileCedric Nugteren
2017-10-25Fixed small bug in (unused) invert testerCedric Nugteren
2017-10-17Fix an incompatibility with CUDA's FP16 definitionCedric Nugteren
2017-10-15Fixed a small copy-paste typoCedric Nugteren
2017-10-15Modified test interfaces such that they support either OpenCL or CUDACedric Nugteren
2017-10-15Fixes for the CUDA API: first tests pass and the client runsCedric Nugteren
2017-10-15Prepared test and client infrastructure for use with the CUDA APICedric Nugteren
2017-10-09Fixed the Python generator script w.r.t. the recent change of testing direct/...Cedric Nugteren
2017-10-08Moved the remaining OpenCL specific host code to the clpp11.h header where it...Cedric Nugteren
2017-10-07Synchronizes clpp11.h with CLCudaAPI 9.0Cedric Nugteren
2017-10-01GEMM tests now test both the in-direct and the direct kernels seperatelyCedric Nugteren
2017-09-26Added missing headersCedric Nugteren