Age | Commit message (Expand) | Author |
---|---|---|
2018-01-03 | Added a queue argument to the get-size function when running the tests/clients | Cedric Nugteren |
2017-10-15 | Modified test interfaces such that they support either OpenCL or CUDA | Cedric Nugteren |
2017-07-12 | Relaxed requirement on a_ld and b_ld for batched GEMM | Cedric Nugteren |
2017-06-26 | Fixed and suppresses several warnings for MSVC | Cedric Nugteren |
2017-04-13 | Fixed CUDA malloc and cuBLAS handles: cuBLAS as a performance-reference now w... | Cedric Nugteren |
2017-04-10 | Added reference implementations for performance-testing against cuBLAS | Cedric Nugteren |
2017-04-02 | Factored out inclusion of clBLAS and CBLAS from the test-routine files | Cedric Nugteren |
2017-04-01 | Separated host-device and device-host memory copies from execution of the CBL... | Cedric Nugteren |
2017-03-10 | Added API and test infrastructure for the batched GEMM routine | Cedric Nugteren |