Age | Commit message (Expand) | Author |
---|---|---|
2017-06-26 | Fixed and suppresses several warnings for MSVC | Cedric Nugteren |
2017-04-13 | Fixed CUDA malloc and cuBLAS handles: cuBLAS as a performance-reference now w... | Cedric Nugteren |
2017-04-10 | Added reference implementations for performance-testing against cuBLAS | Cedric Nugteren |
2017-04-02 | Factored out inclusion of clBLAS and CBLAS from the test-routine files | Cedric Nugteren |
2017-04-01 | Separated host-device and device-host memory copies from execution of the CBL... | Cedric Nugteren |
2017-03-10 | Added proper testing of the alpha parameter; finalized the batched AXPY imple... | Cedric Nugteren |
2017-03-08 | Make batched routines based on offsets instead of a vector of cl_mem objects ... | Cedric Nugteren |
2017-03-05 | Minor fixes to the client w.r.t. the addition of the batch count | Cedric Nugteren |
2017-03-05 | Added first naive version of the batched AXPY routine | Cedric Nugteren |