Age | Commit message (Expand) | Author |
---|---|---|
2017-04-13 | Fixed CUDA malloc and cuBLAS handles: cuBLAS as a performance-reference now w... | Cedric Nugteren |
2017-04-10 | Added reference implementations for performance-testing against cuBLAS | Cedric Nugteren |
2017-04-02 | Factored out inclusion of clBLAS and CBLAS from the test-routine files | Cedric Nugteren |
2017-04-01 | Separated host-device and device-host memory copies from execution of the CBL... | Cedric Nugteren |
2017-03-04 | Fixed a missing include for the tests | Cedric Nugteren |
2017-03-04 | Added a proper data-preparation function for the TRSM tests | Cedric Nugteren |
2017-02-26 | Added a guard against invalid buffer sizes in the prepare-data functions for ... | Cedric Nugteren |
2017-02-25 | Added PrepareData function for TRSM to create proper test input | Cedric Nugteren |
2017-01-18 | Added first version of the TRSM routine based on the diagonal invert kernel | Cedric Nugteren |
2016-12-18 | Prepared for the addition of the TRSM triangular solver kernel | Cedric Nugteren |