Age | Commit message (Collapse) | Author | |
---|---|---|---|
2017-07-16 | First step towards supporting im2col in the test infrastructure | Cedric Nugteren | |
2017-04-17 | Fixed a namespace clash with CUDA FP16 for the half-datatype | Cedric Nugteren | |
2017-04-03 | In-lined the float2 and double2 types to avoid collision with CUDA's definitions | Cedric Nugteren | |
2017-03-10 | Added API and test infrastructure for the batched GEMM routine | Cedric Nugteren | |
2017-03-05 | Prepared generator for batched routines; added batched AXPY routine interface | Cedric Nugteren | |
2017-01-15 | Added a first version of the diagonal block invert routine in preparation of ↵ | Cedric Nugteren | |
TRSM | |||
2016-06-19 | Renamed all C++ source files to .cpp to match the .hpp extension better | Cedric Nugteren | |
2016-06-18 | Moved all headers into the source tree, changed headers to .hpp extension | Cedric Nugteren | |
2016-06-16 | Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, ↵ | Cedric Nugteren | |
and/or transposing |