Age | Commit message (Expand) | Author |
---|---|---|
2018-01-06 | Reduced duplicate code in the batched GEMM implementation | Cedric Nugteren |
2017-12-10 | Split GEMM kernel in 4 files instead of 3 due to MSVC 2013 string length limit | Cedric Nugteren |
2017-11-02 | Integrated the GEMM routine tuner for kernel selection; added first tuning re... | Cedric Nugteren |
2017-10-17 | Made buffers of batched routines read/write (was: read-only) | Cedric Nugteren |
2017-09-19 | Fixed type conversion warnings under MSVC 2013 | Cedric Nugteren |
2017-07-12 | Relaxed requirement on a_ld and b_ld for batched GEMM | Cedric Nugteren |
2017-03-19 | Added an (optional) non-direct implementation of the batched GEMM routine | Cedric Nugteren |
2017-03-11 | Added initial naive version of the batched GEMM routine based on the direct G... | Cedric Nugteren |
2017-03-10 | Added API and test infrastructure for the batched GEMM routine | Cedric Nugteren |