Age | Commit message (Collapse) | Author | |
---|---|---|---|
2017-12-10 | Split GEMM kernel in 4 files instead of 3 due to MSVC 2013 string length limit | Cedric Nugteren | |
2017-11-02 | Integrated the GEMM routine tuner for kernel selection; added first tuning ↵ | Cedric Nugteren | |
results | |||
2017-10-17 | Made buffers of batched routines read/write (was: read-only) | Cedric Nugteren | |
2017-09-19 | Fixed type conversion warnings under MSVC 2013 | Cedric Nugteren | |
2017-07-12 | Relaxed requirement on a_ld and b_ld for batched GEMM | Cedric Nugteren | |
2017-03-19 | Added an (optional) non-direct implementation of the batched GEMM routine | Cedric Nugteren | |
2017-03-11 | Added initial naive version of the batched GEMM routine based on the direct ↵ | Cedric Nugteren | |
GEMM kernel | |||
2017-03-10 | Added API and test infrastructure for the batched GEMM routine | Cedric Nugteren | |