Age | Commit message (Collapse) | Author | |
---|---|---|---|
2016-12-18 | Fixed a bug when using offsets in the direct GEMM kernels | Cedric Nugteren | |
2016-10-03 | Fixed a const-correctness issue with complex conjugation in the GEMM direct ↵ | Cedric Nugteren | |
kernel | |||
2016-10-03 | Added functions to load from off-chip to local memory without vector loads ↵ | Cedric Nugteren | |
for the GEMM direct kernels | |||
2016-10-03 | Re-organised GEMM direct kernel and added faster fall-back version for ↵ | Cedric Nugteren | |
incomplete rectangles | |||
2016-10-02 | Specialised the GEMM direct kernel in four ways for ↵ | Cedric Nugteren | |
transposing/non-transposing: NN, NT, TN, TT | |||
2016-10-02 | Split the GEMM direct kernel into two files; set the default tuning target ↵ | Cedric Nugteren | |
to 256-256-256 |