Age | Commit message (Collapse) | Author |
|
|
|
|
|
test from README
|
|
Apple opencl limitations for TRSV/TRSM now return not-implemented status
|
|
< 16 LWGS for TSRV and TRSM
|
|
size
|
|
and standard-deviation
|
|
capture other parts of the kernel code
|
|
approach for convgemm
|
|
|
|
local memory support now
|
|
|
|
|
|
|
|
available; now run part of alltuners target
|
|
|
|
|
|
edge cases now
|
|
gemm kernel
|
|
a new kernel
|
|
|
|
|
|
|
|
|
|
|
|
Intel subgroup shuffling
|
|
the OpenCL program
|
|
|
|
|
|
|
|
|
|
|
|
with the new kernel
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Adding override parameters to pyclblast
|
|
MWG/NWG
|