Age | Commit message (Collapse) | Author |
|
|
|
|
|
|
|
|
|
comparisons
|
|
barriers are present
|
|
TRSV global worksize issue
|
|
|
|
|
|
test from README
|
|
Apple opencl limitations for TRSV/TRSM now return not-implemented status
|
|
Runtime statistics in client
|
|
< 16 LWGS for TSRV and TRSM
|
|
size
|
|
|
|
and standard-deviation
|
|
capture other parts of the kernel code
|
|
approach for convgemm
|
|
Added an option to run the routine tuner for a single specific GEMM size
|
|
|
|
local memory support now
|
|
|
|
|
|
Routine tuners read kernel JSON from disk
|
|
|
|
available; now run part of alltuners target
|
|
|
|
Canary buffer overflow protection
|
|
|
|
Better cache behaviour of OpenCL programs
|
|
|
|
|
|
|
|
|
|
|
|
|
|
edge cases now
|
|
gemm kernel
|
|
a new kernel
|
|
|
|
|
|
Update ci links to use doman names and build names instead of IP/id
|
|
|
|
|
|
|
|
Updates the README badges to point to the domain name instead of
IP addresses. Also updates the names of the builds to the name
of the build instead of the id of the build.
|
|
|
|
|
|
|
|
|