Age | Commit message (Collapse) | Author |
|
|
|
|
|
|
|
|
|
Convolution with single kernel
|
|
|
|
strided-batched-GEMM routine
|
|
|
|
|
|
executions
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
kernel and test
|
|
|
|
|
|
|
|
First im2col+GEMM implementation of convolution
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Fixes bug in conjugate transpose not being executed
|
|
Added workaround for AMD Southern Islands GPU issue
|
|
transposing
|
|
|
|
|
|
|
|
|
|
|
|
invalid ones completely, saving compilation time
|
|
kernels to improve performance
|
|
|
|
|
|
|
|
|