Age | Commit message (Collapse) | Author |
|
|
|
|
|
where the AMD compiler crashes"
This reverts commit 407ed52cec41445f02e85cb45d08f590960216bb.
|
|
|
|
|
|
AMD compiler crashes
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
names in kernels
|
|
|
|
|
|
loops in kernel accordingly
|
|
|
|
|
|
pragma for several kernels
|
|
based on assumptions
|
|
|
|
|
|
|
|
|
|
changed transpose kernel code
|
|
|
|
|
|
|
|
|
|
optional temporary buffers
|
|
|
|
|
|
kernel 2d instead of 3d
|
|
|
|
|
|
NVIDIA and ARM GPUs
|
|
sets of input parameters
|
|
|
|
|
|
|
|
struct; fixes bug on Apple OpenCL
|
|
|
|
|
|
GEMM kernel
|
|
implementation
|
|
|
|
- undoing many earlier changes
|
|
|
|
|