Age | Commit message (Collapse) | Author |
|
|
|
https://jira-dc.qualcomm.com/jira/browse/OSR-8731
|
|
|
|
|
|
|
|
|
|
pragma for several kernels
|
|
|
|
NVIDIA and ARM GPUs
|
|
|
|
|
|
|
|
|
|
dvasschemacq-master
Conflicts:
src/kernels/level1/xaxpy.opencl
src/kernels/level2/xgemv.opencl
src/kernels/level2/xgemv_fast.opencl
src/kernels/level2/xger.opencl
src/kernels/level2/xher.opencl
src/kernels/level2/xher2.opencl
src/kernels/level3/xgemm_part2.opencl
|
|
In OpenCL 1.1 __kernel has to be before __attribute__, at least with
Vivante compiler.
|
|
|
|
|
|
enabling better memory performance
|
|
case of fp16 arguments are cast on host and in kernel
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|