Age | Commit message (Collapse) | Author |
|
|
|
times; requires CLTune 2.5.0
|
|
|
|
|
|
reproducability of results
|
|
|
|
Link clBLAS together with pthread
|
|
|
|
|
|
NWGD and KWGD into one WGD parameter
|
|
indirect version
|
|
|
|
|
|
|
|
servers
|
|
dividiti/add_ref_includes_to_test_correctness_common
Add path to ref library header when building tests.
|
|
|
|
call in the tests and samples
|
|
|
|
environmental variable
|
|
|
|
Fixed link in README.md
|
|
The GitHub link could be https://github.com/gpu
(without "s"), but the website should be OK, too
|
|
Update to version 0.9.0
|
|
|
|
with recent usages of CLBLAST_DEVICE and CLBLAST_PLATFORM
|
|
CMakeLists.txt: use -Wno-ignored-attributes to silence unfixable warnings
|
|
|
|
can't handle long strings
|
|
|
|
now automatically taken from 32-bit if there are no entries at all
|
|
and convienient plain JSON/dict data-type
|
|
|
|
M370X GPU
|
|
best-performing cases for a specific parameters combination
|
|
explored exhaustively and a larger set which is explored randomly
|
|
|
|
problems if C contains NaNs
|
|
|
|
handle duplicate entries of different runs
|
|
test/correctness: read platform and device from environment
|
|
Support passing environment variables CLBLAST_PLATFORM and CLBLAST_DEVICE
instead of -platform and -device arguments to test executables.
This is for `ctest`.
|
|
|
|
method as for known device groups
|
|
search space to have a better chance to evaluate more likely parameter combinations
|
|
|
|
Conflicts:
README.md
|
|
|
|
dvasschemacq-master
Conflicts:
src/kernels/level1/xaxpy.opencl
src/kernels/level2/xgemv.opencl
src/kernels/level2/xgemv_fast.opencl
src/kernels/level2/xger.opencl
src/kernels/level2/xher.opencl
src/kernels/level2/xher2.opencl
src/kernels/level3/xgemm_part2.opencl
|
|
In OpenCL 1.1 __kernel has to be before __attribute__, at least with
Vivante compiler.
|