index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2017-01-24
Routine: cache the database instance as well
Ivan Shapovalov
2017-01-24
Database: ref-count the internal map for caching
Ivan Shapovalov
2017-01-24
Routine, Cache: generalize, reduce amount of copying in fast path
Ivan Shapovalov
2017-01-24
Merge pull request #131 from intelfx/misc
Cedric Nugteren
2017-01-24
.travis.yml: do not build for osx twice, there's no gcc there
Ivan Shapovalov
2017-01-24
treewide: silence type mismatch warnings in *printf()
Ivan Shapovalov
2017-01-24
Tester: always fail on OpenCL and CLBlast internal errors
Ivan Shapovalov
2017-01-24
FillCache: perform compilation for each precision separately
Ivan Shapovalov
2017-01-24
Routine: fix semi-warm routine construction (when binary is in cache)
Ivan Shapovalov
2017-01-24
src/clpp11.hpp: check pointers before clRelease*()
Ivan Shapovalov
2017-01-24
src/clpp11.hpp: do not store program source/binary in Program
Ivan Shapovalov
2017-01-24
samples: add CL_USE_DEPRECATED_OPENCL_1_*_APIS where needed
Ivan Shapovalov
2017-01-20
treewide: include clpp11.hpp first to silence deprecation warnings
Ivan Shapovalov
2017-01-20
Routine: use PrecisionSupported<>() instead of duplicating the check
Ivan Shapovalov
2017-01-19
Added tuning results for NVIDIA GTX 1080 and Intel Core i7-4790K
Cedric Nugteren
2017-01-07
Updated the link to cl.hpp in the Khronos registry for the samples
Cedric Nugteren
2017-01-07
Always enables cl_khr_fp64 when running double-precision, not just for OpenCL...
Cedric Nugteren
2017-01-03
Added tuning results for the AMD Turks GPU and the Intel Core i7-2670QM CPU
Cedric Nugteren
2016-12-18
Fixed a bug when using offsets in the direct GEMM kernels
Cedric Nugteren
2016-11-29
Made Intel GPUs always use the indirect version of the GEMM kernel
Cedric Nugteren
2016-11-27
Updated to version 0.10.0
Cedric Nugteren
2016-11-27
Made it possible to use the command-line environmental variables for each exe...
Cedric Nugteren
2016-11-27
Merge branch 'better_defaults' into development
Cedric Nugteren
2016-11-26
Improved the default parameters for cases with non-common parameters across a...
Cedric Nugteren
2016-11-24
Merge pull request #125 from CNugteren/netlib_blas_api
Cedric Nugteren
2016-11-23
Made the Netlib SGEMM example also optionally compiled
Cedric Nugteren
2016-11-23
Fixed a vector-size related bug in the CLBlast Netlib API
Cedric Nugteren
2016-11-23
Made compilation of the Netlib CBLAS API conditional
Cedric Nugteren
2016-11-23
Fixed a bug in the HSCAL routine
Cedric Nugteren
2016-11-22
Minor changes to ensure full compatibility with the Netlib CBLAS API
Cedric Nugteren
2016-11-20
Made functions with scalar-buffers as output properly return values
Cedric Nugteren
2016-11-20
Added performance results for the Skylake ULT GT2 GPU
Cedric Nugteren
2016-11-20
Now correctly tests for validaty of the B matrix in the TRMM routine
Cedric Nugteren
2016-11-20
Forced OpenCL 1.1 compilation and disabled a deprecation warning
Cedric Nugteren
2016-11-20
Fixed a bug in the TRMM routine caused by overwriting input data before consu...
Cedric Nugteren
2016-11-19
Generating FP16 performance graphs now uses FP32 as a reference for comparison
Cedric Nugteren
2016-11-19
Changed the GEMM kernel selection parameters for Skylake GPUs to always favou...
Cedric Nugteren
2016-11-17
Added a proper half-precision reference for testing of xomatcopy
Cedric Nugteren
2016-11-17
Fixed a bug in the error margins; relaxed the error margins for half-precision
Cedric Nugteren
2016-11-15
Updated the tuning results for the Intel Skylake ULT GT2 GPU
Cedric Nugteren
2016-10-25
Added an example and documentation for the Netlib CBLAS API
Cedric Nugteren
2016-10-25
Renamed the include and source files of the Netlib CBLAS API
Cedric Nugteren
2016-10-25
Removed the clblast namespace from the Netlib C API source file to ensure pro...
Cedric Nugteren
2016-10-25
Fixed some issues preventing the Netlib CBLAS API from linking correctly
Cedric Nugteren
2016-10-25
Made the Netlib CBLAS API use the same enums with prefixes as the regular C A...
Cedric Nugteren
2016-10-25
Sets the proper sizes for the buffers for the Netlib CBLAS API
Cedric Nugteren
2016-10-25
Added initial version of a Netlib CBLAS implementation. TODO: Set correct buf...
Cedric Nugteren
2016-10-25
Merge branch 'development' into netlib_blas_api
Cedric Nugteren
2016-10-24
Updated list of acknowledgments and thanks
Cedric Nugteren
2016-10-24
Added tuning results for GeForce GTX TITAN Black
Cedric Nugteren
[next]