index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2017-02-18
Fixed the naming of the C API of OverrideParameters and fixed the description
Cedric Nugteren
2017-02-18
Added missing documentation for the fill and clear cache functions
Cedric Nugteren
2017-02-16
Added a C interface to the OverrideParameters function; added some in-line co...
Cedric Nugteren
2017-02-16
Added input-sanity checks for the OverrideParameters function
Cedric Nugteren
2017-02-14
Added simple tests for the OverrideParameters function
Cedric Nugteren
2017-02-13
Added first version of the OverrideParameters function
Cedric Nugteren
2017-02-13
Fixed a small bug in GEMV: unused kernel in parameter list
Cedric Nugteren
2017-02-12
Split the database into several smaller cached per-kernel databases (in prepa...
Cedric Nugteren
2017-02-12
Made RemoveBySubset from the cache work with references to keys
Cedric Nugteren
2017-02-11
Added an option to remove items from the caches, optionally by a subset of 2 ...
Cedric Nugteren
2017-02-08
Added tuning results for Titan X (Pascal version)
Cedric Nugteren
2017-01-24
Updated the changelog for PR131 and PR132
Cedric Nugteren
2017-01-24
Merge pull request #132 from intelfx/cache
Cedric Nugteren
2017-01-24
Database: pass Device instead of Queue for clarity
Ivan Shapovalov
2017-01-24
Routine: cache the database instance as well
Ivan Shapovalov
2017-01-24
Database: ref-count the internal map for caching
Ivan Shapovalov
2017-01-24
Routine, Cache: generalize, reduce amount of copying in fast path
Ivan Shapovalov
2017-01-24
Merge pull request #131 from intelfx/misc
Cedric Nugteren
2017-01-24
.travis.yml: do not build for osx twice, there's no gcc there
Ivan Shapovalov
2017-01-24
treewide: silence type mismatch warnings in *printf()
Ivan Shapovalov
2017-01-24
Tester: always fail on OpenCL and CLBlast internal errors
Ivan Shapovalov
2017-01-24
FillCache: perform compilation for each precision separately
Ivan Shapovalov
2017-01-24
Routine: fix semi-warm routine construction (when binary is in cache)
Ivan Shapovalov
2017-01-24
src/clpp11.hpp: check pointers before clRelease*()
Ivan Shapovalov
2017-01-24
src/clpp11.hpp: do not store program source/binary in Program
Ivan Shapovalov
2017-01-24
samples: add CL_USE_DEPRECATED_OPENCL_1_*_APIS where needed
Ivan Shapovalov
2017-01-20
treewide: include clpp11.hpp first to silence deprecation warnings
Ivan Shapovalov
2017-01-20
Routine: use PrecisionSupported<>() instead of duplicating the check
Ivan Shapovalov
2017-01-19
Added tuning results for NVIDIA GTX 1080 and Intel Core i7-4790K
Cedric Nugteren
2017-01-07
Updated the link to cl.hpp in the Khronos registry for the samples
Cedric Nugteren
2017-01-07
Always enables cl_khr_fp64 when running double-precision, not just for OpenCL...
Cedric Nugteren
2017-01-03
Added tuning results for the AMD Turks GPU and the Intel Core i7-2670QM CPU
Cedric Nugteren
2016-12-18
Fixed a bug when using offsets in the direct GEMM kernels
Cedric Nugteren
2016-11-29
Made Intel GPUs always use the indirect version of the GEMM kernel
Cedric Nugteren
2016-11-27
Updated to version 0.10.0
Cedric Nugteren
2016-11-27
Made it possible to use the command-line environmental variables for each exe...
Cedric Nugteren
2016-11-27
Merge branch 'better_defaults' into development
Cedric Nugteren
2016-11-26
Improved the default parameters for cases with non-common parameters across a...
Cedric Nugteren
2016-11-24
Merge pull request #125 from CNugteren/netlib_blas_api
Cedric Nugteren
2016-11-23
Made the Netlib SGEMM example also optionally compiled
Cedric Nugteren
2016-11-23
Fixed a vector-size related bug in the CLBlast Netlib API
Cedric Nugteren
2016-11-23
Made compilation of the Netlib CBLAS API conditional
Cedric Nugteren
2016-11-23
Fixed a bug in the HSCAL routine
Cedric Nugteren
2016-11-22
Minor changes to ensure full compatibility with the Netlib CBLAS API
Cedric Nugteren
2016-11-20
Made functions with scalar-buffers as output properly return values
Cedric Nugteren
2016-11-20
Added performance results for the Skylake ULT GT2 GPU
Cedric Nugteren
2016-11-20
Now correctly tests for validaty of the B matrix in the TRMM routine
Cedric Nugteren
2016-11-20
Forced OpenCL 1.1 compilation and disabled a deprecation warning
Cedric Nugteren
2016-11-20
Fixed a bug in the TRMM routine caused by overwriting input data before consu...
Cedric Nugteren
2016-11-19
Generating FP16 performance graphs now uses FP32 as a reference for comparison
Cedric Nugteren
[next]