index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
Age
Commit message (
Expand
)
Author
2017-02-27
Fixed half-precision bugs in HTBMV/HTPMV/HTRMV/HSYR2K/HTRMM related to incorr...
Cedric Nugteren
2017-02-26
Split the GEMM kernel further up to prevent C1091 in MSVC
Cedric Nugteren
2017-02-26
Merge branch 'development' into triangular_solvers
Cedric Nugteren
2017-02-26
Fixed an out-of-bounds memory access when filling a matrix with a constant
Cedric Nugteren
2017-02-26
Removed half-precision support from the TRSM routine; too unstable
Cedric Nugteren
2017-02-26
Fixes division in the kernel for inversion of complex numbers
Cedric Nugteren
2017-02-25
Added PrepareData function for TRSM to create proper test input
Cedric Nugteren
2017-02-24
Implemented a simple row-major to col-major problem conversion for TRSM
Cedric Nugteren
2017-02-22
Fixed a few issues with the TRSM routine; some tests still failing
Cedric Nugteren
2017-02-19
Added data-preparation function for the TRSV tests and special nan/inf checks...
Cedric Nugteren
2017-02-18
Added tuning parameters for the AMD RX480 GPU (Ellesmere)
Cedric Nugteren
2017-02-18
Fixed the naming of the C API of OverrideParameters and fixed the description
Cedric Nugteren
2017-02-16
Added a C interface to the OverrideParameters function; added some in-line co...
Cedric Nugteren
2017-02-16
Added input-sanity checks for the OverrideParameters function
Cedric Nugteren
2017-02-13
Added first version of the OverrideParameters function
Cedric Nugteren
2017-02-13
Fixed a small bug in GEMV: unused kernel in parameter list
Cedric Nugteren
2017-02-12
Split the database into several smaller cached per-kernel databases (in prepa...
Cedric Nugteren
2017-02-12
Made RemoveBySubset from the cache work with references to keys
Cedric Nugteren
2017-02-11
Added an option to remove items from the caches, optionally by a subset of 2 ...
Cedric Nugteren
2017-02-08
Added tuning results for Titan X (Pascal version)
Cedric Nugteren
2017-02-05
Merge branch 'development' into triangular_solvers
Cedric Nugteren
2017-02-05
Fixed complex version of the TRSV kernel
Cedric Nugteren
2017-02-04
Improved substition kernels a bit; added complex support
Cedric Nugteren
2017-02-04
Completed a first STRSV implementation
Cedric Nugteren
2017-02-04
Added row-major support for TRSV
Cedric Nugteren
2017-01-29
Added first (incomplete) version of TRSV routine
Cedric Nugteren
2017-01-24
Database: pass Device instead of Queue for clarity
Ivan Shapovalov
2017-01-24
Routine: cache the database instance as well
Ivan Shapovalov
2017-01-24
Database: ref-count the internal map for caching
Ivan Shapovalov
2017-01-24
Routine, Cache: generalize, reduce amount of copying in fast path
Ivan Shapovalov
2017-01-24
FillCache: perform compilation for each precision separately
Ivan Shapovalov
2017-01-24
Routine: fix semi-warm routine construction (when binary is in cache)
Ivan Shapovalov
2017-01-24
src/clpp11.hpp: check pointers before clRelease*()
Ivan Shapovalov
2017-01-24
src/clpp11.hpp: do not store program source/binary in Program
Ivan Shapovalov
2017-01-20
treewide: include clpp11.hpp first to silence deprecation warnings
Ivan Shapovalov
2017-01-20
Routine: use PrecisionSupported<>() instead of duplicating the check
Ivan Shapovalov
2017-01-20
Added prototype for the TRSV routine
Cedric Nugteren
2017-01-20
Set number of decimals for floating-point printing for error reporting
Cedric Nugteren
2017-01-19
Added tuning results for NVIDIA GTX 1080 and Intel Core i7-4790K
Cedric Nugteren
2017-01-18
Added first version of the TRSM routine based on the diagonal invert kernel
Cedric Nugteren
2017-01-15
Added a first version of the diagonal block invert routine in preparation of ...
Cedric Nugteren
2017-01-15
Prints additional information in verbose/debug mode
Cedric Nugteren
2017-01-07
Always enables cl_khr_fp64 when running double-precision, not just for OpenCL...
Cedric Nugteren
2017-01-03
Added tuning results for the AMD Turks GPU and the Intel Core i7-2670QM CPU
Cedric Nugteren
2016-12-18
Prepared for the addition of the TRSM triangular solver kernel
Cedric Nugteren
2016-12-18
Fixed a bug when using offsets in the direct GEMM kernels
Cedric Nugteren
2016-11-29
Made Intel GPUs always use the indirect version of the GEMM kernel
Cedric Nugteren
2016-11-27
Made it possible to use the command-line environmental variables for each exe...
Cedric Nugteren
2016-11-26
Improved the default parameters for cases with non-common parameters across a...
Cedric Nugteren
2016-11-24
Merge pull request #125 from CNugteren/netlib_blas_api
Cedric Nugteren
[prev]
[next]