index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2017-03-19
Fixed a compilation issue for GCC/MSVC
Cedric Nugteren
2017-03-19
Added an (optional) non-direct implementation of the batched GEMM routine
Cedric Nugteren
2017-03-19
Added batched versions of the pad/copy/transpose kernels
Cedric Nugteren
2017-03-14
Added the possibility to tune batched kernels
Cedric Nugteren
2017-03-12
Fixed a linker issue for Clang
Cedric Nugteren
2017-03-11
Added initial naive version of the batched GEMM routine based on the direct G...
Cedric Nugteren
2017-03-10
Added API and test infrastructure for the batched GEMM routine
Cedric Nugteren
2017-03-10
Merge pull request #141 from CNugteren/axpy_batched
Cedric Nugteren
2017-03-10
Small fix for a file that isn't currently compiled anymore
Cedric Nugteren
2017-03-10
Added proper testing of the alpha parameter; finalized the batched AXPY imple...
Cedric Nugteren
2017-03-10
Fixed a small compilation bug for MSVC related to a floating-point constant
Cedric Nugteren
2017-03-08
Implemented a batched version of the AXPY kernel
Cedric Nugteren
2017-03-08
Make batched routines based on offsets instead of a vector of cl_mem objects ...
Cedric Nugteren
2017-03-05
Minor fixes to the client w.r.t. the addition of the batch count
Cedric Nugteren
2017-03-05
Added first naive version of the batched AXPY routine
Cedric Nugteren
2017-03-05
Adjusted the test-infrastructure to support testing of batched-versions of ro...
Cedric Nugteren
2017-03-05
Changed the way the test-data is generated: now using a single MT generator a...
Cedric Nugteren
2017-03-05
Prepared generator for batched routines; added batched AXPY routine interface
Cedric Nugteren
2017-03-04
Fixed a missing include for the tests
Cedric Nugteren
2017-03-04
Added tuning results for the Radeon HD6750M GPU (Apple OpenCL)
Cedric Nugteren
2017-03-04
Added a proper data-preparation function for the TRSM tests
Cedric Nugteren
2017-03-01
Added proper support for the b_offset argument in TRSM
Cedric Nugteren
2017-03-01
Made a double to float cast explicit for MSVC compatibility (C2397)
Cedric Nugteren
2017-02-27
Added L2 error computation and checking for half-precision tests
Cedric Nugteren
2017-02-27
Fixed half-precision bugs in HTBMV/HTPMV/HTRMV/HSYR2K/HTRMM related to incorr...
Cedric Nugteren
2017-02-26
Updated the README documentation
Cedric Nugteren
2017-02-26
Merge pull request #138 from CNugteren/triangular_solvers
Cedric Nugteren
2017-02-26
Split the GEMM kernel further up to prevent C1091 in MSVC
Cedric Nugteren
2017-02-26
Minor fix to the generator script
Cedric Nugteren
2017-02-26
Merge branch 'development' into triangular_solvers
Cedric Nugteren
2017-02-26
Added a guard against invalid buffer sizes in the prepare-data functions for ...
Cedric Nugteren
2017-02-26
Fixed an out-of-bounds memory access when filling a matrix with a constant
Cedric Nugteren
2017-02-26
Removed half-precision support from the TRSM routine; too unstable
Cedric Nugteren
2017-02-26
Improved the correctness tests for complex numbers in case either real or ima...
Cedric Nugteren
2017-02-26
Fixes division in the kernel for inversion of complex numbers
Cedric Nugteren
2017-02-25
Added documentation for the TRSV and TRSM routines
Cedric Nugteren
2017-02-25
Removed the invert routine from the tests
Cedric Nugteren
2017-02-25
Added PrepareData function for TRSM to create proper test input
Cedric Nugteren
2017-02-24
Implemented a simple row-major to col-major problem conversion for TRSM
Cedric Nugteren
2017-02-22
Fixed a few issues with the TRSM routine; some tests still failing
Cedric Nugteren
2017-02-19
Added data-preparation function for the TRSV tests and special nan/inf checks...
Cedric Nugteren
2017-02-18
Added tuning parameters for the AMD RX480 GPU (Ellesmere)
Cedric Nugteren
2017-02-18
Merge pull request #137 from CNugteren/custom_parameters
Cedric Nugteren
2017-02-18
Changed the override-parameters test such that it is compatible with more dev...
Cedric Nugteren
2017-02-18
Fixed small typo in the documentation
Cedric Nugteren
2017-02-18
Added documentation for the OverrideParameters function
Cedric Nugteren
2017-02-18
Fixed the naming of the C API of OverrideParameters and fixed the description
Cedric Nugteren
2017-02-18
Added missing documentation for the fill and clear cache functions
Cedric Nugteren
2017-02-16
Added a C interface to the OverrideParameters function; added some in-line co...
Cedric Nugteren
2017-02-16
Added input-sanity checks for the OverrideParameters function
Cedric Nugteren
[prev]
[next]