index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
test
/
routines
Age
Commit message (
Expand
)
Author
2017-05-11
Bug-fix in the half-precision test of the amax routine
Cedric Nugteren
2017-04-23
Fixed a compiler warning message
Cedric Nugteren
2017-04-13
Fixed CUDA malloc and cuBLAS handles: cuBLAS as a performance-reference now w...
Cedric Nugteren
2017-04-11
Made compilation of the cuBLAS wrapper work properly
Cedric Nugteren
2017-04-10
Added reference implementations for performance-testing against cuBLAS
Cedric Nugteren
2017-04-03
Fixes the CUDA wrapper (now actually tested on a system with CUDA)
Cedric Nugteren
2017-04-02
Factored out inclusion of clBLAS and CBLAS from the test-routine files
Cedric Nugteren
2017-04-02
Factored out inclusion of clBLAS and CBLAS from the test-routine files
Cedric Nugteren
2017-04-01
Separated host-device and device-host memory copies from execution of the CBL...
Cedric Nugteren
2017-03-10
Added API and test infrastructure for the batched GEMM routine
Cedric Nugteren
2017-03-10
Small fix for a file that isn't currently compiled anymore
Cedric Nugteren
2017-03-10
Added proper testing of the alpha parameter; finalized the batched AXPY imple...
Cedric Nugteren
2017-03-08
Make batched routines based on offsets instead of a vector of cl_mem objects ...
Cedric Nugteren
2017-03-05
Minor fixes to the client w.r.t. the addition of the batch count
Cedric Nugteren
2017-03-05
Added first naive version of the batched AXPY routine
Cedric Nugteren
2017-03-05
Adjusted the test-infrastructure to support testing of batched-versions of ro...
Cedric Nugteren
2017-03-05
Changed the way the test-data is generated: now using a single MT generator a...
Cedric Nugteren
2017-03-04
Fixed a missing include for the tests
Cedric Nugteren
2017-03-04
Added a proper data-preparation function for the TRSM tests
Cedric Nugteren
2017-02-26
Added a guard against invalid buffer sizes in the prepare-data functions for ...
Cedric Nugteren
2017-02-25
Added PrepareData function for TRSM to create proper test input
Cedric Nugteren
2017-02-19
Added data-preparation function for the TRSV tests and special nan/inf checks...
Cedric Nugteren
2017-01-20
Added prototype for the TRSV routine
Cedric Nugteren
2017-01-18
Added first version of the TRSM routine based on the diagonal invert kernel
Cedric Nugteren
2017-01-15
Added a first version of the diagonal block invert routine in preparation of ...
Cedric Nugteren
2016-12-18
Prepared for the addition of the TRSM triangular solver kernel
Cedric Nugteren
2016-11-17
Added a proper half-precision reference for testing of xomatcopy
Cedric Nugteren
2016-09-22
Fixed a bug waiting for an invalid event in case of a non-succesfull CLBlast ...
Cedric Nugteren
2016-06-28
Made it possible to build the OMATCOPY test and client in case only clBLAS is...
CNugteren
2016-06-18
Moved all headers into the source tree, changed headers to .hpp extension
Cedric Nugteren
2016-06-16
Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, and...
Cedric Nugteren
2016-05-26
Added half-precision tests for the clBLAS reference through conversion to sin...
Cedric Nugteren
2016-05-08
Fixed an issue with computing the GFLOPS numbers for the xGEMM performance te...
cnugteren
2016-04-27
All CLBlast enum constants now have the same raw values as in the cblas standard
Cedric Nugteren
2016-04-20
Added support for the iSAMAX/iDAMAX/iCAMAX/iZAMAX routines
cnugteren
2016-04-14
Added support for the SASUM/DASUM/ScASUM/DzASUM routines
cnugteren
2016-04-02
Added support for testing (performance and correctness) against a CPU BLAS li...
cnugteren
2016-03-28
Added preliminary support for the xNRM2 routines
Cedric Nugteren
2016-03-06
Added preliminary support for xHPR2 and xSPR2 routines
Cedric Nugteren
2016-03-02
Added preliminary support for xHER2 and xSYR2 routines
Cedric Nugteren
2016-02-28
Added support for xHER, xHPR, xSYR, and xSPR routines
Cedric Nugteren
2016-02-20
Added support for xGERU and xGERC routines
Cedric Nugteren
2016-02-20
Added XGER routine, kernel, and tuner
Cedric Nugteren
2015-09-26
Added TRMV/TBMV/TPMV routines
CNugteren
2015-09-19
Added SBMV and SPMV routines
CNugteren
2015-09-19
Added the HPMV routine
CNugteren
2015-09-19
Added the HBMV routine
CNugteren
2015-09-18
Added first version of banded matrix-vector multiplication
CNugteren
2015-09-14
Added xDOT/xDOTU/xDOTC dot-product routines
CNugteren
2015-08-22
Added the XSWAP, XSCAL and XCOPY level-1 routines
CNugteren
[next]