index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
test
/
routines
Age
Commit message (
Expand
)
Author
2017-11-19
Fixed a variety of warnings and an error for MSVC2013 compilation
Cedric Nugteren
2017-11-08
Fixed an FP16 issue in the homatcopy test; added a comment about improper tes...
Cedric Nugteren
2017-11-02
Integrated the GEMM routine tuner for kernel selection; added first tuning re...
Cedric Nugteren
2017-10-25
Fixed small bug in (unused) invert tester
Cedric Nugteren
2017-10-15
Fixed a small copy-paste typo
Cedric Nugteren
2017-10-15
Modified test interfaces such that they support either OpenCL or CUDA
Cedric Nugteren
2017-10-15
Fixes for the CUDA API: first tests pass and the client runs
Cedric Nugteren
2017-10-15
Prepared test and client infrastructure for use with the CUDA API
Cedric Nugteren
2017-10-01
GEMM tests now test both the in-direct and the direct kernels seperately
Cedric Nugteren
2017-08-31
Fixed a bug in im2col confusing first and second workgroup size; made im2col ...
Cedric Nugteren
2017-08-23
Made the im2col client properly handle the arguments
Cedric Nugteren
2017-08-19
Implemented proper im2col reference function and completd tests
Cedric Nugteren
2017-08-12
Merge branch 'master' into im_to_col
Cedric Nugteren
2017-08-12
Moved some utility functions to a test-specific utility compilation-unit
Cedric Nugteren
2017-07-16
First step towards supporting im2col in the test infrastructure
Cedric Nugteren
2017-07-12
Relaxed requirement on a_ld and b_ld for batched GEMM
Cedric Nugteren
2017-06-26
Fixed and suppresses several warnings for MSVC
Cedric Nugteren
2017-05-11
Bug-fix in the half-precision test of the amax routine
Cedric Nugteren
2017-04-23
Fixed a compiler warning message
Cedric Nugteren
2017-04-13
Fixed CUDA malloc and cuBLAS handles: cuBLAS as a performance-reference now w...
Cedric Nugteren
2017-04-11
Made compilation of the cuBLAS wrapper work properly
Cedric Nugteren
2017-04-10
Added reference implementations for performance-testing against cuBLAS
Cedric Nugteren
2017-04-03
Fixes the CUDA wrapper (now actually tested on a system with CUDA)
Cedric Nugteren
2017-04-02
Factored out inclusion of clBLAS and CBLAS from the test-routine files
Cedric Nugteren
2017-04-02
Factored out inclusion of clBLAS and CBLAS from the test-routine files
Cedric Nugteren
2017-04-01
Separated host-device and device-host memory copies from execution of the CBL...
Cedric Nugteren
2017-03-10
Added API and test infrastructure for the batched GEMM routine
Cedric Nugteren
2017-03-10
Small fix for a file that isn't currently compiled anymore
Cedric Nugteren
2017-03-10
Added proper testing of the alpha parameter; finalized the batched AXPY imple...
Cedric Nugteren
2017-03-08
Make batched routines based on offsets instead of a vector of cl_mem objects ...
Cedric Nugteren
2017-03-05
Minor fixes to the client w.r.t. the addition of the batch count
Cedric Nugteren
2017-03-05
Added first naive version of the batched AXPY routine
Cedric Nugteren
2017-03-05
Adjusted the test-infrastructure to support testing of batched-versions of ro...
Cedric Nugteren
2017-03-05
Changed the way the test-data is generated: now using a single MT generator a...
Cedric Nugteren
2017-03-04
Fixed a missing include for the tests
Cedric Nugteren
2017-03-04
Added a proper data-preparation function for the TRSM tests
Cedric Nugteren
2017-02-26
Added a guard against invalid buffer sizes in the prepare-data functions for ...
Cedric Nugteren
2017-02-25
Added PrepareData function for TRSM to create proper test input
Cedric Nugteren
2017-02-19
Added data-preparation function for the TRSV tests and special nan/inf checks...
Cedric Nugteren
2017-01-20
Added prototype for the TRSV routine
Cedric Nugteren
2017-01-18
Added first version of the TRSM routine based on the diagonal invert kernel
Cedric Nugteren
2017-01-15
Added a first version of the diagonal block invert routine in preparation of ...
Cedric Nugteren
2016-12-18
Prepared for the addition of the TRSM triangular solver kernel
Cedric Nugteren
2016-11-17
Added a proper half-precision reference for testing of xomatcopy
Cedric Nugteren
2016-09-22
Fixed a bug waiting for an invalid event in case of a non-succesfull CLBlast ...
Cedric Nugteren
2016-06-28
Made it possible to build the OMATCOPY test and client in case only clBLAS is...
CNugteren
2016-06-18
Moved all headers into the source tree, changed headers to .hpp extension
Cedric Nugteren
2016-06-16
Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, and...
Cedric Nugteren
2016-05-26
Added half-precision tests for the clBLAS reference through conversion to sin...
Cedric Nugteren
2016-05-08
Fixed an issue with computing the GFLOPS numbers for the xGEMM performance te...
cnugteren
[next]