index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
test
/
performance
Age
Commit message (
Expand
)
Author
2018-01-31
Created the API and stubs for the HAD (hadamard-product) routines
Cedric Nugteren
2018-01-14
Small improvements to benchmarking for cuBLAS
Cedric Nugteren
2018-01-07
Added API and tests for new GemmStridedBatched routine
Cedric Nugteren
2018-01-03
Added a queue argument to the get-size function when running the tests/clients
Cedric Nugteren
2017-11-22
Made parameter override in the clients a command-line argument and added supp...
Cedric Nugteren
2017-11-21
Implemented first version of reading JSON files from disk in the client to ov...
Cedric Nugteren
2017-10-15
Prepared test and client infrastructure for use with the CUDA API
Cedric Nugteren
2017-10-01
GEMM tests now test both the in-direct and the direct kernels seperately
Cedric Nugteren
2017-08-23
Made the im2col client properly handle the arguments
Cedric Nugteren
2017-08-12
Merge branch 'master' into im_to_col
Cedric Nugteren
2017-08-12
Moved some utility functions to a test-specific utility compilation-unit
Cedric Nugteren
2017-07-16
First step towards supporting im2col in the test infrastructure
Cedric Nugteren
2017-04-17
Fixed a namespace clash with CUDA FP16 for the half-datatype
Cedric Nugteren
2017-04-13
Fixed CUDA malloc and cuBLAS handles: cuBLAS as a performance-reference now w...
Cedric Nugteren
2017-04-03
In-lined the float2 and double2 types to avoid collision with CUDA's definitions
Cedric Nugteren
2017-04-02
Layed the groundwork for cuBLAS comparisons in the clients
Cedric Nugteren
2017-04-01
Separated host-device and device-host memory copies from execution of the CBL...
Cedric Nugteren
2017-03-19
Fixed a compilation issue for GCC/MSVC
Cedric Nugteren
2017-03-12
Fixed a linker issue for Clang
Cedric Nugteren
2017-03-10
Added API and test infrastructure for the batched GEMM routine
Cedric Nugteren
2017-03-08
Make batched routines based on offsets instead of a vector of cl_mem objects ...
Cedric Nugteren
2017-03-05
Minor fixes to the client w.r.t. the addition of the batch count
Cedric Nugteren
2017-03-05
Adjusted the test-infrastructure to support testing of batched-versions of ro...
Cedric Nugteren
2017-03-05
Changed the way the test-data is generated: now using a single MT generator a...
Cedric Nugteren
2017-03-05
Prepared generator for batched routines; added batched AXPY routine interface
Cedric Nugteren
2017-02-26
Removed half-precision support from the TRSM routine; too unstable
Cedric Nugteren
2017-02-05
Merge branch 'development' into triangular_solvers
Cedric Nugteren
2017-01-20
treewide: include clpp11.hpp first to silence deprecation warnings
Ivan Shapovalov
2017-01-15
Added a first version of the diagonal block invert routine in preparation of ...
Cedric Nugteren
2016-11-27
Made it possible to use the command-line environmental variables for each exe...
Cedric Nugteren
2016-10-22
Moved files around a bit; created a utilities subfolder
Cedric Nugteren
2016-09-27
Now generates test/client/tuner data using a fixed seed to enable reproducabi...
Cedric Nugteren
2016-07-06
Added an option to the performance clients to do a warm-up run before timing
Cedric Nugteren
2016-06-27
Moved the performance graph scripts to the 'scripts' subfolder
Cedric Nugteren
2016-06-19
Renamed all C++ source files to .cpp to match the .hpp extension better
Cedric Nugteren
2016-06-18
Moved all headers into the source tree, changed headers to .hpp extension
Cedric Nugteren
2016-06-16
Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, and...
Cedric Nugteren
2016-05-25
Added possibility to run the performance client with half-precision
Cedric Nugteren
2016-05-18
Merged in latest changes from 0.7.1 release
Cedric Nugteren
2016-04-20
Added support for the iSAMAX/iDAMAX/iCAMAX/iZAMAX routines
cnugteren
2016-04-20
Added prototype for ixAMAX routines
cnugteren
2016-04-14
Added support for the SASUM/DASUM/ScASUM/DzASUM routines
cnugteren
2016-04-13
Added prototype for xASUM routines
cnugteren
2016-04-02
Added support for testing (performance and correctness) against a CPU BLAS li...
cnugteren
2016-03-30
Merge branch 'level1_routines' into development
cnugteren
2016-03-30
Added prototypes for the xROTM and xROTMG routines
Cedric Nugteren
2016-03-30
Added prototypes for the xROT and xROTG functions
Cedric Nugteren
2016-03-28
Added preliminary support for the xNRM2 routines
Cedric Nugteren
2016-03-25
Added prototypes for ScNRM2/DzNRM2 routines
Cedric Nugteren
2016-03-25
Added prototypes for SNRM2/DNRM2 routines
Cedric Nugteren
[next]