index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
test
/
correctness
Age
Commit message (
Expand
)
Author
2018-05-17
Fixed a few issues with canary region testing
Cedric Nugteren
2018-05-17
Added a canary region for overflow detection to the correctness tests
Cedric Nugteren
2018-04-15
Fixed some failing tests for GEMM and batched GEMM routines
Cedric Nugteren
2018-03-15
Fixed breaking preprocessor test on certain platforms due to empty kernel string
Cedric Nugteren
2018-01-31
Created the API and stubs for the HAD (hadamard-product) routines
Cedric Nugteren
2018-01-11
Added test for the RetrieveParameters function
Cedric Nugteren
2018-01-11
Fixed bug in override parameters test
Cedric Nugteren
2018-01-07
Added API and tests for new GemmStridedBatched routine
Cedric Nugteren
2018-01-03
Added a queue argument to the get-size function when running the tests/clients
Cedric Nugteren
2017-12-10
Split GEMM kernel in 4 files instead of 3 due to MSVC 2013 string length limit
Cedric Nugteren
2017-12-09
Completed kernel modifications for pre-processor of all other kernels
Cedric Nugteren
2017-12-09
Made the pre-processor run by default for ARM and Qualcomm GPUs
Cedric Nugteren
2017-12-09
Fixed defines parsing and substituting in pre-processor; fixed some variable ...
Cedric Nugteren
2017-12-05
Improved array-to-register promotion, now handling function calls as well
Cedric Nugteren
2017-12-03
Added GEMM (direct and in-direct) to the pre-processor testing; modified the ...
Cedric Nugteren
2017-12-03
Reformated transpose kernels for the pre-processor; extended the amount of tests
Cedric Nugteren
2017-11-30
Improved the pre-processor's handling of defines; added a special nested defi...
Cedric Nugteren
2017-11-30
Integrated pre-processor in compilation flow, default is still disabled
Cedric Nugteren
2017-11-29
Extended the preprocessor tests to include CopyFast and CopyPad
Cedric Nugteren
2017-11-28
Improved the pre-processor tester, added GEMV and GER kernels
Cedric Nugteren
2017-11-25
Added stub for a preprocessor and a corresponding compilation test
Cedric Nugteren
2017-11-13
Moved square-difference utility function for use in the tuners
Cedric Nugteren
2017-10-28
Merge branch 'master' into android_support
Cedric Nugteren
2017-10-15
Prepared test and client infrastructure for use with the CUDA API
Cedric Nugteren
2017-10-09
Fixed the Python generator script w.r.t. the recent change of testing direct/...
Cedric Nugteren
2017-10-01
GEMM tests now test both the in-direct and the direct kernels seperately
Cedric Nugteren
2017-09-26
Added missing headers
Cedric Nugteren
2017-09-24
Updated database override function to work with the new database storage format
Cedric Nugteren
2017-09-23
Made database-caching no longer dependent on device name but on device/platfo...
Cedric Nugteren
2017-08-19
Implemented proper im2col reference function and completd tests
Cedric Nugteren
2017-08-12
Merge branch 'master' into im_to_col
Cedric Nugteren
2017-08-12
Moved some utility functions to a test-specific utility compilation-unit
Cedric Nugteren
2017-07-16
First step towards supporting im2col in the test infrastructure
Cedric Nugteren
2017-07-12
Fixed batched tests when testing for invalid sizes against clBLAS
Cedric Nugteren
2017-07-09
Changed printf-statements with %zu into std::cout to fix MSVC 2013 compatibility
Cedric Nugteren
2017-07-09
Disabled UNIX-style terminal color printing under Windows
Cedric Nugteren
2017-06-27
Moved and inlined some static member variables and disabled spurious clang wa...
Cedric Nugteren
2017-06-27
Undo of earlier move of TestBlas::kTransposes constant to fix MSVC 2013 compi...
Cedric Nugteren
2017-06-25
Moved static variable declarations from .cpp to .hpp to resolve some Clang wa...
Cedric Nugteren
2017-04-17
Fixed a namespace clash with CUDA FP16 for the half-datatype
Cedric Nugteren
2017-04-16
Merge branch 'development' into benchmarking
Cedric Nugteren
2017-04-14
Added a new Xaxpy kernel in between the regular and fast version in
Cedric Nugteren
2017-04-11
Made compilation of the cuBLAS wrapper work properly
Cedric Nugteren
2017-04-10
Added reference implementations for performance-testing against cuBLAS
Cedric Nugteren
2017-04-03
In-lined the float2 and double2 types to avoid collision with CUDA's definitions
Cedric Nugteren
2017-04-02
Layed the groundwork for cuBLAS comparisons in the clients
Cedric Nugteren
2017-04-01
Separated host-device and device-host memory copies from execution of the CBL...
Cedric Nugteren
2017-03-20
Fixed a GCC/MSVC compilation issue
Cedric Nugteren
2017-03-12
Fixed a linker issue for Clang
Cedric Nugteren
2017-03-10
Added API and test infrastructure for the batched GEMM routine
Cedric Nugteren
[next]