index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
test
Age
Commit message (
Collapse
)
Author
2017-10-08
Moved the remaining OpenCL specific host code to the clpp11.h header where ↵
Cedric Nugteren
it belongs
2017-10-07
Synchronizes clpp11.h with CLCudaAPI 9.0
Cedric Nugteren
2017-10-01
GEMM tests now test both the in-direct and the direct kernels seperately
Cedric Nugteren
2017-09-24
Updated database override function to work with the new database storage format
Cedric Nugteren
2017-09-23
Added extra benchmarks to verify new database caching keys performance
Cedric Nugteren
2017-09-23
Made database-caching no longer dependent on device name but on ↵
Cedric Nugteren
device/platform IDs
2017-09-22
Added OpenCL properties printing to the diagnostics helper
Cedric Nugteren
2017-09-19
Added first version of a small CLBlast diagnostics helper
Cedric Nugteren
2017-08-31
Fixed a bug in im2col confusing first and second workgroup size; made im2col ↵
Cedric Nugteren
kernel 2d instead of 3d
2017-08-23
Made the im2col client properly handle the arguments
Cedric Nugteren
2017-08-19
Implemented proper im2col reference function and completd tests
Cedric Nugteren
2017-08-12
Merge branch 'master' into im_to_col
Cedric Nugteren
2017-08-12
Moved some utility functions to a test-specific utility compilation-unit
Cedric Nugteren
2017-07-16
First step towards supporting im2col in the test infrastructure
Cedric Nugteren
2017-07-12
Fixed batched tests when testing for invalid sizes against clBLAS
Cedric Nugteren
2017-07-12
Relaxed requirement on a_ld and b_ld for batched GEMM
Cedric Nugteren
2017-07-09
Changed printf-statements with %zu into std::cout to fix MSVC 2013 compatibility
Cedric Nugteren
2017-07-09
Disabled UNIX-style terminal color printing under Windows
Cedric Nugteren
2017-06-27
Moved and inlined some static member variables and disabled spurious clang ↵
Cedric Nugteren
warnings
2017-06-27
Undo of earlier move of TestBlas::kTransposes constant to fix MSVC 2013 ↵
Cedric Nugteren
compilation
2017-06-26
Fixed and suppresses several warnings for MSVC
Cedric Nugteren
2017-06-25
Moved static variable declarations from .cpp to .hpp to resolve some Clang ↵
Cedric Nugteren
warnings
2017-06-25
Fixed some Clang and MSVC warnings
Cedric Nugteren
2017-05-11
Bug-fix in the half-precision test of the amax routine
Cedric Nugteren
2017-04-23
Fixed a compiler warning message
Cedric Nugteren
2017-04-17
Fixed a namespace clash with CUDA FP16 for the half-datatype
Cedric Nugteren
2017-04-16
Merge branch 'development' into benchmarking
Cedric Nugteren
2017-04-16
Finalized support for performance testing against cuBLAS
Cedric Nugteren
2017-04-14
Added a new Xaxpy kernel in between the regular and fast version in
Cedric Nugteren
2017-04-13
Fixed CUDA malloc and cuBLAS handles: cuBLAS as a performance-reference now ↵
Cedric Nugteren
works
2017-04-11
Made compilation of the cuBLAS wrapper work properly
Cedric Nugteren
2017-04-10
Added reference implementations for performance-testing against cuBLAS
Cedric Nugteren
2017-04-06
Completed the cuBLAS wrapper
Cedric Nugteren
2017-04-06
Fixed some size_t to int conversion warnings for the CBLAS interface
Cedric Nugteren
2017-04-05
Added a first version of a cuBLAS wrapper (WIP)
Cedric Nugteren
2017-04-03
Fixes the CUDA wrapper (now actually tested on a system with CUDA)
Cedric Nugteren
2017-04-03
In-lined the float2 and double2 types to avoid collision with CUDA's definitions
Cedric Nugteren
2017-04-02
Layed the groundwork for cuBLAS comparisons in the clients
Cedric Nugteren
2017-04-02
Factored out inclusion of clBLAS and CBLAS from the test-routine files
Cedric Nugteren
2017-04-02
Factored out inclusion of clBLAS and CBLAS from the test-routine files
Cedric Nugteren
2017-04-01
Separated host-device and device-host memory copies from execution of the ↵
Cedric Nugteren
CBLAS reference code; for fair timing and code de-duplication
2017-03-20
Fixed a GCC/MSVC compilation issue
Cedric Nugteren
2017-03-19
Fixed a compilation issue for GCC/MSVC
Cedric Nugteren
2017-03-12
Fixed a linker issue for Clang
Cedric Nugteren
2017-03-10
Added API and test infrastructure for the batched GEMM routine
Cedric Nugteren
2017-03-10
Small fix for a file that isn't currently compiled anymore
Cedric Nugteren
2017-03-10
Added proper testing of the alpha parameter; finalized the batched AXPY ↵
Cedric Nugteren
implementation
2017-03-08
Make batched routines based on offsets instead of a vector of cl_mem objects ↵
Cedric Nugteren
- undoing many earlier changes
2017-03-05
Minor fixes to the client w.r.t. the addition of the batch count
Cedric Nugteren
2017-03-05
Added first naive version of the batched AXPY routine
Cedric Nugteren
[next]