summaryrefslogtreecommitdiff
path: root/test
AgeCommit message (Expand)Author
2017-08-19Implemented proper im2col reference function and completd testsCedric Nugteren
2017-08-12Merge branch 'master' into im_to_colCedric Nugteren
2017-08-12Moved some utility functions to a test-specific utility compilation-unitCedric Nugteren
2017-07-16First step towards supporting im2col in the test infrastructureCedric Nugteren
2017-07-12Fixed batched tests when testing for invalid sizes against clBLASCedric Nugteren
2017-07-12Relaxed requirement on a_ld and b_ld for batched GEMMCedric Nugteren
2017-07-09Changed printf-statements with %zu into std::cout to fix MSVC 2013 compatibilityCedric Nugteren
2017-07-09Disabled UNIX-style terminal color printing under WindowsCedric Nugteren
2017-06-27Moved and inlined some static member variables and disabled spurious clang wa...Cedric Nugteren
2017-06-27Undo of earlier move of TestBlas::kTransposes constant to fix MSVC 2013 compi...Cedric Nugteren
2017-06-26Fixed and suppresses several warnings for MSVCCedric Nugteren
2017-06-25Moved static variable declarations from .cpp to .hpp to resolve some Clang wa...Cedric Nugteren
2017-06-25Fixed some Clang and MSVC warningsCedric Nugteren
2017-05-11Bug-fix in the half-precision test of the amax routineCedric Nugteren
2017-04-23Fixed a compiler warning messageCedric Nugteren
2017-04-17Fixed a namespace clash with CUDA FP16 for the half-datatypeCedric Nugteren
2017-04-16Merge branch 'development' into benchmarkingCedric Nugteren
2017-04-16Finalized support for performance testing against cuBLASCedric Nugteren
2017-04-14Added a new Xaxpy kernel in between the regular and fast version inCedric Nugteren
2017-04-13Fixed CUDA malloc and cuBLAS handles: cuBLAS as a performance-reference now w...Cedric Nugteren
2017-04-11Made compilation of the cuBLAS wrapper work properlyCedric Nugteren
2017-04-10Added reference implementations for performance-testing against cuBLASCedric Nugteren
2017-04-06Completed the cuBLAS wrapperCedric Nugteren
2017-04-06Fixed some size_t to int conversion warnings for the CBLAS interfaceCedric Nugteren
2017-04-05Added a first version of a cuBLAS wrapper (WIP)Cedric Nugteren
2017-04-03Fixes the CUDA wrapper (now actually tested on a system with CUDA)Cedric Nugteren
2017-04-03In-lined the float2 and double2 types to avoid collision with CUDA's definitionsCedric Nugteren
2017-04-02Layed the groundwork for cuBLAS comparisons in the clientsCedric Nugteren
2017-04-02Factored out inclusion of clBLAS and CBLAS from the test-routine filesCedric Nugteren
2017-04-02Factored out inclusion of clBLAS and CBLAS from the test-routine filesCedric Nugteren
2017-04-01Separated host-device and device-host memory copies from execution of the CBL...Cedric Nugteren
2017-03-20Fixed a GCC/MSVC compilation issueCedric Nugteren
2017-03-19Fixed a compilation issue for GCC/MSVCCedric Nugteren
2017-03-12Fixed a linker issue for ClangCedric Nugteren
2017-03-10Added API and test infrastructure for the batched GEMM routineCedric Nugteren
2017-03-10Small fix for a file that isn't currently compiled anymoreCedric Nugteren
2017-03-10Added proper testing of the alpha parameter; finalized the batched AXPY imple...Cedric Nugteren
2017-03-08Make batched routines based on offsets instead of a vector of cl_mem objects ...Cedric Nugteren
2017-03-05Minor fixes to the client w.r.t. the addition of the batch countCedric Nugteren
2017-03-05Added first naive version of the batched AXPY routineCedric Nugteren
2017-03-05Adjusted the test-infrastructure to support testing of batched-versions of ro...Cedric Nugteren
2017-03-05Changed the way the test-data is generated: now using a single MT generator a...Cedric Nugteren
2017-03-05Prepared generator for batched routines; added batched AXPY routine interfaceCedric Nugteren
2017-03-04Fixed a missing include for the testsCedric Nugteren
2017-03-04Added a proper data-preparation function for the TRSM testsCedric Nugteren
2017-03-01Made a double to float cast explicit for MSVC compatibility (C2397)Cedric Nugteren
2017-02-27Added L2 error computation and checking for half-precision testsCedric Nugteren
2017-02-26Merge branch 'development' into triangular_solversCedric Nugteren
2017-02-26Added a guard against invalid buffer sizes in the prepare-data functions for ...Cedric Nugteren
2017-02-26Removed half-precision support from the TRSM routine; too unstableCedric Nugteren