summaryrefslogtreecommitdiff
path: root/src/utilities
AgeCommit message (Collapse)Author
2017-07-16First step towards supporting im2col in the test infrastructureCedric Nugteren
2017-07-12Relaxed requirement on a_ld and b_ld for batched GEMMCedric Nugteren
2017-05-26Fixed a compilation issue under MSVC 2013Cedric Nugteren
2017-04-17Fixed a namespace clash with CUDA FP16 for the half-datatypeCedric Nugteren
2017-04-13Fixed CUDA malloc and cuBLAS handles: cuBLAS as a performance-reference now ↵Cedric Nugteren
works
2017-04-10Merge branch 'development' into cublas_referenceCedric Nugteren
Conflicts: scripts/generator/generator.py
2017-04-07Added a special override database for the Apple CPU implementation on OS X: ↵Cedric Nugteren
this makes the test work, it does not focus on good performance
2017-04-02Layed the groundwork for cuBLAS comparisons in the clientsCedric Nugteren
2017-04-01Separated host-device and device-host memory copies from execution of the ↵Cedric Nugteren
CBLAS reference code; for fair timing and code de-duplication
2017-03-10Added proper testing of the alpha parameter; finalized the batched AXPY ↵Cedric Nugteren
implementation
2017-03-10Fixed a small compilation bug for MSVC related to a floating-point constantCedric Nugteren
2017-03-08Make batched routines based on offsets instead of a vector of cl_mem objects ↵Cedric Nugteren
- undoing many earlier changes
2017-03-05Minor fixes to the client w.r.t. the addition of the batch countCedric Nugteren
2017-03-05Adjusted the test-infrastructure to support testing of batched-versions of ↵Cedric Nugteren
routines
2017-03-05Changed the way the test-data is generated: now using a single MT generator ↵Cedric Nugteren
and distribution for all data
2017-03-04Added a proper data-preparation function for the TRSM testsCedric Nugteren
2017-02-25Added PrepareData function for TRSM to create proper test inputCedric Nugteren
2017-02-19Added data-preparation function for the TRSV tests and special nan/inf ↵Cedric Nugteren
checks in the error checking to make the tests pass
2017-02-05Merge branch 'development' into triangular_solversCedric Nugteren
2017-01-20treewide: include clpp11.hpp first to silence deprecation warningsIvan Shapovalov
Otherwise, cl.h gets included through clblast.h before clpp11.hpp.
2017-01-20Set number of decimals for floating-point printing for error reportingCedric Nugteren
2017-01-18Added first version of the TRSM routine based on the diagonal invert kernelCedric Nugteren
2017-01-15Added a first version of the diagonal block invert routine in preparation of ↵Cedric Nugteren
TRSM
2016-11-27Made it possible to use the command-line environmental variables for each ↵Cedric Nugteren
executable and without re-running CMake
2016-11-24Merge pull request #125 from CNugteren/netlib_blas_apiCedric Nugteren
Netlib CBLAS API for CLBlast
2016-11-20Fixed a bug in the TRMM routine caused by overwriting input data before ↵Cedric Nugteren
consuming everything
2016-10-25Fixed some issues preventing the Netlib CBLAS API from linking correctlyCedric Nugteren
2016-10-23Removed PUBLIC_API from the C++ exception classesCedric Nugteren
2016-10-22Moved files around a bit; created a utilities subfolderCedric Nugteren