Age | Commit message (Expand) | Author |
2018-01-31 | Created the API and stubs for the HAD (hadamard-product) routines | Cedric Nugteren |
2018-01-07 | Added API and tests for new GemmStridedBatched routine | Cedric Nugteren |
2017-10-01 | GEMM tests now test both the in-direct and the direct kernels seperately | Cedric Nugteren |
2017-07-16 | First step towards supporting im2col in the test infrastructure | Cedric Nugteren |
2017-04-17 | Fixed a namespace clash with CUDA FP16 for the half-datatype | Cedric Nugteren |
2017-04-03 | In-lined the float2 and double2 types to avoid collision with CUDA's definitions | Cedric Nugteren |
2017-03-10 | Added API and test infrastructure for the batched GEMM routine | Cedric Nugteren |
2017-03-05 | Prepared generator for batched routines; added batched AXPY routine interface | Cedric Nugteren |
2017-02-26 | Removed half-precision support from the TRSM routine; too unstable | Cedric Nugteren |
2017-01-15 | Added a first version of the diagonal block invert routine in preparation of ... | Cedric Nugteren |
2016-11-27 | Made it possible to use the command-line environmental variables for each exe... | Cedric Nugteren |
2016-06-19 | Renamed all C++ source files to .cpp to match the .hpp extension better | Cedric Nugteren |
2016-06-18 | Moved all headers into the source tree, changed headers to .hpp extension | Cedric Nugteren |
2016-06-16 | Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, and... | Cedric Nugteren |
2016-05-25 | Added possibility to run the performance client with half-precision | Cedric Nugteren |
2016-04-20 | Added prototype for ixAMAX routines | cnugteren |
2016-04-13 | Added prototype for xASUM routines | cnugteren |
2016-03-30 | Merge branch 'level1_routines' into development | cnugteren |
2016-03-30 | Added prototypes for the xROTM and xROTMG routines | Cedric Nugteren |
2016-03-30 | Added prototypes for the xROT and xROTG functions | Cedric Nugteren |
2016-03-25 | Added prototypes for ScNRM2/DzNRM2 routines | Cedric Nugteren |
2016-03-25 | Added prototypes for SNRM2/DNRM2 routines | Cedric Nugteren |
2016-02-20 | Set a proper default precision for the CLBlast clients | Cedric Nugteren |
2015-09-18 | Added generated main functions for correctness/performance tests for level 2 ... | CNugteren |
2015-09-14 | Added xDOT/xDOTU/xDOTC dot-product routines | CNugteren |
2015-08-22 | Added the XSWAP, XSCAL and XCOPY level-1 routines | CNugteren |
2015-07-31 | Added HEMV routine | CNugteren |
2015-07-31 | Added SYMV routine | CNugteren |
2015-07-12 | Added subfolders for the level1/2/3 routines | CNugteren |
2015-07-12 | Added the HEMM routine, tester, and client | CNugteren |
2015-07-10 | Added the HER2K routine, tester, and client | CNugteren |
2015-07-10 | Added the HERK routine, tester, and client | CNugteren |
2015-07-10 | The clients now distinguish between the memory and alpha/beta data-type | CNugteren |
2015-07-02 | Added the TRMM routine, tester, and client | CNugteren |
2015-06-29 | Re-organized the performance-client infrastructure to avoid code duplication | CNugteren |
2015-06-26 | Added the SYR2K routine, tester, and client | CNugteren |
2015-06-26 | Added symmetric matrix support for the ABC performance tester | CNugteren |
2015-06-24 | Added the SYRK routine, tester, and client | CNugteren |
2015-06-23 | Updated bandwidth computation for GEMM and SYMM | CNugteren |
2015-06-21 | Fixed support for complex data-types for GEMM and SYMM clients | CNugteren |
2015-06-13 | Added initial version of GEMV including tester and performance client | CNugteren |
2015-05-30 | Initial commit of preview version | CNugteren |