summaryrefslogtreecommitdiff
path: root/test/performance/routines
AgeCommit message (Expand)Author
2018-10-23Added groundwork for col2im algorithm plus first non-working version of kerne...Cedric Nugteren
2018-07-29Removed complex numbers support for CONVGEMMCedric Nugteren
2018-05-06Added convgemm skeleton, test infrastructure, and first reference implementationCedric Nugteren
2018-01-31Created the API and stubs for the HAD (hadamard-product) routinesCedric Nugteren
2018-01-07Added API and tests for new GemmStridedBatched routineCedric Nugteren
2017-10-01GEMM tests now test both the in-direct and the direct kernels seperatelyCedric Nugteren
2017-07-16First step towards supporting im2col in the test infrastructureCedric Nugteren
2017-04-17Fixed a namespace clash with CUDA FP16 for the half-datatypeCedric Nugteren
2017-04-03In-lined the float2 and double2 types to avoid collision with CUDA's definitionsCedric Nugteren
2017-03-10Added API and test infrastructure for the batched GEMM routineCedric Nugteren
2017-03-05Prepared generator for batched routines; added batched AXPY routine interfaceCedric Nugteren
2017-02-26Removed half-precision support from the TRSM routine; too unstableCedric Nugteren
2017-01-15Added a first version of the diagonal block invert routine in preparation of ...Cedric Nugteren
2016-11-27Made it possible to use the command-line environmental variables for each exe...Cedric Nugteren
2016-06-19Renamed all C++ source files to .cpp to match the .hpp extension betterCedric Nugteren
2016-06-18Moved all headers into the source tree, changed headers to .hpp extensionCedric Nugteren
2016-06-16Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, and...Cedric Nugteren
2016-05-25Added possibility to run the performance client with half-precisionCedric Nugteren
2016-04-20Added prototype for ixAMAX routinescnugteren
2016-04-13Added prototype for xASUM routinescnugteren
2016-03-30Merge branch 'level1_routines' into developmentcnugteren
2016-03-30Added prototypes for the xROTM and xROTMG routinesCedric Nugteren
2016-03-30Added prototypes for the xROT and xROTG functionsCedric Nugteren
2016-03-25Added prototypes for ScNRM2/DzNRM2 routinesCedric Nugteren
2016-03-25Added prototypes for SNRM2/DNRM2 routinesCedric Nugteren
2016-02-20Set a proper default precision for the CLBlast clientsCedric Nugteren
2015-09-18Added generated main functions for correctness/performance tests for level 2 ...CNugteren
2015-09-14Added xDOT/xDOTU/xDOTC dot-product routinesCNugteren
2015-08-22Added the XSWAP, XSCAL and XCOPY level-1 routinesCNugteren
2015-07-31Added HEMV routineCNugteren
2015-07-31Added SYMV routineCNugteren
2015-07-12Added subfolders for the level1/2/3 routinesCNugteren
2015-07-12Added the HEMM routine, tester, and clientCNugteren
2015-07-10Added the HER2K routine, tester, and clientCNugteren
2015-07-10Added the HERK routine, tester, and clientCNugteren
2015-07-10The clients now distinguish between the memory and alpha/beta data-typeCNugteren
2015-07-02Added the TRMM routine, tester, and clientCNugteren
2015-06-29Re-organized the performance-client infrastructure to avoid code duplicationCNugteren
2015-06-26Added the SYR2K routine, tester, and clientCNugteren
2015-06-26Added symmetric matrix support for the ABC performance testerCNugteren
2015-06-24Added the SYRK routine, tester, and clientCNugteren
2015-06-23Updated bandwidth computation for GEMM and SYMMCNugteren
2015-06-21Fixed support for complex data-types for GEMM and SYMM clientsCNugteren
2015-06-13Added initial version of GEMV including tester and performance clientCNugteren
2015-05-30Initial commit of preview versionCNugteren