summaryrefslogtreecommitdiff
path: root/scripts
AgeCommit message (Expand)Author
2017-04-02Various tweaks to the new benchmark scriptCedric Nugteren
2017-04-01Tuned the plots for a tight-layout for in papers and presentationsCedric Nugteren
2017-03-26Replaced the R graph scripts with Python/Matplotlib benchmark scriptsCedric Nugteren
2017-03-10Added API and test infrastructure for the batched GEMM routineCedric Nugteren
2017-03-08Make batched routines based on offsets instead of a vector of cl_mem objects ...Cedric Nugteren
2017-03-05Added first naive version of the batched AXPY routineCedric Nugteren
2017-03-05Prepared generator for batched routines; added batched AXPY routine interfaceCedric Nugteren
2017-02-26Minor fix to the generator scriptCedric Nugteren
2017-02-26Merge branch 'development' into triangular_solversCedric Nugteren
2017-02-26Removed half-precision support from the TRSM routine; too unstableCedric Nugteren
2017-02-18Added documentation for the OverrideParameters functionCedric Nugteren
2017-02-18Added missing documentation for the fill and clear cache functionsCedric Nugteren
2017-02-16Added a C interface to the OverrideParameters function; added some in-line co...Cedric Nugteren
2017-02-16Added input-sanity checks for the OverrideParameters functionCedric Nugteren
2017-02-13Added first version of the OverrideParameters functionCedric Nugteren
2017-02-05Merge branch 'development' into triangular_solversCedric Nugteren
2017-01-24FillCache: perform compilation for each precision separatelyIvan Shapovalov
2017-01-20Added prototype for the TRSV routineCedric Nugteren
2017-01-03Added tuning results for the AMD Turks GPU and the Intel Core i7-2670QM CPUCedric Nugteren
2016-12-18Prepared for the addition of the TRSM triangular solver kernelCedric Nugteren
2016-11-27Made it possible to use the command-line environmental variables for each exe...Cedric Nugteren
2016-11-26Improved the default parameters for cases with non-common parameters across a...Cedric Nugteren
2016-11-24Merge pull request #125 from CNugteren/netlib_blas_apiCedric Nugteren
2016-11-23Fixed a vector-size related bug in the CLBlast Netlib APICedric Nugteren
2016-11-22Minor changes to ensure full compatibility with the Netlib CBLAS APICedric Nugteren
2016-11-20Made functions with scalar-buffers as output properly return valuesCedric Nugteren
2016-11-19Generating FP16 performance graphs now uses FP32 as a reference for comparisonCedric Nugteren
2016-10-25Renamed the include and source files of the Netlib CBLAS APICedric Nugteren
2016-10-25Removed the clblast namespace from the Netlib C API source file to ensure pro...Cedric Nugteren
2016-10-25Fixed some issues preventing the Netlib CBLAS API from linking correctlyCedric Nugteren
2016-10-25Made the Netlib CBLAS API use the same enums with prefixes as the regular C A...Cedric Nugteren
2016-10-25Sets the proper sizes for the buffers for the Netlib CBLAS APICedric Nugteren
2016-10-25Added initial version of a Netlib CBLAS implementation. TODO: Set correct buf...Cedric Nugteren
2016-10-25Merge branch 'development' into netlib_blas_apiCedric Nugteren
2016-10-22All enums in the C API are now prefixed with CLBlast to avoid potential name ...Cedric Nugteren
2016-10-22Added extra error codes to reflect the more detailed error reporting of OpenC...Cedric Nugteren
2016-10-22Routine: get rid of ::SetUp()Ivan Shapovalov
2016-10-22treewide: use C++ exceptions properlyIvan Shapovalov
2016-10-16Merge branch 'development' into netlib_blas_apiCedric Nugteren
2016-10-14Fixed an issue with a growing database: the database is now a global variable...Cedric Nugteren
2016-10-10Changed the storage location of the database to a separate Github repositoryCedric Nugteren
2016-10-10Added fresh performance graphs for GeForce 750Ti; removed old GTX480 resultsCedric Nugteren
2016-10-08Added benchmark script for small matrix sizes, testing the direct GEMM kernelsCedric Nugteren
2016-10-05Made non-standard types void-pointers in the Netlib BLAS interfaceCedric Nugteren
2016-10-05Added first version of Netlib BLAS API headerCedric Nugteren
2016-09-12Added XgemvFastRot and Xgemm 16-bit tuning results: just defaults which are n...Cedric Nugteren
2016-09-11Complete re-write of the database script. Changed Pandas for the much faster ...Cedric Nugteren
2016-09-10Updated database based on exhaustive tuning results for GEMM for the R9 M370X...Cedric Nugteren
2016-09-10Updated the database script to remove duplicate entries: keeps only the best-...Cedric Nugteren
2016-09-04Refactored the Python C++ generator script; now confirms to the PEP8 styleguideCedric Nugteren