summaryrefslogtreecommitdiff
path: root/include
AgeCommit message (Expand)Author
2016-04-01Added a wrapper for CBLAS libraries for performance/correctness testingcnugteren
2016-03-30Merge branch 'level1_routines' into developmentcnugteren
2016-03-30Added prototypes for the xROTM and xROTMG routinesCedric Nugteren
2016-03-30Added prototypes for the xROT and xROTG functionsCedric Nugteren
2016-03-30Made event an optional argument in the CLBlast C++ APICedric Nugteren
2016-03-30Added missing newline to the end of the public API fileCedric Nugteren
2016-03-30Fixed properly passing of OpenCL events to CLBlast functionsCedric Nugteren
2016-03-28Added preliminary support for the xNRM2 routinesCedric Nugteren
2016-03-25Added prototypes for ScNRM2/DzNRM2 routinesCedric Nugteren
2016-03-25Added prototypes for SNRM2/DNRM2 routinesCedric Nugteren
2016-03-23Fixed the C-api export to be able to properly build a DLL on WindowsCedric Nugteren
2016-03-19Added __declspec(dllexport) to create a DLL on WindowsCedric Nugteren
2016-03-14Made the library thread-safe by guarding the kernel cache with a mutexCedric Nugteren
2016-03-12Added tuning results for the newest xGER family kernelsCedric Nugteren
2016-03-12Added tuning results for the ARM Mali-T628 GPUCedric Nugteren
2016-03-06Added preliminary support for xHPR2 and xSPR2 routinesCedric Nugteren
2016-03-02Added preliminary support for xHER2 and xSYR2 routinesCedric Nugteren
2016-02-28Added tuning results for Intel Iris Pro and AMD R9 M370XCedric Nugteren
2016-02-28Added support for xHER, xHPR, xSYR, and xSPR routinesCedric Nugteren
2016-02-28Fixed a compilation issue under AppleClangCedric Nugteren
2016-02-20Set a proper default precision for the CLBlast clientsCedric Nugteren
2016-02-20Added support for xGERU and xGERC routinesCedric Nugteren
2016-02-20Added XGER routine, kernel, and tunerCedric Nugteren
2016-02-07Added tuning parameters for various devices using the new database scriptCedric Nugteren
2016-02-07Added dictionary with short and long OpenCL vendor names to fix issues with I...Cedric Nugteren
2016-02-06Fixed a linker error in the performance client under GCCCNugteren
2016-01-30Updated to version 4.0 of the CLCudaAPI headerCedric Nugteren
2016-01-30Added first auto-generated database headers from the Python database; only K4...Cedric Nugteren
2015-10-23Added alpha and beta to tuner meta-dataCNugteren
2015-10-12Routine names are now all default arguments defined in the headerCNugteren
2015-09-26Added TRMV/TBMV/TPMV routinesCNugteren
2015-09-26Made buffer copying a const-method for the sourceCNugteren
2015-09-19Added SBMV and SPMV routinesCNugteren
2015-09-19Added the HPMV routineCNugteren
2015-09-19Added infrastructure for packed matricesCNugteren
2015-09-19Added the HBMV routineCNugteren
2015-09-18Improved the organization and performance of level 2 routinesCNugteren
2015-09-18Added first version of banded matrix-vector multiplicationCNugteren
2015-09-17Added interface of all level 2 routinesCNugteren
2015-09-17Added script to generate API interface and implementation automaticallyCNugteren
2015-09-14Added xDOT/xDOTU/xDOTC dot-product routinesCNugteren
2015-09-14Added extra temporary buffer to tuners in preparation of Xdot routinesCNugteren
2015-09-14Added support for the dot buffer and offset argumentCNugteren
2015-08-22Added the XSWAP, XSCAL and XCOPY level-1 routinesCNugteren
2015-08-20Merge pull request #23 from CNugteren/tuner_databaseCedric Nugteren
2015-08-19Add check for supported precision to the tunersCNugteren
2015-08-19Moved precision tester to utilitiesCNugteren
2015-08-19Added precision to the JSON outputCNugteren
2015-08-13Added all supported routines to the C APICNugteren
2015-08-13Added initial version of C API with just one routineCNugteren