summaryrefslogtreecommitdiff
path: root/scripts
AgeCommit message (Collapse)Author
2016-04-27Added prototypes for non-BLAS routines: xSUM and IxMAX (non-absolute ↵Cedric Nugteren
counterparts of xASUM and IxAMAX)
2016-04-27Moved all cache-related functions to a separate file; added a ↵Cedric Nugteren
ClearCompiledProgramCache function to clear the cache
2016-04-27All CLBlast enum constants now have the same raw values as in the cblas standardCedric Nugteren
2016-04-20Added support for the iSAMAX/iDAMAX/iCAMAX/iZAMAX routinescnugteren
2016-04-20Added prototype for ixAMAX routinescnugteren
2016-04-14Added support for the SASUM/DASUM/ScASUM/DzASUM routinescnugteren
2016-04-13Added prototype for xASUM routinescnugteren
2016-04-11Fixed the way the defaults are calculated in the database; added warning for ↵cnugteren
non-matching tuner arguments
2016-04-09Events are now properly implemented using event waiting list and asking the ↵cnugteren
user to wait for event completion
2016-04-02Added support for testing (performance and correctness) against a CPU BLAS ↵cnugteren
library
2016-04-01Added a wrapper for CBLAS libraries for performance/correctness testingcnugteren
2016-03-30Merge branch 'level1_routines' into developmentcnugteren
2016-03-30Added prototypes for the xROTM and xROTMG routinesCedric Nugteren
2016-03-30Added prototypes for the xROT and xROTG functionsCedric Nugteren
2016-03-30Made event an optional argument in the CLBlast C++ APICedric Nugteren
2016-03-28Added preliminary support for the xNRM2 routinesCedric Nugteren
2016-03-25Added prototypes for ScNRM2/DzNRM2 routinesCedric Nugteren
2016-03-25Added prototypes for SNRM2/DNRM2 routinesCedric Nugteren
2016-03-23Fixed the C-api export to be able to properly build a DLL on WindowsCedric Nugteren
2016-03-19Added __declspec(dllexport) to create a DLL on WindowsCedric Nugteren
2016-03-06Added preliminary support for xHPR2 and xSPR2 routinesCedric Nugteren
2016-03-02Added preliminary support for xHER2 and xSYR2 routinesCedric Nugteren
2016-02-28Added tuning results for Intel Iris Pro and AMD R9 M370XCedric Nugteren
2016-02-28Added support for xHER, xHPR, xSYR, and xSPR routinesCedric Nugteren
2016-02-20Set a proper default precision for the CLBlast clientsCedric Nugteren
2016-02-20Added support for xGERU and xGERC routinesCedric Nugteren
2016-02-20Added XGER routine, kernel, and tunerCedric Nugteren
2016-02-07Various fixes to the database scriptCedric Nugteren
2016-02-07Added dictionary with short and long OpenCL vendor names to fix issues with ↵Cedric Nugteren
Intel having multiple names
2016-02-07Made the tuning database an optional external downloadCedric Nugteren
2016-02-06Made the database script compatible with Python 3CNugteren
2016-01-30Added first auto-generated database headers from the Python database; only ↵Cedric Nugteren
K40 and Iris supported now
2016-01-24Minor improvements to the database script, including proper file pathsCedric Nugteren
2016-01-24Added Python function to compute defaults for a particular device/vendor ↵Cedric Nugteren
combination
2015-10-28Added tuning data for Tesla K40CNugteren
2015-10-25Added initial tuning database with Intel Iris dataCNugteren
2015-10-25Updated tuning database script according to the new JSON formatCNugteren
2015-10-25Moved the tuner database script to a separate folderCNugteren
2015-09-26Added TRMV/TBMV/TPMV routinesCNugteren
2015-09-19Added SBMV and SPMV routinesCNugteren
2015-09-19Added the HPMV routineCNugteren
2015-09-19Added the HBMV routineCNugteren
2015-09-18Added first version of banded matrix-vector multiplicationCNugteren
2015-09-18Added generated main functions for correctness/performance tests for level 2 ↵CNugteren
routines
2015-09-17Added interface of all level 2 routinesCNugteren
2015-09-17Added script to generate API interface and implementation automaticallyCNugteren