summaryrefslogtreecommitdiff
path: root/include/internal
AgeCommit message (Collapse)Author
2016-01-30Added first auto-generated database headers from the Python database; only ↵Cedric Nugteren
K40 and Iris supported now
2015-10-23Added alpha and beta to tuner meta-dataCNugteren
2015-10-12Routine names are now all default arguments defined in the headerCNugteren
2015-09-26Added TRMV/TBMV/TPMV routinesCNugteren
2015-09-26Made buffer copying a const-method for the sourceCNugteren
2015-09-19Added SBMV and SPMV routinesCNugteren
2015-09-19Added the HPMV routineCNugteren
2015-09-19Added infrastructure for packed matricesCNugteren
2015-09-19Added the HBMV routineCNugteren
2015-09-18Improved the organization and performance of level 2 routinesCNugteren
2015-09-18Added first version of banded matrix-vector multiplicationCNugteren
2015-09-14Added xDOT/xDOTU/xDOTC dot-product routinesCNugteren
2015-09-14Added extra temporary buffer to tuners in preparation of Xdot routinesCNugteren
2015-09-14Added support for the dot buffer and offset argumentCNugteren
2015-08-22Added the XSWAP, XSCAL and XCOPY level-1 routinesCNugteren
2015-08-19Add check for supported precision to the tunersCNugteren
2015-08-19Moved precision tester to utilitiesCNugteren
2015-08-19Added precision to the JSON outputCNugteren
2015-08-13Added argument m,n,k metadata to JSON filesCNugteren
2015-08-09Refactored the tuners, added JSON outputCNugteren
2015-08-04Added distinguished names for GEMV inherited HEMV/SYMVCNugteren
2015-07-31Added HEMV routineCNugteren
2015-07-31Added SYMV routineCNugteren
2015-07-27Now using the new Claduc C++11 OpenCL headerCNugteren
2015-07-22Set the correct name for AMD OpenCL devicesCNugteren
2015-07-22Updated GEMM tuning results for TahitiCNugteren
2015-07-22Added workgroup shuffle option to transpose kernel for AMD GPUsCNugteren
2015-07-19Kernel caching is now based on a routine's nameCNugteren
2015-07-19The kernel source string is now a routine's member variableCNugteren
2015-07-19Fixed complex performance on Intel IrisCNugteren
2015-07-13Updated interface of the PadCopyTransposeMatrix methodCNugteren
2015-07-12Added subfolders for the level1/2/3 routinesCNugteren
2015-07-12Added the HEMM routine, tester, and clientCNugteren
2015-07-10Added the HER2K routine, tester, and clientCNugteren
2015-07-10Added the HERK routine, tester, and clientCNugteren
2015-07-08Added option to set the imaginary part of the diagonal to zeroCNugteren
2015-07-02Added the TRMM routine, tester, and clientCNugteren
2015-07-01Added the unit/non-unit diagonal enumCNugteren
2015-06-28Added buffer structure and sizes to argumentsCNugteren
2015-06-26Added the SYR2K routine, tester, and clientCNugteren
2015-06-24Added the SYRK routine, tester, and clientCNugteren
2015-06-23Added a condition to update only lower/upper triangular parts in the un-pad ↵CNugteren
kernels
2015-06-20Automatically skips tests with unsupported precisionCNugteren
2015-06-20Distinguish between a short smoke test and a full testCNugteren
2015-06-20Added additional absolute error checking when testingCNugteren
2015-06-19Added const-ref accessors to all CL++11 classesCNugteren
2015-06-18Now returns program from database by referenceCNugteren
2015-06-16Added support for complex conjugate transposeCNugteren
2015-06-15Added tuning for DGEMV on Iris and SGEMV on K40mCNugteren
2015-06-14Split the three variations of the GEMV kernel for maximal tuning freedomCNugteren