summaryrefslogtreecommitdiff
path: root/src/routines
AgeCommit message (Expand)Author
2016-03-06Fixed a bug in the GER-family of routines due to incorrect division of the wo...Cedric Nugteren
2016-03-06Added preliminary support for xHPR2 and xSPR2 routinesCedric Nugteren
2016-03-02Added preliminary support for xHER2 and xSYR2 routinesCedric Nugteren
2016-02-28Fixed a couple of correctness bugs in the Xher kernelsCedric Nugteren
2016-02-28Added support for xHER, xHPR, xSYR, and xSPR routinesCedric Nugteren
2016-02-20Added support for xGERU and xGERC routinesCedric Nugteren
2016-02-20Added XGER routine, kernel, and tunerCedric Nugteren
2016-02-08Separated the GEMM kernel in two parts to reduce string length for MSVCCedric Nugteren
2016-02-08Split-up the XGEMV kernel in two partsCedric Nugteren
2016-01-30Added first auto-generated database headers from the Python database; only K4...Cedric Nugteren
2015-10-12Routine names are now all default arguments defined in the headerCNugteren
2015-10-12Moved level3 kernel files to a subfolderCNugteren
2015-09-26Added TRMV/TBMV/TPMV routinesCNugteren
2015-09-19Added SBMV and SPMV routinesCNugteren
2015-09-19Added the HPMV routineCNugteren
2015-09-19Added the HBMV routineCNugteren
2015-09-18Improved the organization and performance of level 2 routinesCNugteren
2015-09-18Added first version of banded matrix-vector multiplicationCNugteren
2015-09-14Added xDOT/xDOTU/xDOTC dot-product routinesCNugteren
2015-08-22Added the XSWAP, XSCAL and XCOPY level-1 routinesCNugteren
2015-08-22Re-organized level1 xaxpy kernelCNugteren
2015-08-04Added distinguished names for GEMV inherited HEMV/SYMVCNugteren
2015-07-31Added HEMV routineCNugteren
2015-07-31Added SYMV routineCNugteren
2015-07-27Now using the new Claduc C++11 OpenCL headerCNugteren
2015-07-19Kernel caching is now based on a routine's nameCNugteren
2015-07-19The kernel source string is now a routine's member variableCNugteren
2015-07-15Skips pre/post processing kernels if not neededCNugteren
2015-07-13Updated interface of the PadCopyTransposeMatrix methodCNugteren
2015-07-12Added subfolders for the level1/2/3 routinesCNugteren
2015-07-12Added the HEMM routine, tester, and clientCNugteren
2015-07-10Added the HER2K routine, tester, and clientCNugteren
2015-07-10Added the HERK routine, tester, and clientCNugteren
2015-07-08Added option to set the imaginary part of the diagonal to zeroCNugteren
2015-07-02Added the TRMM routine, tester, and clientCNugteren
2015-07-01Fixed typos in SYMMCNugteren
2015-06-26Added the SYR2K routine, tester, and clientCNugteren
2015-06-25Clarified commentCNugteren
2015-06-24Added the SYRK routine, tester, and clientCNugteren
2015-06-23Added a condition to update only lower/upper triangular parts in the un-pad k...CNugteren
2015-06-18Now returns program from database by referenceCNugteren
2015-06-16Added support for conjugate transpose in GEMVCNugteren
2015-06-16Added support for complex conjugate transposeCNugteren
2015-06-14Split the three variations of the GEMV kernel for maximal tuning freedomCNugteren
2015-06-14Fixed number of threads launched for GEMVCNugteren
2015-06-14Fixed number of threads launched for AXPYCNugteren
2015-06-13Added a fast GEMV kernel with vector loads, no tail, and fewer if-statementsCNugteren
2015-06-13Added initial version of GEMV including tester and performance clientCNugteren
2015-05-30Initial commit of preview versionCNugteren