summaryrefslogtreecommitdiff
path: root/src/tuning
AgeCommit message (Expand)Author
2015-09-18Added first version of banded matrix-vector multiplicationCNugteren
2015-09-14Added xDOT/xDOTU/xDOTC dot-product routinesCNugteren
2015-09-14Added extra temporary buffer to tuners in preparation of Xdot routinesCNugteren
2015-08-22Re-organized level1 xaxpy kernelCNugteren
2015-08-09Refactored the tuners, added JSON outputCNugteren
2015-07-22Added workgroup shuffle option to transpose kernel for AMD GPUsCNugteren
2015-07-19The kernel source string is now a routine's member variableCNugteren
2015-06-16Added support for conjugate transpose in GEMVCNugteren
2015-06-16Updated the tuners to set the conjugate argumentCNugteren
2015-06-14Split the three variations of the GEMV kernel for maximal tuning freedomCNugteren
2015-06-13Added a fast GEMV kernel with vector loads, no tail, and fewer if-statementsCNugteren
2015-06-13Improved GEMV kernel with local memory and a tunable WPTCNugteren
2015-06-13Added initial version of GEMV including tester and performance clientCNugteren
2015-06-10Added initial naive version of Xgemv kernelCNugteren
2015-05-30Initial commit of preview versionCNugteren