summaryrefslogtreecommitdiff
path: root/src
AgeCommit message (Expand)Author
2015-08-13Fixed a complex data-type bug in the transpose kernelCNugteren
2015-08-04Added distinguished names for GEMV inherited HEMV/SYMVCNugteren
2015-08-03Abstracted loading of matrix A for GEMV kernelCNugteren
2015-07-31Added HEMV routineCNugteren
2015-07-31Added SYMV routineCNugteren
2015-07-27Now using the new Claduc C++11 OpenCL headerCNugteren
2015-07-22Added workgroup shuffle option to transpose kernel for AMD GPUsCNugteren
2015-07-21Transpose kernel now uses vectorized local memory loads and storesCNugteren
2015-07-19Triangular GEMM kernels are only compiled when neededCNugteren
2015-07-19Kernel caching is now based on a routine's nameCNugteren
2015-07-19The kernel source string is now a routine's member variableCNugteren
2015-07-16Fixed a bug when using the Xgemm kernel without local memoryCNugteren
2015-07-16Using mad() instruction for AMD devices like clBLAS doesCNugteren
2015-07-15Skips pre/post processing kernels if not neededCNugteren
2015-07-13Updated interface of the PadCopyTransposeMatrix methodCNugteren
2015-07-12Added subfolders for the level1/2/3 routinesCNugteren
2015-07-12Added the HEMM routine, tester, and clientCNugteren
2015-07-10Disabled prototype of TRSMCNugteren
2015-07-10Added the HER2K routine, tester, and clientCNugteren
2015-07-10Added the HERK routine, tester, and clientCNugteren
2015-07-08Added option to set the imaginary part of the diagonal to zeroCNugteren
2015-07-07Added option to set the imaginary part of the diagonal to zeroCNugteren
2015-07-02Added the TRMM routine, tester, and clientCNugteren
2015-07-02Added a set-to-one function for kernelsCNugteren
2015-07-01Added the unit/non-unit diagonal enumCNugteren
2015-07-01Fixed typos in SYMMCNugteren
2015-06-30Added the TRMM and TRSM interfaceCNugteren
2015-06-26Added the SYR2K routine, tester, and clientCNugteren
2015-06-25Clarified commentCNugteren
2015-06-24Added the SYRK routine, tester, and clientCNugteren
2015-06-23Added a lower/upper triangular version of the GEMM kernelCNugteren
2015-06-23Added a condition to update only lower/upper triangular parts in the un-pad k...CNugteren
2015-06-21Added prototypes of SYRK and SYR2KCNugteren
2015-06-20Distinguish between a short smoke test and a full testCNugteren
2015-06-20Added additional absolute error checking when testingCNugteren
2015-06-18Now returns program from database by referenceCNugteren
2015-06-16Added support for conjugate transpose in GEMVCNugteren
2015-06-16Updated the tuners to set the conjugate argumentCNugteren
2015-06-16Added support for CGEMM/ZGEMM and CSYMM/ZSYMMCNugteren
2015-06-16Added support for complex conjugate transposeCNugteren
2015-06-15Fixed a bug in AXPBY defines for complex data-typesCNugteren
2015-06-14Split the three variations of the GEMV kernel for maximal tuning freedomCNugteren
2015-06-14Fixed number of threads launched for GEMVCNugteren
2015-06-14Fixed number of threads launched for AXPYCNugteren
2015-06-13Added a fast GEMV kernel with vector loads, no tail, and fewer if-statementsCNugteren
2015-06-13Refactored the GEMV kernelCNugteren
2015-06-13Improved GEMV kernel with local memory and a tunable WPTCNugteren
2015-06-13Added initial version of GEMV including tester and performance clientCNugteren
2015-06-10Added initial naive version of Xgemv kernelCNugteren
2015-05-30Initial commit of preview versionCNugteren