Age | Commit message (Expand) | Author |
2015-07-27 | Now using the new Claduc C++11 OpenCL header | CNugteren |
2015-07-22 | Added workgroup shuffle option to transpose kernel for AMD GPUs | CNugteren |
2015-07-21 | Transpose kernel now uses vectorized local memory loads and stores | CNugteren |
2015-07-19 | Triangular GEMM kernels are only compiled when needed | CNugteren |
2015-07-19 | Kernel caching is now based on a routine's name | CNugteren |
2015-07-19 | The kernel source string is now a routine's member variable | CNugteren |
2015-07-16 | Fixed a bug when using the Xgemm kernel without local memory | CNugteren |
2015-07-16 | Using mad() instruction for AMD devices like clBLAS does | CNugteren |
2015-07-15 | Skips pre/post processing kernels if not needed | CNugteren |
2015-07-13 | Updated interface of the PadCopyTransposeMatrix method | CNugteren |
2015-07-12 | Added subfolders for the level1/2/3 routines | CNugteren |
2015-07-12 | Added the HEMM routine, tester, and client | CNugteren |
2015-07-10 | Disabled prototype of TRSM | CNugteren |
2015-07-10 | Added the HER2K routine, tester, and client | CNugteren |
2015-07-10 | Added the HERK routine, tester, and client | CNugteren |
2015-07-08 | Added option to set the imaginary part of the diagonal to zero | CNugteren |
2015-07-07 | Added option to set the imaginary part of the diagonal to zero | CNugteren |
2015-07-02 | Added the TRMM routine, tester, and client | CNugteren |
2015-07-02 | Added a set-to-one function for kernels | CNugteren |
2015-07-01 | Added the unit/non-unit diagonal enum | CNugteren |
2015-07-01 | Fixed typos in SYMM | CNugteren |
2015-06-30 | Added the TRMM and TRSM interface | CNugteren |
2015-06-26 | Added the SYR2K routine, tester, and client | CNugteren |
2015-06-25 | Clarified comment | CNugteren |
2015-06-24 | Added the SYRK routine, tester, and client | CNugteren |
2015-06-23 | Added a lower/upper triangular version of the GEMM kernel | CNugteren |
2015-06-23 | Added a condition to update only lower/upper triangular parts in the un-pad k... | CNugteren |
2015-06-21 | Added prototypes of SYRK and SYR2K | CNugteren |
2015-06-20 | Distinguish between a short smoke test and a full test | CNugteren |
2015-06-20 | Added additional absolute error checking when testing | CNugteren |
2015-06-18 | Now returns program from database by reference | CNugteren |
2015-06-16 | Added support for conjugate transpose in GEMV | CNugteren |
2015-06-16 | Updated the tuners to set the conjugate argument | CNugteren |
2015-06-16 | Added support for CGEMM/ZGEMM and CSYMM/ZSYMM | CNugteren |
2015-06-16 | Added support for complex conjugate transpose | CNugteren |
2015-06-15 | Fixed a bug in AXPBY defines for complex data-types | CNugteren |
2015-06-14 | Split the three variations of the GEMV kernel for maximal tuning freedom | CNugteren |
2015-06-14 | Fixed number of threads launched for GEMV | CNugteren |
2015-06-14 | Fixed number of threads launched for AXPY | CNugteren |
2015-06-13 | Added a fast GEMV kernel with vector loads, no tail, and fewer if-statements | CNugteren |
2015-06-13 | Refactored the GEMV kernel | CNugteren |
2015-06-13 | Improved GEMV kernel with local memory and a tunable WPT | CNugteren |
2015-06-13 | Added initial version of GEMV including tester and performance client | CNugteren |
2015-06-10 | Added initial naive version of Xgemv kernel | CNugteren |
2015-05-30 | Initial commit of preview version | CNugteren |