Age | Commit message (Expand) | Author |
2016-11-22 | Minor changes to ensure full compatibility with the Netlib CBLAS API | Cedric Nugteren |
2016-11-20 | Made functions with scalar-buffers as output properly return values | Cedric Nugteren |
2016-10-25 | Renamed the include and source files of the Netlib CBLAS API | Cedric Nugteren |
2016-10-25 | Fixed some issues preventing the Netlib CBLAS API from linking correctly | Cedric Nugteren |
2016-10-25 | Made the Netlib CBLAS API use the same enums with prefixes as the regular C A... | Cedric Nugteren |
2016-10-25 | Added initial version of a Netlib CBLAS implementation. TODO: Set correct buf... | Cedric Nugteren |
2016-10-25 | Merge branch 'development' into netlib_blas_api | Cedric Nugteren |
2016-10-22 | All enums in the C API are now prefixed with CLBlast to avoid potential name ... | Cedric Nugteren |
2016-10-22 | Added extra error codes to reflect the more detailed error reporting of OpenC... | Cedric Nugteren |
2016-10-22 | treewide: use C++ exceptions properly | Ivan Shapovalov |
2016-10-16 | Merge branch 'development' into netlib_blas_api | Cedric Nugteren |
2016-10-15 | Added documentation and minor refactoring for the recent support of static li... | Cedric Nugteren |
2016-10-14 | Fixes for static lib compilation on Windows | Shehzan Mohammed |
2016-10-10 | Added support for compiling the library, the client, and the samples under MS... | Cedric Nugteren |
2016-10-05 | Made non-standard types void-pointers in the Netlib BLAS interface | Cedric Nugteren |
2016-10-05 | Added first version of Netlib BLAS API header | Cedric Nugteren |
2016-06-30 | Added declspec(dllexport) to ClearCache and FillCache, and added declspec(dll... | Cedric Nugteren |
2016-06-18 | Moved all headers into the source tree, changed headers to .hpp extension | Cedric Nugteren |
2016-06-18 | Clean-up of the routine class, moved RunKernel to the routine/common file | Cedric Nugteren |
2016-06-18 | Removed the template from the Routine base-class | Cedric Nugteren |
2016-06-17 | Removed the precision argument from the routines in favor of a single templat... | Cedric Nugteren |
2016-06-17 | Removed the interface to the cache functions from the Routine class, calls th... | Cedric Nugteren |
2016-06-17 | Moved the RunKernel and PadCopyTransposeMatrix functions out of the Routine c... | Cedric Nugteren |
2016-06-17 | Moved the ErrorIn function from the Routine class to the utilities header | Cedric Nugteren |
2016-06-17 | Moved the test-for-valid-buffers function from the Routine class to separate ... | Cedric Nugteren |
2016-06-16 | Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, and... | Cedric Nugteren |
2016-06-15 | Added some constness to variables related to the GEMM routines | Cedric Nugteren |
2016-06-14 | Moved device vendor and type checks to a common header | Cedric Nugteren |
2016-06-08 | Added global memory synchronisation for better cache performance on ARM Mali ... | Cedric Nugteren |
2016-06-01 | Added tuning parameters for 'GRID K520' and 'HD Graphics Skylake ULT GT2' | Cedric Nugteren |
2016-05-26 | Added half-precision tests for the clBLAS reference through conversion to sin... | Cedric Nugteren |
2016-05-25 | Added level-3 half-precision routines HGEMM/HSYMM/HSYRK/HSYR2K/HTRMM | Cedric Nugteren |
2016-05-22 | Added level-2 half-precision routines HGER/HSYR/HSPR/HSYR2/HSPR2 | Cedric Nugteren |
2016-05-22 | Fixed tuning results for half-precision; added first results for the xGER ker... | Cedric Nugteren |
2016-05-22 | Prepared the GER kernels and tuner for half-precision support | Cedric Nugteren |
2016-05-22 | Added level-2 half-precision routines HGEMV/HGBMV/HHEMV/HHBMV/HHPMV/HSYMV/HSB... | Cedric Nugteren |
2016-05-22 | Added first tuning results for the half-precision xGEMV kernels | Cedric Nugteren |
2016-05-22 | Prepared the GEMV kernels and tuner for half-precision support | Cedric Nugteren |
2016-05-22 | Added level-1 half-precision routines HSWAP/HSCAL/HCOPY/HAXPY/HDOT/HNRM2/HASU... | Cedric Nugteren |
2016-05-22 | Added first tuning results for the half-precision xDOT kernels | Cedric Nugteren |
2016-05-18 | Merged in latest changes from 0.7.1 release | Cedric Nugteren |
2016-05-16 | Added half precision tuning results for supporting kernels (pad, copy, transp... | Cedric Nugteren |
2016-05-15 | Added header with conversions from and to half-precision floating-point | Cedric Nugteren |
2016-05-14 | Set kernel arguments for AXPY as constant memory buffers, making it possible ... | Cedric Nugteren |
2016-05-13 | Initial experimental version of the half-precision HAXPY routine | Cedric Nugteren |
2016-05-12 | Initial changes in preparation for half-precision fp16 support | Cedric Nugteren |
2016-05-02 | Added tuning results for AMD Hawaii (R9 290X) | Cedric Nugteren |
2016-05-01 | Added tuning results for AMD Pitcairn (R9 270X) | Cedric Nugteren |
2016-05-01 | Updated tuning database for reduction/dot kernels based on the new tuner; par... | Cedric Nugteren |
2016-05-01 | Changed the index buffer of IxAMAX routines to unsigned int for proper buffer... | Cedric Nugteren |