summaryrefslogtreecommitdiff
path: root/include
AgeCommit message (Expand)Author
2016-10-15Added documentation and minor refactoring for the recent support of static li...Cedric Nugteren
2016-10-14Fixes for static lib compilation on WindowsShehzan Mohammed
2016-10-10Added support for compiling the library, the client, and the samples under MS...Cedric Nugteren
2016-10-05Made non-standard types void-pointers in the Netlib BLAS interfaceCedric Nugteren
2016-10-05Added first version of Netlib BLAS API headerCedric Nugteren
2016-06-30Added declspec(dllexport) to ClearCache and FillCache, and added declspec(dll...Cedric Nugteren
2016-06-18Moved all headers into the source tree, changed headers to .hpp extensionCedric Nugteren
2016-06-18Clean-up of the routine class, moved RunKernel to the routine/common fileCedric Nugteren
2016-06-18Removed the template from the Routine base-classCedric Nugteren
2016-06-17Removed the precision argument from the routines in favor of a single templat...Cedric Nugteren
2016-06-17Removed the interface to the cache functions from the Routine class, calls th...Cedric Nugteren
2016-06-17Moved the RunKernel and PadCopyTransposeMatrix functions out of the Routine c...Cedric Nugteren
2016-06-17Moved the ErrorIn function from the Routine class to the utilities headerCedric Nugteren
2016-06-17Moved the test-for-valid-buffers function from the Routine class to separate ...Cedric Nugteren
2016-06-16Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, and...Cedric Nugteren
2016-06-15Added some constness to variables related to the GEMM routinesCedric Nugteren
2016-06-14Moved device vendor and type checks to a common headerCedric Nugteren
2016-06-08Added global memory synchronisation for better cache performance on ARM Mali ...Cedric Nugteren
2016-06-01Added tuning parameters for 'GRID K520' and 'HD Graphics Skylake ULT GT2'Cedric Nugteren
2016-05-26Added half-precision tests for the clBLAS reference through conversion to sin...Cedric Nugteren
2016-05-25Added level-3 half-precision routines HGEMM/HSYMM/HSYRK/HSYR2K/HTRMMCedric Nugteren
2016-05-22Added level-2 half-precision routines HGER/HSYR/HSPR/HSYR2/HSPR2Cedric Nugteren
2016-05-22Fixed tuning results for half-precision; added first results for the xGER ker...Cedric Nugteren
2016-05-22Prepared the GER kernels and tuner for half-precision supportCedric Nugteren
2016-05-22Added level-2 half-precision routines HGEMV/HGBMV/HHEMV/HHBMV/HHPMV/HSYMV/HSB...Cedric Nugteren
2016-05-22Added first tuning results for the half-precision xGEMV kernelsCedric Nugteren
2016-05-22Prepared the GEMV kernels and tuner for half-precision supportCedric Nugteren
2016-05-22Added level-1 half-precision routines HSWAP/HSCAL/HCOPY/HAXPY/HDOT/HNRM2/HASU...Cedric Nugteren
2016-05-22Added first tuning results for the half-precision xDOT kernelsCedric Nugteren
2016-05-18Merged in latest changes from 0.7.1 releaseCedric Nugteren
2016-05-16Added half precision tuning results for supporting kernels (pad, copy, transp...Cedric Nugteren
2016-05-15Added header with conversions from and to half-precision floating-pointCedric Nugteren
2016-05-14Set kernel arguments for AXPY as constant memory buffers, making it possible ...Cedric Nugteren
2016-05-13Initial experimental version of the half-precision HAXPY routineCedric Nugteren
2016-05-12Initial changes in preparation for half-precision fp16 supportCedric Nugteren
2016-05-02Added tuning results for AMD Hawaii (R9 290X)Cedric Nugteren
2016-05-01Added tuning results for AMD Pitcairn (R9 270X)Cedric Nugteren
2016-05-01Updated tuning database for reduction/dot kernels based on the new tuner; par...Cedric Nugteren
2016-05-01Changed the index buffer of IxAMAX routines to unsigned int for proper buffer...Cedric Nugteren
2016-05-01Added a program cache (per-context) next to the per-device binary cacheCedric Nugteren
2016-04-30Added non-aboslute minimum counter-part IxMIN of the BLAS routine IxAMAXCedric Nugteren
2016-04-29Added FillCache: a function to pre-compile all kernels for a specific deviceCedric Nugteren
2016-04-28Fixed the cache to store binaries instead of OpenCL programsCedric Nugteren
2016-04-27Added non-absolute counter-parts xSUM and IxMAX of the BLAS routines xASUM an...Cedric Nugteren
2016-04-27Added prototypes for non-BLAS routines: xSUM and IxMAX (non-absolute counterp...Cedric Nugteren
2016-04-27Moved all cache-related functions to a separate file; added a ClearCompiledPr...Cedric Nugteren
2016-04-27Added a '-verbose' option to the test binaries to report errors in more detai...Cedric Nugteren
2016-04-27All CLBlast enum constants now have the same raw values as in the cblas standardCedric Nugteren
2016-04-20Added support for the iSAMAX/iDAMAX/iCAMAX/iZAMAX routinescnugteren
2016-04-20Added prototype for ixAMAX routinescnugteren