summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2016-06-18Moved all headers into the source tree, changed headers to .hpp extensionCedric Nugteren
2016-06-18Clean-up of the routine class, moved RunKernel to the routine/common fileCedric Nugteren
2016-06-18Removed the template from the Routine base-classCedric Nugteren
2016-06-17Removed the precision argument from the routines in favor of a single templat...Cedric Nugteren
2016-06-17Removed the interface to the cache functions from the Routine class, calls th...Cedric Nugteren
2016-06-17Moved the RunKernel and PadCopyTransposeMatrix functions out of the Routine c...Cedric Nugteren
2016-06-17Moved the ErrorIn function from the Routine class to the utilities headerCedric Nugteren
2016-06-17Moved the test-for-valid-buffers function from the Routine class to separate ...Cedric Nugteren
2016-06-16Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, and...Cedric Nugteren
2016-06-15Added some constness to variables related to the GEMM routinesCedric Nugteren
2016-06-14Re-organised the level-3 supporting kernels (copy, pad, transpose, convert) a...Cedric Nugteren
2016-06-14Moved device vendor and type checks to a common headerCedric Nugteren
2016-06-14Added support for FP16 on ARM Mali-T628 (officially not supported)Cedric Nugteren
2016-06-13Improved API documentation and added documentation for level-2 and level-3 ro...Cedric Nugteren
2016-06-10Added documentation for the matrix-update level-2 family of routinesCedric Nugteren
2016-06-08Added global memory synchronisation for better cache performance on ARM Mali ...Cedric Nugteren
2016-06-08Made the CPU BLAS library the default reference to test against in favor of c...Cedric Nugteren
2016-06-06Fixed the RPATH settings for linking on OS XCedric Nugteren
2016-06-06Made the tests for invalid buffer sizes also verbose in verbose modeCedric Nugteren
2016-06-02Added return value to the test binaries (0: success, 1: failure), allowing it...Cedric Nugteren
2016-06-01Added tuning parameters for 'GRID K520' and 'HD Graphics Skylake ULT GT2'Cedric Nugteren
2016-05-31Made use of CMake's built-in unit testing, allowing all tests to be run using...Cedric Nugteren
2016-05-30Increased the verbosity of the -verbose option in the correctness testsCedric Nugteren
2016-05-30Separated the performance tests (clients) from the correctness tests in CMakeCedric Nugteren
2016-05-30Merge branch 'half_precision' into developmentCedric Nugteren
2016-05-26Added half-precision tests for the clBLAS reference through conversion to sin...Cedric Nugteren
2016-05-26Added half-precision tests for the CBLAS reference through conversion to sing...Cedric Nugteren
2016-05-25Added possibility to run the performance client with half-precisionCedric Nugteren
2016-05-25Added level-3 half-precision routines HGEMM/HSYMM/HSYRK/HSYR2K/HTRMMCedric Nugteren
2016-05-24Added proper argument handling and displaying for half-precision data-typesCedric Nugteren
2016-05-23Updated README with information on half-precision supportCedric Nugteren
2016-05-22Added level-2 half-precision routines HGER/HSYR/HSPR/HSYR2/HSPR2Cedric Nugteren
2016-05-22Fixed tuning results for half-precision; added first results for the xGER ker...Cedric Nugteren
2016-05-22Prepared the GER kernels and tuner for half-precision supportCedric Nugteren
2016-05-22Added level-2 half-precision routines HGEMV/HGBMV/HHEMV/HHBMV/HHPMV/HSYMV/HSB...Cedric Nugteren
2016-05-22Added first tuning results for the half-precision xGEMV kernelsCedric Nugteren
2016-05-22Prepared the GEMV kernels and tuner for half-precision supportCedric Nugteren
2016-05-22Added level-1 half-precision routines HSWAP/HSCAL/HCOPY/HAXPY/HDOT/HNRM2/HASU...Cedric Nugteren
2016-05-22Added first tuning results for the half-precision xDOT kernelsCedric Nugteren
2016-05-22Added half-precision support for all level 1 routinesCedric Nugteren
2016-05-18Merged in latest changes from 0.7.1 releaseCedric Nugteren
2016-05-18Prepared the changelog for the next releaseCedric Nugteren
2016-05-18Updated to version 0.7.1Cedric Nugteren
2016-05-18Fixes for Visual StudioCNugteren
2016-05-18Fixes for CMake policy CMP0054Cedric Nugteren
2016-05-17Made MSVC link the run-time libraries staticallyCedric Nugteren
2016-05-17Fixed warning CMP0054Cedric Nugteren
2016-05-16Added half precision tuning results for supporting kernels (pad, copy, transp...Cedric Nugteren
2016-05-16Prepared GEMM and supporting kernels and tuners for half-precision supportCedric Nugteren
2016-05-15Added an example of using the half-precision HAXPY routineCedric Nugteren