summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2016-06-08Made the CPU BLAS library the default reference to test against in favor of ↵Cedric Nugteren
clBLAS
2016-06-06Fixed the RPATH settings for linking on OS XCedric Nugteren
2016-06-06Made the tests for invalid buffer sizes also verbose in verbose modeCedric Nugteren
2016-06-02Added return value to the test binaries (0: success, 1: failure), allowing ↵Cedric Nugteren
it to work under CTest properly
2016-06-01Added tuning parameters for 'GRID K520' and 'HD Graphics Skylake ULT GT2'Cedric Nugteren
2016-05-31Made use of CMake's built-in unit testing, allowing all tests to be run ↵Cedric Nugteren
using 'make test'
2016-05-30Increased the verbosity of the -verbose option in the correctness testsCedric Nugteren
2016-05-30Separated the performance tests (clients) from the correctness tests in CMakeCedric Nugteren
2016-05-30Merge branch 'half_precision' into developmentCedric Nugteren
2016-05-26Added half-precision tests for the clBLAS reference through conversion to ↵Cedric Nugteren
single-precision
2016-05-26Added half-precision tests for the CBLAS reference through conversion to ↵Cedric Nugteren
single-precison
2016-05-25Added possibility to run the performance client with half-precisionCedric Nugteren
2016-05-25Added level-3 half-precision routines HGEMM/HSYMM/HSYRK/HSYR2K/HTRMMCedric Nugteren
2016-05-24Added proper argument handling and displaying for half-precision data-typesCedric Nugteren
2016-05-23Updated README with information on half-precision supportCedric Nugteren
2016-05-22Added level-2 half-precision routines HGER/HSYR/HSPR/HSYR2/HSPR2Cedric Nugteren
2016-05-22Fixed tuning results for half-precision; added first results for the xGER ↵Cedric Nugteren
kernels
2016-05-22Prepared the GER kernels and tuner for half-precision supportCedric Nugteren
2016-05-22Added level-2 half-precision routines ↵Cedric Nugteren
HGEMV/HGBMV/HHEMV/HHBMV/HHPMV/HSYMV/HSBMV/HSPMV/HTRMV/HTBMV/HTPMV
2016-05-22Added first tuning results for the half-precision xGEMV kernelsCedric Nugteren
2016-05-22Prepared the GEMV kernels and tuner for half-precision supportCedric Nugteren
2016-05-22Added level-1 half-precision routines ↵Cedric Nugteren
HSWAP/HSCAL/HCOPY/HAXPY/HDOT/HNRM2/HASUM/HSUM/iHAMAX/iHMAX/iHMIN
2016-05-22Added first tuning results for the half-precision xDOT kernelsCedric Nugteren
2016-05-22Added half-precision support for all level 1 routinesCedric Nugteren
2016-05-18Merged in latest changes from 0.7.1 releaseCedric Nugteren
2016-05-18Prepared the changelog for the next releaseCedric Nugteren
2016-05-18Updated to version 0.7.1Cedric Nugteren
2016-05-18Fixes for Visual StudioCNugteren
2016-05-18Fixes for CMake policy CMP0054Cedric Nugteren
2016-05-17Made MSVC link the run-time libraries staticallyCedric Nugteren
2016-05-17Fixed warning CMP0054Cedric Nugteren
2016-05-16Added half precision tuning results for supporting kernels (pad, copy, ↵Cedric Nugteren
transpose, padtranspose)
2016-05-16Prepared GEMM and supporting kernels and tuners for half-precision supportCedric Nugteren
2016-05-15Added an example of using the half-precision HAXPY routineCedric Nugteren
2016-05-15Added header with conversions from and to half-precision floating-pointCedric Nugteren
2016-05-15Updated the performance graph for the Radeon M370X AMD GPUcnugteren
2016-05-15Added new tuning results for SGEMM and updated the performance graph for the ↵cnugteren
Radeon M370X AMD GPU
2016-05-15Removed comparison to CBLAS for the graph scriptscnugteren
2016-05-15Fixed a bug in the xGEMM routine related to the event incorrectly setcnugteren
2016-05-15Fixed the arguments in the performance graphs to reflect the changes in enum ↵cnugteren
values
2016-05-15Added support for staggered/shuffled offsets for GEMM to improve performance ↵cnugteren
for large power-of-2 kernels on AMD GPUs
2016-05-14Set kernel arguments for AXPY as constant memory buffers, making it possible ↵Cedric Nugteren
to transfer half-precision values as well
2016-05-13Initial experimental version of the half-precision HAXPY routineCedric Nugteren
2016-05-12Initial changes in preparation for half-precision fp16 supportCedric Nugteren
2016-05-10Fixed links in the READMECedric Nugteren
2016-05-08Prepared the changelog for the next releaseCedric Nugteren
2016-05-08Fixes for compilation of the tests under Visual Studio 2015CNugteren
2016-05-08Updated to version 0.7.0Cedric Nugteren
2016-05-08Fixed an issue where the xAMAX tester would incorrectly report failures when ↵cnugteren
testing against CBLAS
2016-05-08Fixed an issue where the xNRM2 and xASUM testers would incorrectly report ↵cnugteren
failures for complex inputs