Age | Commit message (Expand) | Author |
2016-06-17 | Moved the RunKernel and PadCopyTransposeMatrix functions out of the Routine c... | Cedric Nugteren |
2016-06-17 | Moved the ErrorIn function from the Routine class to the utilities header | Cedric Nugteren |
2016-06-17 | Moved the test-for-valid-buffers function from the Routine class to separate ... | Cedric Nugteren |
2016-06-16 | Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, and... | Cedric Nugteren |
2016-06-15 | Added some constness to variables related to the GEMM routines | Cedric Nugteren |
2016-06-14 | Re-organised the level-3 supporting kernels (copy, pad, transpose, convert) a... | Cedric Nugteren |
2016-06-14 | Moved device vendor and type checks to a common header | Cedric Nugteren |
2016-06-14 | Added support for FP16 on ARM Mali-T628 (officially not supported) | Cedric Nugteren |
2016-06-13 | Improved API documentation and added documentation for level-2 and level-3 ro... | Cedric Nugteren |
2016-06-10 | Added documentation for the matrix-update level-2 family of routines | Cedric Nugteren |
2016-06-08 | Added global memory synchronisation for better cache performance on ARM Mali ... | Cedric Nugteren |
2016-06-08 | Made the CPU BLAS library the default reference to test against in favor of c... | Cedric Nugteren |
2016-06-06 | Fixed the RPATH settings for linking on OS X | Cedric Nugteren |
2016-06-06 | Made the tests for invalid buffer sizes also verbose in verbose mode | Cedric Nugteren |
2016-06-02 | Added return value to the test binaries (0: success, 1: failure), allowing it... | Cedric Nugteren |
2016-06-01 | Added tuning parameters for 'GRID K520' and 'HD Graphics Skylake ULT GT2' | Cedric Nugteren |
2016-05-31 | Made use of CMake's built-in unit testing, allowing all tests to be run using... | Cedric Nugteren |
2016-05-30 | Increased the verbosity of the -verbose option in the correctness tests | Cedric Nugteren |
2016-05-30 | Separated the performance tests (clients) from the correctness tests in CMake | Cedric Nugteren |
2016-05-30 | Merge branch 'half_precision' into development | Cedric Nugteren |
2016-05-26 | Added half-precision tests for the clBLAS reference through conversion to sin... | Cedric Nugteren |
2016-05-26 | Added half-precision tests for the CBLAS reference through conversion to sing... | Cedric Nugteren |
2016-05-25 | Added possibility to run the performance client with half-precision | Cedric Nugteren |
2016-05-25 | Added level-3 half-precision routines HGEMM/HSYMM/HSYRK/HSYR2K/HTRMM | Cedric Nugteren |
2016-05-24 | Added proper argument handling and displaying for half-precision data-types | Cedric Nugteren |
2016-05-23 | Updated README with information on half-precision support | Cedric Nugteren |
2016-05-22 | Added level-2 half-precision routines HGER/HSYR/HSPR/HSYR2/HSPR2 | Cedric Nugteren |
2016-05-22 | Fixed tuning results for half-precision; added first results for the xGER ker... | Cedric Nugteren |
2016-05-22 | Prepared the GER kernels and tuner for half-precision support | Cedric Nugteren |
2016-05-22 | Added level-2 half-precision routines HGEMV/HGBMV/HHEMV/HHBMV/HHPMV/HSYMV/HSB... | Cedric Nugteren |
2016-05-22 | Added first tuning results for the half-precision xGEMV kernels | Cedric Nugteren |
2016-05-22 | Prepared the GEMV kernels and tuner for half-precision support | Cedric Nugteren |
2016-05-22 | Added level-1 half-precision routines HSWAP/HSCAL/HCOPY/HAXPY/HDOT/HNRM2/HASU... | Cedric Nugteren |
2016-05-22 | Added first tuning results for the half-precision xDOT kernels | Cedric Nugteren |
2016-05-22 | Added half-precision support for all level 1 routines | Cedric Nugteren |
2016-05-18 | Merged in latest changes from 0.7.1 release | Cedric Nugteren |
2016-05-18 | Prepared the changelog for the next release | Cedric Nugteren |
2016-05-18 | Updated to version 0.7.1 | Cedric Nugteren |
2016-05-18 | Fixes for Visual Studio | CNugteren |
2016-05-18 | Fixes for CMake policy CMP0054 | Cedric Nugteren |
2016-05-17 | Made MSVC link the run-time libraries statically | Cedric Nugteren |
2016-05-17 | Fixed warning CMP0054 | Cedric Nugteren |
2016-05-16 | Added half precision tuning results for supporting kernels (pad, copy, transp... | Cedric Nugteren |
2016-05-16 | Prepared GEMM and supporting kernels and tuners for half-precision support | Cedric Nugteren |
2016-05-15 | Added an example of using the half-precision HAXPY routine | Cedric Nugteren |
2016-05-15 | Added header with conversions from and to half-precision floating-point | Cedric Nugteren |
2016-05-15 | Updated the performance graph for the Radeon M370X AMD GPU | cnugteren |
2016-05-15 | Added new tuning results for SGEMM and updated the performance graph for the ... | cnugteren |
2016-05-15 | Removed comparison to CBLAS for the graph scripts | cnugteren |
2016-05-15 | Fixed a bug in the xGEMM routine related to the event incorrectly set | cnugteren |