Age | Commit message (Expand) | Author |
2016-08-21 | Updated the changelog; refactored the database-get-bests code a bit | Cedric Nugteren |
2016-08-15 | Updated the database script to calculate the relative best performance of tun... | Cedric Nugteren |
2016-08-09 | Improved the speed of the new common-best defaults method for the database ge... | Cedric Nugteren |
2016-08-07 | Added a first version of the database's common-best default calculation | Cedric Nugteren |
2016-07-25 | Moved the XgemvFast and XgemvFastRot tuning database into a separate file | Cedric Nugteren |
2016-07-24 | Refactored the Python database script: separated functionality in modules, no... | Cedric Nugteren |
2016-07-03 | Added tuning results for GTX670, GTX750, and GTX1070 (thanks to gcp) | Cedric Nugteren |
2016-07-02 | Prints the current pandas version and reports the minimum required version | Cedric Nugteren |
2016-06-30 | Added declspec(dllexport) to ClearCache and FillCache, and added declspec(dll... | Cedric Nugteren |
2016-06-27 | Moved the performance graph scripts to the 'scripts' subfolder | Cedric Nugteren |
2016-06-19 | Minor fix to the database script | Cedric Nugteren |
2016-06-19 | Renamed all C++ source files to .cpp to match the .hpp extension better | Cedric Nugteren |
2016-06-18 | Moved all headers into the source tree, changed headers to .hpp extension | Cedric Nugteren |
2016-06-18 | Clean-up of the routine class, moved RunKernel to the routine/common file | Cedric Nugteren |
2016-06-16 | Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, and... | Cedric Nugteren |
2016-06-13 | Improved API documentation and added documentation for level-2 and level-3 ro... | Cedric Nugteren |
2016-06-10 | Added documentation for the matrix-update level-2 family of routines | Cedric Nugteren |
2016-06-02 | Added return value to the test binaries (0: success, 1: failure), allowing it... | Cedric Nugteren |
2016-05-26 | Added half-precision tests for the clBLAS reference through conversion to sin... | Cedric Nugteren |
2016-05-26 | Added half-precision tests for the CBLAS reference through conversion to sing... | Cedric Nugteren |
2016-05-25 | Added possibility to run the performance client with half-precision | Cedric Nugteren |
2016-05-25 | Added level-3 half-precision routines HGEMM/HSYMM/HSYRK/HSYR2K/HTRMM | Cedric Nugteren |
2016-05-22 | Added level-2 half-precision routines HGER/HSYR/HSPR/HSYR2/HSPR2 | Cedric Nugteren |
2016-05-22 | Added level-2 half-precision routines HGEMV/HGBMV/HHEMV/HHBMV/HHPMV/HSYMV/HSB... | Cedric Nugteren |
2016-05-22 | Added level-1 half-precision routines HSWAP/HSCAL/HCOPY/HAXPY/HDOT/HNRM2/HASU... | Cedric Nugteren |
2016-05-18 | Merged in latest changes from 0.7.1 release | Cedric Nugteren |
2016-05-13 | Initial experimental version of the half-precision HAXPY routine | Cedric Nugteren |
2016-05-12 | Initial changes in preparation for half-precision fp16 support | Cedric Nugteren |
2016-05-08 | Fixed an issue where the xAMAX tester would incorrectly report failures when ... | cnugteren |
2016-05-08 | Fixed an issue where the xNRM2 and xASUM testers would incorrectly report fai... | cnugteren |
2016-05-08 | Added preliminary generated API documentation | Cedric Nugteren |
2016-05-04 | Fixed an issue with linking against the ATLAS BLAS library | Cedric Nugteren |
2016-05-01 | Added tuning results for AMD Pitcairn (R9 270X) | Cedric Nugteren |
2016-05-01 | Updated tuning database for reduction/dot kernels based on the new tuner; par... | Cedric Nugteren |
2016-05-01 | Changed the index buffer of IxAMAX routines to unsigned int for proper buffer... | Cedric Nugteren |
2016-04-30 | Added non-aboslute minimum counter-part IxMIN of the BLAS routine IxAMAX | Cedric Nugteren |
2016-04-29 | Added FillCache: a function to pre-compile all kernels for a specific device | Cedric Nugteren |
2016-04-27 | Added non-absolute counter-parts xSUM and IxMAX of the BLAS routines xASUM an... | Cedric Nugteren |
2016-04-27 | Added prototypes for non-BLAS routines: xSUM and IxMAX (non-absolute counterp... | Cedric Nugteren |
2016-04-27 | Moved all cache-related functions to a separate file; added a ClearCompiledPr... | Cedric Nugteren |
2016-04-27 | All CLBlast enum constants now have the same raw values as in the cblas standard | Cedric Nugteren |
2016-04-20 | Added support for the iSAMAX/iDAMAX/iCAMAX/iZAMAX routines | cnugteren |
2016-04-20 | Added prototype for ixAMAX routines | cnugteren |
2016-04-14 | Added support for the SASUM/DASUM/ScASUM/DzASUM routines | cnugteren |
2016-04-13 | Added prototype for xASUM routines | cnugteren |
2016-04-11 | Fixed the way the defaults are calculated in the database; added warning for ... | cnugteren |
2016-04-09 | Events are now properly implemented using event waiting list and asking the u... | cnugteren |
2016-04-02 | Added support for testing (performance and correctness) against a CPU BLAS li... | cnugteren |
2016-04-01 | Added a wrapper for CBLAS libraries for performance/correctness testing | cnugteren |
2016-03-30 | Merge branch 'level1_routines' into development | cnugteren |