summaryrefslogtreecommitdiff
path: root/README.md
AgeCommit message (Collapse)Author
2016-09-27Added an option to run tuned kernels multiple times to average execution ↵Cedric Nugteren
times; requires CLTune 2.5.0
2016-09-21It is now possible to set the OpenCL compiler options through an ↵Cedric Nugteren
environmental variable
2016-09-20Fixed link in README.mdMarco Hutter
The GitHub link could be https://github.com/gpu (without "s"), but the website should be OK, too
2016-09-13Updated to version 0.9.0Cedric Nugteren
2016-09-13Renamed the DEFAULT_DEVICE and DEFAULT_PLATFORM env variables to be in line ↵Cedric Nugteren
with recent usages of CLBLAST_DEVICE and CLBLAST_PLATFORM
2016-09-03Added tuning results for Intel Broadwell 5500 GT2 GPUCedric Nugteren
2016-08-20Merge branch 'development' of github.com:CNugteren/CLBlast into developmentCedric Nugteren
Conflicts: README.md
2016-08-20Merge branch 'master' of https://github.com/dvasschemacq/CLBlast into ↵Cedric Nugteren
dvasschemacq-master Conflicts: src/kernels/level1/xaxpy.opencl src/kernels/level2/xgemv.opencl src/kernels/level2/xgemv_fast.opencl src/kernels/level2/xger.opencl src/kernels/level2/xher.opencl src/kernels/level2/xher2.opencl src/kernels/level3/xgemm_part2.opencl
2016-07-28Minor update regarding the previous CMake export/install target changesCedric Nugteren
2016-07-10Added tuning results for AMD Oland and for Intel Graphics HD 530Cedric Nugteren
2016-07-06Added a VERBOSE mode to debug performance: now prints details about ↵Cedric Nugteren
compilation and kernel execution to screen
2016-07-03Added tuning results for GTX670, GTX750, and GTX1070 (thanks to gcp)Cedric Nugteren
2016-06-27Updated the README in various placesCedric Nugteren
2016-06-27Added vcvarsall to AppVeyor and added AppVeyor icons to READMECedric Nugteren
2016-06-27Added Appveyor Windows CI supportCedric Nugteren
2016-06-27Increased coverage of Travis CI automatic buildsCedric Nugteren
2016-06-27Moved the performance graph scripts to the 'scripts' subfolderCedric Nugteren
2016-06-19Added tuning results for 'Intel(R) HD Graphics Haswell Ultrabook GT2 Mobile' ↵Cedric Nugteren
(thanks to OursDesCavernes)
2016-06-19Renamed all C++ source files to .cpp to match the .hpp extension betterCedric Nugteren
2016-06-16Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, ↵Cedric Nugteren
and/or transposing
2016-06-08Made the CPU BLAS library the default reference to test against in favor of ↵Cedric Nugteren
clBLAS
2016-06-01Added tuning parameters for 'GRID K520' and 'HD Graphics Skylake ULT GT2'Cedric Nugteren
2016-05-30Separated the performance tests (clients) from the correctness tests in CMakeCedric Nugteren
2016-05-25Added level-3 half-precision routines HGEMM/HSYMM/HSYRK/HSYR2K/HTRMMCedric Nugteren
2016-05-24Added proper argument handling and displaying for half-precision data-typesCedric Nugteren
2016-05-23Updated README with information on half-precision supportCedric Nugteren
2016-05-18Merged in latest changes from 0.7.1 releaseCedric Nugteren
2016-05-10Fixed links in the READMECedric Nugteren
2016-05-08Updated to version 0.7.0Cedric Nugteren
2016-05-07Added an option to the tests to control whether to test against clBLAS or a ↵Cedric Nugteren
CPU BLAS library
2016-05-02Added tuning results for AMD Hawaii (R9 290X)Cedric Nugteren
2016-04-30Added non-aboslute minimum counter-part IxMIN of the BLAS routine IxAMAXCedric Nugteren
2016-04-27Added non-absolute counter-parts xSUM and IxMAX of the BLAS routines xASUM ↵Cedric Nugteren
and IxAMAX
2016-04-20Added support for the iSAMAX/iDAMAX/iCAMAX/iZAMAX routinescnugteren
2016-04-14Added support for the SASUM/DASUM/ScASUM/DzASUM routinescnugteren
2016-04-09Events are now properly implemented using event waiting list and asking the ↵cnugteren
user to wait for event completion
2016-04-03Updated the documentation in light of the support for a reference CPU BLAS ↵cnugteren
library
2016-03-31Updated the documentationcnugteren
2016-03-13Updated the README fileCedric Nugteren
2016-03-06Added preliminary support for xHPR2 and xSPR2 routinesCedric Nugteren
2016-02-28Added tuning results for Intel Iris Pro and AMD R9 M370XCedric Nugteren
2016-02-28Added support for xHER, xHPR, xSYR, and xSPR routinesCedric Nugteren
2016-02-20Added support for xGERU and xGERC routinesCedric Nugteren
2016-02-07Added tuning parameters for various devices using the new database scriptCedric Nugteren
2016-01-30Prepared for MSVC supportCedric Nugteren
2015-09-26Added TRMV/TBMV/TPMV routinesCNugteren
2015-09-19Added SBMV and SPMV routinesCNugteren
2015-09-19Added the HPMV routineCNugteren
2015-09-19Added the HBMV routineCNugteren
2015-09-18Added first version of banded matrix-vector multiplicationCNugteren