summaryrefslogtreecommitdiff
path: root/CMakeLists.txt
AgeCommit message (Collapse)Author
2017-11-15Added kernel timing functionality to the utilitiesCedric Nugteren
2017-11-08Updated to CLBlast version 1.2.0Cedric Nugteren
2017-11-07Merge pull request #212 from CNugteren/kernel_selection_tunerCedric Nugteren
GEMM kernel selection tuner
2017-11-02Integrated the GEMM routine tuner for kernel selection; added first tuning ↵Cedric Nugteren
results
2017-10-29Made it possible to compile the CLBlast performance clients for Android with ↵Cedric Nugteren
the NDK
2017-10-28Added initial version of a GEMM kernel selection tunerCedric Nugteren
2017-10-28Moved timing function to a separate fileCedric Nugteren
2017-10-27Added a DTRSM C++ interface exampleCedric Nugteren
2017-10-23Use GNUInstallDirs to determine install pathsMatthias Vogelgesang
The GNUInstallDirs module* provides variables that match the install directories for GNU Software and allows users to override them. Without hardcoding paths packagers can choose library paths according to distribution policies (i.e. lib, lib64, lib<arch>, ...). * https://cmake.org/cmake/help/v3.0/module/GNUInstallDirs.html
2017-10-20Merge pull request #204 from CNugteren/cuda_apiCedric Nugteren
Cuda API to CLBlast
2017-10-15Prepared test and client infrastructure for use with the CUDA APICedric Nugteren
2017-10-14Added an option to choose whether to override the MSVC flags from /MT to /MD ↵Cedric Nugteren
(default ON)
2017-10-14Added DAXPY example for the CUDA APICedric Nugteren
2017-10-14Added first untested CUDA sampleCedric Nugteren
2017-10-11Added first (untested) version of a CUDA APICedric Nugteren
2017-10-08Moved non-routine-specific API functions and includes to separate filesCedric Nugteren
2017-09-30Updated to version 1.1.0Cedric Nugteren
2017-09-19Added first version of a small CLBlast diagnostics helperCedric Nugteren
2017-09-10Added the new vendor-architecture-name hierarchy to the tuners as wellCedric Nugteren
2017-09-08Introduced the notion of a device-architecture for the database and added ↵Cedric Nugteren
device and architecture name mappings
2017-09-06Split the database files over multiple directories and files; first step ↵Cedric Nugteren
towards separate compilation
2017-08-12Merge branch 'master' into im_to_colCedric Nugteren
2017-08-12Fixed .hpp -> .h typo in CMakeListsCedric Nugteren
2017-08-12Properly set the common test utilities in the CMake filesCedric Nugteren
2017-08-12Moved some utility functions to a test-specific utility compilation-unitCedric Nugteren
2017-08-11Updated CMakeLists to include header files such that IDEs can locate themCedric Nugteren
2017-08-08Updated to version 1.0.1 (bugfix release)Cedric Nugteren
2017-07-30Updated to version 1.0.0Cedric Nugteren
2017-07-30Fixes warnings for Clang & AppleClangCedric Nugteren
2017-07-30Fixes to AppVeyor and Travis scriptsCedric Nugteren
2017-07-12Removed spurious warning for Clang < 3.9Cedric Nugteren
2017-07-02Added interface and stubs for the im2col routineCedric Nugteren
2017-06-29Added batched GEMM example programCedric Nugteren
2017-06-27Moved and inlined some static member variables and disabled spurious clang ↵Cedric Nugteren
warnings
2017-06-26Fixed and suppresses several warnings for MSVCCedric Nugteren
2017-06-26Reduced optimization level for the (non-performance critical) host-code to ↵Cedric Nugteren
speed-up compilation
2017-05-02Updated to version 0.11.0Cedric Nugteren
2017-04-16Finalized support for performance testing against cuBLASCedric Nugteren
2017-04-03Added proper CMake searching for CUDA and cuBLASCedric Nugteren
2017-04-02Layed the groundwork for cuBLAS comparisons in the clientsCedric Nugteren
2017-03-10Added API and test infrastructure for the batched GEMM routineCedric Nugteren
2017-03-05Added first naive version of the batched AXPY routineCedric Nugteren
2017-02-26Merge branch 'development' into triangular_solversCedric Nugteren
2017-02-25Removed the invert routine from the testsCedric Nugteren
2017-02-14Added simple tests for the OverrideParameters functionCedric Nugteren
2017-02-05Merge branch 'development' into triangular_solversCedric Nugteren
2017-01-20Added prototype for the TRSV routineCedric Nugteren
2017-01-15Added a first version of the diagonal block invert routine in preparation of ↵Cedric Nugteren
TRSM
2017-01-07Updated the link to cl.hpp in the Khronos registry for the samplesCedric Nugteren
2016-12-18Prepared for the addition of the TRSM triangular solver kernelCedric Nugteren