summaryrefslogtreecommitdiff
path: root/CMakeLists.txt
AgeCommit message (Expand)Author
2021-01-19Updated to version 1.5.2Cedric Nugteren
2020-03-08Added sample to play around with XAMAX routineCedric Nugteren
2020-02-18Updated to version 1.5.1Cedric Nugteren
2018-12-31Added convgemm to the CLBlast database, added initial parameters for Skylake GPUCedric Nugteren
2018-12-18Added first version of a tuner for the ConvGemm direct kernelCedric Nugteren
2018-12-04Updated to version 1.5.0Cedric Nugteren
2018-10-23Added groundwork for col2im algorithm plus first non-working version of kerne...Cedric Nugteren
2018-09-16Merge branch 'master' into convgemm_multi_kernelCedric Nugteren
2018-08-07Name change of setting to NETLIB_PERSISTENT_OPENCLCedric Nugteren
2018-08-05Added an option to compile the Netlib API with static OpenCL device and contextCedric Nugteren
2018-07-29Merge branch 'master' into CLBlast-267-convgemmCedric Nugteren
2018-07-27Renamed AMD SI workaround definesCedric Nugteren
2018-07-25Added workaround for weird AMD SI Hainan bugCedric Nugteren
2018-07-14Updated to CLBlast version 1.4.1Cedric Nugteren
2018-06-03Merge branch 'master' into CLBlast-267-convgemmCedric Nugteren
2018-06-03Updated to CLBlast version 1.4.0Cedric Nugteren
2018-06-03Fixes for CUDA version of CLBlastCedric Nugteren
2018-06-02Added MKL as an alternative for CBLAS for correctness and performance compari...Cedric Nugteren
2018-05-19Merge branch 'master' into CLBlast-267-convgemmCedric Nugteren
2018-05-19Fixed compilation issuesCedric Nugteren
2018-05-19The GEMM routine tuner now loads kernel JSON tuning results from disk if avai...Cedric Nugteren
2018-05-06Added convgemm skeleton, test infrastructure, and first reference implementationCedric Nugteren
2018-04-20Fixes for the CUDA APICedric Nugteren
2018-03-06First version of the tuning API, added interface for copy-kernel, added sampleCedric Nugteren
2018-02-11Added CLBlast version numbering to the compiled libraryCedric Nugteren
2018-01-31Created the API and stubs for the HAD (hadamard-product) routinesCedric Nugteren
2018-01-29Updated to CLBlast version 1.3.0Cedric Nugteren
2018-01-29Fixed a compilation error of the kernel-preprocessor test under MSVCCedric Nugteren
2018-01-15Factored out the generic parts of the GEMM routine tunerCedric Nugteren
2018-01-11Added test for the RetrieveParameters functionCedric Nugteren
2018-01-07Added API and tests for new GemmStridedBatched routineCedric Nugteren
2017-12-27Split the database into multiple small compilation unitsCedric Nugteren
2017-12-24Fixes for the CUDA backend of CLBlastCedric Nugteren
2017-12-24Fixed linking of the preprocessor test for MSVCCedric Nugteren
2017-12-23Updated the database to use the new TRSV and Invert tunersCedric Nugteren
2017-12-23Added TRSV block-size tunerCedric Nugteren
2017-12-19Added skeleton for a tuner for the invert kernelCedric Nugteren
2017-11-25Added stub for a preprocessor and a corresponding compilation testCedric Nugteren
2017-11-19Made the tuners be compiled by defaultCedric Nugteren
2017-11-19Added a library with common tuner sources to speed-up compilationCedric Nugteren
2017-11-17Moved compilation function to separate file; removed dependency of tuners of ...Cedric Nugteren
2017-11-16Removed dependency on CLTuneCedric Nugteren
2017-11-15Added first version of integrated and re-written auto-tunerCedric Nugteren
2017-11-15Added kernel timing functionality to the utilitiesCedric Nugteren
2017-11-08Updated to CLBlast version 1.2.0Cedric Nugteren
2017-11-07Merge pull request #212 from CNugteren/kernel_selection_tunerCedric Nugteren
2017-11-02Integrated the GEMM routine tuner for kernel selection; added first tuning re...Cedric Nugteren
2017-10-29Made it possible to compile the CLBlast performance clients for Android with ...Cedric Nugteren
2017-10-28Added initial version of a GEMM kernel selection tunerCedric Nugteren
2017-10-28Moved timing function to a separate fileCedric Nugteren