summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2016-09-21It is now possible to set the OpenCL compiler options through an environmenta...Cedric Nugteren
2016-09-21Merge branch 'master' into developmentCedric Nugteren
2016-09-20Merge pull request #100 from gpu/masterCedric Nugteren
2016-09-20Fixed link in README.mdMarco Hutter
2016-09-13Merge pull request #99 from CNugteren/developmentCedric Nugteren
2016-09-13Updated to version 0.9.0Cedric Nugteren
2016-09-13Renamed the DEFAULT_DEVICE and DEFAULT_PLATFORM env variables to be in line w...Cedric Nugteren
2016-09-13Merge pull request #98 from intelfx/no-ignored-attributesCedric Nugteren
2016-09-13CMakeLists.txt: use -Wno-ignored-attributes to silence unfixable warningsIvan Shapovalov
2016-09-12Split the XGEMM kernel further up: now in 3 parts. This is done because MSVC ...Cedric Nugteren
2016-09-12Merge branch 'database_rewrite' into developmentCedric Nugteren
2016-09-12Added XgemvFastRot and Xgemm 16-bit tuning results: just defaults which are n...Cedric Nugteren
2016-09-11Complete re-write of the database script. Changed Pandas for the much faster ...Cedric Nugteren
2016-09-10Merge branch 'xgemm_tuner_exhaustive' into developmentCedric Nugteren
2016-09-10Updated database based on exhaustive tuning results for GEMM for the R9 M370X...Cedric Nugteren
2016-09-10Updated the database script to remove duplicate entries: keeps only the best-...Cedric Nugteren
2016-09-06Split GEMM tuning in two parts: a small set of tuning parameters which is exp...Cedric Nugteren
2016-09-04Refactored the Python C++ generator script; now confirms to the PEP8 styleguideCedric Nugteren
2016-09-04The GEMM kernel no longer adds beta*C in case beta is zero; this would cause ...Cedric Nugteren
2016-09-03Added tuning results for Intel Broadwell 5500 GT2 GPUCedric Nugteren
2016-09-03Updated tuning results for Haswell GT2 Mobile GPU; fixed database script to h...Cedric Nugteren
2016-08-27Merge pull request #93 from intelfx/test-read-environmentCedric Nugteren
2016-08-27test/correctness: read platform and device from environmentIvan Shapovalov
2016-08-22Merge branch 'database_defaults' into developmentCedric Nugteren
2016-08-21Also changed the default-default for unknown device types to use the same met...Cedric Nugteren
2016-08-21Increased the ratio of GEMM tuning results to explore; reduced the tuning sea...Cedric Nugteren
2016-08-21Updated the changelog; refactored the database-get-bests code a bitCedric Nugteren
2016-08-20Merge branch 'development' of github.com:CNugteren/CLBlast into developmentCedric Nugteren
2016-08-20Merge branch 'dvasschemacq-master' into developmentCedric Nugteren
2016-08-20Merge branch 'master' of https://github.com/dvasschemacq/CLBlast into dvassch...Cedric Nugteren
2016-08-18Adapt opencl files for 1.1 OpenCLD. Van Assche
2016-08-15Updated the database script to calculate the relative best performance of tun...Cedric Nugteren
2016-08-09Improved the speed of the new common-best defaults method for the database ge...Cedric Nugteren
2016-08-07Added a first version of the database's common-best default calculationCedric Nugteren
2016-07-28Minor update regarding the previous CMake export/install target changesCedric Nugteren
2016-07-28Merge pull request #86 from intelfx/cmakeCedric Nugteren
2016-07-28.appveyor.yml: move {OPENCL,CLBLAST}_ROOT out of source treeIvan Shapovalov
2016-07-28.travis.yml: use OpenCL ICD Loader and headers shipped by distroIvan Shapovalov
2016-07-28CMakeLists.txt: use target_include_directories()Ivan Shapovalov
2016-07-28CMakeLists.txt: provide a find_package() config for dependent projectsIvan Shapovalov
2016-07-26Merge branch 'gemv_performance' into developmentCedric Nugteren
2016-07-25Removed all old tuning results for the XgemvFastRot kernel; re-added for a co...Cedric Nugteren
2016-07-25Moved the XgemvFast and XgemvFastRot tuning database into a separate fileCedric Nugteren
2016-07-24Merge branch 'development' into gemv_performanceCedric Nugteren
2016-07-24Minor improvements after merging in groundwork for custom tuning parameters a...Cedric Nugteren
2016-07-24Merge pull request #84 from intelfx/device-specific-kernelsCedric Nugteren
2016-07-24Refactored the Python database script: separated functionality in modules, no...Cedric Nugteren
2016-07-23Fixe a bug in the new XgemvFastRot kernel related to local memory sizeCedric Nugteren
2016-07-23Further improvements to the XgemvFastRot kernel, properly enables coalescing nowCedric Nugteren
2016-07-23Improved the XgemvFastRot kernel by tiled loading of the input matrix A, enab...Cedric Nugteren