Age | Commit message (Expand) | Author |
2016-08-15 | Updated the database script to calculate the relative best performance of tun... | Cedric Nugteren |
2016-08-09 | Improved the speed of the new common-best defaults method for the database ge... | Cedric Nugteren |
2016-08-07 | Added a first version of the database's common-best default calculation | Cedric Nugteren |
2016-07-28 | Minor update regarding the previous CMake export/install target changes | Cedric Nugteren |
2016-07-28 | Merge pull request #86 from intelfx/cmake | Cedric Nugteren |
2016-07-28 | .appveyor.yml: move {OPENCL,CLBLAST}_ROOT out of source tree | Ivan Shapovalov |
2016-07-28 | .travis.yml: use OpenCL ICD Loader and headers shipped by distro | Ivan Shapovalov |
2016-07-28 | CMakeLists.txt: use target_include_directories() | Ivan Shapovalov |
2016-07-28 | CMakeLists.txt: provide a find_package() config for dependent projects | Ivan Shapovalov |
2016-07-26 | Merge branch 'gemv_performance' into development | Cedric Nugteren |
2016-07-25 | Removed all old tuning results for the XgemvFastRot kernel; re-added for a co... | Cedric Nugteren |
2016-07-25 | Moved the XgemvFast and XgemvFastRot tuning database into a separate file | Cedric Nugteren |
2016-07-24 | Merge branch 'development' into gemv_performance | Cedric Nugteren |
2016-07-24 | Minor improvements after merging in groundwork for custom tuning parameters a... | Cedric Nugteren |
2016-07-24 | Merge pull request #84 from intelfx/device-specific-kernels | Cedric Nugteren |
2016-07-24 | Refactored the Python database script: separated functionality in modules, no... | Cedric Nugteren |
2016-07-23 | Fixe a bug in the new XgemvFastRot kernel related to local memory size | Cedric Nugteren |
2016-07-23 | Further improvements to the XgemvFastRot kernel, properly enables coalescing now | Cedric Nugteren |
2016-07-23 | Improved the XgemvFastRot kernel by tiled loading of the input matrix A, enab... | Cedric Nugteren |
2016-07-22 | clblast::Database, clblast::Routine: implement "database overlays" provided b... | Ivan Shapovalov |
2016-07-22 | clblast::RunKernel, cl::Kernel: unify variants with/without waitForEvents, su... | Ivan Shapovalov |
2016-07-22 | cl::Kernel: skip NULL entries in waitForEvents | Ivan Shapovalov |
2016-07-22 | clblast::RunKernel, cl::Kernel: take const vector as waitForEvents | Ivan Shapovalov |
2016-07-22 | xgemm: do not hardcode kernel requirements for internal matrix layout | Ivan Shapovalov |
2016-07-22 | CMakeLists.txt: use ${clblast_SOURCE_DIR} instead of ${CMAKE_SOURCE_DIR} | Ivan Shapovalov |
2016-07-16 | Fixed some more types and type conversions in the clpp11 interface to OpenCL | Cedric Nugteren |
2016-07-16 | Merge pull request #80 from gcp/getdevinfo_fixes | Cedric Nugteren |
2016-07-16 | Removed an unused variable from the copy-transpose-pad function | Cedric Nugteren |
2016-07-13 | Make sure the passed types are large enough. | Gian-Carlo Pascutto |
2016-07-10 | Now passing alpha/beta to the kernel as arguments as before fp16 support; in ... | Cedric Nugteren |
2016-07-10 | Added tuning results for AMD Oland and for Intel Graphics HD 530 | Cedric Nugteren |
2016-07-10 | Fixed a bug related to the cache and retrieval of programs based on the OpenC... | Cedric Nugteren |
2016-07-08 | Cache now compares cl_context instead of a pointer to a context; added verbos... | Cedric Nugteren |
2016-07-06 | Added a VERBOSE mode to debug performance: now prints details about compilati... | Cedric Nugteren |
2016-07-06 | Added an option to the performance clients to do a warm-up run before timing | Cedric Nugteren |
2016-07-04 | Fixed a linking issue with the tuners on Visual Studio | CNugteren |
2016-07-03 | Added tuning results for GTX670, GTX750, and GTX1070 (thanks to gcp) | Cedric Nugteren |
2016-07-03 | Merge pull request #76 from gcp/fix_local_mem_size | Cedric Nugteren |
2016-07-02 | Ensure clGetKernelWorkGroupInfo return value fits. | Gian-Carlo Pascutto |
2016-07-02 | Prints the current pandas version and reports the minimum required version | Cedric Nugteren |
2016-07-02 | Fixed some memory leaks related to events not properly cleaned-up | Cedric Nugteren |
2016-06-30 | Added declspec(dllexport) to ClearCache and FillCache, and added declspec(dll... | Cedric Nugteren |
2016-06-29 | Updated to version 6.0 of the CLCudaAPI header | Cedric Nugteren |
2016-06-28 | Prepared the changelog for the next release | Cedric Nugteren |
2016-06-28 | Updated to version 0.8.0 | Cedric Nugteren |
2016-06-28 | Changed the AppVeyor buildscript to use nmake instead of 'cmake --build' (2) | Cedric Nugteren |
2016-06-28 | Changed the AppVeyor buildscript to use nmake instead of 'cmake --build' | Cedric Nugteren |
2016-06-28 | Fixes bug in AppVeyor with install directory (2) | Cedric Nugteren |
2016-06-28 | Fixes bug in AppVeyor with install directory | Cedric Nugteren |
2016-06-28 | Added configuration for AppVeyor to keep the results of the builds as an 'art... | Cedric Nugteren |