Age | Commit message (Expand) | Author |
2016-07-24 | Merge branch 'development' into gemv_performance | Cedric Nugteren |
2016-07-24 | Minor improvements after merging in groundwork for custom tuning parameters a... | Cedric Nugteren |
2016-07-24 | Merge pull request #84 from intelfx/device-specific-kernels | Cedric Nugteren |
2016-07-24 | Refactored the Python database script: separated functionality in modules, no... | Cedric Nugteren |
2016-07-23 | Fixe a bug in the new XgemvFastRot kernel related to local memory size | Cedric Nugteren |
2016-07-23 | Further improvements to the XgemvFastRot kernel, properly enables coalescing now | Cedric Nugteren |
2016-07-23 | Improved the XgemvFastRot kernel by tiled loading of the input matrix A, enab... | Cedric Nugteren |
2016-07-22 | clblast::Database, clblast::Routine: implement "database overlays" provided b... | Ivan Shapovalov |
2016-07-22 | clblast::RunKernel, cl::Kernel: unify variants with/without waitForEvents, su... | Ivan Shapovalov |
2016-07-22 | cl::Kernel: skip NULL entries in waitForEvents | Ivan Shapovalov |
2016-07-22 | clblast::RunKernel, cl::Kernel: take const vector as waitForEvents | Ivan Shapovalov |
2016-07-22 | xgemm: do not hardcode kernel requirements for internal matrix layout | Ivan Shapovalov |
2016-07-22 | CMakeLists.txt: use ${clblast_SOURCE_DIR} instead of ${CMAKE_SOURCE_DIR} | Ivan Shapovalov |
2016-07-16 | Fixed some more types and type conversions in the clpp11 interface to OpenCL | Cedric Nugteren |
2016-07-16 | Merge pull request #80 from gcp/getdevinfo_fixes | Cedric Nugteren |
2016-07-16 | Removed an unused variable from the copy-transpose-pad function | Cedric Nugteren |
2016-07-13 | Make sure the passed types are large enough. | Gian-Carlo Pascutto |
2016-07-10 | Now passing alpha/beta to the kernel as arguments as before fp16 support; in ... | Cedric Nugteren |
2016-07-10 | Added tuning results for AMD Oland and for Intel Graphics HD 530 | Cedric Nugteren |
2016-07-10 | Fixed a bug related to the cache and retrieval of programs based on the OpenC... | Cedric Nugteren |
2016-07-08 | Cache now compares cl_context instead of a pointer to a context; added verbos... | Cedric Nugteren |
2016-07-06 | Added a VERBOSE mode to debug performance: now prints details about compilati... | Cedric Nugteren |
2016-07-06 | Added an option to the performance clients to do a warm-up run before timing | Cedric Nugteren |
2016-07-04 | Fixed a linking issue with the tuners on Visual Studio | CNugteren |
2016-07-03 | Added tuning results for GTX670, GTX750, and GTX1070 (thanks to gcp) | Cedric Nugteren |
2016-07-03 | Merge pull request #76 from gcp/fix_local_mem_size | Cedric Nugteren |
2016-07-02 | Ensure clGetKernelWorkGroupInfo return value fits. | Gian-Carlo Pascutto |
2016-07-02 | Prints the current pandas version and reports the minimum required version | Cedric Nugteren |
2016-07-02 | Fixed some memory leaks related to events not properly cleaned-up | Cedric Nugteren |
2016-06-30 | Added declspec(dllexport) to ClearCache and FillCache, and added declspec(dll... | Cedric Nugteren |
2016-06-29 | Updated to version 6.0 of the CLCudaAPI header | Cedric Nugteren |
2016-06-28 | Prepared the changelog for the next release | Cedric Nugteren |
2016-06-28 | Updated to version 0.8.0 | Cedric Nugteren |
2016-06-28 | Changed the AppVeyor buildscript to use nmake instead of 'cmake --build' (2) | Cedric Nugteren |
2016-06-28 | Changed the AppVeyor buildscript to use nmake instead of 'cmake --build' | Cedric Nugteren |
2016-06-28 | Fixes bug in AppVeyor with install directory (2) | Cedric Nugteren |
2016-06-28 | Fixes bug in AppVeyor with install directory | Cedric Nugteren |
2016-06-28 | Added configuration for AppVeyor to keep the results of the builds as an 'art... | Cedric Nugteren |
2016-06-28 | Made it possible to build the clients and tests on Windows using Visual Studio | CNugteren |
2016-06-28 | Made it possible to build the OMATCOPY test and client in case only clBLAS is... | CNugteren |
2016-06-27 | Updated the README in various places | Cedric Nugteren |
2016-06-27 | Fixes for the AppVeyor Windows build | Cedric Nugteren |
2016-06-27 | Added vcvarsall to AppVeyor and added AppVeyor icons to README | Cedric Nugteren |
2016-06-27 | Fixed a bug in the Appveyor script | Cedric Nugteren |
2016-06-27 | Added Appveyor Windows CI support | Cedric Nugteren |
2016-06-27 | Increased coverage of Travis CI automatic builds | Cedric Nugteren |
2016-06-27 | Moved the performance graph scripts to the 'scripts' subfolder | Cedric Nugteren |
2016-06-27 | Added fp16 to the alltuners target | Cedric Nugteren |
2016-06-27 | Changed the symbol for error-code skipped tests to distinguish from succesful... | Cedric Nugteren |
2016-06-27 | Increased the verbosity of the '-verbose' option for the correctness tests, n... | Cedric Nugteren |