summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2016-07-10Added tuning results for AMD Oland and for Intel Graphics HD 530Cedric Nugteren
2016-07-10Fixed a bug related to the cache and retrieval of programs based on the OpenC...Cedric Nugteren
2016-07-08Cache now compares cl_context instead of a pointer to a context; added verbos...Cedric Nugteren
2016-07-06Added a VERBOSE mode to debug performance: now prints details about compilati...Cedric Nugteren
2016-07-06Added an option to the performance clients to do a warm-up run before timingCedric Nugteren
2016-07-04Fixed a linking issue with the tuners on Visual StudioCNugteren
2016-07-03Added tuning results for GTX670, GTX750, and GTX1070 (thanks to gcp)Cedric Nugteren
2016-07-03Merge pull request #76 from gcp/fix_local_mem_sizeCedric Nugteren
2016-07-02Ensure clGetKernelWorkGroupInfo return value fits.Gian-Carlo Pascutto
2016-07-02Prints the current pandas version and reports the minimum required versionCedric Nugteren
2016-07-02Fixed some memory leaks related to events not properly cleaned-upCedric Nugteren
2016-06-30Added declspec(dllexport) to ClearCache and FillCache, and added declspec(dll...Cedric Nugteren
2016-06-29Updated to version 6.0 of the CLCudaAPI headerCedric Nugteren
2016-06-28Prepared the changelog for the next releaseCedric Nugteren
2016-06-28Updated to version 0.8.0Cedric Nugteren
2016-06-28Changed the AppVeyor buildscript to use nmake instead of 'cmake --build' (2)Cedric Nugteren
2016-06-28Changed the AppVeyor buildscript to use nmake instead of 'cmake --build'Cedric Nugteren
2016-06-28Fixes bug in AppVeyor with install directory (2)Cedric Nugteren
2016-06-28Fixes bug in AppVeyor with install directoryCedric Nugteren
2016-06-28Added configuration for AppVeyor to keep the results of the builds as an 'art...Cedric Nugteren
2016-06-28Made it possible to build the clients and tests on Windows using Visual StudioCNugteren
2016-06-28Made it possible to build the OMATCOPY test and client in case only clBLAS is...CNugteren
2016-06-27Updated the README in various placesCedric Nugteren
2016-06-27Fixes for the AppVeyor Windows buildCedric Nugteren
2016-06-27Added vcvarsall to AppVeyor and added AppVeyor icons to READMECedric Nugteren
2016-06-27Fixed a bug in the Appveyor scriptCedric Nugteren
2016-06-27Added Appveyor Windows CI supportCedric Nugteren
2016-06-27Increased coverage of Travis CI automatic buildsCedric Nugteren
2016-06-27Moved the performance graph scripts to the 'scripts' subfolderCedric Nugteren
2016-06-27Added fp16 to the alltuners targetCedric Nugteren
2016-06-27Changed the symbol for error-code skipped tests to distinguish from succesful...Cedric Nugteren
2016-06-27Increased the verbosity of the '-verbose' option for the correctness tests, n...Cedric Nugteren
2016-06-19Added tuning results for 'Intel(R) HD Graphics Haswell Ultrabook GT2 Mobile' ...Cedric Nugteren
2016-06-19Minor fix to the database scriptCedric Nugteren
2016-06-19Merge pull request #69 from CNugteren/refactoringCedric Nugteren
2016-06-19Renamed all C++ source files to .cpp to match the .hpp extension betterCedric Nugteren
2016-06-18Moved all headers into the source tree, changed headers to .hpp extensionCedric Nugteren
2016-06-18Clean-up of the routine class, moved RunKernel to the routine/common fileCedric Nugteren
2016-06-18Removed the template from the Routine base-classCedric Nugteren
2016-06-17Removed the precision argument from the routines in favor of a single templat...Cedric Nugteren
2016-06-17Removed the interface to the cache functions from the Routine class, calls th...Cedric Nugteren
2016-06-17Moved the RunKernel and PadCopyTransposeMatrix functions out of the Routine c...Cedric Nugteren
2016-06-17Moved the ErrorIn function from the Routine class to the utilities headerCedric Nugteren
2016-06-17Moved the test-for-valid-buffers function from the Routine class to separate ...Cedric Nugteren
2016-06-16Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, and...Cedric Nugteren
2016-06-15Added some constness to variables related to the GEMM routinesCedric Nugteren
2016-06-14Re-organised the level-3 supporting kernels (copy, pad, transpose, convert) a...Cedric Nugteren
2016-06-14Moved device vendor and type checks to a common headerCedric Nugteren
2016-06-14Added support for FP16 on ARM Mali-T628 (officially not supported)Cedric Nugteren
2016-06-13Improved API documentation and added documentation for level-2 and level-3 ro...Cedric Nugteren