summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2016-07-02Fixed some memory leaks related to events not properly cleaned-upCedric Nugteren
2016-06-30Added declspec(dllexport) to ClearCache and FillCache, and added ↵Cedric Nugteren
declspec(dllimport) when not building the library
2016-06-29Updated to version 6.0 of the CLCudaAPI headerCedric Nugteren
2016-06-28Prepared the changelog for the next releaseCedric Nugteren
2016-06-28Updated to version 0.8.0Cedric Nugteren
2016-06-28Changed the AppVeyor buildscript to use nmake instead of 'cmake --build' (2)Cedric Nugteren
2016-06-28Changed the AppVeyor buildscript to use nmake instead of 'cmake --build'Cedric Nugteren
2016-06-28Fixes bug in AppVeyor with install directory (2)Cedric Nugteren
2016-06-28Fixes bug in AppVeyor with install directoryCedric Nugteren
2016-06-28Added configuration for AppVeyor to keep the results of the builds as an ↵Cedric Nugteren
'artifact'
2016-06-28Made it possible to build the clients and tests on Windows using Visual StudioCNugteren
2016-06-28Made it possible to build the OMATCOPY test and client in case only clBLAS ↵CNugteren
is present
2016-06-27Updated the README in various placesCedric Nugteren
2016-06-27Fixes for the AppVeyor Windows buildCedric Nugteren
2016-06-27Added vcvarsall to AppVeyor and added AppVeyor icons to READMECedric Nugteren
2016-06-27Fixed a bug in the Appveyor scriptCedric Nugteren
2016-06-27Added Appveyor Windows CI supportCedric Nugteren
2016-06-27Increased coverage of Travis CI automatic buildsCedric Nugteren
2016-06-27Moved the performance graph scripts to the 'scripts' subfolderCedric Nugteren
2016-06-27Added fp16 to the alltuners targetCedric Nugteren
2016-06-27Changed the symbol for error-code skipped tests to distinguish from ↵Cedric Nugteren
succesfull error-code checks in the correctness tests
2016-06-27Increased the verbosity of the '-verbose' option for the correctness tests, ↵Cedric Nugteren
now printing when a library is called
2016-06-19Added tuning results for 'Intel(R) HD Graphics Haswell Ultrabook GT2 Mobile' ↵Cedric Nugteren
(thanks to OursDesCavernes)
2016-06-19Minor fix to the database scriptCedric Nugteren
2016-06-19Merge pull request #69 from CNugteren/refactoringCedric Nugteren
Refactoring of the Routine class and file-renaming
2016-06-19Renamed all C++ source files to .cpp to match the .hpp extension betterCedric Nugteren
2016-06-18Moved all headers into the source tree, changed headers to .hpp extensionCedric Nugteren
2016-06-18Clean-up of the routine class, moved RunKernel to the routine/common fileCedric Nugteren
2016-06-18Removed the template from the Routine base-classCedric Nugteren
2016-06-17Removed the precision argument from the routines in favor of a single ↵Cedric Nugteren
templated function
2016-06-17Removed the interface to the cache functions from the Routine class, calls ↵Cedric Nugteren
them directly now
2016-06-17Moved the RunKernel and PadCopyTransposeMatrix functions out of the Routine ↵Cedric Nugteren
class
2016-06-17Moved the ErrorIn function from the Routine class to the utilities headerCedric Nugteren
2016-06-17Moved the test-for-valid-buffers function from the Routine class to separate ↵Cedric Nugteren
functions in a separate file
2016-06-16Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, ↵Cedric Nugteren
and/or transposing
2016-06-15Added some constness to variables related to the GEMM routinesCedric Nugteren
2016-06-14Re-organised the level-3 supporting kernels (copy, pad, transpose, convert) ↵Cedric Nugteren
and renamed files and functions appropriately
2016-06-14Moved device vendor and type checks to a common headerCedric Nugteren
2016-06-14Added support for FP16 on ARM Mali-T628 (officially not supported)Cedric Nugteren
2016-06-13Improved API documentation and added documentation for level-2 and level-3 ↵Cedric Nugteren
routines
2016-06-10Added documentation for the matrix-update level-2 family of routinesCedric Nugteren
2016-06-08Added global memory synchronisation for better cache performance on ARM Mali ↵Cedric Nugteren
GPUs
2016-06-08Made the CPU BLAS library the default reference to test against in favor of ↵Cedric Nugteren
clBLAS
2016-06-06Fixed the RPATH settings for linking on OS XCedric Nugteren
2016-06-06Made the tests for invalid buffer sizes also verbose in verbose modeCedric Nugteren
2016-06-02Added return value to the test binaries (0: success, 1: failure), allowing ↵Cedric Nugteren
it to work under CTest properly
2016-06-01Added tuning parameters for 'GRID K520' and 'HD Graphics Skylake ULT GT2'Cedric Nugteren
2016-05-31Made use of CMake's built-in unit testing, allowing all tests to be run ↵Cedric Nugteren
using 'make test'
2016-05-30Increased the verbosity of the -verbose option in the correctness testsCedric Nugteren
2016-05-30Separated the performance tests (clients) from the correctness tests in CMakeCedric Nugteren