summaryrefslogtreecommitdiff
path: root/scripts
AgeCommit message (Expand)Author
2017-09-16Improved compilation time of the tuner databaseCedric Nugteren
2017-09-14Added architecture layer in the tuning database for better performance on uns...Cedric Nugteren
2017-09-12Added database compress and de-compress functionsCedric Nugteren
2017-09-11Database now works with new format of clblast_[property]Cedric Nugteren
2017-09-06Split the database files over multiple directories and files; first step towa...Cedric Nugteren
2017-07-02Added interface and stubs for the im2col routineCedric Nugteren
2017-06-25Fixed some Clang and MSVC warningsCedric Nugteren
2017-06-21Fixes some compilation issues related to the database structure changeCedric Nugteren
2017-06-20Changed the structure of the database to reduce compilation time and save memoryCedric Nugteren
2017-05-24changing "wb" to "w" when saving json file (text mode) - compatibility for Py...Grigori Fursin
2017-05-12Added the IxAMIN routines: absolute minimum version of IxAMAXCedric Nugteren
2017-05-11Minor naming fixes to the benchmark scriptCedric Nugteren
2017-04-23Added an option to the database script to remove tuning results from the data...Cedric Nugteren
2017-04-23Re-added Titan X (Pascal) tuning results based on more averaging when tuningCedric Nugteren
2017-04-21Merge branch 'development' into benchmarkingCedric Nugteren
2017-04-21Removed the words SUMMARY from the title of the benchmark script when benchma...Cedric Nugteren
2017-04-20Updated the settings for the batched benchmarksCedric Nugteren
2017-04-17Fixed a namespace clash with CUDA FP16 for the half-datatypeCedric Nugteren
2017-04-17Added proper handling of mismatched arguments in the database scriptCedric Nugteren
2017-04-16Set proper settings for the benchmarks of batched routinesCedric Nugteren
2017-04-16Merge branch 'development' into benchmarkingCedric Nugteren
2017-04-16Added settings for benchmarking batched routinesCedric Nugteren
2017-04-14Added a benchmark-all script to run multiple benchmarks automaticallyCedric Nugteren
2017-04-14Tuned the num-runs settings for the benchmarksCedric Nugteren
2017-04-14Added output-folder for benchmarking and removed the requirement on XCedric Nugteren
2017-04-14Made the number of runs a benchmark-specific setting in the benchmark scriptsCedric Nugteren
2017-04-13Fixed CUDA malloc and cuBLAS handles: cuBLAS as a performance-reference now w...Cedric Nugteren
2017-04-11Made compilation of the cuBLAS wrapper work properlyCedric Nugteren
2017-04-10Merge branch 'development' into cublas_referenceCedric Nugteren
2017-04-10Removed const-vector-of-const-objects from the database class to remain accor...Cedric Nugteren
2017-04-06Completed the cuBLAS wrapperCedric Nugteren
2017-04-05Added a first version of a cuBLAS wrapper (WIP)Cedric Nugteren
2017-04-03In-lined the float2 and double2 types to avoid collision with CUDA's definitionsCedric Nugteren
2017-04-02Various tweaks to the new benchmark scriptCedric Nugteren
2017-04-01Tuned the plots for a tight-layout for in papers and presentationsCedric Nugteren
2017-03-26Replaced the R graph scripts with Python/Matplotlib benchmark scriptsCedric Nugteren
2017-03-10Added API and test infrastructure for the batched GEMM routineCedric Nugteren
2017-03-08Make batched routines based on offsets instead of a vector of cl_mem objects ...Cedric Nugteren
2017-03-05Added first naive version of the batched AXPY routineCedric Nugteren
2017-03-05Prepared generator for batched routines; added batched AXPY routine interfaceCedric Nugteren
2017-02-26Minor fix to the generator scriptCedric Nugteren
2017-02-26Merge branch 'development' into triangular_solversCedric Nugteren
2017-02-26Removed half-precision support from the TRSM routine; too unstableCedric Nugteren
2017-02-18Added documentation for the OverrideParameters functionCedric Nugteren
2017-02-18Added missing documentation for the fill and clear cache functionsCedric Nugteren
2017-02-16Added a C interface to the OverrideParameters function; added some in-line co...Cedric Nugteren
2017-02-16Added input-sanity checks for the OverrideParameters functionCedric Nugteren
2017-02-13Added first version of the OverrideParameters functionCedric Nugteren
2017-02-05Merge branch 'development' into triangular_solversCedric Nugteren
2017-01-24FillCache: perform compilation for each precision separatelyIvan Shapovalov