summaryrefslogtreecommitdiff
path: root/src/utilities/utilities.hpp
AgeCommit message (Expand)Author
2023-05-07AMAX/AMIN integer testing and bug fixes (#457)Cedric Nugteren
2019-05-11Added a function to set the OpenCL kernel standard, either 1.1 or 1.2Cedric Nugteren
2019-05-08Changed back to cl_intel_subgroups as suggestedCedric Nugteren
2019-05-07Added a host-code check to make sure the avc_motion_estimation is availableCedric Nugteren
2018-11-12Add kernel_mode option to im2col, col2im, and convgemm functionsKoichi Akabe
2018-10-30Fix col2im implementationKoichi Akabe
2018-06-03Merge branch 'master' into CLBlast-267-convgemmCedric Nugteren
2018-05-23Added an option in the clients to output timing statistics: minimum, mean, an...Cedric Nugteren
2018-05-19Merge branch 'master' into CLBlast-267-convgemmCedric Nugteren
2018-05-17Added a canary region for overflow detection to the tunersCedric Nugteren
2018-05-06Added convgemm skeleton, test infrastructure, and first reference implementationCedric Nugteren
2018-04-24Added a define to enable subgroup shuffling if supported by the deviceCedric Nugteren
2017-12-10Fixed a missing includeCedric Nugteren
2017-11-25Moved string splitting functions; added string character removal functionCedric Nugteren
2017-11-22Made parameter override in the clients a command-line argument and added supp...Cedric Nugteren
2017-11-15Added first version of integrated and re-written auto-tunerCedric Nugteren
2017-11-13Moved square-difference utility function for use in the tunersCedric Nugteren
2017-10-11Added first (untested) version of a CUDA APICedric Nugteren
2017-10-08Moved the remaining OpenCL specific host code to the clpp11.h header where it...Cedric Nugteren
2017-09-16Fixed a compilation error and warning under MacOSCedric Nugteren
2017-09-14Added architecture layer in the tuning database for better performance on uns...Cedric Nugteren
2017-09-10Added the new vendor-architecture-name hierarchy to the tuners as wellCedric Nugteren
2017-09-08Introduced the notion of a device-architecture for the database and added dev...Cedric Nugteren
2017-08-24Merge branch 'master' into im_to_colCedric Nugteren
2017-08-23Made the im2col client properly handle the argumentsCedric Nugteren
2017-08-21Merge pull request #173 from mcian/PSO_paramsCedric Nugteren
2017-08-21Remove multistrategy and related functionsmcian
2017-08-12Merge branch 'master' into im_to_colCedric Nugteren
2017-08-12Moved some utility functions to a test-specific utility compilation-unitCedric Nugteren
2017-07-23Code refactoringmcian
2017-07-17Add PSO parameters support and search strategy selection from command linemcian
2017-07-16First step towards supporting im2col in the test infrastructureCedric Nugteren
2017-05-26Fixed a compilation issue under MSVC 2013Cedric Nugteren
2017-04-17Fixed a namespace clash with CUDA FP16 for the half-datatypeCedric Nugteren
2017-04-13Fixed CUDA malloc and cuBLAS handles: cuBLAS as a performance-reference now w...Cedric Nugteren
2017-04-02Layed the groundwork for cuBLAS comparisons in the clientsCedric Nugteren
2017-04-01Separated host-device and device-host memory copies from execution of the CBL...Cedric Nugteren
2017-03-10Added proper testing of the alpha parameter; finalized the batched AXPY imple...Cedric Nugteren
2017-03-08Make batched routines based on offsets instead of a vector of cl_mem objects ...Cedric Nugteren
2017-03-05Minor fixes to the client w.r.t. the addition of the batch countCedric Nugteren
2017-03-05Adjusted the test-infrastructure to support testing of batched-versions of ro...Cedric Nugteren
2017-03-05Changed the way the test-data is generated: now using a single MT generator a...Cedric Nugteren
2017-03-04Added a proper data-preparation function for the TRSM testsCedric Nugteren
2017-02-25Added PrepareData function for TRSM to create proper test inputCedric Nugteren
2017-02-19Added data-preparation function for the TRSV tests and special nan/inf checks...Cedric Nugteren
2017-02-05Merge branch 'development' into triangular_solversCedric Nugteren
2017-01-20treewide: include clpp11.hpp first to silence deprecation warningsIvan Shapovalov
2017-01-18Added first version of the TRSM routine based on the diagonal invert kernelCedric Nugteren
2017-01-15Added a first version of the diagonal block invert routine in preparation of ...Cedric Nugteren
2016-11-27Made it possible to use the command-line environmental variables for each exe...Cedric Nugteren