summaryrefslogtreecommitdiff
path: root/src/utilities
AgeCommit message (Expand)Author
2017-11-30Integrated pre-processor in compilation flow, default is still disabledCedric Nugteren
2017-11-25Moved string splitting functions; added string character removal functionCedric Nugteren
2017-11-22Made parameter override in the clients a command-line argument and added supp...Cedric Nugteren
2017-11-19Added compilation timing and better compilation error reportingCedric Nugteren
2017-11-19Revived the GEMM routine tuner; minor formatting changesCedric Nugteren
2017-11-17Moved compilation function to separate file; removed dependency of tuners of ...Cedric Nugteren
2017-11-15Added first version of integrated and re-written auto-tunerCedric Nugteren
2017-11-15Added kernel timing functionality to the utilitiesCedric Nugteren
2017-11-15Added exception handle with catch-allCedric Nugteren
2017-11-13Made the exception dispatch function optionally silentCedric Nugteren
2017-11-13Moved square-difference utility function for use in the tunersCedric Nugteren
2017-11-07Merge pull request #212 from CNugteren/kernel_selection_tunerCedric Nugteren
2017-11-02Integrated the GEMM routine tuner for kernel selection; added first tuning re...Cedric Nugteren
2017-10-30Added collecting and printing of scores for the kernel-selection tunerCedric Nugteren
2017-10-29Added Android support using the GNU C++ STL library and the GCC toolchainCedric Nugteren
2017-10-28Merge branch 'master' into android_supportCedric Nugteren
2017-10-28Added initial version of a GEMM kernel selection tunerCedric Nugteren
2017-10-28Moved timing function to a separate fileCedric Nugteren
2017-10-15Various fixes to make the first CUDA examples workCedric Nugteren
2017-10-12CUDA API now takes context and device in instead of streamCedric Nugteren
2017-10-11Added first (untested) version of a CUDA APICedric Nugteren
2017-10-09Removed include of clpp11.hpp in places other than utilities.hppCedric Nugteren
2017-10-08Moved the remaining OpenCL specific host code to the clpp11.h header where it...Cedric Nugteren
2017-10-07Synchronizes clpp11.h with CLCudaAPI 9.0Cedric Nugteren
2017-09-26Added Android header for compilation with gnustl STLCedric Nugteren
2017-09-16Fixed a compilation error and warning under MacOSCedric Nugteren
2017-09-14Added architecture layer in the tuning database for better performance on uns...Cedric Nugteren
2017-09-10Added the new vendor-architecture-name hierarchy to the tuners as wellCedric Nugteren
2017-09-08Introduced the notion of a device-architecture for the database and added dev...Cedric Nugteren
2017-08-24Merge branch 'master' into im_to_colCedric Nugteren
2017-08-23Made the im2col client properly handle the argumentsCedric Nugteren
2017-08-21Merge pull request #173 from mcian/PSO_paramsCedric Nugteren
2017-08-21Remove multistrategy and related functionsmcian
2017-08-12Merge branch 'master' into im_to_colCedric Nugteren
2017-08-12Moved some utility functions to a test-specific utility compilation-unitCedric Nugteren
2017-07-23Code refactoringmcian
2017-07-17Add PSO parameters support and search strategy selection from command linemcian
2017-07-16First step towards supporting im2col in the test infrastructureCedric Nugteren
2017-07-12Relaxed requirement on a_ld and b_ld for batched GEMMCedric Nugteren
2017-05-26Fixed a compilation issue under MSVC 2013Cedric Nugteren
2017-04-17Fixed a namespace clash with CUDA FP16 for the half-datatypeCedric Nugteren
2017-04-13Fixed CUDA malloc and cuBLAS handles: cuBLAS as a performance-reference now w...Cedric Nugteren
2017-04-10Merge branch 'development' into cublas_referenceCedric Nugteren
2017-04-07Added a special override database for the Apple CPU implementation on OS X: t...Cedric Nugteren
2017-04-02Layed the groundwork for cuBLAS comparisons in the clientsCedric Nugteren
2017-04-01Separated host-device and device-host memory copies from execution of the CBL...Cedric Nugteren
2017-03-10Added proper testing of the alpha parameter; finalized the batched AXPY imple...Cedric Nugteren
2017-03-10Fixed a small compilation bug for MSVC related to a floating-point constantCedric Nugteren
2017-03-08Make batched routines based on offsets instead of a vector of cl_mem objects ...Cedric Nugteren
2017-03-05Minor fixes to the client w.r.t. the addition of the batch countCedric Nugteren