summaryrefslogtreecommitdiff
path: root/scripts/generator
AgeCommit message (Expand)Author
2018-02-02Fixed the XHAD documentationCedric Nugteren
2018-01-31Created the API and stubs for the HAD (hadamard-product) routinesCedric Nugteren
2018-01-11Added a RetrieveParameters function to inspect tuning parametersCedric Nugteren
2018-01-07Added API and tests for new GemmStridedBatched routineCedric Nugteren
2018-01-06Fixed a minor nullptr related issue in the code generatorCedric Nugteren
2018-01-06Added CUDA interface to get temporary-buffer size for GEMM routineCedric Nugteren
2018-01-04Added a CUDA version of the GEMM temp-buffer optional argumentCedric Nugteren
2018-01-04Updated the generator script to automatically generate the temp-buffer codeCedric Nugteren
2017-12-28Added interface to compute the required temporary buffer size for GEMMCedric Nugteren
2017-10-14Various fixes to make the host code and sample compile with the CUDA APICedric Nugteren
2017-10-12CUDA API now takes context and device in instead of streamCedric Nugteren
2017-10-11Added first (untested) version of a CUDA APICedric Nugteren
2017-10-09Fixed the Python generator script w.r.t. the recent change of testing direct/...Cedric Nugteren
2017-10-08Moved non-routine-specific API functions and includes to separate filesCedric Nugteren
2017-07-02Added interface and stubs for the im2col routineCedric Nugteren
2017-06-25Fixed some Clang and MSVC warningsCedric Nugteren
2017-06-21Fixes some compilation issues related to the database structure changeCedric Nugteren
2017-05-12Added the IxAMIN routines: absolute minimum version of IxAMAXCedric Nugteren
2017-04-17Fixed a namespace clash with CUDA FP16 for the half-datatypeCedric Nugteren
2017-04-13Fixed CUDA malloc and cuBLAS handles: cuBLAS as a performance-reference now w...Cedric Nugteren
2017-04-11Made compilation of the cuBLAS wrapper work properlyCedric Nugteren
2017-04-10Merge branch 'development' into cublas_referenceCedric Nugteren
2017-04-10Removed const-vector-of-const-objects from the database class to remain accor...Cedric Nugteren
2017-04-06Completed the cuBLAS wrapperCedric Nugteren
2017-04-05Added a first version of a cuBLAS wrapper (WIP)Cedric Nugteren
2017-04-03In-lined the float2 and double2 types to avoid collision with CUDA's definitionsCedric Nugteren
2017-03-10Added API and test infrastructure for the batched GEMM routineCedric Nugteren
2017-03-08Make batched routines based on offsets instead of a vector of cl_mem objects ...Cedric Nugteren
2017-03-05Added first naive version of the batched AXPY routineCedric Nugteren
2017-03-05Prepared generator for batched routines; added batched AXPY routine interfaceCedric Nugteren
2017-02-26Minor fix to the generator scriptCedric Nugteren
2017-02-26Merge branch 'development' into triangular_solversCedric Nugteren
2017-02-26Removed half-precision support from the TRSM routine; too unstableCedric Nugteren
2017-02-18Added documentation for the OverrideParameters functionCedric Nugteren
2017-02-18Added missing documentation for the fill and clear cache functionsCedric Nugteren
2017-02-16Added a C interface to the OverrideParameters function; added some in-line co...Cedric Nugteren
2017-02-16Added input-sanity checks for the OverrideParameters functionCedric Nugteren
2017-02-13Added first version of the OverrideParameters functionCedric Nugteren
2017-02-05Merge branch 'development' into triangular_solversCedric Nugteren
2017-01-24FillCache: perform compilation for each precision separatelyIvan Shapovalov
2017-01-20Added prototype for the TRSV routineCedric Nugteren
2016-12-18Prepared for the addition of the TRSM triangular solver kernelCedric Nugteren
2016-11-27Made it possible to use the command-line environmental variables for each exe...Cedric Nugteren
2016-11-23Fixed a vector-size related bug in the CLBlast Netlib APICedric Nugteren
2016-11-22Minor changes to ensure full compatibility with the Netlib CBLAS APICedric Nugteren
2016-11-20Made functions with scalar-buffers as output properly return valuesCedric Nugteren
2016-10-25Renamed the include and source files of the Netlib CBLAS APICedric Nugteren
2016-10-25Removed the clblast namespace from the Netlib C API source file to ensure pro...Cedric Nugteren
2016-10-25Fixed some issues preventing the Netlib CBLAS API from linking correctlyCedric Nugteren
2016-10-25Made the Netlib CBLAS API use the same enums with prefixes as the regular C A...Cedric Nugteren