summaryrefslogtreecommitdiff
path: root/scripts/generator
AgeCommit message (Expand)Author
2017-04-17Fixed a namespace clash with CUDA FP16 for the half-datatypeCedric Nugteren
2017-04-13Fixed CUDA malloc and cuBLAS handles: cuBLAS as a performance-reference now w...Cedric Nugteren
2017-04-11Made compilation of the cuBLAS wrapper work properlyCedric Nugteren
2017-04-10Merge branch 'development' into cublas_referenceCedric Nugteren
2017-04-10Removed const-vector-of-const-objects from the database class to remain accor...Cedric Nugteren
2017-04-06Completed the cuBLAS wrapperCedric Nugteren
2017-04-05Added a first version of a cuBLAS wrapper (WIP)Cedric Nugteren
2017-04-03In-lined the float2 and double2 types to avoid collision with CUDA's definitionsCedric Nugteren
2017-03-10Added API and test infrastructure for the batched GEMM routineCedric Nugteren
2017-03-08Make batched routines based on offsets instead of a vector of cl_mem objects ...Cedric Nugteren
2017-03-05Added first naive version of the batched AXPY routineCedric Nugteren
2017-03-05Prepared generator for batched routines; added batched AXPY routine interfaceCedric Nugteren
2017-02-26Minor fix to the generator scriptCedric Nugteren
2017-02-26Merge branch 'development' into triangular_solversCedric Nugteren
2017-02-26Removed half-precision support from the TRSM routine; too unstableCedric Nugteren
2017-02-18Added documentation for the OverrideParameters functionCedric Nugteren
2017-02-18Added missing documentation for the fill and clear cache functionsCedric Nugteren
2017-02-16Added a C interface to the OverrideParameters function; added some in-line co...Cedric Nugteren
2017-02-16Added input-sanity checks for the OverrideParameters functionCedric Nugteren
2017-02-13Added first version of the OverrideParameters functionCedric Nugteren
2017-02-05Merge branch 'development' into triangular_solversCedric Nugteren
2017-01-24FillCache: perform compilation for each precision separatelyIvan Shapovalov
2017-01-20Added prototype for the TRSV routineCedric Nugteren
2016-12-18Prepared for the addition of the TRSM triangular solver kernelCedric Nugteren
2016-11-27Made it possible to use the command-line environmental variables for each exe...Cedric Nugteren
2016-11-23Fixed a vector-size related bug in the CLBlast Netlib APICedric Nugteren
2016-11-22Minor changes to ensure full compatibility with the Netlib CBLAS APICedric Nugteren
2016-11-20Made functions with scalar-buffers as output properly return valuesCedric Nugteren
2016-10-25Renamed the include and source files of the Netlib CBLAS APICedric Nugteren
2016-10-25Removed the clblast namespace from the Netlib C API source file to ensure pro...Cedric Nugteren
2016-10-25Fixed some issues preventing the Netlib CBLAS API from linking correctlyCedric Nugteren
2016-10-25Made the Netlib CBLAS API use the same enums with prefixes as the regular C A...Cedric Nugteren
2016-10-25Sets the proper sizes for the buffers for the Netlib CBLAS APICedric Nugteren
2016-10-25Added initial version of a Netlib CBLAS implementation. TODO: Set correct buf...Cedric Nugteren
2016-10-25Merge branch 'development' into netlib_blas_apiCedric Nugteren
2016-10-22All enums in the C API are now prefixed with CLBlast to avoid potential name ...Cedric Nugteren
2016-10-22Added extra error codes to reflect the more detailed error reporting of OpenC...Cedric Nugteren
2016-10-22Routine: get rid of ::SetUp()Ivan Shapovalov
2016-10-22treewide: use C++ exceptions properlyIvan Shapovalov
2016-10-05Made non-standard types void-pointers in the Netlib BLAS interfaceCedric Nugteren
2016-10-05Added first version of Netlib BLAS API headerCedric Nugteren
2016-09-04Refactored the Python C++ generator script; now confirms to the PEP8 styleguideCedric Nugteren
2016-06-30Added declspec(dllexport) to ClearCache and FillCache, and added declspec(dll...Cedric Nugteren
2016-06-19Renamed all C++ source files to .cpp to match the .hpp extension betterCedric Nugteren
2016-06-18Moved all headers into the source tree, changed headers to .hpp extensionCedric Nugteren
2016-06-18Clean-up of the routine class, moved RunKernel to the routine/common fileCedric Nugteren
2016-06-16Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, and...Cedric Nugteren
2016-06-13Improved API documentation and added documentation for level-2 and level-3 ro...Cedric Nugteren
2016-06-10Added documentation for the matrix-update level-2 family of routinesCedric Nugteren
2016-06-02Added return value to the test binaries (0: success, 1: failure), allowing it...Cedric Nugteren