summaryrefslogtreecommitdiff
path: root/scripts/generator/generator/cpp.py
AgeCommit message (Collapse)Author
2023-05-07AMAX/AMIN integer testing and bug fixes (#457)Cedric Nugteren
* Fixed a bug in XAMAX/XMIN routines that caused the increment and offset to be included in the result * Perform proper integer-output testing in XAMAX tests * A few changes towards getting it ready for a PR * Also fix compilation for clBLAS and cuBLAS references * Fix a bug that would only use the real part of complex numbers in the amax/amin routines * A few small fixes related to the AMAX tests
2018-08-05Added an option to compile the Netlib API with static OpenCL device and contextCedric Nugteren
2018-01-07Added API and tests for new GemmStridedBatched routineCedric Nugteren
2018-01-06Fixed a minor nullptr related issue in the code generatorCedric Nugteren
2018-01-04Added a CUDA version of the GEMM temp-buffer optional argumentCedric Nugteren
2018-01-04Updated the generator script to automatically generate the temp-buffer codeCedric Nugteren
2017-10-14Various fixes to make the host code and sample compile with the CUDA APICedric Nugteren
2017-10-12CUDA API now takes context and device in instead of streamCedric Nugteren
2017-10-11Added first (untested) version of a CUDA APICedric Nugteren
2017-10-09Fixed the Python generator script w.r.t. the recent change of testing ↵Cedric Nugteren
direct/in-direct GEMM kernels separately
2017-06-25Fixed some Clang and MSVC warningsCedric Nugteren
2017-04-13Fixed CUDA malloc and cuBLAS handles: cuBLAS as a performance-reference now ↵Cedric Nugteren
works
2017-04-11Made compilation of the cuBLAS wrapper work properlyCedric Nugteren
2017-04-06Completed the cuBLAS wrapperCedric Nugteren
2017-04-05Added a first version of a cuBLAS wrapper (WIP)Cedric Nugteren
2017-04-03In-lined the float2 and double2 types to avoid collision with CUDA's definitionsCedric Nugteren
2017-03-05Prepared generator for batched routines; added batched AXPY routine interfaceCedric Nugteren
2016-11-27Made it possible to use the command-line environmental variables for each ↵Cedric Nugteren
executable and without re-running CMake
2016-11-22Minor changes to ensure full compatibility with the Netlib CBLAS APICedric Nugteren
2016-11-20Made functions with scalar-buffers as output properly return valuesCedric Nugteren
2016-10-25Renamed the include and source files of the Netlib CBLAS APICedric Nugteren
2016-10-25Removed the clblast namespace from the Netlib C API source file to ensure ↵Cedric Nugteren
proper linking
2016-10-25Made the Netlib CBLAS API use the same enums with prefixes as the regular C ↵Cedric Nugteren
API of CLBlast
2016-10-25Added initial version of a Netlib CBLAS implementation. TODO: Set correct ↵Cedric Nugteren
buffer sizes
2016-10-25Merge branch 'development' into netlib_blas_apiCedric Nugteren
Conflicts: scripts/generator/generator.py scripts/generator/generator/routine.py
2016-10-22All enums in the C API are now prefixed with CLBlast to avoid potential name ↵Cedric Nugteren
clashes with other projects
2016-10-22Routine: get rid of ::SetUp()Ivan Shapovalov
Since we now use C++ exceptions inside the implementation (and exceptions can be thrown from constructors), there is no need for a separate Routine::SetUp() function. For this, we also change the way how the kernel source string is constructed. The kernel-specific source code is now passed to the Routine ctor via an initializer_list of C strings to avoid unnecessary data copying while also working around C1091 of MSVC 2013.
2016-10-22treewide: use C++ exceptions properlyIvan Shapovalov
Since the codebase is designed around proper C++ idioms such as RAII, it makes sense to only use C++ exceptions internally instead of mixing exceptions and error codes. The exceptions are now caught at top level to preserve compatibility with the existing error code-based API. Note that we deliberately do not catch C++ runtime errors (such as `std::bad_alloc`) nor logic errors (aka failed assertions) because no actual handling can ever happen for such errors. However, in the C interface we do catch _all_ exceptions (...) and convert them into a wild-card error code.
2016-10-05Added first version of Netlib BLAS API headerCedric Nugteren
2016-09-04Refactored the Python C++ generator script; now confirms to the PEP8 styleguideCedric Nugteren