summaryrefslogtreecommitdiff
path: root/src/clblast.cpp
AgeCommit message (Expand)Author
2018-11-12Add kernel_mode option to im2col, col2im, and convgemm functionsKoichi Akabe
2018-10-23Added groundwork for col2im algorithm plus first non-working version of kerne...Cedric Nugteren
2018-07-29Removed complex numbers support for CONVGEMMCedric Nugteren
2018-05-06Added convgemm skeleton, test infrastructure, and first reference implementationCedric Nugteren
2018-05-05Added interface of batched convolution as GEMMCedric Nugteren
2018-04-13Made GEMM rotation expectations kernel-specificCedric Nugteren
2018-01-31Created the API and stubs for the HAD (hadamard-product) routinesCedric Nugteren
2018-01-07Added API and tests for new GemmStridedBatched routineCedric Nugteren
2018-01-06Added CUDA interface to get temporary-buffer size for GEMM routineCedric Nugteren
2018-01-04Updated the generator script to automatically generate the temp-buffer codeCedric Nugteren
2017-12-30Added optional temp-buffer argument to C++ interface of GEMMCedric Nugteren
2017-12-28Added interface to compute the required temporary buffer size for GEMMCedric Nugteren
2017-10-08Moved non-routine-specific API functions and includes to separate filesCedric Nugteren
2017-10-07Synchronizes clpp11.h with CLCudaAPI 9.0Cedric Nugteren
2017-10-01Allow OverrideParameters function to work before a kernel was first usedCedric Nugteren
2017-09-24Updated database override function to work with the new database storage formatCedric Nugteren
2017-09-23Made database-caching no longer dependent on device name but on device/platfo...Cedric Nugteren
2017-09-16Improved compilation time of the tuner databaseCedric Nugteren
2017-09-14Added architecture layer in the tuning database for better performance on uns...Cedric Nugteren
2017-09-06Split the database files over multiple directories and files; first step towa...Cedric Nugteren
2017-07-02Added interface and stubs for the im2col routineCedric Nugteren
2017-06-21Fixes some compilation issues related to the database structure changeCedric Nugteren
2017-05-26Fixes inability to run GEMM on multiple identical GPUs (issue #155)Kirill Mavreshko
2017-05-12Added the IxAMIN routines: absolute minimum version of IxAMAXCedric Nugteren
2017-04-10Removed const-vector-of-const-objects from the database class to remain accor...Cedric Nugteren
2017-03-10Added API and test infrastructure for the batched GEMM routineCedric Nugteren
2017-03-08Make batched routines based on offsets instead of a vector of cl_mem objects ...Cedric Nugteren
2017-03-05Added first naive version of the batched AXPY routineCedric Nugteren
2017-03-05Prepared generator for batched routines; added batched AXPY routine interfaceCedric Nugteren
2017-02-26Merge branch 'development' into triangular_solversCedric Nugteren
2017-02-26Removed half-precision support from the TRSM routine; too unstableCedric Nugteren
2017-02-16Added a C interface to the OverrideParameters function; added some in-line co...Cedric Nugteren
2017-02-16Added input-sanity checks for the OverrideParameters functionCedric Nugteren
2017-02-13Added first version of the OverrideParameters functionCedric Nugteren
2017-02-05Merge branch 'development' into triangular_solversCedric Nugteren
2017-01-24Routine, Cache: generalize, reduce amount of copying in fast pathIvan Shapovalov
2017-01-24FillCache: perform compilation for each precision separatelyIvan Shapovalov
2017-01-20treewide: include clpp11.hpp first to silence deprecation warningsIvan Shapovalov
2017-01-20Added prototype for the TRSV routineCedric Nugteren
2016-12-18Prepared for the addition of the TRSM triangular solver kernelCedric Nugteren
2016-10-22Routine: get rid of ::SetUp()Ivan Shapovalov
2016-10-22treewide: use C++ exceptions properlyIvan Shapovalov
2016-06-30Added declspec(dllexport) to ClearCache and FillCache, and added declspec(dll...Cedric Nugteren
2016-06-19Renamed all C++ source files to .cpp to match the .hpp extension betterCedric Nugteren