index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
/
clblast.cpp
Age
Commit message (
Expand
)
Author
2018-01-06
Added CUDA interface to get temporary-buffer size for GEMM routine
Cedric Nugteren
2018-01-04
Updated the generator script to automatically generate the temp-buffer code
Cedric Nugteren
2017-12-30
Added optional temp-buffer argument to C++ interface of GEMM
Cedric Nugteren
2017-12-28
Added interface to compute the required temporary buffer size for GEMM
Cedric Nugteren
2017-10-08
Moved non-routine-specific API functions and includes to separate files
Cedric Nugteren
2017-10-07
Synchronizes clpp11.h with CLCudaAPI 9.0
Cedric Nugteren
2017-10-01
Allow OverrideParameters function to work before a kernel was first used
Cedric Nugteren
2017-09-24
Updated database override function to work with the new database storage format
Cedric Nugteren
2017-09-23
Made database-caching no longer dependent on device name but on device/platfo...
Cedric Nugteren
2017-09-16
Improved compilation time of the tuner database
Cedric Nugteren
2017-09-14
Added architecture layer in the tuning database for better performance on uns...
Cedric Nugteren
2017-09-06
Split the database files over multiple directories and files; first step towa...
Cedric Nugteren
2017-07-02
Added interface and stubs for the im2col routine
Cedric Nugteren
2017-06-21
Fixes some compilation issues related to the database structure change
Cedric Nugteren
2017-05-26
Fixes inability to run GEMM on multiple identical GPUs (issue #155)
Kirill Mavreshko
2017-05-12
Added the IxAMIN routines: absolute minimum version of IxAMAX
Cedric Nugteren
2017-04-10
Removed const-vector-of-const-objects from the database class to remain accor...
Cedric Nugteren
2017-03-10
Added API and test infrastructure for the batched GEMM routine
Cedric Nugteren
2017-03-08
Make batched routines based on offsets instead of a vector of cl_mem objects ...
Cedric Nugteren
2017-03-05
Added first naive version of the batched AXPY routine
Cedric Nugteren
2017-03-05
Prepared generator for batched routines; added batched AXPY routine interface
Cedric Nugteren
2017-02-26
Merge branch 'development' into triangular_solvers
Cedric Nugteren
2017-02-26
Removed half-precision support from the TRSM routine; too unstable
Cedric Nugteren
2017-02-16
Added a C interface to the OverrideParameters function; added some in-line co...
Cedric Nugteren
2017-02-16
Added input-sanity checks for the OverrideParameters function
Cedric Nugteren
2017-02-13
Added first version of the OverrideParameters function
Cedric Nugteren
2017-02-05
Merge branch 'development' into triangular_solvers
Cedric Nugteren
2017-01-24
Routine, Cache: generalize, reduce amount of copying in fast path
Ivan Shapovalov
2017-01-24
FillCache: perform compilation for each precision separately
Ivan Shapovalov
2017-01-20
treewide: include clpp11.hpp first to silence deprecation warnings
Ivan Shapovalov
2017-01-20
Added prototype for the TRSV routine
Cedric Nugteren
2016-12-18
Prepared for the addition of the TRSM triangular solver kernel
Cedric Nugteren
2016-10-22
Routine: get rid of ::SetUp()
Ivan Shapovalov
2016-10-22
treewide: use C++ exceptions properly
Ivan Shapovalov
2016-06-30
Added declspec(dllexport) to ClearCache and FillCache, and added declspec(dll...
Cedric Nugteren
2016-06-19
Renamed all C++ source files to .cpp to match the .hpp extension better
Cedric Nugteren