index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
/
database
/
database.cpp
Age
Commit message (
Expand
)
Author
2018-12-31
Added convgemm to the CLBlast database, added initial parameters for Skylake GPU
Cedric Nugteren
2018-05-30
Widened Apple OpenCL check, added way to debug too-large-workgroups issue
Cedric Nugteren
2018-05-29
Added Apple OpenCL TRSV block size override; removed failing old Intel GPU te...
Cedric Nugteren
2018-01-06
Fixed a performance overhead in database creation: it is again a static varia...
Cedric Nugteren
2017-12-26
Made the database-vector a non-static member
Cedric Nugteren
2017-12-23
Updated the database to use the new TRSV and Invert tuners
Cedric Nugteren
2017-11-02
Integrated the GEMM routine tuner for kernel selection; added first tuning re...
Cedric Nugteren
2017-09-23
Made program and binary databases dependent on the routine parameters on top ...
Cedric Nugteren
2017-09-16
Updated README with proper AMD device names; fixed device look-up for names o...
Cedric Nugteren
2017-09-16
Fixed a compilation error and warning under MacOS
Cedric Nugteren
2017-09-16
Improved compilation time of the tuner database
Cedric Nugteren
2017-09-14
Added architecture layer in the tuning database for better performance on uns...
Cedric Nugteren
2017-09-10
Added the new vendor-architecture-name hierarchy to the tuners as well
Cedric Nugteren
2017-09-08
Introduced the notion of a device-architecture for the database and added dev...
Cedric Nugteren
2017-09-06
Split the database files over multiple directories and files; first step towa...
Cedric Nugteren
2017-09-04
Removed an assumption that the 'default' tuning parameters have to be stored ...
Cedric Nugteren
2017-06-20
Changed the structure of the database to reduce compilation time and save memory
Cedric Nugteren
2017-04-10
Fixed a compilation issue under MSVC and GCC
Cedric Nugteren
2017-04-10
Removed const-vector-of-const-objects from the database class to remain accor...
Cedric Nugteren
2017-04-07
Added a special override database for the Apple CPU implementation on OS X: t...
Cedric Nugteren
2017-02-26
Merge branch 'development' into triangular_solvers
Cedric Nugteren
2017-02-16
Added input-sanity checks for the OverrideParameters function
Cedric Nugteren
2017-02-12
Split the database into several smaller cached per-kernel databases (in prepa...
Cedric Nugteren
2017-02-05
Merge branch 'development' into triangular_solvers
Cedric Nugteren
2017-02-04
Completed a first STRSV implementation
Cedric Nugteren
2017-01-24
Database: pass Device instead of Queue for clarity
Ivan Shapovalov
2017-01-24
Database: ref-count the internal map for caching
Ivan Shapovalov
2017-01-15
Added a first version of the diagonal block invert routine in preparation of ...
Cedric Nugteren
2016-10-22
Moved files around a bit; created a utilities subfolder
Cedric Nugteren
2016-10-22
treewide: use C++ exceptions properly
Ivan Shapovalov
2016-10-14
Fixed an issue with a growing database: the database is now a global variable...
Cedric Nugteren
2016-10-10
First fixes towards compilation on Visual Studio 2013
Cedric Nugteren
2016-10-06
Added a kernel selection database to select between the direct and indirect G...
Cedric Nugteren
2016-09-25
Separated the tuning parameters of the new direct GEMM kernel from the indire...
Cedric Nugteren
2016-09-12
Added XgemvFastRot and Xgemm 16-bit tuning results: just defaults which are n...
Cedric Nugteren
2016-07-25
Removed all old tuning results for the XgemvFastRot kernel; re-added for a co...
Cedric Nugteren
2016-07-25
Moved the XgemvFast and XgemvFastRot tuning database into a separate file
Cedric Nugteren
2016-07-24
Minor improvements after merging in groundwork for custom tuning parameters a...
Cedric Nugteren
2016-07-22
clblast::Database, clblast::Routine: implement "database overlays" provided b...
Ivan Shapovalov
2016-06-19
Renamed all C++ source files to .cpp to match the .hpp extension better
Cedric Nugteren