index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
/
routine.cpp
Age
Commit message (
Expand
)
Author
2017-12-28
Added interface to compute the required temporary buffer size for GEMM
Cedric Nugteren
2017-12-09
Made the pre-processor run by default for ARM and Qualcomm GPUs
Cedric Nugteren
2017-11-30
Integrated pre-processor in compilation flow, default is still disabled
Cedric Nugteren
2017-11-11
Factored out the creation of the OpenCL header and the program compilation
Cedric Nugteren
2017-11-07
Merge pull request #212 from CNugteren/kernel_selection_tuner
Cedric Nugteren
2017-11-02
Integrated the GEMM routine tuner for kernel selection; added first tuning re...
Cedric Nugteren
2017-10-29
Added platform ID to the binary program cache to prevent issues with multi-pl...
Cedric Nugteren
2017-10-14
Added OpenCL to CUDA translation header for the kernels
Cedric Nugteren
2017-10-08
Moved the remaining OpenCL specific host code to the clpp11.h header where it...
Cedric Nugteren
2017-10-07
Synchronizes clpp11.h with CLCudaAPI 9.0
Cedric Nugteren
2017-09-24
Updated database override function to work with the new database storage format
Cedric Nugteren
2017-09-23
Made program and binary databases dependent on the routine parameters on top ...
Cedric Nugteren
2017-09-23
Made database-caching no longer dependent on device name but on device/platfo...
Cedric Nugteren
2017-09-06
Split the database files over multiple directories and files; first step towa...
Cedric Nugteren
2017-07-08
Made the inline keyword in kernels optional currently only enabled for NVIDIA...
Cedric Nugteren
2017-05-26
Fixes inability to run GEMM on multiple identical GPUs (issue #155)
Kirill Mavreshko
2017-04-10
Removed const-vector-of-const-objects from the database class to remain accor...
Cedric Nugteren
2017-02-26
Merge branch 'development' into triangular_solvers
Cedric Nugteren
2017-02-13
Added first version of the OverrideParameters function
Cedric Nugteren
2017-02-12
Split the database into several smaller cached per-kernel databases (in prepa...
Cedric Nugteren
2017-01-24
Database: pass Device instead of Queue for clarity
Ivan Shapovalov
2017-01-24
Routine: cache the database instance as well
Ivan Shapovalov
2017-01-24
Routine, Cache: generalize, reduce amount of copying in fast path
Ivan Shapovalov
2017-01-24
Routine: fix semi-warm routine construction (when binary is in cache)
Ivan Shapovalov
2017-01-20
Routine: use PrecisionSupported<>() instead of duplicating the check
Ivan Shapovalov
2016-10-22
Routine: get rid of ::SetUp()
Ivan Shapovalov
2016-10-22
treewide: use C++ exceptions properly
Ivan Shapovalov
2016-10-14
Fixed an issue with a growing database: the database is now a global variable...
Cedric Nugteren
2016-09-21
It is now possible to set the OpenCL compiler options through an environmenta...
Cedric Nugteren
2016-07-22
clblast::Database, clblast::Routine: implement "database overlays" provided b...
Ivan Shapovalov
2016-07-06
Added a VERBOSE mode to debug performance: now prints details about compilati...
Cedric Nugteren
2016-06-19
Renamed all C++ source files to .cpp to match the .hpp extension better
Cedric Nugteren