summaryrefslogtreecommitdiff
path: root/src/database
AgeCommit message (Collapse)Author
2017-09-16Fixed a compilation error and warning under MacOSCedric Nugteren
2017-09-16Improved compilation time of the tuner databaseCedric Nugteren
2017-09-14Added architecture layer in the tuning database for better performance on ↵Cedric Nugteren
unseen devices
2017-09-10Added the new vendor-architecture-name hierarchy to the tuners as wellCedric Nugteren
2017-09-08Introduced the notion of a device-architecture for the database and added ↵Cedric Nugteren
device and architecture name mappings
2017-09-06Split the database files over multiple directories and files; first step ↵Cedric Nugteren
towards separate compilation
2017-09-04Removed an assumption that the 'default' tuning parameters have to be stored ↵Cedric Nugteren
last; this is no longer needed
2017-07-30Added tuning results for the Qualcomm Adreno 330 GPUCedric Nugteren
2017-07-17Add new threshold for ARMmcian
2017-06-25Fixed some Clang and MSVC warningsCedric Nugteren
2017-06-21Fixes some compilation issues related to the database structure changeCedric Nugteren
2017-06-20Changed the structure of the database to reduce compilation time and save memoryCedric Nugteren
2017-06-18Added tuning results for the Core i7-920 CPUCedric Nugteren
2017-06-01Added tuning results for GeForce GT 650M (thanks to bzcheeseman)Cedric Nugteren
2017-05-11Added tuning results for the AMD Radeon Fiji GPUCedric Nugteren
2017-04-23Re-added Titan X (Pascal) tuning results based on more averaging when tuningCedric Nugteren
2017-04-22Fixed the direct vs indirect setting for NVIDIA GPUsCedric Nugteren
2017-04-21Increased the default number of runs for GEMV tuning; updated GEMV tuning ↵Cedric Nugteren
results for Iris Pro
2017-04-20Tuned the direct versus indirect GEMM kernel trade-off point for NVIDIA GPUsCedric Nugteren
2017-04-10Fixed a compilation issue under MSVC and GCCCedric Nugteren
2017-04-10Removed const-vector-of-const-objects from the database class to remain ↵Cedric Nugteren
according to the C++11 standard
2017-04-07Added a special override database for the Apple CPU implementation on OS X: ↵Cedric Nugteren
this makes the test work, it does not focus on good performance
2017-03-04Added tuning results for the Radeon HD6750M GPU (Apple OpenCL)Cedric Nugteren
2017-02-26Merge branch 'development' into triangular_solversCedric Nugteren
2017-02-18Added tuning parameters for the AMD RX480 GPU (Ellesmere)Cedric Nugteren
2017-02-16Added input-sanity checks for the OverrideParameters functionCedric Nugteren
2017-02-12Split the database into several smaller cached per-kernel databases (in ↵Cedric Nugteren
preparation of per-kernel database overrides)
2017-02-08Added tuning results for Titan X (Pascal version)Cedric Nugteren
2017-02-05Merge branch 'development' into triangular_solversCedric Nugteren
2017-02-04Completed a first STRSV implementationCedric Nugteren
2017-01-24Database: pass Device instead of Queue for clarityIvan Shapovalov
2017-01-24Database: ref-count the internal map for cachingIvan Shapovalov
2017-01-19Added tuning results for NVIDIA GTX 1080 and Intel Core i7-4790KCedric Nugteren
2017-01-15Added a first version of the diagonal block invert routine in preparation of ↵Cedric Nugteren
TRSM
2017-01-03Added tuning results for the AMD Turks GPU and the Intel Core i7-2670QM CPUCedric Nugteren
2016-11-29Made Intel GPUs always use the indirect version of the GEMM kernelCedric Nugteren
2016-11-26Improved the default parameters for cases with non-common parameters across ↵Cedric Nugteren
all devices
2016-11-19Changed the GEMM kernel selection parameters for Skylake GPUs to always ↵Cedric Nugteren
favour the regular kernel
2016-11-15Updated the tuning results for the Intel Skylake ULT GT2 GPUCedric Nugteren
2016-10-24Added tuning results for GeForce GTX TITAN BlackCedric Nugteren
2016-10-22Added tuning results for the AMD Tonga GPUCedric Nugteren
2016-10-22Moved files around a bit; created a utilities subfolderCedric Nugteren
2016-10-22treewide: use C++ exceptions properlyIvan Shapovalov
Since the codebase is designed around proper C++ idioms such as RAII, it makes sense to only use C++ exceptions internally instead of mixing exceptions and error codes. The exceptions are now caught at top level to preserve compatibility with the existing error code-based API. Note that we deliberately do not catch C++ runtime errors (such as `std::bad_alloc`) nor logic errors (aka failed assertions) because no actual handling can ever happen for such errors. However, in the C interface we do catch _all_ exceptions (...) and convert them into a wild-card error code.
2016-10-14Fixed an issue with a growing database: the database is now a global ↵Cedric Nugteren
variable in a namespace and its container uses const-pointers to the actual data
2016-10-13Added tuning results for Intel HD Graphics IvyBridge GPUCedric Nugteren
2016-10-10Fixed an issue with const members of structs in the databaseCedric Nugteren
2016-10-10First fixes towards compilation on Visual Studio 2013Cedric Nugteren
2016-10-10Updated the tuning results for the GTX 750 Ti GPUCedric Nugteren
2016-10-10Changed the thresholds for the direct/indirect GEMM kernels for NVIDIA and ↵Cedric Nugteren
Intel GPUs
2016-10-08Fixed a performance bug for Intel Iris Pro GPUs due to incorrect tuning resultsCedric Nugteren