summaryrefslogtreecommitdiff
path: root/src/database
AgeCommit message (Collapse)Author
2017-04-22Fixed the direct vs indirect setting for NVIDIA GPUsCedric Nugteren
2017-04-21Increased the default number of runs for GEMV tuning; updated GEMV tuning ↵Cedric Nugteren
results for Iris Pro
2017-04-20Tuned the direct versus indirect GEMM kernel trade-off point for NVIDIA GPUsCedric Nugteren
2017-04-10Fixed a compilation issue under MSVC and GCCCedric Nugteren
2017-04-10Removed const-vector-of-const-objects from the database class to remain ↵Cedric Nugteren
according to the C++11 standard
2017-04-07Added a special override database for the Apple CPU implementation on OS X: ↵Cedric Nugteren
this makes the test work, it does not focus on good performance
2017-03-04Added tuning results for the Radeon HD6750M GPU (Apple OpenCL)Cedric Nugteren
2017-02-26Merge branch 'development' into triangular_solversCedric Nugteren
2017-02-18Added tuning parameters for the AMD RX480 GPU (Ellesmere)Cedric Nugteren
2017-02-16Added input-sanity checks for the OverrideParameters functionCedric Nugteren
2017-02-12Split the database into several smaller cached per-kernel databases (in ↵Cedric Nugteren
preparation of per-kernel database overrides)
2017-02-08Added tuning results for Titan X (Pascal version)Cedric Nugteren
2017-02-05Merge branch 'development' into triangular_solversCedric Nugteren
2017-02-04Completed a first STRSV implementationCedric Nugteren
2017-01-24Database: pass Device instead of Queue for clarityIvan Shapovalov
2017-01-24Database: ref-count the internal map for cachingIvan Shapovalov
2017-01-19Added tuning results for NVIDIA GTX 1080 and Intel Core i7-4790KCedric Nugteren
2017-01-15Added a first version of the diagonal block invert routine in preparation of ↵Cedric Nugteren
TRSM
2017-01-03Added tuning results for the AMD Turks GPU and the Intel Core i7-2670QM CPUCedric Nugteren
2016-11-29Made Intel GPUs always use the indirect version of the GEMM kernelCedric Nugteren
2016-11-26Improved the default parameters for cases with non-common parameters across ↵Cedric Nugteren
all devices
2016-11-19Changed the GEMM kernel selection parameters for Skylake GPUs to always ↵Cedric Nugteren
favour the regular kernel
2016-11-15Updated the tuning results for the Intel Skylake ULT GT2 GPUCedric Nugteren
2016-10-24Added tuning results for GeForce GTX TITAN BlackCedric Nugteren
2016-10-22Added tuning results for the AMD Tonga GPUCedric Nugteren
2016-10-22Moved files around a bit; created a utilities subfolderCedric Nugteren
2016-10-22treewide: use C++ exceptions properlyIvan Shapovalov
Since the codebase is designed around proper C++ idioms such as RAII, it makes sense to only use C++ exceptions internally instead of mixing exceptions and error codes. The exceptions are now caught at top level to preserve compatibility with the existing error code-based API. Note that we deliberately do not catch C++ runtime errors (such as `std::bad_alloc`) nor logic errors (aka failed assertions) because no actual handling can ever happen for such errors. However, in the C interface we do catch _all_ exceptions (...) and convert them into a wild-card error code.
2016-10-14Fixed an issue with a growing database: the database is now a global ↵Cedric Nugteren
variable in a namespace and its container uses const-pointers to the actual data
2016-10-13Added tuning results for Intel HD Graphics IvyBridge GPUCedric Nugteren
2016-10-10Fixed an issue with const members of structs in the databaseCedric Nugteren
2016-10-10First fixes towards compilation on Visual Studio 2013Cedric Nugteren
2016-10-10Updated the tuning results for the GTX 750 Ti GPUCedric Nugteren
2016-10-10Changed the thresholds for the direct/indirect GEMM kernels for NVIDIA and ↵Cedric Nugteren
Intel GPUs
2016-10-08Fixed a performance bug for Intel Iris Pro GPUs due to incorrect tuning resultsCedric Nugteren
2016-10-06Added first tuning results for the single-kernel direct GEMM implementationCedric Nugteren
2016-10-06Added a kernel selection database to select between the direct and indirect ↵Cedric Nugteren
GEMM kernels
2016-10-01Added padding to the local memory of the GEMM direct kernelCedric Nugteren
2016-09-25Added a first version of a tuner for the GEMM direct kernel; collapsed MWGD, ↵Cedric Nugteren
NWGD and KWGD into one WGD parameter
2016-09-25Separated the tuning parameters of the new direct GEMM kernel from the ↵Cedric Nugteren
indirect version
2016-09-12Added XgemvFastRot and Xgemm 16-bit tuning results: just defaults which are ↵Cedric Nugteren
now automatically taken from 32-bit if there are no entries at all
2016-09-11Complete re-write of the database script. Changed Pandas for the much faster ↵Cedric Nugteren
and convienient plain JSON/dict data-type
2016-09-10Updated database based on exhaustive tuning results for GEMM for the R9 ↵Cedric Nugteren
M370X GPU
2016-09-10Updated the database script to remove duplicate entries: keeps only the ↵Cedric Nugteren
best-performing cases for a specific parameters combination
2016-09-03Added tuning results for Intel Broadwell 5500 GT2 GPUCedric Nugteren
2016-09-03Updated tuning results for Haswell GT2 Mobile GPU; fixed database script to ↵Cedric Nugteren
handle duplicate entries of different runs
2016-08-21Also changed the default-default for unknown device types to use the same ↵Cedric Nugteren
method as for known device groups
2016-08-15Updated the database script to calculate the relative best performance of ↵Cedric Nugteren
tuning results common for a device/vendor type
2016-07-25Removed all old tuning results for the XgemvFastRot kernel; re-added for a ↵Cedric Nugteren
couple of devices
2016-07-25Moved the XgemvFast and XgemvFastRot tuning database into a separate fileCedric Nugteren
2016-07-24Merge branch 'development' into gemv_performanceCedric Nugteren