index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
scripts
Age
Commit message (
Expand
)
Author
2017-01-24
FillCache: perform compilation for each precision separately
Ivan Shapovalov
2017-01-03
Added tuning results for the AMD Turks GPU and the Intel Core i7-2670QM CPU
Cedric Nugteren
2016-11-27
Made it possible to use the command-line environmental variables for each exe...
Cedric Nugteren
2016-11-26
Improved the default parameters for cases with non-common parameters across a...
Cedric Nugteren
2016-11-24
Merge pull request #125 from CNugteren/netlib_blas_api
Cedric Nugteren
2016-11-23
Fixed a vector-size related bug in the CLBlast Netlib API
Cedric Nugteren
2016-11-22
Minor changes to ensure full compatibility with the Netlib CBLAS API
Cedric Nugteren
2016-11-20
Made functions with scalar-buffers as output properly return values
Cedric Nugteren
2016-11-19
Generating FP16 performance graphs now uses FP32 as a reference for comparison
Cedric Nugteren
2016-10-25
Renamed the include and source files of the Netlib CBLAS API
Cedric Nugteren
2016-10-25
Removed the clblast namespace from the Netlib C API source file to ensure pro...
Cedric Nugteren
2016-10-25
Fixed some issues preventing the Netlib CBLAS API from linking correctly
Cedric Nugteren
2016-10-25
Made the Netlib CBLAS API use the same enums with prefixes as the regular C A...
Cedric Nugteren
2016-10-25
Sets the proper sizes for the buffers for the Netlib CBLAS API
Cedric Nugteren
2016-10-25
Added initial version of a Netlib CBLAS implementation. TODO: Set correct buf...
Cedric Nugteren
2016-10-25
Merge branch 'development' into netlib_blas_api
Cedric Nugteren
2016-10-22
All enums in the C API are now prefixed with CLBlast to avoid potential name ...
Cedric Nugteren
2016-10-22
Added extra error codes to reflect the more detailed error reporting of OpenC...
Cedric Nugteren
2016-10-22
Routine: get rid of ::SetUp()
Ivan Shapovalov
2016-10-22
treewide: use C++ exceptions properly
Ivan Shapovalov
2016-10-16
Merge branch 'development' into netlib_blas_api
Cedric Nugteren
2016-10-14
Fixed an issue with a growing database: the database is now a global variable...
Cedric Nugteren
2016-10-10
Changed the storage location of the database to a separate Github repository
Cedric Nugteren
2016-10-10
Added fresh performance graphs for GeForce 750Ti; removed old GTX480 results
Cedric Nugteren
2016-10-08
Added benchmark script for small matrix sizes, testing the direct GEMM kernels
Cedric Nugteren
2016-10-05
Made non-standard types void-pointers in the Netlib BLAS interface
Cedric Nugteren
2016-10-05
Added first version of Netlib BLAS API header
Cedric Nugteren
2016-09-12
Added XgemvFastRot and Xgemm 16-bit tuning results: just defaults which are n...
Cedric Nugteren
2016-09-11
Complete re-write of the database script. Changed Pandas for the much faster ...
Cedric Nugteren
2016-09-10
Updated database based on exhaustive tuning results for GEMM for the R9 M370X...
Cedric Nugteren
2016-09-10
Updated the database script to remove duplicate entries: keeps only the best-...
Cedric Nugteren
2016-09-04
Refactored the Python C++ generator script; now confirms to the PEP8 styleguide
Cedric Nugteren
2016-09-03
Added tuning results for Intel Broadwell 5500 GT2 GPU
Cedric Nugteren
2016-09-03
Updated tuning results for Haswell GT2 Mobile GPU; fixed database script to h...
Cedric Nugteren
2016-08-21
Also changed the default-default for unknown device types to use the same met...
Cedric Nugteren
2016-08-21
Updated the changelog; refactored the database-get-bests code a bit
Cedric Nugteren
2016-08-15
Updated the database script to calculate the relative best performance of tun...
Cedric Nugteren
2016-08-09
Improved the speed of the new common-best defaults method for the database ge...
Cedric Nugteren
2016-08-07
Added a first version of the database's common-best default calculation
Cedric Nugteren
2016-07-25
Moved the XgemvFast and XgemvFastRot tuning database into a separate file
Cedric Nugteren
2016-07-24
Refactored the Python database script: separated functionality in modules, no...
Cedric Nugteren
2016-07-03
Added tuning results for GTX670, GTX750, and GTX1070 (thanks to gcp)
Cedric Nugteren
2016-07-02
Prints the current pandas version and reports the minimum required version
Cedric Nugteren
2016-06-30
Added declspec(dllexport) to ClearCache and FillCache, and added declspec(dll...
Cedric Nugteren
2016-06-27
Moved the performance graph scripts to the 'scripts' subfolder
Cedric Nugteren
2016-06-19
Minor fix to the database script
Cedric Nugteren
2016-06-19
Renamed all C++ source files to .cpp to match the .hpp extension better
Cedric Nugteren
2016-06-18
Moved all headers into the source tree, changed headers to .hpp extension
Cedric Nugteren
2016-06-18
Clean-up of the routine class, moved RunKernel to the routine/common file
Cedric Nugteren
2016-06-16
Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, and...
Cedric Nugteren
[next]