index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
scripts
Age
Commit message (
Expand
)
Author
2017-04-02
Various tweaks to the new benchmark script
Cedric Nugteren
2017-04-01
Tuned the plots for a tight-layout for in papers and presentations
Cedric Nugteren
2017-03-26
Replaced the R graph scripts with Python/Matplotlib benchmark scripts
Cedric Nugteren
2017-03-10
Added API and test infrastructure for the batched GEMM routine
Cedric Nugteren
2017-03-08
Make batched routines based on offsets instead of a vector of cl_mem objects ...
Cedric Nugteren
2017-03-05
Added first naive version of the batched AXPY routine
Cedric Nugteren
2017-03-05
Prepared generator for batched routines; added batched AXPY routine interface
Cedric Nugteren
2017-02-26
Minor fix to the generator script
Cedric Nugteren
2017-02-26
Merge branch 'development' into triangular_solvers
Cedric Nugteren
2017-02-26
Removed half-precision support from the TRSM routine; too unstable
Cedric Nugteren
2017-02-18
Added documentation for the OverrideParameters function
Cedric Nugteren
2017-02-18
Added missing documentation for the fill and clear cache functions
Cedric Nugteren
2017-02-16
Added a C interface to the OverrideParameters function; added some in-line co...
Cedric Nugteren
2017-02-16
Added input-sanity checks for the OverrideParameters function
Cedric Nugteren
2017-02-13
Added first version of the OverrideParameters function
Cedric Nugteren
2017-02-05
Merge branch 'development' into triangular_solvers
Cedric Nugteren
2017-01-24
FillCache: perform compilation for each precision separately
Ivan Shapovalov
2017-01-20
Added prototype for the TRSV routine
Cedric Nugteren
2017-01-03
Added tuning results for the AMD Turks GPU and the Intel Core i7-2670QM CPU
Cedric Nugteren
2016-12-18
Prepared for the addition of the TRSM triangular solver kernel
Cedric Nugteren
2016-11-27
Made it possible to use the command-line environmental variables for each exe...
Cedric Nugteren
2016-11-26
Improved the default parameters for cases with non-common parameters across a...
Cedric Nugteren
2016-11-24
Merge pull request #125 from CNugteren/netlib_blas_api
Cedric Nugteren
2016-11-23
Fixed a vector-size related bug in the CLBlast Netlib API
Cedric Nugteren
2016-11-22
Minor changes to ensure full compatibility with the Netlib CBLAS API
Cedric Nugteren
2016-11-20
Made functions with scalar-buffers as output properly return values
Cedric Nugteren
2016-11-19
Generating FP16 performance graphs now uses FP32 as a reference for comparison
Cedric Nugteren
2016-10-25
Renamed the include and source files of the Netlib CBLAS API
Cedric Nugteren
2016-10-25
Removed the clblast namespace from the Netlib C API source file to ensure pro...
Cedric Nugteren
2016-10-25
Fixed some issues preventing the Netlib CBLAS API from linking correctly
Cedric Nugteren
2016-10-25
Made the Netlib CBLAS API use the same enums with prefixes as the regular C A...
Cedric Nugteren
2016-10-25
Sets the proper sizes for the buffers for the Netlib CBLAS API
Cedric Nugteren
2016-10-25
Added initial version of a Netlib CBLAS implementation. TODO: Set correct buf...
Cedric Nugteren
2016-10-25
Merge branch 'development' into netlib_blas_api
Cedric Nugteren
2016-10-22
All enums in the C API are now prefixed with CLBlast to avoid potential name ...
Cedric Nugteren
2016-10-22
Added extra error codes to reflect the more detailed error reporting of OpenC...
Cedric Nugteren
2016-10-22
Routine: get rid of ::SetUp()
Ivan Shapovalov
2016-10-22
treewide: use C++ exceptions properly
Ivan Shapovalov
2016-10-16
Merge branch 'development' into netlib_blas_api
Cedric Nugteren
2016-10-14
Fixed an issue with a growing database: the database is now a global variable...
Cedric Nugteren
2016-10-10
Changed the storage location of the database to a separate Github repository
Cedric Nugteren
2016-10-10
Added fresh performance graphs for GeForce 750Ti; removed old GTX480 results
Cedric Nugteren
2016-10-08
Added benchmark script for small matrix sizes, testing the direct GEMM kernels
Cedric Nugteren
2016-10-05
Made non-standard types void-pointers in the Netlib BLAS interface
Cedric Nugteren
2016-10-05
Added first version of Netlib BLAS API header
Cedric Nugteren
2016-09-12
Added XgemvFastRot and Xgemm 16-bit tuning results: just defaults which are n...
Cedric Nugteren
2016-09-11
Complete re-write of the database script. Changed Pandas for the much faster ...
Cedric Nugteren
2016-09-10
Updated database based on exhaustive tuning results for GEMM for the R9 M370X...
Cedric Nugteren
2016-09-10
Updated the database script to remove duplicate entries: keeps only the best-...
Cedric Nugteren
2016-09-04
Refactored the Python C++ generator script; now confirms to the PEP8 styleguide
Cedric Nugteren
[next]