index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
doc
Age
Commit message (
Expand
)
Author
2018-06-03
Added list of tuners to be run by 'alltuners' target
Cedric Nugteren
2018-05-19
The GEMM routine tuner now loads kernel JSON tuning results from disk if avai...
Cedric Nugteren
2018-05-17
Added documentation on some details of the GEMM implementation
Cedric Nugteren
2018-04-07
Added tuning results for NVIDIA GeForce 970
Cedric Nugteren
2018-04-07
Added tuning results for NVIDIA GeForce 920MX
Cedric Nugteren
2018-03-30
Updated the roadmap
Cedric Nugteren
2018-03-11
Merge pull request #262 from CNugteren/CLBlast-237-tuning-api
Cedric Nugteren
2018-03-10
Added initial glossary
Cedric Nugteren
2018-03-10
Updated the documentation for the tuner API
Cedric Nugteren
2018-03-03
Updated documentation and build badges
Cedric Nugteren
2018-03-03
Fixed some small issues regarding PR#253
Cedric Nugteren
2018-03-03
Added C API for getting GEMM temp buffer size
sivagnanamn
2018-02-26
Added a note on preventing segfaults with OpenCL using the AMD APP SDK
Cedric Nugteren
2018-02-25
Fixed Ubuntu PPA package name
Cedric Nugteren
2018-02-25
Added API documentation for two missing C++ functions
Cedric Nugteren
2018-02-24
Split the documentation and updated where needed
Cedric Nugteren
2018-02-24
Renamed the API documentation
Cedric Nugteren
2018-02-20
Fix of multiple duplicates in documentation
Kirill Mavreshko
2018-02-02
Fixed the XHAD documentation
Cedric Nugteren
2018-01-31
Created the API and stubs for the HAD (hadamard-product) routines
Cedric Nugteren
2018-01-07
Added API and tests for new GemmStridedBatched routine
Cedric Nugteren
2018-01-04
Updated the generator script to automatically generate the temp-buffer code
Cedric Nugteren
2017-07-02
Added interface and stubs for the im2col routine
Cedric Nugteren
2017-05-12
Added the IxAMIN routines: absolute minimum version of IxAMAX
Cedric Nugteren
2017-05-12
Removed the included performance reports; README now redirects to the new ext...
Cedric Nugteren
2017-03-10
Added API and test infrastructure for the batched GEMM routine
Cedric Nugteren
2017-03-08
Make batched routines based on offsets instead of a vector of cl_mem objects ...
Cedric Nugteren
2017-03-05
Added first naive version of the batched AXPY routine
Cedric Nugteren
2017-03-05
Prepared generator for batched routines; added batched AXPY routine interface
Cedric Nugteren
2017-02-26
Merge branch 'development' into triangular_solvers
Cedric Nugteren
2017-02-26
Removed half-precision support from the TRSM routine; too unstable
Cedric Nugteren
2017-02-18
Added documentation for the OverrideParameters function
Cedric Nugteren
2017-02-18
Added missing documentation for the fill and clear cache functions
Cedric Nugteren
2017-01-20
Added prototype for the TRSV routine
Cedric Nugteren
2016-12-18
Prepared for the addition of the TRSM triangular solver kernel
Cedric Nugteren
2016-11-20
Added performance results for the Skylake ULT GT2 GPU
Cedric Nugteren
2016-10-22
All enums in the C API are now prefixed with CLBlast to avoid potential name ...
Cedric Nugteren
2016-10-10
Updated the performance graphs for Intel Iris Pro GPU and AMD Radeon M370X GPU
Cedric Nugteren
2016-10-10
Added fresh performance graphs for GeForce 750Ti; removed old GTX480 results
Cedric Nugteren
2016-06-16
Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, and...
Cedric Nugteren
2016-06-13
Improved API documentation and added documentation for level-2 and level-3 ro...
Cedric Nugteren
2016-06-10
Added documentation for the matrix-update level-2 family of routines
Cedric Nugteren
2016-05-25
Added level-3 half-precision routines HGEMM/HSYMM/HSYRK/HSYR2K/HTRMM
Cedric Nugteren
2016-05-22
Added level-2 half-precision routines HGER/HSYR/HSPR/HSYR2/HSPR2
Cedric Nugteren
2016-05-22
Added level-2 half-precision routines HGEMV/HGBMV/HHEMV/HHBMV/HHPMV/HSYMV/HSB...
Cedric Nugteren
2016-05-22
Added level-1 half-precision routines HSWAP/HSCAL/HCOPY/HAXPY/HDOT/HNRM2/HASU...
Cedric Nugteren
2016-05-18
Merged in latest changes from 0.7.1 release
Cedric Nugteren
2016-05-13
Initial experimental version of the half-precision HAXPY routine
Cedric Nugteren
2016-05-08
Added preliminary generated API documentation
Cedric Nugteren
2016-03-12
Added performance graphs for Intel Iris and Radeon M370X
Cedric Nugteren
[next]