index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
doc
Age
Commit message (
Collapse
)
Author
2018-09-07
Added xCONVGEMM as im2col plus a batched GEMM kernel
Cedric Nugteren
2018-07-29
Removed complex numbers support for CONVGEMM
Cedric Nugteren
2018-07-29
Merge branch 'master' into CLBlast-267-convgemm
Cedric Nugteren
2018-07-14
Updated to CLBlast version 1.4.1
Cedric Nugteren
2018-07-13
Added tuning results for Intel i5-4970S
Cedric Nugteren
2018-07-13
Added tuning results for GeForce GTX 1070 Ti
Cedric Nugteren
2018-07-13
Added tuning results for HD Graphics 6000 Broadwell GT3
Cedric Nugteren
2018-06-03
Merge branch 'master' into CLBlast-267-convgemm
Cedric Nugteren
2018-06-03
Added list of tuners to be run by 'alltuners' target
Cedric Nugteren
2018-05-19
Merge branch 'master' into CLBlast-267-convgemm
Cedric Nugteren
2018-05-19
The GEMM routine tuner now loads kernel JSON tuning results from disk if ↵
Cedric Nugteren
available; now run part of alltuners target
2018-05-17
Added documentation on some details of the GEMM implementation
Cedric Nugteren
2018-05-09
Updated the documentation for convgemm to include data layout (NCHW)
Cedric Nugteren
2018-05-06
Added convgemm skeleton, test infrastructure, and first reference implementation
Cedric Nugteren
2018-05-05
Added interface of batched convolution as GEMM
Cedric Nugteren
2018-04-07
Added tuning results for NVIDIA GeForce 970
Cedric Nugteren
2018-04-07
Added tuning results for NVIDIA GeForce 920MX
Cedric Nugteren
2018-03-30
Updated the roadmap
Cedric Nugteren
2018-03-11
Merge pull request #262 from CNugteren/CLBlast-237-tuning-api
Cedric Nugteren
CLBlast #237: Tuning API
2018-03-10
Added initial glossary
Cedric Nugteren
2018-03-10
Updated the documentation for the tuner API
Cedric Nugteren
2018-03-03
Updated documentation and build badges
Cedric Nugteren
2018-03-03
Fixed some small issues regarding PR#253
Cedric Nugteren
2018-03-03
Added C API for getting GEMM temp buffer size
sivagnanamn
2018-02-26
Added a note on preventing segfaults with OpenCL using the AMD APP SDK
Cedric Nugteren
2018-02-25
Fixed Ubuntu PPA package name
Cedric Nugteren
2018-02-25
Added API documentation for two missing C++ functions
Cedric Nugteren
2018-02-24
Split the documentation and updated where needed
Cedric Nugteren
2018-02-24
Renamed the API documentation
Cedric Nugteren
2018-02-20
Fix of multiple duplicates in documentation
Kirill Mavreshko
2018-02-02
Fixed the XHAD documentation
Cedric Nugteren
2018-01-31
Created the API and stubs for the HAD (hadamard-product) routines
Cedric Nugteren
2018-01-07
Added API and tests for new GemmStridedBatched routine
Cedric Nugteren
2018-01-04
Updated the generator script to automatically generate the temp-buffer code
Cedric Nugteren
2017-07-02
Added interface and stubs for the im2col routine
Cedric Nugteren
2017-05-12
Added the IxAMIN routines: absolute minimum version of IxAMAX
Cedric Nugteren
2017-05-12
Removed the included performance reports; README now redirects to the new ↵
Cedric Nugteren
external website
2017-03-10
Added API and test infrastructure for the batched GEMM routine
Cedric Nugteren
2017-03-08
Make batched routines based on offsets instead of a vector of cl_mem objects ↵
Cedric Nugteren
- undoing many earlier changes
2017-03-05
Added first naive version of the batched AXPY routine
Cedric Nugteren
2017-03-05
Prepared generator for batched routines; added batched AXPY routine interface
Cedric Nugteren
2017-02-26
Merge branch 'development' into triangular_solvers
Cedric Nugteren
2017-02-26
Removed half-precision support from the TRSM routine; too unstable
Cedric Nugteren
2017-02-18
Added documentation for the OverrideParameters function
Cedric Nugteren
2017-02-18
Added missing documentation for the fill and clear cache functions
Cedric Nugteren
2017-01-20
Added prototype for the TRSV routine
Cedric Nugteren
2016-12-18
Prepared for the addition of the TRSM triangular solver kernel
Cedric Nugteren
2016-11-20
Added performance results for the Skylake ULT GT2 GPU
Cedric Nugteren
2016-10-22
All enums in the C API are now prefixed with CLBlast to avoid potential name ↵
Cedric Nugteren
clashes with other projects
2016-10-10
Updated the performance graphs for Intel Iris Pro GPU and AMD Radeon M370X GPU
Cedric Nugteren
[next]