index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
scripts
/
generator
Age
Commit message (
Collapse
)
Author
2018-09-16
Merge branch 'master' into convgemm_multi_kernel
Cedric Nugteren
2018-08-05
Added an option to compile the Netlib API with static OpenCL device and context
Cedric Nugteren
2018-07-29
Removed complex numbers support for CONVGEMM
Cedric Nugteren
2018-07-29
Merge branch 'master' into CLBlast-267-convgemm
Cedric Nugteren
2018-07-13
Added tuning results for HD Graphics 6000 Broadwell GT3
Cedric Nugteren
2018-05-09
Updated the documentation for convgemm to include data layout (NCHW)
Cedric Nugteren
2018-05-06
Added convgemm skeleton, test infrastructure, and first reference implementation
Cedric Nugteren
2018-05-05
Added interface of batched convolution as GEMM
Cedric Nugteren
2018-03-27
merged
kodonell
2018-03-27
got the generator thing working
kodonell
2018-03-10
Updated the documentation for the tuner API
Cedric Nugteren
2018-03-10
Fixed a few things for the new tuning API
Cedric Nugteren
2018-03-03
Fixed some small issues regarding PR#253
Cedric Nugteren
2018-03-03
Added C API for getting GEMM temp buffer size
sivagnanamn
2018-02-25
Generated PyCLBlast docstrings
Cedric Nugteren
2018-02-25
Some style improvements in the pyclblast code generator
Cedric Nugteren
2018-02-25
Added API documentation for two missing C++ functions
Cedric Nugteren
2018-02-24
Renamed the API documentation
Cedric Nugteren
2018-02-21
Fixed duplication of parameter descriptions by the doc generator
Kirill Mavreshko
2018-02-18
Prepared PyCLBlast for release as a package on PyPi
Cedric Nugteren
2018-02-18
Added all other level 1/2/3 routines to pyclblast
Cedric Nugteren
2018-02-18
Added GEMM to the Python wrapper
Cedric Nugteren
2018-02-14
First agenerated version (clblastXswap only for now) of the pyclblast wrapper
Cedric Nugteren
2018-02-02
Fixed the XHAD documentation
Cedric Nugteren
2018-01-31
Created the API and stubs for the HAD (hadamard-product) routines
Cedric Nugteren
2018-01-11
Added a RetrieveParameters function to inspect tuning parameters
Cedric Nugteren
2018-01-07
Added API and tests for new GemmStridedBatched routine
Cedric Nugteren
2018-01-06
Fixed a minor nullptr related issue in the code generator
Cedric Nugteren
2018-01-06
Added CUDA interface to get temporary-buffer size for GEMM routine
Cedric Nugteren
2018-01-04
Added a CUDA version of the GEMM temp-buffer optional argument
Cedric Nugteren
2018-01-04
Updated the generator script to automatically generate the temp-buffer code
Cedric Nugteren
2017-12-28
Added interface to compute the required temporary buffer size for GEMM
Cedric Nugteren
2017-10-14
Various fixes to make the host code and sample compile with the CUDA API
Cedric Nugteren
2017-10-12
CUDA API now takes context and device in instead of stream
Cedric Nugteren
2017-10-11
Added first (untested) version of a CUDA API
Cedric Nugteren
2017-10-09
Fixed the Python generator script w.r.t. the recent change of testing ↵
Cedric Nugteren
direct/in-direct GEMM kernels separately
2017-10-08
Moved non-routine-specific API functions and includes to separate files
Cedric Nugteren
2017-07-02
Added interface and stubs for the im2col routine
Cedric Nugteren
2017-06-25
Fixed some Clang and MSVC warnings
Cedric Nugteren
2017-06-21
Fixes some compilation issues related to the database structure change
Cedric Nugteren
2017-05-12
Added the IxAMIN routines: absolute minimum version of IxAMAX
Cedric Nugteren
2017-04-17
Fixed a namespace clash with CUDA FP16 for the half-datatype
Cedric Nugteren
2017-04-13
Fixed CUDA malloc and cuBLAS handles: cuBLAS as a performance-reference now ↵
Cedric Nugteren
works
2017-04-11
Made compilation of the cuBLAS wrapper work properly
Cedric Nugteren
2017-04-10
Merge branch 'development' into cublas_reference
Cedric Nugteren
Conflicts: scripts/generator/generator.py
2017-04-10
Removed const-vector-of-const-objects from the database class to remain ↵
Cedric Nugteren
according to the C++11 standard
2017-04-06
Completed the cuBLAS wrapper
Cedric Nugteren
2017-04-05
Added a first version of a cuBLAS wrapper (WIP)
Cedric Nugteren
2017-04-03
In-lined the float2 and double2 types to avoid collision with CUDA's definitions
Cedric Nugteren
2017-03-10
Added API and test infrastructure for the batched GEMM routine
Cedric Nugteren
[next]