index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
include
Age
Commit message (
Expand
)
Author
2017-05-12
Added the IxAMIN routines: absolute minimum version of IxAMAX
Cedric Nugteren
2017-04-17
Fixed a namespace clash with CUDA FP16 for the half-datatype
Cedric Nugteren
2017-04-07
Added a special override database for the Apple CPU implementation on OS X: t...
Cedric Nugteren
2017-03-10
Added API and test infrastructure for the batched GEMM routine
Cedric Nugteren
2017-03-08
Make batched routines based on offsets instead of a vector of cl_mem objects ...
Cedric Nugteren
2017-03-05
Added first naive version of the batched AXPY routine
Cedric Nugteren
2017-03-05
Prepared generator for batched routines; added batched AXPY routine interface
Cedric Nugteren
2017-02-26
Merge branch 'development' into triangular_solvers
Cedric Nugteren
2017-02-26
Removed half-precision support from the TRSM routine; too unstable
Cedric Nugteren
2017-02-18
Fixed the naming of the C API of OverrideParameters and fixed the description
Cedric Nugteren
2017-02-16
Added a C interface to the OverrideParameters function; added some in-line co...
Cedric Nugteren
2017-02-16
Added input-sanity checks for the OverrideParameters function
Cedric Nugteren
2017-02-13
Added first version of the OverrideParameters function
Cedric Nugteren
2016-11-22
Minor changes to ensure full compatibility with the Netlib CBLAS API
Cedric Nugteren
2016-11-20
Made functions with scalar-buffers as output properly return values
Cedric Nugteren
2016-10-25
Renamed the include and source files of the Netlib CBLAS API
Cedric Nugteren
2016-10-25
Fixed some issues preventing the Netlib CBLAS API from linking correctly
Cedric Nugteren
2016-10-25
Made the Netlib CBLAS API use the same enums with prefixes as the regular C A...
Cedric Nugteren
2016-10-25
Added initial version of a Netlib CBLAS implementation. TODO: Set correct buf...
Cedric Nugteren
2016-10-25
Merge branch 'development' into netlib_blas_api
Cedric Nugteren
2016-10-22
All enums in the C API are now prefixed with CLBlast to avoid potential name ...
Cedric Nugteren
2016-10-22
Added extra error codes to reflect the more detailed error reporting of OpenC...
Cedric Nugteren
2016-10-22
treewide: use C++ exceptions properly
Ivan Shapovalov
2016-10-16
Merge branch 'development' into netlib_blas_api
Cedric Nugteren
2016-10-15
Added documentation and minor refactoring for the recent support of static li...
Cedric Nugteren
2016-10-14
Fixes for static lib compilation on Windows
Shehzan Mohammed
2016-10-10
Added support for compiling the library, the client, and the samples under MS...
Cedric Nugteren
2016-10-05
Made non-standard types void-pointers in the Netlib BLAS interface
Cedric Nugteren
2016-10-05
Added first version of Netlib BLAS API header
Cedric Nugteren
2016-06-30
Added declspec(dllexport) to ClearCache and FillCache, and added declspec(dll...
Cedric Nugteren
2016-06-18
Moved all headers into the source tree, changed headers to .hpp extension
Cedric Nugteren
2016-06-18
Clean-up of the routine class, moved RunKernel to the routine/common file
Cedric Nugteren
2016-06-18
Removed the template from the Routine base-class
Cedric Nugteren
2016-06-17
Removed the precision argument from the routines in favor of a single templat...
Cedric Nugteren
2016-06-17
Removed the interface to the cache functions from the Routine class, calls th...
Cedric Nugteren
2016-06-17
Moved the RunKernel and PadCopyTransposeMatrix functions out of the Routine c...
Cedric Nugteren
2016-06-17
Moved the ErrorIn function from the Routine class to the utilities header
Cedric Nugteren
2016-06-17
Moved the test-for-valid-buffers function from the Routine class to separate ...
Cedric Nugteren
2016-06-16
Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, and...
Cedric Nugteren
2016-06-15
Added some constness to variables related to the GEMM routines
Cedric Nugteren
2016-06-14
Moved device vendor and type checks to a common header
Cedric Nugteren
2016-06-08
Added global memory synchronisation for better cache performance on ARM Mali ...
Cedric Nugteren
2016-06-01
Added tuning parameters for 'GRID K520' and 'HD Graphics Skylake ULT GT2'
Cedric Nugteren
2016-05-26
Added half-precision tests for the clBLAS reference through conversion to sin...
Cedric Nugteren
2016-05-25
Added level-3 half-precision routines HGEMM/HSYMM/HSYRK/HSYR2K/HTRMM
Cedric Nugteren
2016-05-22
Added level-2 half-precision routines HGER/HSYR/HSPR/HSYR2/HSPR2
Cedric Nugteren
2016-05-22
Fixed tuning results for half-precision; added first results for the xGER ker...
Cedric Nugteren
2016-05-22
Prepared the GER kernels and tuner for half-precision support
Cedric Nugteren
2016-05-22
Added level-2 half-precision routines HGEMV/HGBMV/HHEMV/HHBMV/HHPMV/HSYMV/HSB...
Cedric Nugteren
2016-05-22
Added first tuning results for the half-precision xGEMV kernels
Cedric Nugteren
[next]