Age | Commit message (Collapse) | Author |
|
(thanks to OursDesCavernes)
|
|
|
|
|
|
|
|
|
|
templated function
|
|
them directly now
|
|
class
|
|
functions in a separate file
|
|
and/or transposing
|
|
|
|
and renamed files and functions appropriately
|
|
|
|
|
|
GPUs
|
|
single-precision
|
|
|
|
|
|
|
|
kernels
|
|
|
|
HGEMV/HGBMV/HHEMV/HHBMV/HHPMV/HSYMV/HSBMV/HSPMV/HTRMV/HTBMV/HTPMV
|
|
|
|
|
|
HSWAP/HSCAL/HCOPY/HAXPY/HDOT/HNRM2/HASUM/HSUM/iHAMAX/iHMAX/iHMIN
|
|
|
|
|
|
|
|
transpose, padtranspose)
|
|
|
|
|
|
to transfer half-precision values as well
|
|
|
|
|
|
|
|
submatrices
|
|
|
|
buffersize checking
|
|
|
|
|
|
|
|
|
|
and IxAMAX
|
|
counterparts of xASUM and IxAMAX)
|
|
ClearCompiledProgramCache function to clear the cache
|
|
|
|
|
|
|
|
|
|
|