index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
/
kernels
/
level1
Age
Commit message (
Expand
)
Author
2018-02-02
Implemented the XHAD Hadamard product routine
Cedric Nugteren
2017-12-09
Completed kernel modifications for pre-processor of all other kernels
Cedric Nugteren
2017-12-03
Added GEMM (direct and in-direct) to the pre-processor testing; modified the ...
Cedric Nugteren
2017-11-29
Reformatted unrollable kernel loops and added the new promote_to_registers pr...
Cedric Nugteren
2017-11-25
Implemented first simple pre-processor: defines parser and loop unrolling bas...
Cedric Nugteren
2017-07-08
Made the inline keyword in kernels optional currently only enabled for NVIDIA...
Cedric Nugteren
2017-05-12
Added the IxAMIN routines: absolute minimum version of IxAMAX
Cedric Nugteren
2017-04-14
Added a new Xaxpy kernel in between the regular and fast version in
Cedric Nugteren
2017-03-10
Added proper testing of the alpha parameter; finalized the batched AXPY imple...
Cedric Nugteren
2017-03-08
Implemented a batched version of the AXPY kernel
Cedric Nugteren
2017-03-08
Make batched routines based on offsets instead of a vector of cl_mem objects ...
Cedric Nugteren
2016-08-20
Merge branch 'master' of https://github.com/dvasschemacq/CLBlast into dvassch...
Cedric Nugteren
2016-08-18
Adapt opencl files for 1.1 OpenCL
D. Van Assche
2016-07-10
Now passing alpha/beta to the kernel as arguments as before fp16 support; in ...
Cedric Nugteren
2016-05-14
Set kernel arguments for AXPY as constant memory buffers, making it possible ...
Cedric Nugteren
2016-05-13
Initial experimental version of the half-precision HAXPY routine
Cedric Nugteren
2016-05-08
Fixed errors in xAXPY and xSCAL tests on AMD hardware
cnugteren
2016-04-30
Added non-aboslute minimum counter-part IxMIN of the BLAS routine IxAMAX
Cedric Nugteren
2016-04-27
Added non-absolute counter-parts xSUM and IxMAX of the BLAS routines xASUM an...
Cedric Nugteren
2016-04-20
Added support for the iSAMAX/iDAMAX/iCAMAX/iZAMAX routines
cnugteren
2016-04-14
Added support for the SASUM/DASUM/ScASUM/DzASUM routines
cnugteren
2016-03-30
Fixed the nrm2 kernel for complex data-types
cnugteren
2016-03-28
Added preliminary support for the xNRM2 routines
Cedric Nugteren
2015-09-14
Added xDOT/xDOTU/xDOTC dot-product routines
CNugteren
2015-08-22
Added the XSWAP, XSCAL and XCOPY level-1 routines
CNugteren
2015-08-22
Re-organized level1 xaxpy kernel
CNugteren