index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
Age
Commit message (
Expand
)
Author
2016-03-06
Added preliminary support for xHPR2 and xSPR2 routines
Cedric Nugteren
2016-03-02
Added preliminary support for xHER2 and xSYR2 routines
Cedric Nugteren
2016-02-28
Fixed a couple of correctness bugs in the Xher kernels
Cedric Nugteren
2016-02-28
Added support for xHER, xHPR, xSYR, and xSPR routines
Cedric Nugteren
2016-02-20
Set a proper default precision for the CLBlast clients
Cedric Nugteren
2016-02-20
Added support for xGERU and xGERC routines
Cedric Nugteren
2016-02-20
Added XGER routine, kernel, and tuner
Cedric Nugteren
2016-02-08
Separated the GEMM kernel in two parts to reduce string length for MSVC
Cedric Nugteren
2016-02-08
Split-up the XGEMV kernel in two parts
Cedric Nugteren
2016-02-07
Added dictionary with short and long OpenCL vendor names to fix issues with I...
Cedric Nugteren
2016-02-06
Reduced the maximum workgroup-size for GEMV kernels further
CNugteren
2016-02-06
Reduced unrolling factor in xgemv kernel to reduce compilation times
CNugteren
2016-01-30
Fixes for compilation under Visual Studio
CNugteren
2016-01-30
Added first auto-generated database headers from the Python database; only K4...
Cedric Nugteren
2015-10-28
Now sets local memory size in xgemv tuner properly
CNugteren
2015-10-25
Fixed an arguments-related bug in the GEMV tuner
CNugteren
2015-10-25
Moved the tuner database script to a separate folder
CNugteren
2015-10-13
Added guards for routine-specific level-3 pad kernels
CNugteren
2015-10-12
Routine names are now all default arguments defined in the header
CNugteren
2015-10-12
Moved level3 kernel files to a subfolder
CNugteren
2015-09-26
Added TRMV/TBMV/TPMV routines
CNugteren
2015-09-19
Added SBMV and SPMV routines
CNugteren
2015-09-19
Added the HPMV routine
CNugteren
2015-09-19
Added infrastructure for packed matrices
CNugteren
2015-09-19
Added the HBMV routine
CNugteren
2015-09-18
Improved the organization and performance of level 2 routines
CNugteren
2015-09-18
Added first version of banded matrix-vector multiplication
CNugteren
2015-09-17
Added interface of all level 2 routines
CNugteren
2015-09-17
Added script to generate API interface and implementation automatically
CNugteren
2015-09-14
Added xDOT/xDOTU/xDOTC dot-product routines
CNugteren
2015-09-14
Added extra temporary buffer to tuners in preparation of Xdot routines
CNugteren
2015-09-14
Added support for the dot buffer and offset argument
CNugteren
2015-08-22
Added the XSWAP, XSCAL and XCOPY level-1 routines
CNugteren
2015-08-22
Re-organized level1 xaxpy kernel
CNugteren
2015-08-20
Merge pull request #23 from CNugteren/tuner_database
Cedric Nugteren
2015-08-20
Added initial version of tuner-database Python script
CNugteren
2015-08-19
Moved precision tester to utilities
CNugteren
2015-08-19
Added hotfix 8eeb7f721ff8811521147cfe5ae9796164286b77
CNugteren
2015-08-13
Merge pull request #21 from CNugteren/c_api
Cedric Nugteren
2015-08-13
Added all supported routines to the C API
CNugteren
2015-08-13
Fixed a complex data-type bug in the transpose kernel
CNugteren
2015-08-13
Added initial version of C API with just one routine
CNugteren
2015-08-09
Refactored the tuners, added JSON output
CNugteren
2015-08-04
Added distinguished names for GEMV inherited HEMV/SYMV
CNugteren
2015-08-03
Abstracted loading of matrix A for GEMV kernel
CNugteren
2015-07-31
Added HEMV routine
CNugteren
2015-07-31
Added SYMV routine
CNugteren
2015-07-27
Now using the new Claduc C++11 OpenCL header
CNugteren
2015-07-22
Added workgroup shuffle option to transpose kernel for AMD GPUs
CNugteren
2015-07-21
Transpose kernel now uses vectorized local memory loads and stores
CNugteren
[next]