index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
Age
Commit message (
Expand
)
Author
2016-05-02
Fixed the calculation of the required buffer sizes in case of subvectors and ...
Cedric Nugteren
2016-05-01
Made the default xDOT tuning size smaller
Cedric Nugteren
2016-05-01
Changed the index buffer of IxAMAX routines to unsigned int for proper buffer...
Cedric Nugteren
2016-05-01
Added a program cache (per-context) next to the per-device binary cache
Cedric Nugteren
2016-04-30
Added non-aboslute minimum counter-part IxMIN of the BLAS routine IxAMAX
Cedric Nugteren
2016-04-29
Added FillCache: a function to pre-compile all kernels for a specific device
Cedric Nugteren
2016-04-28
Fixed the cache to store binaries instead of OpenCL programs
Cedric Nugteren
2016-04-27
Added non-absolute counter-parts xSUM and IxMAX of the BLAS routines xASUM an...
Cedric Nugteren
2016-04-27
Added prototypes for non-BLAS routines: xSUM and IxMAX (non-absolute counterp...
Cedric Nugteren
2016-04-27
Moved all cache-related functions to a separate file; added a ClearCompiledPr...
Cedric Nugteren
2016-04-20
Added support for the iSAMAX/iDAMAX/iCAMAX/iZAMAX routines
cnugteren
2016-04-20
Added prototype for ixAMAX routines
cnugteren
2016-04-14
Updated the reduction-kernel tuner to also tune the epilogue
cnugteren
2016-04-14
Added support for the SASUM/DASUM/ScASUM/DzASUM routines
cnugteren
2016-04-13
Added prototype for xASUM routines
cnugteren
2016-04-09
Events are now properly implemented using event waiting list and asking the u...
cnugteren
2016-04-04
Removed redundant queue synchronisation statements
cnugteren
2016-04-01
Added a wrapper for CBLAS libraries for performance/correctness testing
cnugteren
2016-03-30
Merge branch 'level1_routines' into development
cnugteren
2016-03-30
Fixed the nrm2 kernel for complex data-types
cnugteren
2016-03-30
Added prototypes for the xROTM and xROTMG routines
Cedric Nugteren
2016-03-30
Added prototypes for the xROT and xROTG functions
Cedric Nugteren
2016-03-30
Fixed properly passing of OpenCL events to CLBlast functions
Cedric Nugteren
2016-03-28
Added preliminary support for the xNRM2 routines
Cedric Nugteren
2016-03-25
Added prototypes for ScNRM2/DzNRM2 routines
Cedric Nugteren
2016-03-25
Added prototypes for SNRM2/DNRM2 routines
Cedric Nugteren
2016-03-23
Fixed the C-api export to be able to properly build a DLL on Windows
Cedric Nugteren
2016-03-19
Added __declspec(dllexport) to create a DLL on Windows
Cedric Nugteren
2016-03-14
Made the library thread-safe by guarding the kernel cache with a mutex
Cedric Nugteren
2016-03-06
Fixed a bug in the GER-family of routines due to incorrect division of the wo...
Cedric Nugteren
2016-03-06
Added preliminary support for xHPR2 and xSPR2 routines
Cedric Nugteren
2016-03-02
Added preliminary support for xHER2 and xSYR2 routines
Cedric Nugteren
2016-02-28
Fixed a couple of correctness bugs in the Xher kernels
Cedric Nugteren
2016-02-28
Added support for xHER, xHPR, xSYR, and xSPR routines
Cedric Nugteren
2016-02-20
Set a proper default precision for the CLBlast clients
Cedric Nugteren
2016-02-20
Added support for xGERU and xGERC routines
Cedric Nugteren
2016-02-20
Added XGER routine, kernel, and tuner
Cedric Nugteren
2016-02-08
Separated the GEMM kernel in two parts to reduce string length for MSVC
Cedric Nugteren
2016-02-08
Split-up the XGEMV kernel in two parts
Cedric Nugteren
2016-02-07
Added dictionary with short and long OpenCL vendor names to fix issues with I...
Cedric Nugteren
2016-02-06
Reduced the maximum workgroup-size for GEMV kernels further
CNugteren
2016-02-06
Reduced unrolling factor in xgemv kernel to reduce compilation times
CNugteren
2016-01-30
Fixes for compilation under Visual Studio
CNugteren
2016-01-30
Added first auto-generated database headers from the Python database; only K4...
Cedric Nugteren
2015-10-28
Now sets local memory size in xgemv tuner properly
CNugteren
2015-10-25
Fixed an arguments-related bug in the GEMV tuner
CNugteren
2015-10-25
Moved the tuner database script to a separate folder
CNugteren
2015-10-13
Added guards for routine-specific level-3 pad kernels
CNugteren
2015-10-12
Routine names are now all default arguments defined in the header
CNugteren
2015-10-12
Moved level3 kernel files to a subfolder
CNugteren
[next]