index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
include
Age
Commit message (
Expand
)
Author
2016-05-02
Added tuning results for AMD Hawaii (R9 290X)
Cedric Nugteren
2016-05-01
Added tuning results for AMD Pitcairn (R9 270X)
Cedric Nugteren
2016-05-01
Updated tuning database for reduction/dot kernels based on the new tuner; par...
Cedric Nugteren
2016-05-01
Changed the index buffer of IxAMAX routines to unsigned int for proper buffer...
Cedric Nugteren
2016-05-01
Added a program cache (per-context) next to the per-device binary cache
Cedric Nugteren
2016-04-30
Added non-aboslute minimum counter-part IxMIN of the BLAS routine IxAMAX
Cedric Nugteren
2016-04-29
Added FillCache: a function to pre-compile all kernels for a specific device
Cedric Nugteren
2016-04-28
Fixed the cache to store binaries instead of OpenCL programs
Cedric Nugteren
2016-04-27
Added non-absolute counter-parts xSUM and IxMAX of the BLAS routines xASUM an...
Cedric Nugteren
2016-04-27
Added prototypes for non-BLAS routines: xSUM and IxMAX (non-absolute counterp...
Cedric Nugteren
2016-04-27
Moved all cache-related functions to a separate file; added a ClearCompiledPr...
Cedric Nugteren
2016-04-27
Added a '-verbose' option to the test binaries to report errors in more detai...
Cedric Nugteren
2016-04-27
All CLBlast enum constants now have the same raw values as in the cblas standard
Cedric Nugteren
2016-04-20
Added support for the iSAMAX/iDAMAX/iCAMAX/iZAMAX routines
cnugteren
2016-04-20
Added prototype for ixAMAX routines
cnugteren
2016-04-14
Added support for the SASUM/DASUM/ScASUM/DzASUM routines
cnugteren
2016-04-13
Added prototype for xASUM routines
cnugteren
2016-04-11
Fixed the way the defaults are calculated in the database; added warning for ...
cnugteren
2016-04-09
Events are now properly implemented using event waiting list and asking the u...
cnugteren
2016-04-02
Added support for testing (performance and correctness) against a CPU BLAS li...
cnugteren
2016-04-01
Added a wrapper for CBLAS libraries for performance/correctness testing
cnugteren
2016-03-30
Merge branch 'level1_routines' into development
cnugteren
2016-03-30
Added prototypes for the xROTM and xROTMG routines
Cedric Nugteren
2016-03-30
Added prototypes for the xROT and xROTG functions
Cedric Nugteren
2016-03-30
Made event an optional argument in the CLBlast C++ API
Cedric Nugteren
2016-03-30
Added missing newline to the end of the public API file
Cedric Nugteren
2016-03-30
Fixed properly passing of OpenCL events to CLBlast functions
Cedric Nugteren
2016-03-28
Added preliminary support for the xNRM2 routines
Cedric Nugteren
2016-03-25
Added prototypes for ScNRM2/DzNRM2 routines
Cedric Nugteren
2016-03-25
Added prototypes for SNRM2/DNRM2 routines
Cedric Nugteren
2016-03-23
Fixed the C-api export to be able to properly build a DLL on Windows
Cedric Nugteren
2016-03-19
Added __declspec(dllexport) to create a DLL on Windows
Cedric Nugteren
2016-03-14
Made the library thread-safe by guarding the kernel cache with a mutex
Cedric Nugteren
2016-03-12
Added tuning results for the newest xGER family kernels
Cedric Nugteren
2016-03-12
Added tuning results for the ARM Mali-T628 GPU
Cedric Nugteren
2016-03-06
Added preliminary support for xHPR2 and xSPR2 routines
Cedric Nugteren
2016-03-02
Added preliminary support for xHER2 and xSYR2 routines
Cedric Nugteren
2016-02-28
Added tuning results for Intel Iris Pro and AMD R9 M370X
Cedric Nugteren
2016-02-28
Added support for xHER, xHPR, xSYR, and xSPR routines
Cedric Nugteren
2016-02-28
Fixed a compilation issue under AppleClang
Cedric Nugteren
2016-02-20
Set a proper default precision for the CLBlast clients
Cedric Nugteren
2016-02-20
Added support for xGERU and xGERC routines
Cedric Nugteren
2016-02-20
Added XGER routine, kernel, and tuner
Cedric Nugteren
2016-02-07
Added tuning parameters for various devices using the new database script
Cedric Nugteren
2016-02-07
Added dictionary with short and long OpenCL vendor names to fix issues with I...
Cedric Nugteren
2016-02-06
Fixed a linker error in the performance client under GCC
CNugteren
2016-01-30
Updated to version 4.0 of the CLCudaAPI header
Cedric Nugteren
2016-01-30
Added first auto-generated database headers from the Python database; only K4...
Cedric Nugteren
2015-10-23
Added alpha and beta to tuner meta-data
CNugteren
2015-10-12
Routine names are now all default arguments defined in the header
CNugteren
[next]