index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2016-05-08
Fixed errors in xAXPY and xSCAL tests on AMD hardware
cnugteren
2016-05-08
Fixed an issue with computing the GFLOPS numbers for the xGEMM performance te...
cnugteren
2016-05-08
Added preliminary generated API documentation
Cedric Nugteren
2016-05-07
Added an option to the tests to control whether to test against clBLAS or a C...
Cedric Nugteren
2016-05-05
Added printing of indices when testing in verbose mode
Cedric Nugteren
2016-05-05
Merge pull request #57 from dividiti/development
Cedric Nugteren
2016-05-05
Locate the C BLAS library before the F77 one.
Anton Lokhmotov
2016-05-04
Fixed an issue with linking against the ATLAS BLAS library
Cedric Nugteren
2016-05-02
Added tuning results for AMD Hawaii (R9 290X)
Cedric Nugteren
2016-05-02
Fixed the calculation of the required buffer sizes in case of subvectors and ...
Cedric Nugteren
2016-05-01
Added tuning results for AMD Pitcairn (R9 270X)
Cedric Nugteren
2016-05-01
Updated tuning database for reduction/dot kernels based on the new tuner; par...
Cedric Nugteren
2016-05-01
Made the default xDOT tuning size smaller
Cedric Nugteren
2016-05-01
Changed the index buffer of IxAMAX routines to unsigned int for proper buffer...
Cedric Nugteren
2016-05-01
Added a program cache (per-context) next to the per-device binary cache
Cedric Nugteren
2016-04-30
Added non-aboslute minimum counter-part IxMIN of the BLAS routine IxAMAX
Cedric Nugteren
2016-04-29
Added an example to demonstrate the use of the ClearCache and FillCache funct...
Cedric Nugteren
2016-04-29
Added FillCache: a function to pre-compile all kernels for a specific device
Cedric Nugteren
2016-04-29
Added sample C programs for the SASUM and DGEMV routines
Cedric Nugteren
2016-04-28
Fixed the cache to store binaries instead of OpenCL programs
Cedric Nugteren
2016-04-27
Added non-absolute counter-parts xSUM and IxMAX of the BLAS routines xASUM an...
Cedric Nugteren
2016-04-27
Added missing namespace to the SGEMM example
Cedric Nugteren
2016-04-27
Added prototypes for non-BLAS routines: xSUM and IxMAX (non-absolute counterp...
Cedric Nugteren
2016-04-27
Moved all cache-related functions to a separate file; added a ClearCompiledPr...
Cedric Nugteren
2016-04-27
Relaxed the absolute error margin for floating-point value comparisons to 1e-4
Cedric Nugteren
2016-04-27
Added a '-verbose' option to the test binaries to report errors in more detai...
Cedric Nugteren
2016-04-27
All CLBlast enum constants now have the same raw values as in the cblas standard
Cedric Nugteren
2016-04-20
Merge branch 'level1_routines' into development
cnugteren
2016-04-20
Added support for the iSAMAX/iDAMAX/iCAMAX/iZAMAX routines
cnugteren
2016-04-20
Added prototype for ixAMAX routines
cnugteren
2016-04-14
Updated the reduction-kernel tuner to also tune the epilogue
cnugteren
2016-04-14
Added support for the SASUM/DASUM/ScASUM/DzASUM routines
cnugteren
2016-04-13
Added prototype for xASUM routines
cnugteren
2016-04-11
Fixed the way the defaults are calculated in the database; added warning for ...
cnugteren
2016-04-09
Events are now properly implemented using event waiting list and asking the u...
cnugteren
2016-04-04
Properly set warning flags for Clang
cnugteren
2016-04-04
Removed redundant queue synchronisation statements
cnugteren
2016-04-03
Merge branch 'cpu_blas' into development
cnugteren
2016-04-03
Updated the documentation in light of the support for a reference CPU BLAS li...
cnugteren
2016-04-03
Added support for detection of CPU BLAS libraries OpenBLAS, BLIS and Accelera...
cnugteren
2016-04-02
Added support for testing (performance and correctness) against a CPU BLAS li...
cnugteren
2016-04-01
Added a wrapper for CBLAS libraries for performance/correctness testing
cnugteren
2016-03-31
Create a first version of CPU BLAS detection in CMake
cnugteren
2016-03-31
Updated the documentation
cnugteren
2016-03-30
Merge branch 'level1_routines' into development
cnugteren
2016-03-30
Fixed the nrm2 kernel for complex data-types
cnugteren
2016-03-30
CMake now downloads the cl.hpp header from the Khronos website when building ...
cnugteren
2016-03-30
Added prototypes for the xROTM and xROTMG routines
Cedric Nugteren
2016-03-30
Added prototypes for the xROT and xROTG functions
Cedric Nugteren
2016-03-30
Made event an optional argument in the CLBlast C++ API
Cedric Nugteren
[next]