index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2016-05-15
Added an example of using the half-precision HAXPY routine
Cedric Nugteren
2016-05-15
Added header with conversions from and to half-precision floating-point
Cedric Nugteren
2016-05-14
Set kernel arguments for AXPY as constant memory buffers, making it possible ...
Cedric Nugteren
2016-05-13
Initial experimental version of the half-precision HAXPY routine
Cedric Nugteren
2016-05-12
Initial changes in preparation for half-precision fp16 support
Cedric Nugteren
2016-05-10
Fixed links in the README
Cedric Nugteren
2016-05-08
Prepared the changelog for the next release
Cedric Nugteren
2016-05-08
Fixes for compilation of the tests under Visual Studio 2015
CNugteren
2016-05-08
Updated to version 0.7.0
Cedric Nugteren
2016-05-08
Fixed an issue where the xAMAX tester would incorrectly report failures when ...
cnugteren
2016-05-08
Fixed an issue where the xNRM2 and xASUM testers would incorrectly report fai...
cnugteren
2016-05-08
Fixed errors in xAXPY and xSCAL tests on AMD hardware
cnugteren
2016-05-08
Fixed an issue with computing the GFLOPS numbers for the xGEMM performance te...
cnugteren
2016-05-08
Added preliminary generated API documentation
Cedric Nugteren
2016-05-07
Added an option to the tests to control whether to test against clBLAS or a C...
Cedric Nugteren
2016-05-05
Added printing of indices when testing in verbose mode
Cedric Nugteren
2016-05-05
Merge pull request #57 from dividiti/development
Cedric Nugteren
2016-05-05
Locate the C BLAS library before the F77 one.
Anton Lokhmotov
2016-05-04
Fixed an issue with linking against the ATLAS BLAS library
Cedric Nugteren
2016-05-02
Added tuning results for AMD Hawaii (R9 290X)
Cedric Nugteren
2016-05-02
Fixed the calculation of the required buffer sizes in case of subvectors and ...
Cedric Nugteren
2016-05-01
Added tuning results for AMD Pitcairn (R9 270X)
Cedric Nugteren
2016-05-01
Updated tuning database for reduction/dot kernels based on the new tuner; par...
Cedric Nugteren
2016-05-01
Made the default xDOT tuning size smaller
Cedric Nugteren
2016-05-01
Changed the index buffer of IxAMAX routines to unsigned int for proper buffer...
Cedric Nugteren
2016-05-01
Added a program cache (per-context) next to the per-device binary cache
Cedric Nugteren
2016-04-30
Added non-aboslute minimum counter-part IxMIN of the BLAS routine IxAMAX
Cedric Nugteren
2016-04-29
Added an example to demonstrate the use of the ClearCache and FillCache funct...
Cedric Nugteren
2016-04-29
Added FillCache: a function to pre-compile all kernels for a specific device
Cedric Nugteren
2016-04-29
Added sample C programs for the SASUM and DGEMV routines
Cedric Nugteren
2016-04-28
Fixed the cache to store binaries instead of OpenCL programs
Cedric Nugteren
2016-04-27
Added non-absolute counter-parts xSUM and IxMAX of the BLAS routines xASUM an...
Cedric Nugteren
2016-04-27
Added missing namespace to the SGEMM example
Cedric Nugteren
2016-04-27
Added prototypes for non-BLAS routines: xSUM and IxMAX (non-absolute counterp...
Cedric Nugteren
2016-04-27
Moved all cache-related functions to a separate file; added a ClearCompiledPr...
Cedric Nugteren
2016-04-27
Relaxed the absolute error margin for floating-point value comparisons to 1e-4
Cedric Nugteren
2016-04-27
Added a '-verbose' option to the test binaries to report errors in more detai...
Cedric Nugteren
2016-04-27
All CLBlast enum constants now have the same raw values as in the cblas standard
Cedric Nugteren
2016-04-20
Merge branch 'level1_routines' into development
cnugteren
2016-04-20
Added support for the iSAMAX/iDAMAX/iCAMAX/iZAMAX routines
cnugteren
2016-04-20
Added prototype for ixAMAX routines
cnugteren
2016-04-14
Updated the reduction-kernel tuner to also tune the epilogue
cnugteren
2016-04-14
Added support for the SASUM/DASUM/ScASUM/DzASUM routines
cnugteren
2016-04-13
Added prototype for xASUM routines
cnugteren
2016-04-11
Fixed the way the defaults are calculated in the database; added warning for ...
cnugteren
2016-04-09
Events are now properly implemented using event waiting list and asking the u...
cnugteren
2016-04-04
Properly set warning flags for Clang
cnugteren
2016-04-04
Removed redundant queue synchronisation statements
cnugteren
2016-04-03
Merge branch 'cpu_blas' into development
cnugteren
2016-04-03
Updated the documentation in light of the support for a reference CPU BLAS li...
cnugteren
[next]