index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
CHANGELOG
Age
Commit message (
Expand
)
Author
2016-07-10
Now passing alpha/beta to the kernel as arguments as before fp16 support; in ...
Cedric Nugteren
2016-07-08
Cache now compares cl_context instead of a pointer to a context; added verbos...
Cedric Nugteren
2016-07-06
Added an option to the performance clients to do a warm-up run before timing
Cedric Nugteren
2016-07-03
Added tuning results for GTX670, GTX750, and GTX1070 (thanks to gcp)
Cedric Nugteren
2016-07-02
Fixed some memory leaks related to events not properly cleaned-up
Cedric Nugteren
2016-06-30
Added declspec(dllexport) to ClearCache and FillCache, and added declspec(dll...
Cedric Nugteren
2016-06-29
Updated to version 6.0 of the CLCudaAPI header
Cedric Nugteren
2016-06-28
Prepared the changelog for the next release
Cedric Nugteren
2016-06-28
Updated to version 0.8.0
Cedric Nugteren
2016-06-27
Added Appveyor Windows CI support
Cedric Nugteren
2016-06-19
Renamed all C++ source files to .cpp to match the .hpp extension better
Cedric Nugteren
2016-06-16
Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, and...
Cedric Nugteren
2016-06-13
Improved API documentation and added documentation for level-2 and level-3 ro...
Cedric Nugteren
2016-06-01
Added tuning parameters for 'GRID K520' and 'HD Graphics Skylake ULT GT2'
Cedric Nugteren
2016-05-31
Made use of CMake's built-in unit testing, allowing all tests to be run using...
Cedric Nugteren
2016-05-30
Increased the verbosity of the -verbose option in the correctness tests
Cedric Nugteren
2016-05-30
Separated the performance tests (clients) from the correctness tests in CMake
Cedric Nugteren
2016-05-30
Merge branch 'half_precision' into development
Cedric Nugteren
2016-05-25
Added level-3 half-precision routines HGEMM/HSYMM/HSYRK/HSYR2K/HTRMM
Cedric Nugteren
2016-05-22
Added level-2 half-precision routines HGER/HSYR/HSPR/HSYR2/HSPR2
Cedric Nugteren
2016-05-22
Added level-2 half-precision routines HGEMV/HGBMV/HHEMV/HHBMV/HHPMV/HSYMV/HSB...
Cedric Nugteren
2016-05-22
Added level-1 half-precision routines HSWAP/HSCAL/HCOPY/HAXPY/HDOT/HNRM2/HASU...
Cedric Nugteren
2016-05-18
Merged in latest changes from 0.7.1 release
Cedric Nugteren
2016-05-18
Prepared the changelog for the next release
Cedric Nugteren
2016-05-18
Updated to version 0.7.1
Cedric Nugteren
2016-05-17
Made MSVC link the run-time libraries statically
Cedric Nugteren
2016-05-15
Added header with conversions from and to half-precision floating-point
Cedric Nugteren
2016-05-15
Fixed a bug in the xGEMM routine related to the event incorrectly set
cnugteren
2016-05-15
Added support for staggered/shuffled offsets for GEMM to improve performance ...
cnugteren
2016-05-08
Prepared the changelog for the next release
Cedric Nugteren
2016-05-08
Updated to version 0.7.0
Cedric Nugteren
2016-05-08
Added preliminary generated API documentation
Cedric Nugteren
2016-05-07
Added an option to the tests to control whether to test against clBLAS or a C...
Cedric Nugteren
2016-05-02
Added tuning results for AMD Hawaii (R9 290X)
Cedric Nugteren
2016-04-30
Added non-aboslute minimum counter-part IxMIN of the BLAS routine IxAMAX
Cedric Nugteren
2016-04-28
Fixed the cache to store binaries instead of OpenCL programs
Cedric Nugteren
2016-04-27
Added non-absolute counter-parts xSUM and IxMAX of the BLAS routines xASUM an...
Cedric Nugteren
2016-04-27
Moved all cache-related functions to a separate file; added a ClearCompiledPr...
Cedric Nugteren
2016-04-20
Added support for the iSAMAX/iDAMAX/iCAMAX/iZAMAX routines
cnugteren
2016-04-14
Updated the reduction-kernel tuner to also tune the epilogue
cnugteren
2016-04-03
Updated the documentation in light of the support for a reference CPU BLAS li...
cnugteren
2016-03-31
Updated the documentation
cnugteren
2016-03-23
Fixed the C-api export to be able to properly build a DLL on Windows
Cedric Nugteren
2016-03-14
Made the library thread-safe by guarding the kernel cache with a mutex
Cedric Nugteren
2016-03-13
Prepared the changelog for the next release
Cedric Nugteren
2016-03-13
Updated to version 0.6.0
Cedric Nugteren
2016-03-06
Added preliminary support for xHPR2 and xSPR2 routines
Cedric Nugteren
2016-02-28
Updated the changelog with newly supported level-2 routines
Cedric Nugteren
2016-02-10
Updated the changelog
Cedric Nugteren
2015-10-17
Prepared the changelog for the next release
CNugteren
[next]