index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2016-05-30
Merge branch 'half_precision' into development
Cedric Nugteren
2016-05-26
Added half-precision tests for the clBLAS reference through conversion to sin...
Cedric Nugteren
2016-05-26
Added half-precision tests for the CBLAS reference through conversion to sing...
Cedric Nugteren
2016-05-25
Added possibility to run the performance client with half-precision
Cedric Nugteren
2016-05-25
Added level-3 half-precision routines HGEMM/HSYMM/HSYRK/HSYR2K/HTRMM
Cedric Nugteren
2016-05-24
Added proper argument handling and displaying for half-precision data-types
Cedric Nugteren
2016-05-23
Updated README with information on half-precision support
Cedric Nugteren
2016-05-22
Added level-2 half-precision routines HGER/HSYR/HSPR/HSYR2/HSPR2
Cedric Nugteren
2016-05-22
Fixed tuning results for half-precision; added first results for the xGER ker...
Cedric Nugteren
2016-05-22
Prepared the GER kernels and tuner for half-precision support
Cedric Nugteren
2016-05-22
Added level-2 half-precision routines HGEMV/HGBMV/HHEMV/HHBMV/HHPMV/HSYMV/HSB...
Cedric Nugteren
2016-05-22
Added first tuning results for the half-precision xGEMV kernels
Cedric Nugteren
2016-05-22
Prepared the GEMV kernels and tuner for half-precision support
Cedric Nugteren
2016-05-22
Added level-1 half-precision routines HSWAP/HSCAL/HCOPY/HAXPY/HDOT/HNRM2/HASU...
Cedric Nugteren
2016-05-22
Added first tuning results for the half-precision xDOT kernels
Cedric Nugteren
2016-05-22
Added half-precision support for all level 1 routines
Cedric Nugteren
2016-05-18
Merged in latest changes from 0.7.1 release
Cedric Nugteren
2016-05-18
Prepared the changelog for the next release
Cedric Nugteren
2016-05-18
Updated to version 0.7.1
Cedric Nugteren
2016-05-18
Fixes for Visual Studio
CNugteren
2016-05-18
Fixes for CMake policy CMP0054
Cedric Nugteren
2016-05-17
Made MSVC link the run-time libraries statically
Cedric Nugteren
2016-05-17
Fixed warning CMP0054
Cedric Nugteren
2016-05-16
Added half precision tuning results for supporting kernels (pad, copy, transp...
Cedric Nugteren
2016-05-16
Prepared GEMM and supporting kernels and tuners for half-precision support
Cedric Nugteren
2016-05-15
Added an example of using the half-precision HAXPY routine
Cedric Nugteren
2016-05-15
Added header with conversions from and to half-precision floating-point
Cedric Nugteren
2016-05-15
Updated the performance graph for the Radeon M370X AMD GPU
cnugteren
2016-05-15
Added new tuning results for SGEMM and updated the performance graph for the ...
cnugteren
2016-05-15
Removed comparison to CBLAS for the graph scripts
cnugteren
2016-05-15
Fixed a bug in the xGEMM routine related to the event incorrectly set
cnugteren
2016-05-15
Fixed the arguments in the performance graphs to reflect the changes in enum ...
cnugteren
2016-05-15
Added support for staggered/shuffled offsets for GEMM to improve performance ...
cnugteren
2016-05-14
Set kernel arguments for AXPY as constant memory buffers, making it possible ...
Cedric Nugteren
2016-05-13
Initial experimental version of the half-precision HAXPY routine
Cedric Nugteren
2016-05-12
Initial changes in preparation for half-precision fp16 support
Cedric Nugteren
2016-05-10
Fixed links in the README
Cedric Nugteren
2016-05-08
Prepared the changelog for the next release
Cedric Nugteren
2016-05-08
Fixes for compilation of the tests under Visual Studio 2015
CNugteren
2016-05-08
Updated to version 0.7.0
Cedric Nugteren
2016-05-08
Fixed an issue where the xAMAX tester would incorrectly report failures when ...
cnugteren
2016-05-08
Fixed an issue where the xNRM2 and xASUM testers would incorrectly report fai...
cnugteren
2016-05-08
Fixed errors in xAXPY and xSCAL tests on AMD hardware
cnugteren
2016-05-08
Fixed an issue with computing the GFLOPS numbers for the xGEMM performance te...
cnugteren
2016-05-08
Added preliminary generated API documentation
Cedric Nugteren
2016-05-07
Added an option to the tests to control whether to test against clBLAS or a C...
Cedric Nugteren
2016-05-05
Added printing of indices when testing in verbose mode
Cedric Nugteren
2016-05-05
Merge pull request #57 from dividiti/development
Cedric Nugteren
2016-05-05
Locate the C BLAS library before the F77 one.
Anton Lokhmotov
2016-05-04
Fixed an issue with linking against the ATLAS BLAS library
Cedric Nugteren
[next]