index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2017-06-18
Fixed an overflow bug on 32-bit systems when chosing a GEMM kernel
Cedric Nugteren
2017-06-01
Added tuning results for GeForce GT 650M (thanks to bzcheeseman)
Cedric Nugteren
2017-05-27
Merge pull request #158 from CNugteren/msvc_compilation_fixes
Cedric Nugteren
2017-05-27
Update to AppVeyor because of changed Khronos repository (9)
Cedric Nugteren
2017-05-27
Update to AppVeyor because of changed Khronos repository (8)
Cedric Nugteren
2017-05-27
Update to AppVeyor because of changed Khronos repository (7)
Cedric Nugteren
2017-05-27
Update to AppVeyor because of changed Khronos repository (6)
Cedric Nugteren
2017-05-27
Update to AppVeyor because of changed Khronos repository (5)
Cedric Nugteren
2017-05-27
Update to AppVeyor because of changed Khronos repository (4)
Cedric Nugteren
2017-05-27
Update to AppVeyor because of changed Khronos repository (3)
Cedric Nugteren
2017-05-27
Merge pull request #157 from kpot/improved_caching
Cedric Nugteren
2017-05-27
Fixed comment decribing the order of program cache fields
Kirill Mavreshko
2017-05-26
Update to AppVeyor because of changed Khronos repository (2)
Cedric Nugteren
2017-05-26
Update to AppVeyor because of changed Khronos repository
Cedric Nugteren
2017-05-26
Fixed a compilation issue under MSVC 2013
Cedric Nugteren
2017-05-26
Fixes inability to run GEMM on multiple identical GPUs (issue #155)
Kirill Mavreshko
2017-05-24
Merge pull request #156 from ctuning/master
Cedric Nugteren
2017-05-24
changing "wb" to "w" when saving json file (text mode) - compatibility for Py...
Grigori Fursin
2017-05-15
Fixed a minor compilation issue of a sample with GCC 4.8
Cedric Nugteren
2017-05-15
Fixed an TRSM issue caused by incorrect block size calculation
Cedric Nugteren
2017-05-14
Fixed a missing synchronization barrier in the invert kernel; fixes TRSM tests
Cedric Nugteren
2017-05-12
Added the IxAMIN routines: absolute minimum version of IxAMAX
Cedric Nugteren
2017-05-12
Fixed a bug in the TRSM routine; tests now pass
Cedric Nugteren
2017-05-12
Removed the included performance reports; README now redirects to the new ext...
Cedric Nugteren
2017-05-11
Added tuning results for the AMD Radeon Fiji GPU
Cedric Nugteren
2017-05-11
Fixes the build-status table in the README
Cedric Nugteren
2017-05-11
Bug-fix in the half-precision test of the amax routine
Cedric Nugteren
2017-05-11
Re-added random tuning for GEMM after accidental removal
Cedric Nugteren
2017-05-11
Minor naming fixes to the benchmark script
Cedric Nugteren
2017-05-11
Merge branch 'master_is_neww_devel_branch'
Cedric Nugteren
2017-05-03
The master branch is now the main 'development' branch
Cedric Nugteren
2017-05-02
Merge pull request #150 from CNugteren/development
Cedric Nugteren
2017-05-02
Updated to version 0.11.0
Cedric Nugteren
2017-04-23
Merge pull request #148 from CNugteren/benchmarking
Cedric Nugteren
2017-04-23
Added an option to the database script to remove tuning results from the data...
Cedric Nugteren
2017-04-23
Re-added Titan X (Pascal) tuning results based on more averaging when tuning
Cedric Nugteren
2017-04-23
Fixed a compiler warning message
Cedric Nugteren
2017-04-22
Increased the default number of runs for the tuner from 2 up to 10 for fast k...
Cedric Nugteren
2017-04-22
Fixed the direct vs indirect setting for NVIDIA GPUs
Cedric Nugteren
2017-04-21
Increased the default number of runs for GEMV tuning; updated GEMV tuning res...
Cedric Nugteren
2017-04-21
Merge branch 'development' into benchmarking
Cedric Nugteren
2017-04-21
Removed the words SUMMARY from the title of the benchmark script when benchma...
Cedric Nugteren
2017-04-20
Updated the settings for the batched benchmarks
Cedric Nugteren
2017-04-20
Tuned the direct versus indirect GEMM kernel trade-off point for NVIDIA GPUs
Cedric Nugteren
2017-04-17
Fixed a namespace clash with CUDA FP16 for the half-datatype
Cedric Nugteren
2017-04-17
Added proper handling of mismatched arguments in the database script
Cedric Nugteren
2017-04-16
Set proper settings for the benchmarks of batched routines
Cedric Nugteren
2017-04-16
Merge branch 'development' into benchmarking
Cedric Nugteren
2017-04-16
Merge pull request #147 from CNugteren/cublas_reference
Cedric Nugteren
2017-04-16
Finalized support for performance testing against cuBLAS
Cedric Nugteren
[prev]
[next]