index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
CHANGELOG
Age
Commit message (
Expand
)
Author
2018-01-06
Updated changelog and roadmap
Cedric Nugteren
2017-12-31
Fixed the issue with AMD's APP compiler not being able to compile the invert ...
Cedric Nugteren
2017-12-27
Split the database into multiple small compilation units
Cedric Nugteren
2017-12-23
Updated the database to use the new TRSV and Invert tuners
Cedric Nugteren
2017-12-20
Added try-except to database script parser to skip invalid files
Cedric Nugteren
2017-12-17
Removed all ARM Mali tuning results; re-added Mali-T760 and Mali-T628 results...
Cedric Nugteren
2017-12-10
Updated roadmap: completed pre-processor implementation
Cedric Nugteren
2017-12-09
Made the pre-processor run by default for ARM and Qualcomm GPUs
Cedric Nugteren
2017-11-24
Added precision check to parameter override for the clients
Cedric Nugteren
2017-11-19
Revived the GEMM routine tuner; minor formatting changes
Cedric Nugteren
2017-11-09
Added tuning results for the GeForce GTX750Ti
Cedric Nugteren
2017-11-08
Updated to CLBlast version 1.2.0
Cedric Nugteren
2017-11-07
Merge pull request #212 from CNugteren/kernel_selection_tuner
Cedric Nugteren
2017-11-02
Integrated the GEMM routine tuner for kernel selection; added first tuning re...
Cedric Nugteren
2017-10-29
Made it possible to compile the CLBlast performance clients for Android with ...
Cedric Nugteren
2017-10-27
Fixed a bug when using the matrix A-offset argument for the TRSM routine
Cedric Nugteren
2017-10-27
Added GEMV synchronisation for the TRSV routine: similar bug as in TRSM
Cedric Nugteren
2017-10-25
Fixed a bug in TRSM routine due to missing event synchronisations after GEMM ...
Cedric Nugteren
2017-10-20
Added tuning parameters for GeForce GTX 580, GeForce GTX 1080Ti, and Core i5-...
Cedric Nugteren
2017-10-16
Added CUDA API documentation
Cedric Nugteren
2017-10-03
Gemm in-direct implementation now uses only 1 larger instead of max 3 optiona...
Cedric Nugteren
2017-10-01
Allow OverrideParameters function to work before a kernel was first used
Cedric Nugteren
2017-09-30
Kernels are now cached based on their routine name and their tuning parameters
Cedric Nugteren
2017-09-30
Updated to version 1.1.0
Cedric Nugteren
2017-09-23
Added extra benchmarks to verify new database caching keys performance
Cedric Nugteren
2017-09-22
Added OpenCL properties printing to the diagnostics helper
Cedric Nugteren
2017-09-16
Added tuning results for Intel Core i7 6770HQ
Cedric Nugteren
2017-09-16
Improved compilation time of the tuner database
Cedric Nugteren
2017-09-14
Added architecture layer in the tuning database for better performance on uns...
Cedric Nugteren
2017-09-04
Removed an assumption that the 'default' tuning parameters have to be stored ...
Cedric Nugteren
2017-08-24
Merge branch 'master' into im_to_col
Cedric Nugteren
2017-08-24
Completed im2col implementation
Cedric Nugteren
2017-08-21
Minor updates after merging in the PSO addition to the tuners
Cedric Nugteren
2017-08-08
Updated to version 1.0.1 (bugfix release)
Cedric Nugteren
2017-07-30
Updated to version 1.0.0
Cedric Nugteren
2017-07-24
Added status badges for correctness tests; updated list of contributors; fixe...
Cedric Nugteren
2017-06-30
Fixed an if-statement in the direct GEMM kernel causing a bug with specific s...
Cedric Nugteren
2017-06-26
Fixed and suppresses several warnings for MSVC
Cedric Nugteren
2017-06-21
Fixes some compilation issues related to the database structure change
Cedric Nugteren
2017-06-01
Added tuning results for GeForce GT 650M (thanks to bzcheeseman)
Cedric Nugteren
2017-05-12
Added the IxAMIN routines: absolute minimum version of IxAMAX
Cedric Nugteren
2017-05-12
Fixed a bug in the TRSM routine; tests now pass
Cedric Nugteren
2017-05-12
Removed the included performance reports; README now redirects to the new ext...
Cedric Nugteren
2017-05-11
Added tuning results for the AMD Radeon Fiji GPU
Cedric Nugteren
2017-05-11
Minor naming fixes to the benchmark script
Cedric Nugteren
2017-05-02
Updated to version 0.11.0
Cedric Nugteren
2017-04-16
Finalized support for performance testing against cuBLAS
Cedric Nugteren
2017-04-10
Updated the changelog with the Apple CPU override
Cedric Nugteren
2017-03-26
Replaced the R graph scripts with Python/Matplotlib benchmark scripts
Cedric Nugteren
2017-03-11
Added initial naive version of the batched GEMM routine based on the direct G...
Cedric Nugteren
[next]