index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
CHANGELOG
Age
Commit message (
Expand
)
Author
2018-07-06
Updated changelog
Cedric Nugteren
2018-06-28
Disabled calls to clReleaseProgram under Windows to avoid segfaults when the ...
Cedric Nugteren
2018-06-03
Updated to CLBlast version 1.4.0
Cedric Nugteren
2018-06-02
Added MKL as an alternative for CBLAS for correctness and performance compari...
Cedric Nugteren
2018-05-19
The GEMM routine tuner now loads kernel JSON tuning results from disk if avai...
Cedric Nugteren
2018-05-18
Merge branch 'master' into canary_buffer_overflow_protection
Cedric Nugteren
2018-05-17
Added a canary region for overflow detection to the correctness tests
Cedric Nugteren
2018-05-01
Now stores a shared_ptr to the Program class in the cache
Cedric Nugteren
2018-04-29
Merge pull request #277 from CNugteren/CLBlast-257-intel-subgroups
Cedric Nugteren
2018-04-29
Updated the changelog
Cedric Nugteren
2018-04-26
Fixed an access violation when compiled with Visual Studio upon releasing the...
Cedric Nugteren
2018-04-15
Updated tuning results for the Skylake ULT GT2 GPU with the new kernel
Cedric Nugteren
2018-04-07
Added tuning results for NVIDIA GeForce 920MX
Cedric Nugteren
2018-03-22
Added the OpenCL local memory size constraint to the tuners
Cedric Nugteren
2018-03-10
Updated the documentation for the tuner API
Cedric Nugteren
2018-02-26
Updated the changelog
Cedric Nugteren
2018-02-20
Fixed several issues in the new invert tuner
Cedric Nugteren
2018-02-18
Updated changelog and roadmap: Python package created
Cedric Nugteren
2018-02-02
Implemented the XHAD Hadamard product routine
Cedric Nugteren
2018-01-29
Updated to CLBlast version 1.3.0
Cedric Nugteren
2018-01-11
Added a RetrieveParameters function to inspect tuning parameters
Cedric Nugteren
2018-01-08
Implemented the in-direct version of the strided-batched GEMM kernel
Cedric Nugteren
2018-01-06
Updated changelog and roadmap
Cedric Nugteren
2017-12-31
Fixed the issue with AMD's APP compiler not being able to compile the invert ...
Cedric Nugteren
2017-12-27
Split the database into multiple small compilation units
Cedric Nugteren
2017-12-23
Updated the database to use the new TRSV and Invert tuners
Cedric Nugteren
2017-12-20
Added try-except to database script parser to skip invalid files
Cedric Nugteren
2017-12-17
Removed all ARM Mali tuning results; re-added Mali-T760 and Mali-T628 results...
Cedric Nugteren
2017-12-10
Updated roadmap: completed pre-processor implementation
Cedric Nugteren
2017-12-09
Made the pre-processor run by default for ARM and Qualcomm GPUs
Cedric Nugteren
2017-11-24
Added precision check to parameter override for the clients
Cedric Nugteren
2017-11-19
Revived the GEMM routine tuner; minor formatting changes
Cedric Nugteren
2017-11-09
Added tuning results for the GeForce GTX750Ti
Cedric Nugteren
2017-11-08
Updated to CLBlast version 1.2.0
Cedric Nugteren
2017-11-07
Merge pull request #212 from CNugteren/kernel_selection_tuner
Cedric Nugteren
2017-11-02
Integrated the GEMM routine tuner for kernel selection; added first tuning re...
Cedric Nugteren
2017-10-29
Made it possible to compile the CLBlast performance clients for Android with ...
Cedric Nugteren
2017-10-27
Fixed a bug when using the matrix A-offset argument for the TRSM routine
Cedric Nugteren
2017-10-27
Added GEMV synchronisation for the TRSV routine: similar bug as in TRSM
Cedric Nugteren
2017-10-25
Fixed a bug in TRSM routine due to missing event synchronisations after GEMM ...
Cedric Nugteren
2017-10-20
Added tuning parameters for GeForce GTX 580, GeForce GTX 1080Ti, and Core i5-...
Cedric Nugteren
2017-10-16
Added CUDA API documentation
Cedric Nugteren
2017-10-03
Gemm in-direct implementation now uses only 1 larger instead of max 3 optiona...
Cedric Nugteren
2017-10-01
Allow OverrideParameters function to work before a kernel was first used
Cedric Nugteren
2017-09-30
Kernels are now cached based on their routine name and their tuning parameters
Cedric Nugteren
2017-09-30
Updated to version 1.1.0
Cedric Nugteren
2017-09-23
Added extra benchmarks to verify new database caching keys performance
Cedric Nugteren
2017-09-22
Added OpenCL properties printing to the diagnostics helper
Cedric Nugteren
2017-09-16
Added tuning results for Intel Core i7 6770HQ
Cedric Nugteren
2017-09-16
Improved compilation time of the tuner database
Cedric Nugteren
[next]