index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
CHANGELOG
Age
Commit message (
Expand
)
Author
2018-10-15
Fixed a bug in the XaxpyFaster kernel for specific parameters
Cedric Nugteren
2018-10-14
Merge pull request #319 from CNugteren/convgemm_multi_kernel
Cedric Nugteren
2018-10-13
Updated changelog regarding tuning API change
Cedric Nugteren
2018-09-16
Merge branch 'master' into convgemm_multi_kernel
Cedric Nugteren
2018-09-15
Disabled Intel subgroup shuffling for double-precision
Cedric Nugteren
2018-09-07
Added xCONVGEMM as im2col plus a batched GEMM kernel
Cedric Nugteren
2018-08-07
Name change of setting to NETLIB_PERSISTENT_OPENCL
Cedric Nugteren
2018-08-05
Added an option to compile the Netlib API with static OpenCL device and context
Cedric Nugteren
2018-07-31
Fixed issue with not performing complex conjugation under certain cases when ...
Cedric Nugteren
2018-07-28
The tuners now also check for valid local thread configurations and skip inva...
Cedric Nugteren
2018-07-27
Fixed an issue with AMD GPUs and the new GEMMK == 1 kernel
Cedric Nugteren
2018-07-25
Added code to report the average tuning results
Cedric Nugteren
2018-07-14
Updated to CLBlast version 1.4.1
Cedric Nugteren
2018-07-13
Added tuning results for HD Graphics 6000 Broadwell GT3
Cedric Nugteren
2018-07-06
Updated changelog
Cedric Nugteren
2018-06-28
Disabled calls to clReleaseProgram under Windows to avoid segfaults when the ...
Cedric Nugteren
2018-06-03
Updated to CLBlast version 1.4.0
Cedric Nugteren
2018-06-02
Added MKL as an alternative for CBLAS for correctness and performance compari...
Cedric Nugteren
2018-05-19
The GEMM routine tuner now loads kernel JSON tuning results from disk if avai...
Cedric Nugteren
2018-05-18
Merge branch 'master' into canary_buffer_overflow_protection
Cedric Nugteren
2018-05-17
Added a canary region for overflow detection to the correctness tests
Cedric Nugteren
2018-05-01
Now stores a shared_ptr to the Program class in the cache
Cedric Nugteren
2018-04-29
Merge pull request #277 from CNugteren/CLBlast-257-intel-subgroups
Cedric Nugteren
2018-04-29
Updated the changelog
Cedric Nugteren
2018-04-26
Fixed an access violation when compiled with Visual Studio upon releasing the...
Cedric Nugteren
2018-04-15
Updated tuning results for the Skylake ULT GT2 GPU with the new kernel
Cedric Nugteren
2018-04-07
Added tuning results for NVIDIA GeForce 920MX
Cedric Nugteren
2018-03-22
Added the OpenCL local memory size constraint to the tuners
Cedric Nugteren
2018-03-10
Updated the documentation for the tuner API
Cedric Nugteren
2018-02-26
Updated the changelog
Cedric Nugteren
2018-02-20
Fixed several issues in the new invert tuner
Cedric Nugteren
2018-02-18
Updated changelog and roadmap: Python package created
Cedric Nugteren
2018-02-02
Implemented the XHAD Hadamard product routine
Cedric Nugteren
2018-01-29
Updated to CLBlast version 1.3.0
Cedric Nugteren
2018-01-11
Added a RetrieveParameters function to inspect tuning parameters
Cedric Nugteren
2018-01-08
Implemented the in-direct version of the strided-batched GEMM kernel
Cedric Nugteren
2018-01-06
Updated changelog and roadmap
Cedric Nugteren
2017-12-31
Fixed the issue with AMD's APP compiler not being able to compile the invert ...
Cedric Nugteren
2017-12-27
Split the database into multiple small compilation units
Cedric Nugteren
2017-12-23
Updated the database to use the new TRSV and Invert tuners
Cedric Nugteren
2017-12-20
Added try-except to database script parser to skip invalid files
Cedric Nugteren
2017-12-17
Removed all ARM Mali tuning results; re-added Mali-T760 and Mali-T628 results...
Cedric Nugteren
2017-12-10
Updated roadmap: completed pre-processor implementation
Cedric Nugteren
2017-12-09
Made the pre-processor run by default for ARM and Qualcomm GPUs
Cedric Nugteren
2017-11-24
Added precision check to parameter override for the clients
Cedric Nugteren
2017-11-19
Revived the GEMM routine tuner; minor formatting changes
Cedric Nugteren
2017-11-09
Added tuning results for the GeForce GTX750Ti
Cedric Nugteren
2017-11-08
Updated to CLBlast version 1.2.0
Cedric Nugteren
2017-11-07
Merge pull request #212 from CNugteren/kernel_selection_tuner
Cedric Nugteren
2017-11-02
Integrated the GEMM routine tuner for kernel selection; added first tuning re...
Cedric Nugteren
[next]