index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2018-10-14
Merge pull request #319 from CNugteren/convgemm_multi_kernel
Cedric Nugteren
2018-10-14
Merge pull request #324 from CNugteren/CLBlast-315-tuning-api-improvements
Cedric Nugteren
2018-10-13
Updated changelog regarding tuning API change
Cedric Nugteren
2018-10-13
Made tuning API more flexible: disregards any extra parameter values
Cedric Nugteren
2018-10-13
Updated the documentation for GEMV tuning
Cedric Nugteren
2018-10-11
Merge pull request #323 from CNugteren/CLBlast-322-fix-preprocessor-warnings
Cedric Nugteren
2018-10-10
Fixed pre-processor warnings related to the subgroup shuffling
Cedric Nugteren
2018-09-16
Merge branch 'master' into convgemm_multi_kernel
Cedric Nugteren
2018-09-15
Merge pull request #318 from CNugteren/CLBlast-315-preprocessor-gemmk1-issue
Cedric Nugteren
2018-09-15
Fixed an MSVC compilation error due to large strings
Cedric Nugteren
2018-09-15
Added a kernel-parameter pair table to document the tuning API
Cedric Nugteren
2018-09-15
Fixed an MSVC compilation error due to large strings
Cedric Nugteren
2018-09-15
Disabled Intel subgroup shuffling for double-precision
Cedric Nugteren
2018-09-15
Fixed issues with GEMMK=1 kernel and the pre-processor
Cedric Nugteren
2018-09-15
Added pre-processor test for GEMMK=1 kernel
Cedric Nugteren
2018-09-07
Reduced size of the xCONVGEMM correctness tests
Cedric Nugteren
2018-09-07
Added reference implementation for xCONVGEMM for half-precision
Cedric Nugteren
2018-09-07
Added xCONVGEMM as im2col plus a batched GEMM kernel
Cedric Nugteren
2018-09-03
Merge pull request #316 from ranocha/patch-1
Cedric Nugteren
2018-09-03
Add Julia Wrapper
Hendrik Ranocha
2018-08-14
Merge pull request #312 from CNugteren/CLBlast-311-missing-event-in-trsv-trsm
Cedric Nugteren
2018-08-13
Made last operation in TRSV and TRSM asynchronous, making the events not null
Cedric Nugteren
2018-08-13
Small refactoring of events in TRSV substitution routine
Cedric Nugteren
2018-08-09
Merge pull request #310 from CNugteren/CLBlast-307-netlib-api-static-opencl-vars
Cedric Nugteren
2018-08-07
Name change of setting to NETLIB_PERSISTENT_OPENCL
Cedric Nugteren
2018-08-05
Added an option to compile the Netlib API with static OpenCL device and context
Cedric Nugteren
2018-08-02
Merge pull request #309 from CNugteren/CLBlast-306-omatcopy-conjugate
Cedric Nugteren
2018-07-31
Merge pull request #308 from CNugteren/CLBlast-301-weird-AMD-Hainan-bug
Cedric Nugteren
2018-07-31
Fixed issue with not performing complex conjugation under certain cases when ...
Cedric Nugteren
2018-07-31
Fixed the tests of OMATCOPY to include proper complex conjugation
Cedric Nugteren
2018-07-31
Fixed an error reporting issue related to the canary region
Cedric Nugteren
2018-07-31
Added note about AMD southern islands GPU issue and the required workaround
Cedric Nugteren
2018-07-31
Added Beignet 1.2.1 requirement to the README for IvyBridge GPUs
Cedric Nugteren
2018-07-31
Updated the tuning results for Intel IvyBridge M GT2
Cedric Nugteren
2018-07-30
Merge pull request #305 from CNugteren/CLBlast-303-tuner-check-local-size
Cedric Nugteren
2018-07-29
Fixed a wrong event issue causing error -57
Cedric Nugteren
2018-07-29
Removed complex numbers support for CONVGEMM
Cedric Nugteren
2018-07-29
Merge branch 'master' into CLBlast-267-convgemm
Cedric Nugteren
2018-07-28
Added print statements to indicate the 4 stages of GEMM tuning
Cedric Nugteren
2018-07-28
The tuners now also check for valid local thread configurations and skip inva...
Cedric Nugteren
2018-07-28
Merge pull request #304 from CNugteren/CLBlast-300-fix-staggered-indices-AMD-...
Cedric Nugteren
2018-07-28
Disabled the use of staggered indices on AMD GPUs for the new GEMMK == 1 kern...
Cedric Nugteren
2018-07-27
Fixed an issue with AMD GPUs and the new GEMMK == 1 kernel
Cedric Nugteren
2018-07-27
Fixed a bug: forgot to initialize the shared pointer for the null kernel
Cedric Nugteren
2018-07-27
Renamed AMD SI workaround defines
Cedric Nugteren
2018-07-25
Added workaround for weird AMD SI Hainan bug
Cedric Nugteren
2018-07-25
Added code to report the average tuning results
Cedric Nugteren
2018-07-23
Merge pull request #297 from tyler-utah/master
Cedric Nugteren
2018-07-16
moved a two-line macro to a single line
Tyler Sorensen
2018-07-14
forgot to add test cases back in, oops
Tyler Sorensen
[prev]
[next]