index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2018-12-18
Update the documentation
Koichi Akabe
2018-12-18
Fix the xconvgemm tuner
Koichi Akabe
2018-12-18
Added first version of a tuner for the ConvGemm direct kernel
Cedric Nugteren
2018-12-18
Fix xconvgemm kernel and enable ConvGemmMethod::kSingleKernel
Koichi Akabe
2018-12-17
Merge pull request #342 from vbkaisetsu/fix/im2col-hf-tests
Cedric Nugteren
2018-12-17
Fix half-float+kernel_mode test cases of im2col, col2im, and convgemm
Koichi Akabe
2018-12-04
Updated to version 1.5.0
Cedric Nugteren
2018-12-01
Updated the roadmap document
Cedric Nugteren
2018-12-01
Added a FAQ document
Cedric Nugteren
2018-12-01
Merge pull request #341 from CNugteren/CLBlast-340-GEMMK1-issue-with-unequal-...
Cedric Nugteren
2018-11-30
Fixed an issue for unequal MWG and NWG and the new GEMMK == 1 kernel
Cedric Nugteren
2018-11-19
Merge pull request #335 from vbkaisetsu/patch-1
Cedric Nugteren
2018-11-19
Remove unnecessary attribute of inline function
Koichi Akabe
2018-11-17
Merge pull request #332 from vbkaisetsu/feature/im2col-col2im-flip
Cedric Nugteren
2018-11-12
Add kernel_mode option to im2col, col2im, and convgemm functions
Koichi Akabe
2018-11-09
Merge pull request #331 from CNugteren/CLBlast-270-col2im
Cedric Nugteren
2018-11-07
Changed col2im to append to the existing im-buffer
Cedric Nugteren
2018-11-01
Added new col2im routine to the documentation
Cedric Nugteren
2018-11-01
Fixed half-precision tests for im2col and col2im
Cedric Nugteren
2018-10-31
Merge pull request #330 from vbkaisetsu/CLBlast-270-col2im
Cedric Nugteren
2018-10-30
Fix col2im implementation
Koichi Akabe
2018-10-29
Merge pull request #329 from tholu/patch-1
Cedric Nugteren
2018-10-28
Update FindOpenCL.cmake
Thomas Lutz
2018-10-23
Added groundwork for col2im algorithm plus first non-working version of kerne...
Cedric Nugteren
2018-10-22
Some name changes in im2col code
Cedric Nugteren
2018-10-17
Fixed MSVC's compilation error C1061 due to too many for-loops
Cedric Nugteren
2018-10-17
Fixed a bug with the pre-processing and the AXPY kernel
Cedric Nugteren
2018-10-16
Merge pull request #325 from CNugteren/CLBlast-321-axpy-faster-kernel-bug
Cedric Nugteren
2018-10-15
Fixed a bug in the XaxpyFaster kernel for specific parameters
Cedric Nugteren
2018-10-14
Merge pull request #319 from CNugteren/convgemm_multi_kernel
Cedric Nugteren
2018-10-14
Merge pull request #324 from CNugteren/CLBlast-315-tuning-api-improvements
Cedric Nugteren
2018-10-13
Updated changelog regarding tuning API change
Cedric Nugteren
2018-10-13
Made tuning API more flexible: disregards any extra parameter values
Cedric Nugteren
2018-10-13
Updated the documentation for GEMV tuning
Cedric Nugteren
2018-10-11
Merge pull request #323 from CNugteren/CLBlast-322-fix-preprocessor-warnings
Cedric Nugteren
2018-10-10
Fixed pre-processor warnings related to the subgroup shuffling
Cedric Nugteren
2018-09-16
Merge branch 'master' into convgemm_multi_kernel
Cedric Nugteren
2018-09-15
Merge pull request #318 from CNugteren/CLBlast-315-preprocessor-gemmk1-issue
Cedric Nugteren
2018-09-15
Fixed an MSVC compilation error due to large strings
Cedric Nugteren
2018-09-15
Added a kernel-parameter pair table to document the tuning API
Cedric Nugteren
2018-09-15
Fixed an MSVC compilation error due to large strings
Cedric Nugteren
2018-09-15
Disabled Intel subgroup shuffling for double-precision
Cedric Nugteren
2018-09-15
Fixed issues with GEMMK=1 kernel and the pre-processor
Cedric Nugteren
2018-09-15
Added pre-processor test for GEMMK=1 kernel
Cedric Nugteren
2018-09-07
Reduced size of the xCONVGEMM correctness tests
Cedric Nugteren
2018-09-07
Added reference implementation for xCONVGEMM for half-precision
Cedric Nugteren
2018-09-07
Added xCONVGEMM as im2col plus a batched GEMM kernel
Cedric Nugteren
2018-09-03
Merge pull request #316 from ranocha/patch-1
Cedric Nugteren
2018-09-03
Add Julia Wrapper
Hendrik Ranocha
2018-08-14
Merge pull request #312 from CNugteren/CLBlast-311-missing-event-in-trsv-trsm
Cedric Nugteren
[prev]
[next]