debian-clblast - Debian package for CLBlast.

Age	Commit message (Collapse)	Author
2019-01-26	Merge pull request #348 from ↵	Cedric Nugteren
	CNugteren/CLBlast-334-pyclblast-half-precision-support PyCLBlast half precision support
2019-01-23	Added fp32 to fp16 conversion function in Python to make haxpy example work	Cedric Nugteren

2019-01-22	Added a (non-working) sample of half precision AXPY in Python	Cedric Nugteren

2019-01-22	Updated pyclblast README, updated to 1.2.0 for half-precision support	Cedric Nugteren

2019-01-22	Added experimental support for half-precision in pyclblast	Cedric Nugteren

2019-01-19	Merge pull request #345 from CNugteren/convolution-fixes-and-tuner	Cedric Nugteren
	Convolution with single kernel
2019-01-19	Added documentation on the convgemm routine	Cedric Nugteren

2019-01-19	Added a few more initial Intel tuning parameters for convgemm	Cedric Nugteren

2019-01-05	Added a check to prevent the stride of matrix C being set to 0 for the ↵	Cedric Nugteren
	strided-batched-GEMM routine
2018-12-31	Added convgemm to the CLBlast database, added initial parameters for Skylake GPU	Cedric Nugteren

2018-12-31	Added support for the convgemm tuner in the tuner database	Cedric Nugteren

2018-12-31	Added the forgotten batch dimension to the tuner to get correct kernel ↵	Cedric Nugteren
	executions
2018-12-23	Merge pull request #343 from vbkaisetsu/feature/convgemm-single	Cedric Nugteren
	Fix single kernel version of convgemm
2018-12-22	Merge branch 'master' into convolution-fixes-and-tuner	Cedric Nugteren

2018-12-21	Update changelog	Koichi Akabe

2018-12-18	Update the documentation	Koichi Akabe

2018-12-18	Fix the xconvgemm tuner	Koichi Akabe

2018-12-18	Added first version of a tuner for the ConvGemm direct kernel	Cedric Nugteren

2018-12-18	Fix xconvgemm kernel and enable ConvGemmMethod::kSingleKernel	Koichi Akabe

2018-12-17	Merge pull request #342 from vbkaisetsu/fix/im2col-hf-tests	Cedric Nugteren
	Fix half-float+kernel_mode test cases of im2col, col2im, and convgemm
2018-12-17	Fix half-float+kernel_mode test cases of im2col, col2im, and convgemm	Koichi Akabe

2018-12-04	Updated to version 1.5.0	Cedric Nugteren

2018-12-01	Updated the roadmap document	Cedric Nugteren

2018-12-01	Added a FAQ document	Cedric Nugteren

2018-12-01	Merge pull request #341 from ↵	Cedric Nugteren
	CNugteren/CLBlast-340-GEMMK1-issue-with-unequal-MWG-NWG Fixed an issue for the GEMMK == 1 kernel
2018-11-30	Fixed an issue for unequal MWG and NWG and the new GEMMK == 1 kernel	Cedric Nugteren

2018-11-19	Merge pull request #335 from vbkaisetsu/patch-1	Cedric Nugteren
	Remove unnecessary qualifier of inline function
2018-11-19	Remove unnecessary attribute of inline function	Koichi Akabe

2018-11-17	Merge pull request #332 from vbkaisetsu/feature/im2col-col2im-flip	Cedric Nugteren
	Add im2colflip and col2imflip functions
2018-11-12	Add kernel_mode option to im2col, col2im, and convgemm functions	Koichi Akabe

2018-11-09	Merge pull request #331 from CNugteren/CLBlast-270-col2im	Cedric Nugteren
	Implements col2im routine
2018-11-07	Changed col2im to append to the existing im-buffer	Cedric Nugteren

2018-11-01	Added new col2im routine to the documentation	Cedric Nugteren

2018-11-01	Fixed half-precision tests for im2col and col2im	Cedric Nugteren

2018-10-31	Merge pull request #330 from vbkaisetsu/CLBlast-270-col2im	Cedric Nugteren
	Add col2im function
2018-10-30	Fix col2im implementation	Koichi Akabe

2018-10-29	Merge pull request #329 from tholu/patch-1	Cedric Nugteren
	Update FindOpenCL.cmake
2018-10-28	Update FindOpenCL.cmake	Thomas Lutz
	Add path to ROCm OpenCL as possible location in cmake script
2018-10-23	Added groundwork for col2im algorithm plus first non-working version of ↵	Cedric Nugteren
	kernel and test
2018-10-22	Some name changes in im2col code	Cedric Nugteren

2018-10-17	Fixed MSVC's compilation error C1061 due to too many for-loops	Cedric Nugteren

2018-10-17	Fixed a bug with the pre-processing and the AXPY kernel	Cedric Nugteren

2018-10-16	Merge pull request #325 from CNugteren/CLBlast-321-axpy-faster-kernel-bug	Cedric Nugteren
	Fixed a bug in the XaxpyFaster kernel for specific parameters
2018-10-15	Fixed a bug in the XaxpyFaster kernel for specific parameters	Cedric Nugteren

2018-10-14	Merge pull request #319 from CNugteren/convgemm_multi_kernel	Cedric Nugteren
	First im2col+GEMM implementation of convolution
2018-10-14	Merge pull request #324 from CNugteren/CLBlast-315-tuning-api-improvements	Cedric Nugteren
	Made tuning API more flexible
2018-10-13	Updated changelog regarding tuning API change	Cedric Nugteren

2018-10-13	Made tuning API more flexible: disregards any extra parameter values	Cedric Nugteren

2018-10-13	Updated the documentation for GEMV tuning	Cedric Nugteren

2018-10-11	Merge pull request #323 from CNugteren/CLBlast-322-fix-preprocessor-warnings	Cedric Nugteren
	Fixed pre-processor warnings related to the subgroup shuffling