index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
Age
Commit message (
Expand
)
Author
2019-05-08
Changed back to cl_intel_subgroups as suggested
Cedric Nugteren
2019-05-07
Added a host-code check to make sure the avc_motion_estimation is available
Cedric Nugteren
2019-05-07
Enabled avc_motion_estimation extension for Intel subgroup shuffling
Cedric Nugteren
2019-05-03
Remove assert for extention not available in macOS
Umar Arshad
2019-02-09
Added tuning parameters for Tesla P100 16GB
Cedric Nugteren
2019-02-09
Added tuning parameters for Xeon E5-2630 v3 and v4
Cedric Nugteren
2019-01-23
Added fp32 to fp16 conversion function in Python to make haxpy example work
Cedric Nugteren
2019-01-22
Added a (non-working) sample of half precision AXPY in Python
Cedric Nugteren
2019-01-22
Updated pyclblast README, updated to 1.2.0 for half-precision support
Cedric Nugteren
2019-01-22
Added experimental support for half-precision in pyclblast
Cedric Nugteren
2019-01-19
Merge pull request #345 from CNugteren/convolution-fixes-and-tuner
Cedric Nugteren
2019-01-19
Added a few more initial Intel tuning parameters for convgemm
Cedric Nugteren
2019-01-05
Added a check to prevent the stride of matrix C being set to 0 for the stride...
Cedric Nugteren
2018-12-31
Added convgemm to the CLBlast database, added initial parameters for Skylake GPU
Cedric Nugteren
2018-12-31
Added support for the convgemm tuner in the tuner database
Cedric Nugteren
2018-12-31
Added the forgotten batch dimension to the tuner to get correct kernel execut...
Cedric Nugteren
2018-12-18
Fix the xconvgemm tuner
Koichi Akabe
2018-12-18
Added first version of a tuner for the ConvGemm direct kernel
Cedric Nugteren
2018-12-18
Fix xconvgemm kernel and enable ConvGemmMethod::kSingleKernel
Koichi Akabe
2018-11-30
Fixed an issue for unequal MWG and NWG and the new GEMMK == 1 kernel
Cedric Nugteren
2018-11-19
Remove unnecessary attribute of inline function
Koichi Akabe
2018-11-12
Add kernel_mode option to im2col, col2im, and convgemm functions
Koichi Akabe
2018-11-07
Changed col2im to append to the existing im-buffer
Cedric Nugteren
2018-11-01
Added new col2im routine to the documentation
Cedric Nugteren
2018-10-30
Fix col2im implementation
Koichi Akabe
2018-10-23
Added groundwork for col2im algorithm plus first non-working version of kerne...
Cedric Nugteren
2018-10-22
Some name changes in im2col code
Cedric Nugteren
2018-10-17
Fixed a bug with the pre-processing and the AXPY kernel
Cedric Nugteren
2018-10-15
Fixed a bug in the XaxpyFaster kernel for specific parameters
Cedric Nugteren
2018-10-14
Merge pull request #319 from CNugteren/convgemm_multi_kernel
Cedric Nugteren
2018-10-13
Made tuning API more flexible: disregards any extra parameter values
Cedric Nugteren
2018-10-10
Fixed pre-processor warnings related to the subgroup shuffling
Cedric Nugteren
2018-09-16
Merge branch 'master' into convgemm_multi_kernel
Cedric Nugteren
2018-09-15
Fixed an MSVC compilation error due to large strings
Cedric Nugteren
2018-09-15
Fixed an MSVC compilation error due to large strings
Cedric Nugteren
2018-09-15
Disabled Intel subgroup shuffling for double-precision
Cedric Nugteren
2018-09-15
Fixed issues with GEMMK=1 kernel and the pre-processor
Cedric Nugteren
2018-09-07
Added xCONVGEMM as im2col plus a batched GEMM kernel
Cedric Nugteren
2018-08-13
Made last operation in TRSV and TRSM asynchronous, making the events not null
Cedric Nugteren
2018-08-13
Small refactoring of events in TRSV substitution routine
Cedric Nugteren
2018-08-07
Name change of setting to NETLIB_PERSISTENT_OPENCL
Cedric Nugteren
2018-08-05
Added an option to compile the Netlib API with static OpenCL device and context
Cedric Nugteren
2018-08-02
Merge pull request #309 from CNugteren/CLBlast-306-omatcopy-conjugate
Cedric Nugteren
2018-07-31
Merge pull request #308 from CNugteren/CLBlast-301-weird-AMD-Hainan-bug
Cedric Nugteren
2018-07-31
Fixed issue with not performing complex conjugation under certain cases when ...
Cedric Nugteren
2018-07-31
Updated the tuning results for Intel IvyBridge M GT2
Cedric Nugteren
2018-07-29
Fixed a wrong event issue causing error -57
Cedric Nugteren
2018-07-29
Removed complex numbers support for CONVGEMM
Cedric Nugteren
2018-07-29
Merge branch 'master' into CLBlast-267-convgemm
Cedric Nugteren
2018-07-28
Added print statements to indicate the 4 stages of GEMM tuning
Cedric Nugteren
[prev]
[next]