summaryrefslogtreecommitdiff
path: root/scripts
AgeCommit message (Collapse)Author
2023-06-16Fix pointer error in `pyclblast` on ARM (#490)Yubraj Bhoi
* Fix pointer error in `pyclblast` on ARM Use `ptrdiff_t` instead of `size_t` for pointers. Fix error in `setup.py` * Fix ARM pointer error in `pyclblast` generator Update CHANGELOG file
2023-05-07AMAX/AMIN integer testing and bug fixes (#457)Cedric Nugteren
* Fixed a bug in XAMAX/XMIN routines that caused the increment and offset to be included in the result * Perform proper integer-output testing in XAMAX tests * A few changes towards getting it ready for a PR * Also fix compilation for clBLAS and cuBLAS references * Fix a bug that would only use the real part of complex numbers in the amax/amin routines * A few small fixes related to the AMAX tests
2023-03-25Fix documentation bug w.r.t. ld values and matrix layoutCedric Nugteren
2022-10-13Fix plotting issue with a single row or columnCedric Nugteren
2022-10-13Fix plotting issue in case of 'inf' valuesCedric Nugteren
2020-10-05Allow single graph / subplot on plotWitold Baryluk
`plt.subplots` tries to be special, and return array or not-array depending on a number of subplots. It is not actually helpful, and IMHO bad design. Make it always `ndarray`. The `and not type(axes) is np.ndarray`, is just in case matplotlib decides to make their behavior more uniform. For now work around it. Also, no need for `ndarray.flat` really. Confirmed to work with existing benchmarks (i.e. rows=2, cols=3), and with single graphs (rows=1, cols=1).
2020-10-04Fix a typo in benchmark when running fp 16 vs 32Witold Baryluk
The intention here was to limit the iteration range to common indexes only. Fix that.
2020-10-04Merge pull request #397 from baryluk/patch-1Cedric Nugteren
Fix Python SyntaxWarning
2020-10-04Fix --load_from_disk argument help messageWitold Baryluk
2020-10-04Fix Python SyntaxWarningWitold Baryluk
There is no guarantee that all empty strings objects are the same or share object with `""` literal.
2020-10-02Fix a Python 3 bug in the benchmark scriptCedric Nugteren
2020-05-12Added CLBLAST_VERSION_MAJOR/MINOR/PATCH defines in headers to store version ↵Cedric Nugteren
numbering
2020-05-10Added pyclblast bindings for the 3 batched routinesCedric Nugteren
2020-03-08Update API documentationCedric Nugteren
2019-01-23Added fp32 to fp16 conversion function in Python to make haxpy example workCedric Nugteren
2019-01-22Added experimental support for half-precision in pyclblastCedric Nugteren
2018-12-31Added support for the convgemm tuner in the tuner databaseCedric Nugteren
2018-11-12Add kernel_mode option to im2col, col2im, and convgemm functionsKoichi Akabe
2018-11-07Changed col2im to append to the existing im-bufferCedric Nugteren
2018-10-23Added groundwork for col2im algorithm plus first non-working version of ↵Cedric Nugteren
kernel and test
2018-09-16Merge branch 'master' into convgemm_multi_kernelCedric Nugteren
2018-08-05Added an option to compile the Netlib API with static OpenCL device and contextCedric Nugteren
2018-07-29Removed complex numbers support for CONVGEMMCedric Nugteren
2018-07-29Merge branch 'master' into CLBlast-267-convgemmCedric Nugteren
2018-07-13Added tuning results for HD Graphics 6000 Broadwell GT3Cedric Nugteren
2018-05-09Updated the documentation for convgemm to include data layout (NCHW)Cedric Nugteren
2018-05-06Added convgemm skeleton, test infrastructure, and first reference implementationCedric Nugteren
2018-05-05Added interface of batched convolution as GEMMCedric Nugteren
2018-04-15Updated tuning results for the Skylake ULT GT2 GPU with the new kernelCedric Nugteren
2018-04-10Made it possible to add tuning parameters to the database using the scriptCedric Nugteren
2018-04-10Fixed a bug in the compression part of the database scriptCedric Nugteren
2018-04-08Extended the maximum number of tuning parameters from 14 to 16Cedric Nugteren
2018-04-07Fixed a python3 import error issue with the database scriptCedric Nugteren
2018-03-27mergedkodonell
2018-03-27got the generator thing workingkodonell
2018-03-11Merge pull request #262 from CNugteren/CLBlast-237-tuning-apiCedric Nugteren
CLBlast #237: Tuning API
2018-03-10Made benchmarking script also work for complex numbersCedric Nugteren
2018-03-10Updated the documentation for the tuner APICedric Nugteren
2018-03-10Fixed a few things for the new tuning APICedric Nugteren
2018-03-03Fixed some small issues regarding PR#253Cedric Nugteren
2018-03-03Added C API for getting GEMM temp buffer sizesivagnanamn
2018-02-25Generated PyCLBlast docstringsCedric Nugteren
2018-02-25Some style improvements in the pyclblast code generatorCedric Nugteren
2018-02-25Added API documentation for two missing C++ functionsCedric Nugteren
2018-02-24Renamed the API documentationCedric Nugteren
2018-02-21Fixed duplication of parameter descriptions by the doc generatorKirill Mavreshko
2018-02-18Prepared PyCLBlast for release as a package on PyPiCedric Nugteren
2018-02-18Added all other level 1/2/3 routines to pyclblastCedric Nugteren
2018-02-18Added GEMM to the Python wrapperCedric Nugteren
2018-02-14First agenerated version (clblastXswap only for now) of the pyclblast wrapperCedric Nugteren