index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2016-02-20
Added support for xGERU and xGERC routines
Cedric Nugteren
2016-02-20
Added XGER routine, kernel, and tuner
Cedric Nugteren
2016-02-08
Fixed warnings under MSVC
CNugteren
2016-02-08
Separated the GEMM kernel in two parts to reduce string length for MSVC
Cedric Nugteren
2016-02-08
Split-up the XGEMV kernel in two parts
Cedric Nugteren
2016-02-07
Added tuning parameters for various devices using the new database script
Cedric Nugteren
2016-02-07
Various fixes to the database script
Cedric Nugteren
2016-02-07
Added dictionary with short and long OpenCL vendor names to fix issues with I...
Cedric Nugteren
2016-02-07
Made the tuning database an optional external download
Cedric Nugteren
2016-02-06
Made the database script compatible with Python 3
CNugteren
2016-02-06
Reduced the maximum workgroup-size for GEMV kernels further
CNugteren
2016-02-06
Changed the order of tuners in the alltuners target
Cedric Nugteren
2016-02-06
Reduced unrolling factor in xgemv kernel to reduce compilation times
CNugteren
2016-02-06
Fixed a linker error in the performance client under GCC
CNugteren
2016-01-30
Fixes for compilation under Visual Studio
CNugteren
2016-01-30
Prepared for MSVC support
Cedric Nugteren
2016-01-30
Fixed a bug in the graph scripts (thanks to Victor Pakhomov)
Cedric Nugteren
2016-01-30
Updated to version 4.0 of the CLCudaAPI header
Cedric Nugteren
2016-01-30
Merge branch 'tuning_database' into development
Cedric Nugteren
2016-01-30
Added first auto-generated database headers from the Python database; only K4...
Cedric Nugteren
2016-01-24
Minor improvements to the database script, including proper file paths
Cedric Nugteren
2016-01-24
Added Python function to compute defaults for a particular device/vendor comb...
Cedric Nugteren
2016-01-23
Updated FindOpenCL for Intel Linux OpenCL paths
Cedric Nugteren
2015-10-28
Added tuning data for Tesla K40
CNugteren
2015-10-28
Now sets local memory size in xgemv tuner properly
CNugteren
2015-10-25
Added initial tuning database with Intel Iris data
CNugteren
2015-10-25
Updated tuning database script according to the new JSON format
CNugteren
2015-10-25
Fixed an arguments-related bug in the GEMV tuner
CNugteren
2015-10-25
Moved the tuner database script to a separate folder
CNugteren
2015-10-23
Added alpha and beta to tuner meta-data
CNugteren
2015-10-17
Prepared the changelog for the next release
CNugteren
2015-10-17
Updated to version 0.5.0
CNugteren
2015-10-17
Travis now also build the development branch
CNugteren
2015-10-17
Merge pull request #28 from CNugteren/kernels_reorganization
Cedric Nugteren
2015-10-13
Added guards for routine-specific level-3 pad kernels
CNugteren
2015-10-12
Routine names are now all default arguments defined in the header
CNugteren
2015-10-12
Moved level3 kernel files to a subfolder
CNugteren
2015-09-26
Merge pull request #27 from CNugteren/level2_matrix_vector
Cedric Nugteren
2015-09-26
Added TRMV/TBMV/TPMV routines
CNugteren
2015-09-26
Made buffer copying a const-method for the source
CNugteren
2015-09-19
Added SBMV and SPMV routines
CNugteren
2015-09-19
Added the HPMV routine
CNugteren
2015-09-19
Added infrastructure for packed matrices
CNugteren
2015-09-19
Added the HBMV routine
CNugteren
2015-09-18
Improved the organization and performance of level 2 routines
CNugteren
2015-09-18
Added first version of banded matrix-vector multiplication
CNugteren
2015-09-18
Merge pull request #26 from CNugteren/routine_definitions
Cedric Nugteren
2015-09-18
Added generated main functions for correctness/performance tests for level 2 ...
CNugteren
2015-09-17
Added interface of all level 2 routines
CNugteren
2015-09-17
Added script to generate API interface and implementation automatically
CNugteren
[next]