index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2015-08-19
Added Travis build-status to the README
CNugteren
2015-08-19
Now using apt-get directly in Travis
CNugteren
2015-08-19
Updated fglrx package in Travis
CNugteren
2015-08-19
Added OpenCL and Clang to travis
CNugteren
2015-08-18
Added GCC 4.8 and updated CMake
CNugteren
2015-08-18
Added initial .travis.yml file
CNugteren
2015-08-13
Merge pull request #21 from CNugteren/c_api
Cedric Nugteren
2015-08-13
Added the plain C API
CNugteren
2015-08-13
Added all supported routines to the C API
CNugteren
2015-08-13
Fixed a complex data-type bug in the transpose kernel
CNugteren
2015-08-13
Added SGEMM example using the C API
CNugteren
2015-08-13
Added initial version of C API with just one routine
CNugteren
2015-08-13
Added argument m,n,k metadata to JSON files
CNugteren
2015-08-09
Refactored the tuners, added JSON output
CNugteren
2015-08-04
Merge pull request #19 from CNugteren/basic_level2_routines
Cedric Nugteren
2015-08-04
Added distinguished names for GEMV inherited HEMV/SYMV
CNugteren
2015-08-03
Abstracted loading of matrix A for GEMV kernel
CNugteren
2015-07-31
Added HEMV and SYMV
CNugteren
2015-07-31
Added HEMV and SYMV
CNugteren
2015-07-31
Added HEMV routine
CNugteren
2015-07-31
Added SYMV routine
CNugteren
2015-07-31
Merge pull request #18 from CNugteren/correctness_test_refactoring
Cedric Nugteren
2015-07-31
Refactored the correctness tests
CNugteren
2015-07-31
Merge pull request #17 from CNugteren/clblas_external
Cedric Nugteren
2015-07-31
Updated documentation reflecting removal of clBLAS sources
CNugteren
2015-07-31
Removed clBLAS source code, now requires separate installation
CNugteren
2015-07-27
Moved the preferred options of clBLAS (no tests) to the CLBlast CMakeLists file
CNugteren
2015-07-27
Merge pull request #16 from CNugteren/claduc_header
Cedric Nugteren
2015-07-27
Now using the new Claduc C++11 OpenCL header
CNugteren
2015-07-24
Prepared the changelog for the next release
CNugteren
2015-07-24
Updated to version 0.3.0
CNugteren
2015-07-24
Merge pull request #14 from CNugteren/amd_performance
Cedric Nugteren
2015-07-24
Updated the docs to reflect the performance improvements
CNugteren
2015-07-23
Updated the performance results, added HD7950
CNugteren
2015-07-22
Made the graph script robust against diagnostic system messages
CNugteren
2015-07-22
Set the correct name for AMD OpenCL devices
CNugteren
2015-07-22
Updated GEMM tuning results for Tahiti
CNugteren
2015-07-22
Added workgroup shuffle option to transpose kernel for AMD GPUs
CNugteren
2015-07-21
Transpose kernel now uses vectorized local memory loads and stores
CNugteren
2015-07-19
Triangular GEMM kernels are only compiled when needed
CNugteren
2015-07-19
Kernel caching is now based on a routine's name
CNugteren
2015-07-19
The kernel source string is now a routine's member variable
CNugteren
2015-07-19
Fixed complex performance on Intel Iris
CNugteren
2015-07-16
Fixed a bug when using the Xgemm kernel without local memory
CNugteren
2015-07-16
Using mad() instruction for AMD devices like clBLAS does
CNugteren
2015-07-15
Merge pull request #13 from CNugteren/bypass_pre_post_processing
Cedric Nugteren
2015-07-15
Updated changelog with pre/post-processing bypass
CNugteren
2015-07-15
Changed performance graphs to default to column-major
CNugteren
2015-07-15
Skips pre/post processing kernels if not needed
CNugteren
2015-07-13
Updated interface of the PadCopyTransposeMatrix method
CNugteren
[prev]
[next]