index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
/
utilities
Age
Commit message (
Expand
)
Author
2017-12-24
Fixes for the CUDA backend of CLBlast
Cedric Nugteren
2017-12-23
Added TRSV block-size tuner
Cedric Nugteren
2017-12-17
Removed all ARM Mali tuning results; re-added Mali-T760 and Mali-T628 results...
Cedric Nugteren
2017-12-10
Fixed a missing include
Cedric Nugteren
2017-12-09
Made the pre-processor run by default for ARM and Qualcomm GPUs
Cedric Nugteren
2017-11-30
Integrated pre-processor in compilation flow, default is still disabled
Cedric Nugteren
2017-11-25
Moved string splitting functions; added string character removal function
Cedric Nugteren
2017-11-22
Made parameter override in the clients a command-line argument and added supp...
Cedric Nugteren
2017-11-19
Added compilation timing and better compilation error reporting
Cedric Nugteren
2017-11-19
Revived the GEMM routine tuner; minor formatting changes
Cedric Nugteren
2017-11-17
Moved compilation function to separate file; removed dependency of tuners of ...
Cedric Nugteren
2017-11-15
Added first version of integrated and re-written auto-tuner
Cedric Nugteren
2017-11-15
Added kernel timing functionality to the utilities
Cedric Nugteren
2017-11-15
Added exception handle with catch-all
Cedric Nugteren
2017-11-13
Made the exception dispatch function optionally silent
Cedric Nugteren
2017-11-13
Moved square-difference utility function for use in the tuners
Cedric Nugteren
2017-11-07
Merge pull request #212 from CNugteren/kernel_selection_tuner
Cedric Nugteren
2017-11-02
Integrated the GEMM routine tuner for kernel selection; added first tuning re...
Cedric Nugteren
2017-10-30
Added collecting and printing of scores for the kernel-selection tuner
Cedric Nugteren
2017-10-29
Added Android support using the GNU C++ STL library and the GCC toolchain
Cedric Nugteren
2017-10-28
Merge branch 'master' into android_support
Cedric Nugteren
2017-10-28
Added initial version of a GEMM kernel selection tuner
Cedric Nugteren
2017-10-28
Moved timing function to a separate file
Cedric Nugteren
2017-10-15
Various fixes to make the first CUDA examples work
Cedric Nugteren
2017-10-12
CUDA API now takes context and device in instead of stream
Cedric Nugteren
2017-10-11
Added first (untested) version of a CUDA API
Cedric Nugteren
2017-10-09
Removed include of clpp11.hpp in places other than utilities.hpp
Cedric Nugteren
2017-10-08
Moved the remaining OpenCL specific host code to the clpp11.h header where it...
Cedric Nugteren
2017-10-07
Synchronizes clpp11.h with CLCudaAPI 9.0
Cedric Nugteren
2017-09-26
Added Android header for compilation with gnustl STL
Cedric Nugteren
2017-09-16
Fixed a compilation error and warning under MacOS
Cedric Nugteren
2017-09-14
Added architecture layer in the tuning database for better performance on uns...
Cedric Nugteren
2017-09-10
Added the new vendor-architecture-name hierarchy to the tuners as well
Cedric Nugteren
2017-09-08
Introduced the notion of a device-architecture for the database and added dev...
Cedric Nugteren
2017-08-24
Merge branch 'master' into im_to_col
Cedric Nugteren
2017-08-23
Made the im2col client properly handle the arguments
Cedric Nugteren
2017-08-21
Merge pull request #173 from mcian/PSO_params
Cedric Nugteren
2017-08-21
Remove multistrategy and related functions
mcian
2017-08-12
Merge branch 'master' into im_to_col
Cedric Nugteren
2017-08-12
Moved some utility functions to a test-specific utility compilation-unit
Cedric Nugteren
2017-07-23
Code refactoring
mcian
2017-07-17
Add PSO parameters support and search strategy selection from command line
mcian
2017-07-16
First step towards supporting im2col in the test infrastructure
Cedric Nugteren
2017-07-12
Relaxed requirement on a_ld and b_ld for batched GEMM
Cedric Nugteren
2017-05-26
Fixed a compilation issue under MSVC 2013
Cedric Nugteren
2017-04-17
Fixed a namespace clash with CUDA FP16 for the half-datatype
Cedric Nugteren
2017-04-13
Fixed CUDA malloc and cuBLAS handles: cuBLAS as a performance-reference now w...
Cedric Nugteren
2017-04-10
Merge branch 'development' into cublas_reference
Cedric Nugteren
2017-04-07
Added a special override database for the Apple CPU implementation on OS X: t...
Cedric Nugteren
2017-04-02
Layed the groundwork for cuBLAS comparisons in the clients
Cedric Nugteren
[next]