index
:
debian-clblast
debian/sid
upstream/latest
Debian package for CLBlast.
gspr@nonempty.org
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2016-10-10
Changed the storage location of the database to a separate Github repository
Cedric Nugteren
2016-10-10
Changed the license to MIT
Cedric Nugteren
2016-10-10
Updated the performance graphs for Intel Iris Pro GPU and AMD Radeon M370X GPU
Cedric Nugteren
2016-10-10
Added fresh performance graphs for GeForce 750Ti; removed old GTX480 results
Cedric Nugteren
2016-10-10
Updated the tuning results for the GTX 750 Ti GPU
Cedric Nugteren
2016-10-10
Merge branch 'gemm_direct' into development
Cedric Nugteren
2016-10-10
Changed the thresholds for the direct/indirect GEMM kernels for NVIDIA and In...
Cedric Nugteren
2016-10-08
Added benchmark script for small matrix sizes, testing the direct GEMM kernels
Cedric Nugteren
2016-10-08
Fixed a performance bug for Intel Iris Pro GPUs due to incorrect tuning results
Cedric Nugteren
2016-10-06
Added first tuning results for the single-kernel direct GEMM implementation
Cedric Nugteren
2016-10-06
Added a kernel selection database to select between the direct and indirect G...
Cedric Nugteren
2016-10-03
Fixed a const-correctness issue with complex conjugation in the GEMM direct k...
Cedric Nugteren
2016-10-03
Added functions to load from off-chip to local memory without vector loads fo...
Cedric Nugteren
2016-10-03
Re-organised GEMM direct kernel and added faster fall-back version for incomp...
Cedric Nugteren
2016-10-02
Set the default number of runs for all kernels to at least 2 runs
Cedric Nugteren
2016-10-02
Specialised the GEMM direct kernel in four ways for transposing/non-transposi...
Cedric Nugteren
2016-10-02
Split the GEMM direct kernel into two files; set the default tuning target to...
Cedric Nugteren
2016-10-01
Added padding to the local memory of the GEMM direct kernel
Cedric Nugteren
2016-10-01
Added default num-runs to the tuner adding averaging over 10 runs as a defaul...
Cedric Nugteren
2016-10-01
Merge branch 'development' into gemm_direct
Cedric Nugteren
2016-09-27
Added an option to run tuned kernels multiple times to average execution time...
Cedric Nugteren
2016-09-27
Updated to version 8.0 of the CLCudaAPI header
Cedric Nugteren
2016-09-27
Fixed the local memory size computation for the GEMM tuners
Cedric Nugteren
2016-09-27
Now generates test/client/tuner data using a fixed seed to enable reproducabi...
Cedric Nugteren
2016-09-27
Added more relaxed error checking for the half-precision tests
Cedric Nugteren
2016-09-27
Merge pull request #103 from dividiti/link_clblas_with_pthread
Cedric Nugteren
2016-09-26
Use cross-platform thread lib idiom instead of *nix-specific pthread.
Anton Lokhmotov
2016-09-26
Link clBLAS together with pthread.
Anton Lokhmotov
2016-09-25
Added a first version of a tuner for the GEMM direct kernel; collapsed MWGD, ...
Cedric Nugteren
2016-09-25
Separated the tuning parameters of the new direct GEMM kernel from the indire...
Cedric Nugteren
2016-09-25
Added a first version of the direct version of GEMM with local memory
Cedric Nugteren
2016-09-25
Fix another issue with the packaging in the AppVeyor script
Cedric Nugteren
2016-09-25
Fix an issue with the packaging in the AppVeyor script
Cedric Nugteren
2016-09-25
Updated AppVeyor script to fix an issue with changes in the latest AppVeyor s...
Cedric Nugteren
2016-09-24
Merge pull request #101 from dividiti/add_ref_includes_to_test_correctness_co...
Cedric Nugteren
2016-09-24
Add path to ref library header when building tests.
Anton Lokhmotov
2016-09-22
Fixed a bug waiting for an invalid event in case of a non-succesfull CLBlast ...
Cedric Nugteren
2016-09-21
Merge branch 'development' into gemm_direct
Cedric Nugteren
2016-09-21
It is now possible to set the OpenCL compiler options through an environmenta...
Cedric Nugteren
2016-09-21
Merge branch 'master' into development
Cedric Nugteren
2016-09-20
Merge pull request #100 from gpu/master
Cedric Nugteren
2016-09-20
Fixed link in README.md
Marco Hutter
2016-09-13
Merge pull request #99 from CNugteren/development
Cedric Nugteren
2016-09-13
Updated to version 0.9.0
Cedric Nugteren
2016-09-13
Renamed the DEFAULT_DEVICE and DEFAULT_PLATFORM env variables to be in line w...
Cedric Nugteren
2016-09-13
Merge pull request #98 from intelfx/no-ignored-attributes
Cedric Nugteren
2016-09-13
CMakeLists.txt: use -Wno-ignored-attributes to silence unfixable warnings
Ivan Shapovalov
2016-09-12
Split the XGEMM kernel further up: now in 3 parts. This is done because MSVC ...
Cedric Nugteren
2016-09-12
Merge branch 'database_rewrite' into development
Cedric Nugteren
2016-09-12
Added XgemvFastRot and Xgemm 16-bit tuning results: just defaults which are n...
Cedric Nugteren
[next]