summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2017-08-11Updated CMakeLists to include header files such that IDEs can locate themCedric Nugteren
2017-08-08Updated to version 1.0.1 (bugfix release)Cedric Nugteren
2017-07-31Merge pull request #180 from mcian/restore_direct_GEMMCedric Nugteren
Restore direct gemm
2017-07-31Restore direct GEMM to previous versionmcian
2017-07-31Merge pull request #1 from CNugteren/mastermcian
fork up to date
2017-07-30Updated to version 1.0.0Cedric Nugteren
2017-07-30Fixes for Travis automatic deploymentCedric Nugteren
2017-07-30Fixes warnings for Clang & AppleClangCedric Nugteren
2017-07-30Fixes to AppVeyor and Travis scriptsCedric Nugteren
2017-07-30Improved deployment procedure of automatic buildsCedric Nugteren
2017-07-30Added tuning results for the Qualcomm Adreno 330 GPUCedric Nugteren
2017-07-25Minor optimization for the direct GEMM kernel: don't ceil m and n ↵Cedric Nugteren
unnecessarily high
2017-07-24Added status badges for correctness tests; updated list of contributors; ↵Cedric Nugteren
fixed minor typos
2017-07-23Merge branch 'relax_gemmbatched_ld_requirements'Cedric Nugteren
2017-07-22Merge pull request #176 from CNugteren/inline_keyword_optionalCedric Nugteren
Made the inline keyword in kernels optional
2017-07-19Merge pull request #175 from mcian/Arm_ThresholdCedric Nugteren
Add new threshold for ARM
2017-07-17Add new threshold for ARMmcian
2017-07-12Fixed batched tests when testing for invalid sizes against clBLASCedric Nugteren
2017-07-12Relaxed requirement on a_ld and b_ld for batched GEMMCedric Nugteren
2017-07-12Removed spurious warning for Clang < 3.9Cedric Nugteren
2017-07-09Merge pull request #172 from CNugteren/msvc_improvementsCedric Nugteren
Windows & MSVC improvements
2017-07-09Changed printf-statements with %zu into std::cout to fix MSVC 2013 compatibilityCedric Nugteren
2017-07-09Disabled UNIX-style terminal color printing under WindowsCedric Nugteren
2017-07-08Made the inline keyword in kernels optional currently only enabled for ↵Cedric Nugteren
NVIDIA and ARM GPUs
2017-07-01Merge branch 'gemm_direct_bug'Cedric Nugteren
2017-06-30Fixed an if-statement in the direct GEMM kernel causing a bug with specific ↵Cedric Nugteren
sets of input parameters
2017-06-29Added batched GEMM example programCedric Nugteren
2017-06-27Moved and inlined some static member variables and disabled spurious clang ↵Cedric Nugteren
warnings
2017-06-27Undo of earlier move of TestBlas::kTransposes constant to fix MSVC 2013 ↵Cedric Nugteren
compilation
2017-06-26Fixed and suppresses several warnings for MSVCCedric Nugteren
2017-06-26Reduced optimization level for the (non-performance critical) host-code to ↵Cedric Nugteren
speed-up compilation
2017-06-25Moved static variable declarations from .cpp to .hpp to resolve some Clang ↵Cedric Nugteren
warnings
2017-06-25Added CLBlast paper and presentation references in READMECedric Nugteren
2017-06-25Fixed some Clang and MSVC warningsCedric Nugteren
2017-06-25Merge branch 'database_compilation_speed'Cedric Nugteren
2017-06-21Fixes some compilation issues related to the database structure changeCedric Nugteren
2017-06-20Changed the structure of the database to reduce compilation time and save memoryCedric Nugteren
2017-06-18Added tuning results for the Core i7-920 CPUCedric Nugteren
2017-06-18Fixed an overflow bug on 32-bit systems when chosing a GEMM kernelCedric Nugteren
2017-06-01Added tuning results for GeForce GT 650M (thanks to bzcheeseman)Cedric Nugteren
2017-05-27Merge pull request #158 from CNugteren/msvc_compilation_fixesCedric Nugteren
MSVC compilation fixes
2017-05-27Update to AppVeyor because of changed Khronos repository (9)Cedric Nugteren
2017-05-27Update to AppVeyor because of changed Khronos repository (8)Cedric Nugteren
2017-05-27Update to AppVeyor because of changed Khronos repository (7)Cedric Nugteren
2017-05-27Update to AppVeyor because of changed Khronos repository (6)Cedric Nugteren
2017-05-27Update to AppVeyor because of changed Khronos repository (5)Cedric Nugteren
2017-05-27Update to AppVeyor because of changed Khronos repository (4)Cedric Nugteren
2017-05-27Update to AppVeyor because of changed Khronos repository (3)Cedric Nugteren
2017-05-27Merge pull request #157 from kpot/improved_cachingCedric Nugteren
Fixes inability to run GEMM on multiple identical GPUs (issue #155)
2017-05-27Fixed comment decribing the order of program cache fieldsKirill Mavreshko