diff options
Diffstat (limited to 'CHANGELOG')
-rw-r--r-- | CHANGELOG | 12 |
1 files changed, 12 insertions, 0 deletions
@@ -1,4 +1,16 @@ +Development version (next release) +- Updated to version 6.0 of the CLCudaAPI C++11 OpenCL header +- Fixed proper MSVC dllimport and dllexport declarations +- Fixed memory leaks related to events not being released +- Fixed a bug with a size_t and cl_ulong mismatch on 32-bit systems +- Fixed a bug related to the cache and retrieval of programs based on the OpenCL context +- Fixed a performance issue (caused by fp16 support) by optimizing alpha/beta parameter passing to kernels +- Fixed a bug in the OpenCL kernels: now placing __kernel before __attribute__ +- Added an option (-warm_up) to do a warm-up run before timing in the performance clients +- Improved performance significantly of rotated GEMV computations +- Added tuned parameters for various devices (see README) + Version 0.8.0 - Added support for half-precision floating-point (fp16) in the library - Made it possible to compile the performance tests (clients) separately from the correctness tests |