Age | Commit message (Collapse) | Author | |
---|---|---|---|
2017-11-17 | Moved compilation function to separate file; removed dependency of tuners of ↵ | Cedric Nugteren | |
the CLBlast library | |||
2017-11-11 | Factored out the creation of the OpenCL header and the program compilation | Cedric Nugteren | |
2017-10-09 | Removed include of clpp11.hpp in places other than utilities.hpp | Cedric Nugteren | |
2017-08-12 | Moved functions from the header to the .cpp file to prevent compiling the ↵ | Cedric Nugteren | |
same code multiple times | |||
2017-03-19 | Added batched versions of the pad/copy/transpose kernels | Cedric Nugteren | |
2017-02-26 | Merge branch 'development' into triangular_solvers | Cedric Nugteren | |
2017-02-26 | Fixed an out-of-bounds memory access when filling a matrix with a constant | Cedric Nugteren | |
2017-02-12 | Split the database into several smaller cached per-kernel databases (in ↵ | Cedric Nugteren | |
preparation of per-kernel database overrides) | |||
2017-02-05 | Merge branch 'development' into triangular_solvers | Cedric Nugteren | |
2017-01-29 | Added first (incomplete) version of TRSV routine | Cedric Nugteren | |
2017-01-20 | treewide: include clpp11.hpp first to silence deprecation warnings | Ivan Shapovalov | |
Otherwise, cl.h gets included through clblast.h before clpp11.hpp. | |||
2017-01-15 | Added a first version of the diagonal block invert routine in preparation of ↵ | Cedric Nugteren | |
TRSM | |||
2016-10-23 | Fixed a bug in the transpose-matrix function | Cedric Nugteren | |
2016-10-22 | treewide: use C++ exceptions properly | Ivan Shapovalov | |
Since the codebase is designed around proper C++ idioms such as RAII, it makes sense to only use C++ exceptions internally instead of mixing exceptions and error codes. The exceptions are now caught at top level to preserve compatibility with the existing error code-based API. Note that we deliberately do not catch C++ runtime errors (such as `std::bad_alloc`) nor logic errors (aka failed assertions) because no actual handling can ever happen for such errors. However, in the C interface we do catch _all_ exceptions (...) and convert them into a wild-card error code. | |||
2016-07-22 | clblast::RunKernel, cl::Kernel: unify variants with/without waitForEvents, ↵ | Ivan Shapovalov | |
support empty LWS | |||
2016-07-22 | clblast::RunKernel, cl::Kernel: take const vector as waitForEvents | Ivan Shapovalov | |
2016-07-16 | Removed an unused variable from the copy-transpose-pad function | Cedric Nugteren | |
2016-07-10 | Now passing alpha/beta to the kernel as arguments as before fp16 support; in ↵ | Cedric Nugteren | |
case of fp16 arguments are cast on host and in kernel | |||
2016-06-18 | Moved all headers into the source tree, changed headers to .hpp extension | Cedric Nugteren | |