Age | Commit message (Collapse) | Author | |
---|---|---|---|
2017-07-02 | Added interface and stubs for the im2col routine | Cedric Nugteren | |
2017-06-21 | Fixes some compilation issues related to the database structure change | Cedric Nugteren | |
2017-05-26 | Fixes inability to run GEMM on multiple identical GPUs (issue #155) | Kirill Mavreshko | |
2017-05-12 | Added the IxAMIN routines: absolute minimum version of IxAMAX | Cedric Nugteren | |
2017-04-10 | Removed const-vector-of-const-objects from the database class to remain ↵ | Cedric Nugteren | |
according to the C++11 standard | |||
2017-03-10 | Added API and test infrastructure for the batched GEMM routine | Cedric Nugteren | |
2017-03-08 | Make batched routines based on offsets instead of a vector of cl_mem objects ↵ | Cedric Nugteren | |
- undoing many earlier changes | |||
2017-03-05 | Added first naive version of the batched AXPY routine | Cedric Nugteren | |
2017-03-05 | Prepared generator for batched routines; added batched AXPY routine interface | Cedric Nugteren | |
2017-02-26 | Merge branch 'development' into triangular_solvers | Cedric Nugteren | |
2017-02-26 | Removed half-precision support from the TRSM routine; too unstable | Cedric Nugteren | |
2017-02-16 | Added a C interface to the OverrideParameters function; added some in-line ↵ | Cedric Nugteren | |
comments to the API | |||
2017-02-16 | Added input-sanity checks for the OverrideParameters function | Cedric Nugteren | |
2017-02-13 | Added first version of the OverrideParameters function | Cedric Nugteren | |
2017-02-05 | Merge branch 'development' into triangular_solvers | Cedric Nugteren | |
2017-01-24 | Routine, Cache: generalize, reduce amount of copying in fast path | Ivan Shapovalov | |
Implement a generalized Cache<K, V>. Two variants are provided: the first one is based on std::map, using C++14-specific transparent std::less<> and generalized std::map::find() to allow searching by tuple of references. The second one is based on std::vector and O(n) lookup, but remains C++11-compliant. | |||
2017-01-24 | FillCache: perform compilation for each precision separately | Ivan Shapovalov | |
Thus do not prevent filling cache for float if the device does not support e. g. double. | |||
2017-01-20 | treewide: include clpp11.hpp first to silence deprecation warnings | Ivan Shapovalov | |
Otherwise, cl.h gets included through clblast.h before clpp11.hpp. | |||
2017-01-20 | Added prototype for the TRSV routine | Cedric Nugteren | |
2016-12-18 | Prepared for the addition of the TRSM triangular solver kernel | Cedric Nugteren | |
2016-10-22 | Routine: get rid of ::SetUp() | Ivan Shapovalov | |
Since we now use C++ exceptions inside the implementation (and exceptions can be thrown from constructors), there is no need for a separate Routine::SetUp() function. For this, we also change the way how the kernel source string is constructed. The kernel-specific source code is now passed to the Routine ctor via an initializer_list of C strings to avoid unnecessary data copying while also working around C1091 of MSVC 2013. | |||
2016-10-22 | treewide: use C++ exceptions properly | Ivan Shapovalov | |
Since the codebase is designed around proper C++ idioms such as RAII, it makes sense to only use C++ exceptions internally instead of mixing exceptions and error codes. The exceptions are now caught at top level to preserve compatibility with the existing error code-based API. Note that we deliberately do not catch C++ runtime errors (such as `std::bad_alloc`) nor logic errors (aka failed assertions) because no actual handling can ever happen for such errors. However, in the C interface we do catch _all_ exceptions (...) and convert them into a wild-card error code. | |||
2016-06-30 | Added declspec(dllexport) to ClearCache and FillCache, and added ↵ | Cedric Nugteren | |
declspec(dllimport) when not building the library | |||
2016-06-19 | Renamed all C++ source files to .cpp to match the .hpp extension better | Cedric Nugteren | |