summaryrefslogtreecommitdiff
path: root/src/routines/levelx
AgeCommit message (Expand)Author
2018-04-15Fixed some failing tests for GEMM and batched GEMM routinesCedric Nugteren
2018-04-13Made GEMM rotation expectations kernel-specificCedric Nugteren
2018-02-02Implemented the XHAD Hadamard product routineCedric Nugteren
2018-01-31Created the API and stubs for the HAD (hadamard-product) routinesCedric Nugteren
2018-01-26Fixed an event synchronisation issue in the batched gemm routinesCedric Nugteren
2018-01-18Made the batched routines also chose direct/indirect kernel like the main GEM...Cedric Nugteren
2018-01-08Implemented the in-direct version of the strided-batched GEMM kernelCedric Nugteren
2018-01-07Implemented direct version of strided-batched GEMM kernelCedric Nugteren
2018-01-07Added API and tests for new GemmStridedBatched routineCedric Nugteren
2018-01-06Reduced duplicate code in the batched GEMM implementationCedric Nugteren
2017-12-23Split the invert kernel in two parts to prevent error C1091 in MSVC 2013Cedric Nugteren
2017-12-23Added TRSV block-size tunerCedric Nugteren
2017-12-10Split GEMM kernel in 4 files instead of 3 due to MSVC 2013 string length limitCedric Nugteren
2017-11-02Integrated the GEMM routine tuner for kernel selection; added first tuning re...Cedric Nugteren
2017-10-17Made buffers of batched routines read/write (was: read-only)Cedric Nugteren
2017-09-19Fixed type conversion warnings under MSVC 2013Cedric Nugteren
2017-08-31Fixed a bug in im2col: process only valid channel IDsCedric Nugteren
2017-08-31Fixed a bug in im2col confusing first and second workgroup size; made im2col ...Cedric Nugteren
2017-08-24Completed im2col implementationCedric Nugteren
2017-08-19First version of im2col kernel, unoptimized but workingCedric Nugteren
2017-08-12Merge branch 'master' into im_to_colCedric Nugteren
2017-07-12Relaxed requirement on a_ld and b_ld for batched GEMMCedric Nugteren
2017-07-02Added interface and stubs for the im2col routineCedric Nugteren
2017-03-19Added an (optional) non-direct implementation of the batched GEMM routineCedric Nugteren
2017-03-11Added initial naive version of the batched GEMM routine based on the direct G...Cedric Nugteren
2017-03-10Added API and test infrastructure for the batched GEMM routineCedric Nugteren
2017-03-08Implemented a batched version of the AXPY kernelCedric Nugteren
2017-03-08Make batched routines based on offsets instead of a vector of cl_mem objects ...Cedric Nugteren
2017-03-05Added first naive version of the batched AXPY routineCedric Nugteren
2017-02-26Fixed an out-of-bounds memory access when filling a matrix with a constantCedric Nugteren
2017-02-22Fixed a few issues with the TRSM routine; some tests still failingCedric Nugteren
2017-02-05Merge branch 'development' into triangular_solversCedric Nugteren
2017-01-24Routine, Cache: generalize, reduce amount of copying in fast pathIvan Shapovalov
2017-01-18Added first version of the TRSM routine based on the diagonal invert kernelCedric Nugteren
2017-01-15Added a first version of the diagonal block invert routine in preparation of ...Cedric Nugteren
2016-10-22Routine: get rid of ::SetUp()Ivan Shapovalov
2016-10-22treewide: use C++ exceptions properlyIvan Shapovalov
2016-07-16Removed an unused variable from the copy-transpose-pad functionCedric Nugteren
2016-06-19Renamed all C++ source files to .cpp to match the .hpp extension betterCedric Nugteren
2016-06-18Moved all headers into the source tree, changed headers to .hpp extensionCedric Nugteren
2016-06-18Removed the template from the Routine base-classCedric Nugteren
2016-06-17Removed the precision argument from the routines in favor of a single templat...Cedric Nugteren
2016-06-17Removed the interface to the cache functions from the Routine class, calls th...Cedric Nugteren
2016-06-17Moved the RunKernel and PadCopyTransposeMatrix functions out of the Routine c...Cedric Nugteren
2016-06-17Moved the test-for-valid-buffers function from the Routine class to separate ...Cedric Nugteren
2016-06-16Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, and...Cedric Nugteren