summaryrefslogtreecommitdiff
path: root/src/routines/level2
AgeCommit message (Expand)Author
2018-08-13Made last operation in TRSV and TRSM asynchronous, making the events not nullCedric Nugteren
2018-08-13Small refactoring of events in TRSV substitution routineCedric Nugteren
2018-05-31Added error-checking for half-empty local work group sizes; fixed a minor TRS...Cedric Nugteren
2018-05-27Added a check to return 'NotImplemented' error code in case of systems with <...Cedric Nugteren
2018-05-27Made FillMatrix and FillVector functions take a configurable local workgroup ...Cedric Nugteren
2017-12-23Updated the database to use the new TRSV and Invert tunersCedric Nugteren
2017-10-27Added GEMV synchronisation for the TRSV routine: similar bug as in TRSMCedric Nugteren
2017-04-07Added some missing const-nessCedric Nugteren
2017-02-27Fixed half-precision bugs in HTBMV/HTPMV/HTRMV/HSYR2K/HTRMM related to incorr...Cedric Nugteren
2017-02-26Merge branch 'development' into triangular_solversCedric Nugteren
2017-02-13Fixed a small bug in GEMV: unused kernel in parameter listCedric Nugteren
2017-02-05Merge branch 'development' into triangular_solversCedric Nugteren
2017-02-04Improved substition kernels a bit; added complex supportCedric Nugteren
2017-02-04Completed a first STRSV implementationCedric Nugteren
2017-02-04Added row-major support for TRSVCedric Nugteren
2017-01-29Added first (incomplete) version of TRSV routineCedric Nugteren
2017-01-24Routine, Cache: generalize, reduce amount of copying in fast pathIvan Shapovalov
2017-01-20Added prototype for the TRSV routineCedric Nugteren
2016-10-22Routine: get rid of ::SetUp()Ivan Shapovalov
2016-10-22treewide: use C++ exceptions properlyIvan Shapovalov
2016-07-25Moved the XgemvFast and XgemvFastRot tuning database into a separate fileCedric Nugteren
2016-07-23Improved the XgemvFastRot kernel by tiled loading of the input matrix A, enab...Cedric Nugteren
2016-07-10Now passing alpha/beta to the kernel as arguments as before fp16 support; in ...Cedric Nugteren
2016-06-19Renamed all C++ source files to .cpp to match the .hpp extension betterCedric Nugteren
2016-06-18Moved all headers into the source tree, changed headers to .hpp extensionCedric Nugteren
2016-06-18Removed the template from the Routine base-classCedric Nugteren
2016-06-17Removed the precision argument from the routines in favor of a single templat...Cedric Nugteren
2016-06-17Removed the interface to the cache functions from the Routine class, calls th...Cedric Nugteren
2016-06-17Moved the RunKernel and PadCopyTransposeMatrix functions out of the Routine c...Cedric Nugteren
2016-06-17Moved the test-for-valid-buffers function from the Routine class to separate ...Cedric Nugteren
2016-05-22Added level-2 half-precision routines HGER/HSYR/HSPR/HSYR2/HSPR2Cedric Nugteren
2016-05-22Prepared the GER kernels and tuner for half-precision supportCedric Nugteren
2016-05-22Added level-2 half-precision routines HGEMV/HGBMV/HHEMV/HHBMV/HHPMV/HSYMV/HSB...Cedric Nugteren
2016-05-22Prepared the GEMV kernels and tuner for half-precision supportCedric Nugteren
2016-04-28Fixed the cache to store binaries instead of OpenCL programsCedric Nugteren
2016-04-09Events are now properly implemented using event waiting list and asking the u...cnugteren
2016-04-04Removed redundant queue synchronisation statementscnugteren
2016-03-06Fixed a bug in the GER-family of routines due to incorrect division of the wo...Cedric Nugteren
2016-03-06Added preliminary support for xHPR2 and xSPR2 routinesCedric Nugteren
2016-03-02Added preliminary support for xHER2 and xSYR2 routinesCedric Nugteren
2016-02-28Fixed a couple of correctness bugs in the Xher kernelsCedric Nugteren
2016-02-28Added support for xHER, xHPR, xSYR, and xSPR routinesCedric Nugteren
2016-02-20Added support for xGERU and xGERC routinesCedric Nugteren
2016-02-20Added XGER routine, kernel, and tunerCedric Nugteren
2016-02-08Split-up the XGEMV kernel in two partsCedric Nugteren
2015-09-26Added TRMV/TBMV/TPMV routinesCNugteren
2015-09-19Added SBMV and SPMV routinesCNugteren
2015-09-19Added the HPMV routineCNugteren
2015-09-19Added the HBMV routineCNugteren
2015-09-18Improved the organization and performance of level 2 routinesCNugteren