Age | Commit message (Collapse) | Author |
|
CBLAS reference code; for fair timing and code de-duplication
|
|
|
|
Added a first batched version of the GEMM routine
|
|
|
|
|
|
|
|
|
|
|
|
GEMM kernel
|
|
|
|
Added the batched version of the AXPY routine
|
|
|
|
implementation
|
|
|
|
|
|
- undoing many earlier changes
|
|
|
|
|
|
routines
|
|
and distribution for all data
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
incorrect constants
|
|
|
|
Added the triangular solvers (TRSV/TRSM)
|
|
|
|
|
|
|
|
tests
|
|
|
|
|
|
imag is much larger than the other
|
|
|
|
|
|
|
|
|
|
|
|
|
|
checks in the error checking to make the tests pass
|
|
|
|
API to override tuning parameters
|
|
devices
|
|
|
|
|
|
|