Age | Commit message (Collapse) | Author |
|
|
|
|
|
Radeon M370X AMD GPU
|
|
|
|
|
|
values
|
|
for large power-of-2 kernels on AMD GPUs
|
|
to transfer half-precision values as well
|
|
|
|
|
|
|
|
|
|
|
|
|
|
testing against CBLAS
|
|
failures for complex inputs
|
|
|
|
tests for non-square matrices
|
|
|
|
CPU BLAS library
|
|
|
|
Locate the C BLAS library before the F77 one.
|
|
|
|
|
|
|
|
submatrices
|
|
|
|
partially repopulated the database
|
|
|
|
buffersize checking
|
|
|
|
|
|
functions
|
|
|
|
|
|
|
|
and IxAMAX
|
|
|
|
counterparts of xASUM and IxAMAX)
|
|
ClearCompiledProgramCache function to clear the cache
|
|
|
|
detail if needed
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
non-matching tuner arguments
|