summaryrefslogtreecommitdiff
path: root/doc
diff options
context:
space:
mode:
authorCedric Nugteren <web@cedricnugteren.nl>2018-06-03 10:42:15 +0200
committerCedric Nugteren <web@cedricnugteren.nl>2018-06-03 10:42:15 +0200
commitfee8df153cdb0ed419252ef3f525aabd1bee875f (patch)
treeaec491680e159c83f07630782038abbdd9bb83a1 /doc
parentbd1715aff99a80d0efe83c896e604377d8fd061b (diff)
Added list of tuners to be run by 'alltuners' target
Diffstat (limited to 'doc')
-rw-r--r--doc/tuning.md68
1 files changed, 68 insertions, 0 deletions
diff --git a/doc/tuning.md b/doc/tuning.md
index b5186ac6..5cf32ca8 100644
--- a/doc/tuning.md
+++ b/doc/tuning.md
@@ -84,6 +84,74 @@ The kernels `gemm` and `gemm_direct` have too many parameters to explore. Theref
There are also several routine-level tuners. They tune inter-kernel parameters and should only be run after the kernels are tuned. However, they do automatically pick up kernel tuning results from the current folder if there are any. An example is the GEMM routine tuner, which determines when to use the direct or the in-direct GEMM kernel.
+Here are all the tuners included in the `make alltuners` target (in the same order) with all their precision arguments:
+
+ ./clblast_tuner_copy_fast -precision 32
+ ./clblast_tuner_copy_fast -precision 64
+ ./clblast_tuner_copy_fast -precision 3232
+ ./clblast_tuner_copy_fast -precision 6464
+ ./clblast_tuner_copy_fast -precision 16
+ ./clblast_tuner_copy_pad -precision 32
+ ./clblast_tuner_copy_pad -precision 64
+ ./clblast_tuner_copy_pad -precision 3232
+ ./clblast_tuner_copy_pad -precision 6464
+ ./clblast_tuner_copy_pad -precision 16
+ ./clblast_tuner_transpose_fast -precision 32
+ ./clblast_tuner_transpose_fast -precision 64
+ ./clblast_tuner_transpose_fast -precision 3232
+ ./clblast_tuner_transpose_fast -precision 6464
+ ./clblast_tuner_transpose_fast -precision 16
+ ./clblast_tuner_transpose_pad -precision 32
+ ./clblast_tuner_transpose_pad -precision 64
+ ./clblast_tuner_transpose_pad -precision 3232
+ ./clblast_tuner_transpose_pad -precision 6464
+ ./clblast_tuner_transpose_pad -precision 16
+ ./clblast_tuner_xaxpy -precision 32
+ ./clblast_tuner_xaxpy -precision 64
+ ./clblast_tuner_xaxpy -precision 3232
+ ./clblast_tuner_xaxpy -precision 6464
+ ./clblast_tuner_xaxpy -precision 16
+ ./clblast_tuner_xdot -precision 32
+ ./clblast_tuner_xdot -precision 64
+ ./clblast_tuner_xdot -precision 3232
+ ./clblast_tuner_xdot -precision 6464
+ ./clblast_tuner_xdot -precision 16
+ ./clblast_tuner_xger -precision 32
+ ./clblast_tuner_xger -precision 64
+ ./clblast_tuner_xger -precision 3232
+ ./clblast_tuner_xger -precision 6464
+ ./clblast_tuner_xger -precision 16
+ ./clblast_tuner_xgemm -precision 32
+ ./clblast_tuner_xgemm -precision 64
+ ./clblast_tuner_xgemm -precision 3232
+ ./clblast_tuner_xgemm -precision 6464
+ ./clblast_tuner_xgemm -precision 16
+ ./clblast_tuner_xgemm_direct -precision 32
+ ./clblast_tuner_xgemm_direct -precision 64
+ ./clblast_tuner_xgemm_direct -precision 3232
+ ./clblast_tuner_xgemm_direct -precision 6464
+ ./clblast_tuner_xgemm_direct -precision 16
+ ./clblast_tuner_xgemv -precision 32
+ ./clblast_tuner_xgemv -precision 64
+ ./clblast_tuner_xgemv -precision 3232
+ ./clblast_tuner_xgemv -precision 6464
+ ./clblast_tuner_xgemv -precision 16
+ ./clblast_tuner_invert -precision 32
+ ./clblast_tuner_invert -precision 64
+ ./clblast_tuner_invert -precision 3232
+ ./clblast_tuner_invert -precision 6464
+ ./clblast_tuner_invert -precision 16
+ ./clblast_tuner_routine_xgemm -precision 32
+ ./clblast_tuner_routine_xgemm -precision 64
+ ./clblast_tuner_routine_xgemm -precision 3232
+ ./clblast_tuner_routine_xgemm -precision 6464
+ ./clblast_tuner_routine_xgemm -precision 16
+ ./clblast_tuner_routine_xtrsv -precision 32
+ ./clblast_tuner_routine_xtrsv -precision 64
+ ./clblast_tuner_routine_xtrsv -precision 3232
+ ./clblast_tuner_routine_xtrsv -precision 6464
+ ./clblast_tuner_routine_xtrsv -precision 16
+
Using the tuning results
-------------