Skip to content

Commit dfb1df5

Browse files
committed
Tuning script adaptation for the new triton
1 parent c1ccc5e commit dfb1df5

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

benchmarks/kernels/benchmark_moe.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -155,7 +155,7 @@ def get_rocm_tuning_space(use_fp16):
155155
# For now we see better perf with num_stages=0 for all gemm configs we care
156156
# But keep this explicit so that we do not forget we may need to set it to
157157
# other values in the future
158-
num_stage_range = [0]
158+
num_stage_range = [2]
159159
waves_per_eu_range = [0]
160160
matrix_instr_nonkdim_range = [16, 32] if use_fp16 else []
161161
kpack_range = [1, 2] if use_fp16 else []

0 commit comments

Comments
 (0)