Commit eb574b6
committed
bench: REPS env var for run-all-langs.sh (vllm needs ≥4 to stabilise)
Per the post-mortem in 2026-05-08T01-15-02Z/MATRIX.md §7, vllm has
~10–20 % wire-byte variance across reps from non-deterministic
batching even at temperature=0. The 2-rep median is too few to
collapse that noise. Sglang + llama.cpp are deterministic and
stable at 2 reps.
Adds REPS env var (default 2) to run-all-langs.sh so the next
matrix rerun can pass REPS=4 just for vllm:
REPS=2 packages/bench/scripts/run-all-langs.sh $RUN_ID sglang
REPS=4 packages/bench/scripts/run-all-langs.sh $RUN_ID vllm
REPS=2 packages/bench/scripts/run-all-langs.sh $RUN_ID llama.cpp1 parent 7c12286 commit eb574b6
1 file changed
Lines changed: 13 additions & 7 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
50 | 50 | | |
51 | 51 | | |
52 | 52 | | |
53 | | - | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
54 | 60 | | |
55 | 61 | | |
56 | 62 | | |
57 | 63 | | |
58 | 64 | | |
59 | 65 | | |
60 | | - | |
| 66 | + | |
61 | 67 | | |
62 | 68 | | |
63 | 69 | | |
64 | 70 | | |
65 | 71 | | |
66 | 72 | | |
67 | 73 | | |
68 | | - | |
| 74 | + | |
69 | 75 | | |
70 | 76 | | |
71 | 77 | | |
72 | 78 | | |
73 | 79 | | |
74 | 80 | | |
75 | 81 | | |
76 | | - | |
| 82 | + | |
77 | 83 | | |
78 | 84 | | |
79 | 85 | | |
80 | 86 | | |
81 | 87 | | |
82 | 88 | | |
83 | 89 | | |
84 | | - | |
| 90 | + | |
85 | 91 | | |
86 | 92 | | |
87 | 93 | | |
88 | 94 | | |
89 | 95 | | |
90 | 96 | | |
91 | 97 | | |
92 | | - | |
| 98 | + | |
93 | 99 | | |
94 | 100 | | |
95 | 101 | | |
96 | 102 | | |
97 | 103 | | |
98 | 104 | | |
99 | 105 | | |
100 | | - | |
| 106 | + | |
101 | 107 | | |
102 | 108 | | |
103 | 109 | | |
| |||
0 commit comments