adjust retry partial.to #2175

avtc · 2025-11-04T17:08:17Z

@Qubitium Hi!
I was not able to proceed without second retry, so adjusting back. This is the first MoE layer with experts for GLM-4.5-Air with 1534 samples, "balanced" strategy, on 8 x 3090.

Qubitium · 2025-11-04T19:49:30Z

@avtc Adjust first delay to 0.25s (250ms) and second delay to 0.75s. Same total delay but makes first retry 2x as fast which may satisfy 99% of the cases.

See if you can adjust 0.25 even lower and increase seoncond retry timeout to minimize first retry so it satisify 90% of the cases. Like 0.125 first retry and 0.5s 2nd retry etc.

Run the values and check maybe 4-5 layers.

avtc · 2025-11-04T22:50:24Z

will check tomorrow, I think it will also work

adjust retry partial.to

8081422

adjust delay

6528b28

Qubitium merged commit f4984c8 into ModelCloud:main Nov 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

adjust retry partial.to #2175

adjust retry partial.to #2175

Uh oh!

avtc commented Nov 4, 2025 •

edited

Loading

Uh oh!

Qubitium commented Nov 4, 2025 •

edited

Loading

Uh oh!

avtc commented Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

adjust retry partial.to #2175

adjust retry partial.to #2175

Uh oh!

Conversation

avtc commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Qubitium commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

avtc commented Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

avtc commented Nov 4, 2025 •

edited

Loading

Qubitium commented Nov 4, 2025 •

edited

Loading