Skip to content

reduce all-to-all communication volume when both expert and non-expert are tensor-parallel #455

reduce all-to-all communication volume when both expert and non-expert are tensor-parallel

reduce all-to-all communication volume when both expert and non-expert are tensor-parallel #455

Re-run triggered June 18, 2024 13:59
Status Success
Total duration 1h 37m 55s
Artifacts

hpu-gaudi2.yml

on: pull_request
Fit to window
Zoom out
Zoom in