Skip to content
This repository has been archived by the owner on Mar 19, 2024. It is now read-only.

Fix warm up all reduce call #382

Closed

Commits on Jul 30, 2021

  1. Fix warm up all reduce call

    Summary: The warm up `dist.all_reduce()` call was happening before setting the CUDA device, which meant all workers were using device 0. This resulted in crashes / hangs as mentioned in https://fb.workplace.com/groups/1309000715937050/permalink/1621428588027593/
    
    Differential Revision: D30005438
    
    fbshipit-source-id: ba4fd28bdb9e6142dd0a077ab69c903730ce2353
    mannatsingh authored and facebook-github-bot committed Jul 30, 2021
    Configuration menu
    Copy the full SHA
    fa936e0 View commit details
    Browse the repository at this point in the history