-
Notifications
You must be signed in to change notification settings - Fork 61
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
update trace replay mode for post analysis (#20)
Summary: Pull Request resolved: #20 - Output new traces with sequence number and latency for post analysis (one file per rank since there could be skew due to imbalanced workloads) - Remove extra barrier in non-blocking mode - Add option `--allow-list`/`--allow-ops` to specified desired collectives to be replayed and ignore the rest in the trace (default is 'all', replay all supported ops) - Cleaner summary output in the end (move most messages to logging.info) - Correctly complete GPU collectives in non-blocking mode Reviewed By: srinivas212 Differential Revision: D26730965 fbshipit-source-id: ff07b643039e7140fbafebc270231e99c11c40a2
- Loading branch information
1 parent
7d475d0
commit 4a9e7ee
Showing
2 changed files
with
97 additions
and
51 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters