Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add NVTX op tracing for Reducescatter, make base class destructors virtual #3574

Merged
merged 3 commits into from Jun 20, 2022

Conversation

maxhgerlach
Copy link
Collaborator

@maxhgerlach maxhgerlach commented Jun 9, 2022

Checklist before submitting

  • Did you read the contributor guide?
  • Did you update the docs?
  • Did you write any tests to validate this change?
  • Did you update the CHANGELOG, if this change affects users?

Description

In #3299 I forgot to register the "HorovodReducescatter" string with NVTX to use it in named op ranges. This means that Reducescatter ops would appear as "[Unknown]" in Nsight Systems profiles.

Besides that change I've defined virtual destructors for two abstract polymorphic base classes. This should fix a warning that has been mentioned in #3541. Generally I think having them virtual is a safer practice, but currently there probably is no actual problem because smart pointers (std::shared_ptr or std::unique_ptr) are used rather than raw base class pointers.

Signed-off-by: Max H. Gerlach <git@maxgerlach.de>
Signed-off-by: Max H. Gerlach <git@maxgerlach.de>
Signed-off-by: Max H. Gerlach <git@maxgerlach.de>
@maxhgerlach maxhgerlach changed the title Add NVTX op tracing for ReduceScatter, make base class destructors virtual Add NVTX op tracing for Reducescatter, make base class destructors virtual Jun 9, 2022
@maxhgerlach maxhgerlach added this to the v0.25.0 milestone Jun 9, 2022
@github-actions
Copy link

Unit Test Results

     845 files  ±0       845 suites  ±0   9h 22m 52s ⏱️ - 9m 18s
     768 tests ±0       725 ✔️ ±0       43 💤 ±0  0 ±0 
19 476 runs  ±0  13 865 ✔️ ±0  5 611 💤 ±0  0 ±0 

Results for commit 0e863e1. ± Comparison against base commit 1b3452f.

@github-actions
Copy link

Unit Test Results (with flaky tests)

     975 files   -      44       975 suites   - 44   10h 7m 13s ⏱️ - 24m 50s
     768 tests ±       0       723 ✔️ ±    0       43 💤 ±    0  2 ±0 
22 568 runs   - 1 028  15 621 ✔️  - 580  6 945 💤  - 448  2 ±0 

For more details on these failures, see this check.

Results for commit 0e863e1. ± Comparison against base commit 1b3452f.

@EnricoMi
Copy link
Collaborator

LGTM! @tgaddair @romerojosh what do you think?

Copy link
Collaborator

@romerojosh romerojosh left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@EnricoMi EnricoMi mentioned this pull request Jun 19, 2022
@maxhgerlach maxhgerlach merged commit 176251b into horovod:master Jun 20, 2022
@maxhgerlach maxhgerlach mentioned this pull request Jun 20, 2022
4 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants