Skip to content

Conversation

rohan-varma
Copy link
Contributor

@rohan-varma rohan-varma commented Nov 13, 2020

Stack from ghstack:

Closes #47892. Since we have a 100s timeout on the entire test, we should have a smaller timeout than the default 30 min for the process group used for the test.

This diff sets the timeout to 60s. For example, this is useful when running tests with NCCL_BLOCKING_WAIT so that we get the op timed out error instead of the test itself timing out.

Differential Revision: D24943323

@dr-ci
Copy link

dr-ci bot commented Nov 13, 2020

💊 CI failures summary and remediations

As of commit df13f4c (more details on the Dr. CI page):


💚 💚 Looks good so far! There are no failures yet. 💚 💚


This comment was automatically generated by Dr. CI (expand for details).Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 11 times.

rohan-varma added a commit that referenced this pull request Nov 13, 2020
Pull Request resolved: #47896

Per title
ghstack-source-id: 116607077

Differential Revision: [D24943323](https://our.internmc.facebook.com/intern/diff/D24943323/)
Copy link
Member

@osalpekar osalpekar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks like the fmt library build error is coming from the other PR in the stack, but this one looks good!

…buted_test"


Closes #47892. Since we have a 100s timeout on the entire test, we should have a smaller timeout than the default 30 min for the process group used for the test.

This diff sets the timeout to 60s. For example, this is useful when running tests with NCCL_BLOCKING_WAIT so that we get the op timed out error instead of the test itself timing out. 

Differential Revision: [D24943323](https://our.internmc.facebook.com/intern/diff/D24943323/)

[ghstack-poisoned]
…buted_test"


Closes #47892. Since we have a 100s timeout on the entire test, we should have a smaller timeout than the default 30 min for the process group used for the test.

This diff sets the timeout to 60s. For example, this is useful when running tests with NCCL_BLOCKING_WAIT so that we get the op timed out error instead of the test itself timing out. 

Differential Revision: [D24943323](https://our.internmc.facebook.com/intern/diff/D24943323/)

[ghstack-poisoned]
rohan-varma added a commit that referenced this pull request Nov 14, 2020
Pull Request resolved: #47896

Per title
ghstack-source-id: 116710141

Differential Revision: [D24943323](https://our.internmc.facebook.com/intern/diff/D24943323/)
@facebook-github-bot
Copy link
Contributor

This pull request has been merged in f824854.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cla signed Merged oncall: distributed Add this issue/PR to distributed oncall triage queue

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants