-
Notifications
You must be signed in to change notification settings - Fork 25.6k
add sharding option to test framework #45988
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@janeyx99 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.
docker_image: "308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-xenial-cuda9.2-cudnn7-py3-gcc7" | ||
- pytorch_linux_test: | ||
name: pytorch_linux_xenial_cuda9_2_cudnn7_py3_gcc7_test1 | ||
name: pytorch_linux_xenial_cuda9_2_cudnn7_py3_gcc7_test |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
is cu92 significantly faster than cuda10/11?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not sure, but that particular test is not run in CI (not marked important in the CONFIG TREE)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It is run in CI, but only on master (i.e. important tests run on both master and PRs, while others run only on master), but we don't need to shard it, as users usually do not wait for the test results on master.
a5787f1
to
7741264
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@janeyx99 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.
Codecov Report
@@ Coverage Diff @@
## master #45988 +/- ##
=======================================
Coverage 68.25% 68.25%
=======================================
Files 410 410
Lines 53246 53246
=======================================
Hits 36343 36343
Misses 16903 16903 Continue to review full report at Codecov.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
docker_image: "308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-xenial-cuda9.2-cudnn7-py3-gcc7" | ||
- pytorch_linux_test: | ||
name: pytorch_linux_xenial_cuda9_2_cudnn7_py3_gcc7_test1 | ||
name: pytorch_linux_xenial_cuda9_2_cudnn7_py3_gcc7_test |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It is run in CI, but only on master (i.e. important tests run on both master and PRs, while others run only on master), but we don't need to shard it, as users usually do not wait for the test results on master.
Adding a sharding node to our python CONFIG_TREE