-
Notifications
You must be signed in to change notification settings - Fork 708
[DevX] Skip disabled configs for benchmarking #7868
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7868
Note: Links to docs will display an error until the docs builds have been completed. ⏳ 16 Pending, 2 Unrelated FailuresAs of commit 1a4cf63 with merge base d68ca28 ( BROKEN TRUNK - The following jobs failed but were present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
c4a2c5d to
e0efaba
Compare
|
Subscribed @kimishpatel @digantdesai to DevX related work. |
|
@guangy10 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
e0efaba to
1a4cf63
Compare
|
@guangy10 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
Skip disabled configs for benchmarking Co-authored-by: Github Executorch <github_executorch@arm.com>
Skip disabled configs for benchmarking Co-authored-by: Github Executorch <github_executorch@arm.com>
Some benchmark jobs have been failing for a long time, creating misleading signal about what expected and unexpected by looking at those jobs. For example:
The reason is that those configs are either not supported or having bugs to be fixed. There is no way to specify which config for which model should be disabled/bypassed in the past.
This PR adds the ability to do so. Going forward, the nightly benchmark jobs (
android-perfandapple-perf) should always be green, if not, it clearly signals that some configs are failing/flaky and should be fixed.This PR also set a process to track disabled model+config in the GitHub Issue, the test will catch it if trying to disable a config for a model but not providing a link to GitHub Issue. The enforcement is to make sure regressions is tracked properly.
Test:
qnn_q8config is skipped forw2l: https://github.com/pytorch/executorch/actions/runs/12920378484mpsconfig is skipped formobilebert: https://github.com/pytorch/executorch/actions/runs/12920397985