-
Notifications
You must be signed in to change notification settings - Fork 25.6k
Add details about ic, broken, flaky, and unstable checks to merge records #106162
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/106162
Note: Links to docs will display an error until the docs builds have been completed. ✅ 1 Unrelated FailureAs of commit 5bb86ce: UNSTABLE - The following job failed but was likely due to flakiness present on trunk and has been marked as unstable:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
@huydhn has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
As this touches |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Would it be easier to put a tuple of [job id, classification] for each job into rockset instead of separate lists for each classification?
@pytorchbot merge |
Merge startedYour change will be merged once all checks pass (ETA 0-4 Hours). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
Forward fix in #106203, but I wonder how it escaped testing in the first place |
…ords (pytorch#106162) At the moment, we only record the list of pending and failed check on Rockset merge records. This is enough to compute the force merge KPI(s), but isn't enough for more in-depth analysis on what happened at the time of the merge: * If the number of `ok_failed_checks` is less than `ok_failed_checks_threshold`, the list of `failed_checks` would be empty (expectedly). So Rockset would only record an empty list. * We support retry in PR, so the classifications on Dr.CI could be different than what dev observed at the time of the merge if retry completed successfully ### Testing `python .github/scripts/trymerge.py --comment-id 1654010315 106095 --dry-run` (need to comment out some of the code to actually write a test record to Rockset), then manually verify it with ``` SELECT * FROM commons.merges WHERE pr_num = 106095 ``` to see that `ignore_current_checks`, `broken_trunk_checks`, `flaky_checks`, and `unstable_checks` shows up correctly Pull Request resolved: pytorch#106162 Approved by: https://github.com/clee2000
At the moment, we only record the list of pending and failed check on Rockset merge records. This is enough to compute the force merge KPI(s), but isn't enough for more in-depth analysis on what happened at the time of the merge:
ok_failed_checks
is less thanok_failed_checks_threshold
, the list offailed_checks
would be empty (expectedly). So Rockset would only record an empty list.Testing
python .github/scripts/trymerge.py --comment-id 1654010315 106095 --dry-run
(need to comment out some of the code to actually write a test record to Rockset), then manually verify it withto see that
ignore_current_checks
,broken_trunk_checks
,flaky_checks
, andunstable_checks
shows up correctly