Add details about ic, broken, flaky, and unstable checks to merge records #106162

huydhn · 2023-07-27T20:38:01Z

At the moment, we only record the list of pending and failed check on Rockset merge records. This is enough to compute the force merge KPI(s), but isn't enough for more in-depth analysis on what happened at the time of the merge:

If the number of ok_failed_checks is less than ok_failed_checks_threshold, the list of failed_checks would be empty (expectedly). So Rockset would only record an empty list.
We support retry in PR, so the classifications on Dr.CI could be different than what dev observed at the time of the merge if retry completed successfully

Testing

python .github/scripts/trymerge.py --comment-id 1654010315 106095 --dry-run (need to comment out some of the code to actually write a test record to Rockset), then manually verify it with

SELECT
    *
FROM
    commons.merges
WHERE
    pr_num = 106095

to see that ignore_current_checks, broken_trunk_checks, flaky_checks, and unstable_checks shows up correctly

…e records

pytorch-bot · 2023-07-27T20:38:05Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/106162

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ 1 Unrelated Failure

As of commit 5bb86ce:

UNSTABLE - The following job failed but was likely due to flakiness present on trunk and has been marked as unstable:

linux-focal-rocm5.6-py3.8 / test (default, 1, 3, linux.rocm.gpu, unstable) (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2023-07-27T23:33:45Z

@huydhn has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

huydhn · 2023-07-27T23:37:46Z

As this touches find_matching_merge_rule, I have imported this to run internal tests (if any) just in case.

clee2000

Would it be easier to put a tuple of [job id, classification] for each job into rockset instead of separate lists for each classification?

huydhn · 2023-07-28T07:11:58Z

@pytorchbot merge

pytorchmergebot · 2023-07-28T07:13:40Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

malfet · 2023-07-28T14:12:32Z

Forward fix in #106203, but I wonder how it escaped testing in the first place

huydhn · 2023-07-28T16:35:22Z

Forward fix in #106203, but I wonder how it escaped testing in the first place

I should have known. It was a landrace with my own other change #106095 (ouch)

…ords (pytorch#106162) At the moment, we only record the list of pending and failed check on Rockset merge records. This is enough to compute the force merge KPI(s), but isn't enough for more in-depth analysis on what happened at the time of the merge: * If the number of `ok_failed_checks` is less than `ok_failed_checks_threshold`, the list of `failed_checks` would be empty (expectedly). So Rockset would only record an empty list. * We support retry in PR, so the classifications on Dr.CI could be different than what dev observed at the time of the merge if retry completed successfully ### Testing `python .github/scripts/trymerge.py --comment-id 1654010315 106095 --dry-run` (need to comment out some of the code to actually write a test record to Rockset), then manually verify it with ``` SELECT * FROM commons.merges WHERE pr_num = 106095 ``` to see that `ignore_current_checks`, `broken_trunk_checks`, `flaky_checks`, and `unstable_checks` shows up correctly Pull Request resolved: pytorch#106162 Approved by: https://github.com/clee2000

Add more details about ic, broken, flaky, and unstable checks to merg…

254990f

…e records

huydhn added the test-config/default label Jul 27, 2023

pytorch-bot bot added the topic: not user facing topic category label Jul 27, 2023

Fix typo

5bb86ce

huydhn requested a review from clee2000 July 27, 2023 23:32

huydhn marked this pull request as ready for review July 27, 2023 23:33

huydhn requested a review from a team as a code owner July 27, 2023 23:33

clee2000 approved these changes Jul 27, 2023

View reviewed changes

huydhn changed the title ~~Record details about ic, broken, flaky, and unstable checks to merge records~~ Add details about ic, broken, flaky, and unstable checks to merge records Jul 28, 2023

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jul 28, 2023

pytorchmergebot added the merging label Jul 28, 2023

huydhn added the suppress-bc-linter Suppresses the failures of API backward-compatibility linter (Lint/bc_linter) label Jul 28, 2023

pytorchmergebot added Merged and removed merging labels Jul 28, 2023

pytorchmergebot closed this in 4fe407a Jul 28, 2023

DanilBaibak mentioned this pull request Jul 28, 2023

DISABLED test_get_classifications_pending_unstable (__main__.TestBypassFailures) #106204

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add details about ic, broken, flaky, and unstable checks to merge records #106162

Add details about ic, broken, flaky, and unstable checks to merge records #106162

Uh oh!

huydhn commented Jul 27, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jul 27, 2023 •

edited

Loading

Uh oh!

facebook-github-bot commented Jul 27, 2023

Uh oh!

huydhn commented Jul 27, 2023

Uh oh!

clee2000 left a comment

Uh oh!

huydhn commented Jul 28, 2023

Uh oh!

pytorchmergebot commented Jul 28, 2023

Uh oh!

malfet commented Jul 28, 2023

Uh oh!

huydhn commented Jul 28, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Add details about ic, broken, flaky, and unstable checks to merge records #106162

Add details about ic, broken, flaky, and unstable checks to merge records #106162

Uh oh!

Conversation

huydhn commented Jul 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Testing

Uh oh!

pytorch-bot bot commented Jul 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/106162

✅ 1 Unrelated Failure

Uh oh!

facebook-github-bot commented Jul 27, 2023

Uh oh!

huydhn commented Jul 27, 2023

Uh oh!

clee2000 left a comment

Choose a reason for hiding this comment

Uh oh!

huydhn commented Jul 28, 2023

Uh oh!

pytorchmergebot commented Jul 28, 2023

Merge started

Uh oh!

malfet commented Jul 28, 2023

Uh oh!

huydhn commented Jul 28, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

huydhn commented Jul 27, 2023 •

edited

Loading

pytorch-bot bot commented Jul 27, 2023 •

edited

Loading