Skip to content

Conversation

huydhn
Copy link
Contributor

@huydhn huydhn commented Jun 1, 2023

Per title, after #102426 landed, it makes sense to have a new category for UNSTABLE jobs and handle them accordingly in trymerge.

  • The simple approach is to check for unstable in the check (job) name. I plan to roll this out first and then see if we need to cover the more complicated, but less popular case, of unstable build job. Specifically, an unstable build job has no unstable in its name
  • An unstable job is ignored by trymerge. This is the same behavior we have atm when a job is moved to unstable. It's completely ignored
  • The update to Dr. CI will come later, so that unstable failures would also be hidden like broken trunk or flaky

Testing

Leverage the broken trunk Windows CPU job atm and mark Windows CPU jobs as unstable #102297

@huydhn huydhn added ciflow/trunk Trigger trunk jobs on your pull request test-config/default labels Jun 1, 2023
@pytorch-bot
Copy link

pytorch-bot bot commented Jun 1, 2023

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/102784

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 62277eb:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@pytorch-bot pytorch-bot bot added the topic: not user facing topic category label Jun 1, 2023
@huydhn huydhn requested a review from clee2000 June 1, 2023 22:43
@huydhn huydhn marked this pull request as ready for review June 1, 2023 22:43
@huydhn huydhn requested a review from a team as a code owner June 1, 2023 22:43
huydhn added a commit to pytorch/test-infra that referenced this pull request Jun 2, 2023
After pytorch/pytorch#102426 and
pytorch/pytorch#102784 landed, unstable jobs are
now hidden correctly on HUD https://hud.pytorch.org and also won't block
PR. Previously, this was done by moving unstable jobs to an unstable
workflow. Now unstable jobs will stay in the same workflow, but have
`unstable` in their names.

This is very similar to how `rerun_disabled_tests` are ignored atm.

### Testing


https://torchci-git-fork-huydhn-ignore-unstable-jobs-fbopensource.vercel.app/metrics
@huydhn
Copy link
Contributor Author

huydhn commented Jun 4, 2023

@pytorchbot merge

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

PaliC pushed a commit to pytorch/test-infra that referenced this pull request Jun 5, 2023
After pytorch/pytorch#102426 and
pytorch/pytorch#102784 landed, unstable jobs are
now hidden correctly on HUD https://hud.pytorch.org and also won't block
PR. Previously, this was done by moving unstable jobs to an unstable
workflow. Now unstable jobs will stay in the same workflow, but have
`unstable` in their names.

This is very similar to how `rerun_disabled_tests` are ignored atm.

### Testing


https://torchci-git-fork-huydhn-ignore-unstable-jobs-fbopensource.vercel.app/metrics
huydhn added a commit to pytorch/test-infra that referenced this pull request Jun 7, 2023
Per title, we want to hide unstable jobs in a collapsible section like
flaky and broken trunk. I opt for an unstable section for clarity but we
could also add unstable jobs into flaky section (to reuse the name).
Note that unstable job failures doesn't block merge
pytorch/pytorch#102784
self.assertTrue(len(failed) == 2)

def test_get_classifications_unstable(self, *args: Any) -> None:
pr = GitHubPR("pytorch", "pytorch", 102784)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should never self-reference mocks, as when PR is landed checks might be different... (this is what I'm experiencing right now when trying to regenerate mocks)

malfet added a commit that referenced this pull request Jun 28, 2023
- Add `ngimel` to the list of reviewers to make "test_revert_rules" pass
- Change PR in `test_get_classifications_unstable` from #102784 to #104312  as former do not have unstable jobs after merging.

<!--
copilot:poem
-->
### <samp>🤖 Generated by Copilot at af26e18</samp>

> _Oh we're the crew of the `test_trymerge.py`_
> _We update the rules and the cases on the fly_
> _We heave and we haul on the count of three_
> _We add a new approver for the `super` rule, aye_

cc albanD

[ghstack-poisoned]
pytorchmergebot pushed a commit that referenced this pull request Jun 28, 2023
- Add `ngimel` to the list of reviewers to make "test_revert_rules" pass
- Change PR in `test_get_classifications_unstable` from #102784 to #104312  as former do not have unstable jobs after merging.

<!--
copilot:poem
-->
### <samp>🤖 Generated by Copilot at af26e18</samp>

> _Oh we're the crew of the `test_trymerge.py`_
> _We update the rules and the cases on the fly_
> _We heave and we haul on the count of three_
> _We add a new approver for the `super` rule, aye_

Pull Request resolved: #104343
Approved by: https://github.com/jeanschmidt, https://github.com/albanD
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/trunk Trigger trunk jobs on your pull request Merged test-config/default topic: not user facing topic category

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants