-
Notifications
You must be signed in to change notification settings - Fork 722
Arm backend: Mark test_block_bottleneck_residual_tosa_BI unit test flaky #5812
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Arm backend: Mark test_block_bottleneck_residual_tosa_BI unit test flaky #5812
Conversation
… flaky
This makes the test rerun and getting green CI for now
later we need to investigate if the accuracy is a problem or ok.
The tests fail rate is about 1-2% and the reported error is low
Difference: max: 0.06028306484222412,
abs: 0.06028306484222412,
mean abs error: 9.008637425722554e-05.
Change-Id: I161a336223ddf8e589a4bdda06a25f2877ed913a
Signed-off-by: Zingo Andersen <zingo.andersen@arm.com>
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5812
Note: Links to docs will display an error until the docs builds have been completed. ❌ 1 New Failure, 1 Unrelated FailureAs of commit f3d247e with merge base 393553c ( NEW FAILURE - The following job has failed:
BROKEN TRUNK - The following job failed but were present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
@pytorchbot label ciflow/trunk |
| model = ComboBlockBottleneckResidual() | ||
| self._test_conv_combo_tosa_MI_pipeline(model, model.get_inputs()) | ||
|
|
||
| # TODO: Investigate flakyness (MLTORCH-307) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is the same internal ticket we have in another place for another flaky test. #4390
|
@digantdesai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
|
@digantdesai merged this pull request in ee5d099. |
This makes the test rerun and getting green CI for now later we need to investigate if the accuracy is a problem or OK.
The tests fail rate is about 1-2% and the reported error is low
Difference: max: 0.06028306484222412, abs: 0.06028306484222412, mean abs error: 9.008637425722554e-05.