@slow test RFC #55

driazati · 2022-01-27T00:29:54Z

Also see discussion in the pre-RFC: https://discuss.tvm.apache.org/t/rfc-ci-skip-slow-tests-on-prs/11910

Mousius

Thanks for this @driazati, this is a great concise and meaningful improvement to TVM. I've put a few notes but they're mostly either wording or straight-forward questions.

My personal opinion here is that CI broken often enough on main we need to improve that before we can really complain about differences between tests. Alongside this, we still have PRs landing without test coverage, so until we can accurately define what we do and don't test, it seems better to remove friction from contributions - hopefully driving us to ask for more and better tests 😸

We should also be prioritising a release schedule rather than treating TVM as ever-green which limits our ability to make larger changes between releases as a broken main shouldn't impact end users. This change actually moves us closer to this.

TLDR: Supportive + minor comments 😸

Mousius · 2022-01-27T11:28:13Z

rfcs/0055-slow-tests.md

+[future-possibilities]: #future-possibilities
+
+- Better communication in Jenkins job pages of which tests ran, which did not, and why
+- Different levels of tests. `main` is the most frequent step, but longer running tests could be moved out to nightly or even release level testing (though this makes debugging failures more difficult).


One of the concerns raised by @areusch is the change in coverage by removing slow tests, it would be good to investigate the difference in coverage both in Python and C++ to help guide us in re-introducing tests.

Right, I think that's where flipping the proposal from the pre-RFC from "disable by time cutoff" -> "start with the slowest tests, manually investigate and @slow" will help out, since we should have a better idea what's going on with the individual tests that way. Coverage data + proper test reports help out here too, we currently have https://ci.tlcpack.ai/blue/organizations/jenkins/tvm/detail/main/2414/tests but it's broken (doesn't report failures) and we don't have any tooling to scan across those vertically (i.e. from one commit to the next) to see changes

I just re-read this again, and I think that explicitly stating the roll out strategy as a manual approach of test investigation would clarify this to me - i.e. there's no automation involved albeit you've demonstrated we could automate it.

Added a bit about implementation in the guide section

rfcs/0055-slow-tests.md

Mousius · 2022-01-27T11:44:47Z

rfcs/0055-slow-tests.md

+Often TVM's tests are running much more work than they actually intend to test in
+more of an integration test than a unit test. Replacing these types of test with
+a framework that makes it easier to test TVM passes and functionality in smaller
+chunks is related but orthogonal to this work. It still has the same issue (coverage


How is coverage still reduced if we put in place the better testing practices?

it's a bit of both I suppose, if we take an integration test that is running a whole model to test 1 pass and use infrastructure to test just that 1 pass we're making the test more narrow in what its trying to achieve, but in the end it's flexing fewer parts of the system (even if those parts 99% of the time don't matter for the tests).

The theory at least, is that then each Pass is tested exhaustively with some integration tests covering interactions between Passes? Therefore test coverage should overall be better as we can test more of a single Pass and be more specific about Pass interactions?

Yeah I see what you mean, it all comes down to implementation/how it's used, but either way it's pretty immaterial to this RFC so I removed that language

rfcs/0055-slow-tests.md

@areusch

Also see discussion in the pre-RFC: https://discuss.tvm.apache.org/t/rfc-ci-skip-slow-tests-on-prs/11910 @areusch @Mousius

Co-authored-by: Christopher Sidebottom <chris.sidebottom@arm.com>

areusch

this looks pretty good to me @driazati , thanks for the writeup! I'll defer to @Mousius for approval since he had more comments here.

rfcs/0055-slow-tests.md

areusch · 2022-02-02T19:37:28Z

@Mousius can you take another look when you have a min?

Mousius · 2022-02-04T13:54:45Z

rfcs/0055-slow-tests.md

+[future-possibilities]: #future-possibilities
+
+- Better communication in Jenkins job pages of which tests ran, which did not, and why
+- Different levels of tests. `main` is the most frequent step, but longer running tests could be moved out to nightly or even release level testing (though this makes debugging failures more difficult).


I just re-read this again, and I think that explicitly stating the roll out strategy as a manual approach of test investigation would clarify this to me - i.e. there's no automation involved albeit you've demonstrated we could automate it.

Mousius

Thanks for the clarification @driazati, you're clear to land 😸!

driazati force-pushed the slow branch from d75673f to c303319 Compare January 27, 2022 00:33

driazati marked this pull request as ready for review January 27, 2022 00:33

driazati mentioned this pull request Jan 27, 2022

Add @slow decorator to run tests on main apache/tvm#10057

Merged

Mousius requested changes Jan 27, 2022

View reviewed changes

driazati and others added 4 commits January 31, 2022 13:57

@slow test RFC

daa8de9

Also see discussion in the pre-RFC: https://discuss.tvm.apache.org/t/rfc-ci-skip-slow-tests-on-prs/11910 @areusch @Mousius

Update rfcs/0055-slow-tests.md

42eafa0

Co-authored-by: Christopher Sidebottom <chris.sidebottom@arm.com>

Apply suggestions from code review

5e0208d

Co-authored-by: Christopher Sidebottom <chris.sidebottom@arm.com>

address comments

e4bcc18

driazati force-pushed the slow branch from b7a3e13 to e4bcc18 Compare January 31, 2022 21:57

areusch reviewed Jan 31, 2022

View reviewed changes

rfcs/0055-slow-tests.md Show resolved Hide resolved

areusch mentioned this pull request Feb 1, 2022

[Tracking Issue] Skip slow tests on PRs apache/tvm#10132

Closed

3 tasks

Address comments

3b21b3a

driazati requested a review from Mousius February 2, 2022 21:28

Mousius requested changes Feb 4, 2022

View reviewed changes

Address comments

adf5b8e

driazati requested a review from Mousius February 8, 2022 18:30

Mousius approved these changes Feb 9, 2022

View reviewed changes

Mousius merged commit 9b6203a into apache:main Feb 9, 2022

denise-k added this to Q1 2022 in Apache TVM CI & Testing Feb 17, 2022

denise-k mentioned this pull request Feb 17, 2022

[CI][Tracking Issue] @slow test tracking issue apache/tvm#10296

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

@slow test RFC #55

@slow test RFC #55

driazati commented Jan 27, 2022

Mousius left a comment

Mousius Jan 27, 2022

driazati Jan 31, 2022

Mousius Feb 4, 2022

driazati Feb 8, 2022

Mousius Jan 27, 2022

driazati Jan 31, 2022

Mousius Feb 2, 2022

driazati Feb 8, 2022

areusch left a comment

areusch commented Feb 2, 2022

Mousius Feb 4, 2022

Mousius left a comment

@slow test RFC #55

@slow test RFC #55

Conversation

driazati commented Jan 27, 2022

Mousius left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

areusch left a comment

Choose a reason for hiding this comment

areusch commented Feb 2, 2022

Choose a reason for hiding this comment

Mousius left a comment

Choose a reason for hiding this comment