Fail tests without non-empty @cluster annotation #336

stan-is-hate · 2022-07-07T23:54:19Z

This might be breaking change if you rely on min_cluster_spec() method and also two changes in one.

1. Deprecation of min_cluster_spec() method
Turns out we have two different mechanisms to state expected number of nodes for the test - there's the @cluster annotation, which we know and love/hate, and min_cluster_spec method on the test object itself. They are not related, and we mostly use @cluster annotation, except for a single place where we check min_cluster_spec and possibly fail - however, this check makes no sense, because it has already been checked against @cluster annotation, plus the test would fail if it tries to allocated too many nodes anyway.

So I'm deprecating min_cluster_spec method, removing all its two usages (the second one is for reporting, and it's also inaccurate). The method is still there, so if clients override it and refer to super() they should still be ok. They would probably be ok even if I remove it, since python is interpreted and if nothing calls that method, it's probably fine, but it's cleaner and more obvious this way.

2. Option to fail tests that don't explicitly specify their cluster requirements
The real functionality of this PR is disallowing the tests that don't have @cluster annotation (or have them empty) and failing them immediately - since @cluster annotation is our preferred way of doing things right now, and since this protects us against tests that automatically use the whole cluster.
This behavior is controlled by a new flag - --fail-greedy-tests, which enables this behavior when provided. We've discussed making this behavior automatic when using --parallel but ultimately felt that such behavior will be surprising for users, and not in a good way. We're still open to do this in the future, just not in this PR.

This PR also explicitly allows (and tests for) empty cluster spec in @cluster() annotation (e.g. @cluster(num_nodes=0)), thus allowing tests that don't use any cluster resources. This functionality was working fine before this PR, but didn't have unit tests, so we've made sure it still works after this PR.

imcdo · 2022-07-08T00:09:28Z

Important to note here that this WILL break people who do not use the cluster annotations tests. I would argue that this should be a flag and the default behavior should still be the same (as cluster annotation should really only be enforced when running with parallelism which isn't the default, w/o parallelism theres no harm no foul really)

lbradstreet · 2022-07-08T00:13:06Z

Important to note here that this WILL break people who do not use the cluster annotations tests. I would argue that this should be a flag and the default behavior should still be the same (as cluster annotation should really only be enforced when running with parallelism which isn't the default, w/o parallelism theres no harm no foul really)

It'll take some time to sort these issues out so I think having a flag makes sense.

stan-is-hate · 2022-07-08T18:47:59Z

I agree with not wanting to break everything, but I don't want to introduce a flag, purely because implementing and unit testing the flag will be a relative chore.
I'd rather use semver power instead of the flag really - I'll first update existing kafka and CP tests to make sure they're fine, and then do another version bump (0.11.0) so that people can roll back to 0.10.0 if this breaks them - a single failed test run probably won't make or break things.
This makes me think that the error message should be more explicit though - rather than 'cluster spec missing', I'll update it to say smth like 'cluster spec is missing. Please make sure all of your tests have @cluster annotation specified - it is now a requirement as of ducktape 0.11.0.'

stan-is-hate · 2022-07-08T18:50:19Z

@imcdo It's a different question about not enforcing this without parallelism though - I'll look into whether the behavior can be adjustable based on that.
However, I think it's still a good idea for the test to declare it's node usage, since it enforces the test authors to be more diligent, and also I'm not sure it's a good idea to have the annotation required or not based on different execution modes.

…heck current behavior with 0 nodes

imcdo

Looks good thanks stan!

tests/runner/resources/test_various_num_nodes.py

stan-is-hate · 2022-07-13T00:07:49Z

@lbradstreet @imcdo even though this seems to be ready, I'm not going to merge it just yet - a quick search across kafka and confluent tests reveals that there are some tests that still use min_cluster_spec() method (which is removed in this version). I'll either update those tests, or lock ducktape minor version requirement for those repos before merging this.

stan-is-hate · 2022-07-13T05:24:29Z

Actually, I think a better thing to do would be to keep min_cluster_size method for compatibility, just hardcode it to return 0 and mark it as deprecated - this way we won't have to push a ton of PRs to multiple AK branches, they'll just continue to work.
The only reason to even worry about having those methods in subclasses is because they usually call super(), which will fail if we remove it from superclass, so I'm adding it back.

@cluster

1. Deprecation of min_cluster_spec() method Turns out we have two different mechanisms to state expected number of nodes for the test - there's the @cluster annotation, which we know and love/hate, and min_cluster_spec method on the test object itself. They are not related, and we mostly use @cluster annotation, except for a single place where we check min_cluster_spec and possibly fail - however, this check makes no sense, because it has already been checked against @cluster annotation, plus the test would fail if it tries to allocated too many nodes anyway. So I'm deprecating min_cluster_spec method, removing all its two usages (the second one is for reporting, and it's also inaccurate). The method is still there, so if clients override it and refer to super() they should still be ok. They would probably be ok even if I remove it, since python is interpreted and if nothing calls that method, it's probably fine, but it's cleaner and more obvious this way. 2. Option to fail tests that don't explicitly specify their cluster requirements The real functionality of this PR is disallowing the tests that don't have @cluster annotation (or have them empty) and failing them immediately - since @cluster annotation is our preferred way of doing things right now, and since this protects us against tests that automatically use the whole cluster. This behavior is controlled by a new flag - --fail-greedy-tests, which enables this behavior when provided. We've discussed making this behavior automatic when using --parallel but ultimately felt that such behavior will be surprising for users, and not in a good way. We're still open to do this in the future, just not in this PR. This PR also explicitly allows (and tests for) empty cluster spec in @cluster() annotation (e.g. @cluster(num_nodes=0)), thus allowing tests that don't use any cluster resources. This functionality was working fine before this PR, but didn't have unit tests, so we've made sure it still works after this PR.

@cluster

1. Deprecation of min_cluster_spec() method Turns out we have two different mechanisms to state expected number of nodes for the test - there's the @cluster annotation, which we know and love/hate, and min_cluster_spec method on the test object itself. They are not related, and we mostly use @cluster annotation, except for a single place where we check min_cluster_spec and possibly fail - however, this check makes no sense, because it has already been checked against @cluster annotation, plus the test would fail if it tries to allocated too many nodes anyway. So I'm deprecating min_cluster_spec method, removing all its two usages (the second one is for reporting, and it's also inaccurate). The method is still there, so if clients override it and refer to super() they should still be ok. They would probably be ok even if I remove it, since python is interpreted and if nothing calls that method, it's probably fine, but it's cleaner and more obvious this way. 2. Option to fail tests that don't explicitly specify their cluster requirements The real functionality of this PR is disallowing the tests that don't have @cluster annotation (or have them empty) and failing them immediately - since @cluster annotation is our preferred way of doing things right now, and since this protects us against tests that automatically use the whole cluster. This behavior is controlled by a new flag - --fail-greedy-tests, which enables this behavior when provided. We've discussed making this behavior automatic when using --parallel but ultimately felt that such behavior will be surprising for users, and not in a good way. We're still open to do this in the future, just not in this PR. This PR also explicitly allows (and tests for) empty cluster spec in @cluster() annotation (e.g. @cluster(num_nodes=0)), thus allowing tests that don't use any cluster resources. This functionality was working fine before this PR, but didn't have unit tests, so we've made sure it still works after this PR.

fail tests without non-empty @cluster annotation

f89f812

stan-is-hate requested review from lbradstreet and a team July 7, 2022 23:54

stan-is-hate added 5 commits July 8, 2022 16:19

make this behavior configurable, tied to its own flag for now. TODO c…

ce9a19d

…heck current behavior with 0 nodes

try to support 0 nodes too - wip, tests

56f8c4c

style

8d20d8b

updated unit test and cli help message

364091d

style

ab26ece

imcdo approved these changes Jul 12, 2022

View reviewed changes

tests/runner/resources/test_various_num_nodes.py Outdated Show resolved Hide resolved

updated docs

ca3ad64

return min_cluster_spec and min_cluster_size, mark as deprecated

45ca33a

stan-is-hate mentioned this pull request Jul 13, 2022

remove min_cluster_size() calls apache/kafka#12402

Closed

3 tasks

stan-is-hate requested a review from a team July 13, 2022 05:45

stan-is-hate merged commit d557474 into confluentinc:master Jul 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fail tests without non-empty @cluster annotation #336

Fail tests without non-empty @cluster annotation #336

stan-is-hate commented Jul 7, 2022 •

edited

imcdo commented Jul 8, 2022

lbradstreet commented Jul 8, 2022

stan-is-hate commented Jul 8, 2022

stan-is-hate commented Jul 8, 2022

imcdo left a comment

stan-is-hate commented Jul 13, 2022

stan-is-hate commented Jul 13, 2022

Fail tests without non-empty @cluster annotation #336

Fail tests without non-empty @cluster annotation #336

Conversation

stan-is-hate commented Jul 7, 2022 • edited

imcdo commented Jul 8, 2022

lbradstreet commented Jul 8, 2022

stan-is-hate commented Jul 8, 2022

stan-is-hate commented Jul 8, 2022

imcdo left a comment

Choose a reason for hiding this comment

stan-is-hate commented Jul 13, 2022

stan-is-hate commented Jul 13, 2022

stan-is-hate commented Jul 7, 2022 •

edited