[Core] Separate run, dry-run and skip execution modes #2109

mpkorstanje · 2020-09-03T13:43:35Z

Given a scenario with several steps:

Scenario: 
 Given a passed step
 And a skipped step
 And an pending step
 And an undefined step
 And an ambiguous step

When executing this the outcome of the result is:

a passed step -> PASSED
a skipped step -> SKIPPED
an pending step -> PENDING
an undefined step -> SKIPPED
an ambiguous step -> AMBIGUOUS

This is wrong, because after the first non-passed result all other steps should be skipped. So:

a passed step -> PASSED
a skipped step -> SKIPPED
an pending step -> SKIPPED
an undefined step -> SKIPPED
an ambiguous step -> SKIPPED

When using executing this scenario with --dry-run the result is:

a passed step -> PASSED
a skipped step -> PASSED
an pending step -> PASSED
an undefined step -> UNDEFINED
an ambiguous step -> AMBIGUOUS

And surprisingly enough this is also wrong. While the skipped and pending states can only be detected by executing an implemented step, the undefined step should be marked as undefined and the ambiguous step that follows it should be skipped.

a passed step -> PASSED
a skipped step -> PASSED
an pending step -> PASSED
an undefined step -> UNDEFINED
an ambiguous step -> SKIPPED

The cause for this confusion lies in the fact that --dry-run was implemented using the skip mechanism rather then its own execution mode that is distinct from both a regular run and skip mode. By implementing these as individual execution modes we can avoid this confusion.

Implementing this however revealed that our formatters were often being tested with completely undefined scenarios. This does not provide a representative test and allowed #2102 to come into existence. Fixing this was rather complicated, the formatters were being tested with an overly complicated mock implementation. Replacing this mock implementation with stubs made the tests more readable and removed a significant chunk of complexity.

Fixes: #2102

coveralls · 2020-09-04T16:00:34Z

Coverage increased (+0.3%) to 86.48% when pulling 16e7a33 on 2102-fix-dry-run into b4ed58f on main.

The `TestHelper` is an overly complex way to run tests. Building a runtime and stubbing the step definitions provides a much simpler and more reliable way to test Cucumbers execution. See the `JUnitFormatterTest` for implementation examples.

Fixes: #2102

Good riddance

aslakhellesoy · 2020-09-05T11:15:52Z

This sounds like nice cleanup!

I still think there is room for improvement.

The UNDEFINED and AMBIGUOUS statuses can be identified without execution. They're just a special kind of SKIPPED with more information about why (0 matches or >=2 matches). Marking a step that is UNDEFINED or AMBIGUOUS as SKIPPED weakens the signal about why a step was skipped.

Here is an example. I've modified your example a little since a skipped step doesn't make sense to me - users can't skip steps.

# Execution
a matched step -> PASSED
a matched throwing step -> FAILED
a matched step -> SKIPPED
a matched pending step -> SKIPPED
an undefined step -> UNDEFINED
an ambiguous step -> AMBIGUOUS

# Dry Run
a matched step -> SKIPPED
a matched throwing step -> SKIPPED
a matched step -> SKIPPED
a matched pending step -> SKIPPED
an undefined step -> UNDEFINED
an ambiguous step -> AMBIGUOUS

WDYT @mpkorstanje?

mpkorstanje · 2020-09-06T20:19:15Z

It would be more accurate to say that a user aborts the scenario. This is a common feature in JUnit and TestNG and Cucumber supports several exceptions. Because Cucumber doesn't have an aborted state these are all converted to the skipped state.

https://github.com/cucumber/cucumber-jvm/blob/main/core/src/main/java/io/cucumber/core/runner/TestStep.java#L28

While it is nice that we can inform the user that there were multiple problems. The primary reason for skipping all steps after the first non-passing step is that it simplifies interpreting Cucumbers results.

I would expect that the result of scenario is the most severe results of the steps in that scenario.
I would also expect that the first non-passing step determines the result of the scenario.

By skipping all steps after the first non-passing step both expectations hold.

Additionally this also makes it much easier to integrate with JUnit and TestNG. Both only support one exception per test. Having additional states may make people wonder why JUnit marked the test as skipped but Cucumber reports undefined steps.

And in practice, Cucumber reports all undefined steps with the first exception because we collect the SnippetSuggested event.

One improvement I would consider though is also emitting ambiguous step events. That way we can report all ambiguous and undefined steps in a single exception and we don't have to throw the AmbiguousStepDefinitionsException from the AmbiguousPickleStepDefinitionsMatch.

mpkorstanje · 2020-09-06T20:20:00Z

If you think this makes sense it might be good to create a new feature request where we can hash out the details. Otherwise lets talk about it in the regular call.

mpkorstanje · 2020-09-06T21:13:19Z

Note to self:

We can avoid throwing UndefinedStepDefinitionException and AmbiguousStepDefinitionsException by making the runStep and dryRunStep return the state rather then ExecutionMode always returning pass. This avoids the use of exceptions as a control flow mechanism. Currently the exceptions are mapped back to state again in TestStep.mapThrowableToStatus which is redundant.

AllureCucumber6Jvm correspondingly updated dry-run steps results changed to PASSED due to cucumber behavior change: cucumber/cucumber-jvm#2109

mpkorstanje force-pushed the 2102-fix-dry-run branch 6 times, most recently from bf00837 to b855ba9 Compare September 4, 2020 15:47

mpkorstanje force-pushed the 2102-fix-dry-run branch from 49a12e9 to 7f692f6 Compare September 4, 2020 16:17

mpkorstanje added 12 commits September 4, 2020 18:19

[Core] Deprecate TestHelper

8822aeb

The `TestHelper` is an overly complex way to run tests. Building a runtime and stubbing the step definitions provides a much simpler and more reliable way to test Cucumbers execution. See the `JUnitFormatterTest` for implementation examples.

[Core] Do not use TestHelper in JUnitFormatterTest

37152ac

[Core] Do not use TestHelper in TestNGFormatterTest

5329858

[Core] Undefined steps should fail during --dry-run

d3bd33b

Fixes: #2102

[Core] Do not use TestHelper in JsonFormatterTest

448f076

[Core] Clean up RuntimeTest

3643d55

[Core] Remove TestHelper from RerunFormatterTest

20efa69

[Core] Compare bytes with isBytesEqualTo

1b329be

[Core] Do not use TestHelper in TeamCityPluginTest

172a065

[Core] Reformat code

3a974ea

[Core] Separate dry-run and skip execution strategies

79cf0ea

[Core] Remove TestHelper from TimelineFormatterTest

dcbe190

mpkorstanje force-pushed the 2102-fix-dry-run branch from 7f692f6 to dcbe190 Compare September 4, 2020 16:19

mpkorstanje added 4 commits September 4, 2020 20:58

[Core] Remove TestHelper from PrettyFormatterTest

6ed942c

[Core] Remove TestHelper from JsonFormatterTest

068d326

[Core] Remove TestHelper

11aeea4

Good riddance

[Core] Remove .toString("UTF-8") from JsonFormatterTest

35b74b2

mpkorstanje changed the title ~~[Core] Fix dry run~~ [Core] Seperate --dry-run and skip execution modes Sep 4, 2020

mpkorstanje changed the title ~~[Core] Seperate --dry-run and skip execution modes~~ [Core] Seperate run, dry-run and skip execution modes Sep 4, 2020

mpkorstanje force-pushed the 2102-fix-dry-run branch from 874a16c to e0d5369 Compare September 4, 2020 20:05

mpkorstanje marked this pull request as ready for review September 4, 2020 20:05

mpkorstanje force-pushed the 2102-fix-dry-run branch from e0d5369 to 3054771 Compare September 4, 2020 20:07

[Core] Verify dry-run implementation

e0f8213

mpkorstanje force-pushed the 2102-fix-dry-run branch from 3054771 to e0f8213 Compare September 4, 2020 20:08

Merge remote-tracking branch 'origin/main' into 2102-fix-dry-run

ad9495e

mpkorstanje changed the title ~~[Core] Seperate run, dry-run and skip execution modes~~ [Core] Separate run, dry-run and skip execution modes Sep 4, 2020

Update CHANGELOG

16e7a33

mpkorstanje force-pushed the 2102-fix-dry-run branch from caeb1e1 to 16e7a33 Compare September 4, 2020 20:20

mpkorstanje merged commit f6de527 into main Sep 4, 2020

mpkorstanje deleted the 2102-fix-dry-run branch September 4, 2020 20:44

MetallFoX mentioned this pull request Nov 24, 2020

Cucumber gherkin version 15.0.2 to support cucumber 6.9.0 allure-framework/allure-java#493

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Core] Separate run, dry-run and skip execution modes #2109

[Core] Separate run, dry-run and skip execution modes #2109

mpkorstanje commented Sep 3, 2020 •

edited

coveralls commented Sep 4, 2020 •

edited

aslakhellesoy commented Sep 5, 2020 •

edited

mpkorstanje commented Sep 6, 2020

mpkorstanje commented Sep 6, 2020

mpkorstanje commented Sep 6, 2020 •

edited

[Core] Separate run, dry-run and skip execution modes #2109

[Core] Separate run, dry-run and skip execution modes #2109

Conversation

mpkorstanje commented Sep 3, 2020 • edited

coveralls commented Sep 4, 2020 • edited

aslakhellesoy commented Sep 5, 2020 • edited

mpkorstanje commented Sep 6, 2020

mpkorstanje commented Sep 6, 2020

mpkorstanje commented Sep 6, 2020 • edited

mpkorstanje commented Sep 3, 2020 •

edited

coveralls commented Sep 4, 2020 •

edited

aslakhellesoy commented Sep 5, 2020 •

edited

mpkorstanje commented Sep 6, 2020 •

edited