Use cram-style integration tests #250

shonfeder · 2020-09-30T03:46:05Z

Closes #192

This PR uses mdx for writing and running cram-style CLI integration tests.

The tests are embedded in code blocks in a markdown file. Here's an example of what some tests look like:

See https://github.com/informalsystems/apalache/blob/4a01124f3bdf692fbc6ba356e1077676c52531dc/test/tla/cli-integration-tests.md for the full suite of cram-style integration tests, and for the documentation describing how to run and write the tests.

Note on impact on CI build times.

tl;dr; Note that building the mdx dependency will add about 7 minutes to the integrations tests in CI the first time they run on unstable. But it should only add a few dozen seconds to fetch the catch after that, and this saving should be inherited by all branches based on unstable.

This is all according to GitHub's documentation on caching in actions. Once the dependency is cached on the default branch -- unstable -- the cached artifact should be used by all branches based on the default. This holds for any branch based off of one that has already saved the cache. I tested this in #252, which confirmed that the dependency was fetched from the cache stored from this branch.

If it becomes an issue, we can always dockerize the dependency, prebuild a standalone artifact for our test environments, or use a different tool that achieves the same purpose (tho I am unaware of an actively maintained replacement).

This finds the correct directory even when the file is sourced

The script manages paths, runs some sanity checks, and runs a diff on the expected output to verify success or failure.

Also - Adds an apalache-jar target that builds the package without tests - Adds a phony promote target that promotes corrected integration test output back into the source .md files - Adds a step to remove the target/ directory

I will run this in parallel with the old integration tests until confirming that it provides comparable and consistent coverage.

This isn't supported on the diff command that ships with MacOS apprently.

Checking that both integration suites fail as expected.

This reverts commit ada26b1. This change didn't actually cause the expected failure.

This makes it clearer what parts of the process are eating up time.

shonfeder · 2020-09-30T14:31:39Z

In 5d5fdbd I triggered a failure, to confirm that the new integration tests catch the failure in CI in parity with the existing integration tests.

See https://github.com/informalsystems/apalache/runs/1188178566?check_suite_focus=true#step:9:11 for the logs reporting the failure, showing both how the failures appear in CI (as a diff) and serving as evidence of the failure.

So ada26b1 bears witnesses to both integration test harnesses pass under the same condition, and 5d5fdbd bears witnesses to both failing under the same condition.

This reverts commit 5d5fdbd.

This lets us shadow the apalache-mc binary with an invocation of the dockerized version.

Used to shadow apalache-mc in integration tests

Just ensures that the proper environment is set up before invoking the mdx-tests.

shonfeder · 2020-09-30T19:17:25Z

A substantive improvement to the integrations tests would be and explanation in each section describing what is being tested, but I don't have the context to provide that yet, and this PR is already longer than would be ideal.

konnov

The cram tests are much cleaner than the handwritten ones! My only worry is the opam dependency (see the comment).

test/mdx-test.py

konnov · 2020-10-02T15:17:28Z

.github/workflows/main.yml

+        with:
+          path: ~/.opam
+          key: ${{ runner.os }}-opam-4.11.0
+      - name: Set up opam


I am a bit worried that we add opam as a dependency. I had problems with the stability of opam packages a few years ago, though this may have been improved since then. Are there alternatives, in case we experience a problem with mdx?

I had problems with the stability of opam packages a few years ago

Did you encounter problems with opam when pinning to a specific version of a package and specific version of the compiler? Or was it with updating dependencies?

though this may have been improved since then

I suspect! Note that Opam 2 was released in 2018, and was a major overhaul.

Are there alternatives, in case we experience a problem with mdx?

I don't know of any alternative software that supports these features of mdx:

Extract tests from markdown

Run single tests in particular sections

Multiline wildcards

Also, I'm a contributor to mdx (just some small patches) and I feel confident I could resolve any issues that might arise.

That said, we could try using cram instead. That adds a python dependency (which I more worrisome, but you may find less ;) ), and won't support extracting tests from a markdown file. But if we break all the tests into their own files we could still trigger them in isolation (well, at least each set that lives in its own file).

My main hesitation with cram is that it appears to be abandoned: aiiie/cram#32 and it has not had a new release since 2016.

I see. Then let's stay with mdx and merge!

konnov · 2020-10-07T08:44:02Z

Feel free to merge

Also adds a flag to enable debugging output to stdout This is a followup to #250 -- I had misconfigured the logger, and so nothing was being logged to the files our stdout.

* Fix logging for integration tests Also adds a flag to enable debugging output to stdout This is a followup to #250 -- I had misconfigured the logger, and so nothing was being logged to the files our stdout.

Shon Feder added 8 commits September 29, 2020 17:10

Improve .envrc's dir location command

6275e37

This finds the correct directory even when the file is sourced

Add environment variable pointing to the root target directory

fbb99b8

Add wrapper script for mdx tests

e66bcd6

The script manages paths, runs some sanity checks, and runs a diff on the expected output to verify success or failure.

Add mdx integration test targets to Makefile

e92256a

Also - Adds an apalache-jar target that builds the package without tests - Adds a phony promote target that promotes corrected integration test output back into the source .md files - Adds a step to remove the target/ directory

Add mdx integration tests

e569b09

Add job to run mdx integration tests

54fa0bb

I will run this in parallel with the old integration tests until confirming that it provides comparable and consistent coverage.

Make python script compatible with < 3.8

fa6be2f

Debug CI

9fd7b4c

shonfeder force-pushed the shon/mdx-integration-tests branch from f5ba1cb to 9fd7b4c Compare September 30, 2020 04:13

Shon Feder added 6 commits September 30, 2020 08:47

Remove color flag from diff

270a4fc

This isn't supported on the diff command that ships with MacOS apprently.

Cache entire opam dir

b2a682f

Cause an integration test to fail

ada26b1

Checking that both integration suites fail as expected.

Revert "Cause an integration test to fail"

79ab7a4

This reverts commit ada26b1. This change didn't actually cause the expected failure.

Build and run integration tests in separate steps

66d03d6

This makes it clearer what parts of the process are eating up time.

Make ExistsAsValue fail

5d5fdbd

Shon Feder added 10 commits September 30, 2020 10:32

Revert "Make ExistsAsValue fail"

3ccab5c

This reverts commit 5d5fdbd.

Remove old integration test harness from CI

55eb65c

Remove old integration test harness

61dc484

Add an envrc file for integration tests

8d1c0d0

This lets us shadow the apalache-mc binary with an invocation of the dockerized version.

Document the purpose of the docker-focused scripts

0bb543a

Add symbolic link to docker wrapper script

093144d

Used to shadow apalache-mc in integration tests

Add back run-integration wrapper script

48b7277

Just ensures that the proper environment is set up before invoking the mdx-tests.

Use the run-integration wrapper in CI and Makefile

67eb433

Document the changes to the integration tests and CI

7b3f0f7

Clean up diff invocation

6413dd7

shonfeder changed the title ~~WIP: Add mdx integration tests~~ Add mdx integration tests Sep 30, 2020

Shon Feder added 3 commits September 30, 2020 12:41

Clean up and update integration test docs

64830c2

Correct script path

8f7a062

Make invocation path absolute

4dacf61

Shon Feder added 2 commits September 30, 2020 13:17

Add mdx dep to docker integration test

2749f88

Set the paths straight

1642fe8

shonfeder marked this pull request as ready for review September 30, 2020 18:31

shonfeder requested a review from konnov September 30, 2020 19:16

shonfeder changed the title ~~Add mdx integration tests~~ Use cram-style integration tests Oct 1, 2020

konnov approved these changes Oct 2, 2020

View reviewed changes

Document purpose of script

30c2896

shonfeder merged commit 52c4d1c into unstable Oct 8, 2020

shonfeder deleted the shon/mdx-integration-tests branch October 8, 2020 00:45

shonfeder pushed a commit that referenced this pull request Oct 12, 2020

Fix logging for integration tests

fe1417e

Also adds a flag to enable debugging output to stdout This is a followup to #250 -- I had misconfigured the logger, and so nothing was being logged to the files our stdout.

shonfeder mentioned this pull request Oct 12, 2020

Fix logging for integration tests #269

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use cram-style integration tests #250

Use cram-style integration tests #250

shonfeder commented Sep 30, 2020 •

edited

Loading

shonfeder commented Sep 30, 2020 •

edited

Loading

shonfeder commented Sep 30, 2020

konnov left a comment

konnov Oct 2, 2020

shonfeder Oct 5, 2020

shonfeder Oct 5, 2020

konnov Oct 7, 2020

konnov commented Oct 7, 2020

Use cram-style integration tests #250

Use cram-style integration tests #250

Conversation

shonfeder commented Sep 30, 2020 • edited Loading

shonfeder commented Sep 30, 2020 • edited Loading

shonfeder commented Sep 30, 2020

konnov left a comment

Choose a reason for hiding this comment

konnov Oct 2, 2020

Choose a reason for hiding this comment

shonfeder Oct 5, 2020

Choose a reason for hiding this comment

shonfeder Oct 5, 2020

Choose a reason for hiding this comment

konnov Oct 7, 2020

Choose a reason for hiding this comment

konnov commented Oct 7, 2020

shonfeder commented Sep 30, 2020 •

edited

Loading

shonfeder commented Sep 30, 2020 •

edited

Loading