tests(refactor): Adjust `mail_changedetector` + change detection helpers #2997

polarathene · 2023-01-11T08:08:41Z

Description

Refactored mail_changedetector to the new test conventions. Logic remains roughly the same, just easier to grok.
tls_letsencrypt test had some improvements as a result of this.
setup-cli test used the original setup helper methods, no change to logic there, just fully adapted to helper/setup.bash and adjusted the test case prefix.
Change detection helper was refactored. A generic log watching method is used, with support for waiting on an expected count.
- This was necessary as the mail_changedetector test as two concurrent change events happening, where one may have already processed before you can wait on it. Another test case cannot wait on completion, but instead needs to wait on a change detection starting.
Change detection helpers now live in a separate file. Not sure how useful that is. All tests that use it are migrated to the helper/common.bash methods. This also allowed for dropping the helpers from test_helper/common.bash, as well as the related methods in that file that now exist in helper/setup.bash.
We no longer use the previous change detection helper method (Oct 2020 for tests.bats). Thus I've removed that method and the related test_helper.bats cases for it. Existing methods should have fairly decent coverage in mail_changedetector using it against the actual changedetector service, and not just watching for a checksum change separately (which is not that useful for us).

This refactor removes the need to depend on sleep, shaving off approx 2 mins of time (potentially a bit less until some additional PRs are merged).

I've staged out changes across commits (with associated commit messages) that should make the review of diffs a bit more easier if it helps 😅

The original mail_changedetector test was implemented in Sep 2021. It was added with a focus on testing the locking script with multiple containers sharing a network volume for config (/tmp/docker-mailserver).

I have taken extra care with the changes done here (over several days). I ran repeat test runs, along with some other changes I'm pushing as separate PRs to ensure that the change detection tests remain consistent / reliable.

I was quite thorough with the refactor (easier to follow through the staged commit diffs). I am confident in it and the revised change detection helper.

Type of change

Improvement (non-breaking change that does improve existing functionality)

Checklist:

My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
If necessary I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

test/helper/change-detection.bash

polarathene · 2023-01-11T08:17:49Z

test/helper/common.bash


-  repeat_until_success_or_timeout 60 __is_changedetector_finished
+  repeat_until_success_or_timeout 20 __has_expected_count


I haven't compared, but casper had suggested a different approach than polling like I'm doing here.

Suggested change

repeat_until_success_or_timeout 20 __has_expected_count

timeout 60 docker exec "${CONTAINER_NAME}" bash -c "tail -F '${MATCH_IN_LOG}' | grep --max-count ${EXPECTED_COUNT} '${MATCH_CONTENT}'"

That will take a stream of the log and exit once the result count is matched. It needed to use bash -c, even if using docker logs --follow for the input as I found it'd hang otherwise (presumably something to do with stdin not closing?)

I cannot reproduce the hanging. I tried this:

# exit as soon as the keyword "spawned" appears docker logs -n0 -f mail | grep -q spawned

Then in another session:

docker exec mail supervisorctl restart update-check

PS: The container must be running with SUPERVISOR_LOGLEVEL=info I think, that this example works.

I was able to reproduce on current master the same issue. I tested with replacing this:

docker-mailserver/test/tests/parallel/set1/spam_virus/spam_junk_folder.bats

Lines 33 to 35 in 1650cdf

# message will be added to a queue with varying delay until amavis receives it

run repeat_until_success_or_timeout 60 bash -c "docker logs ${CONTAINER_NAME} | grep 'Passed SPAM {RelayedTaggedInbound,Quarantined}'"

assert_success

Reproduction

# Passes before timing out: timeout 20 docker exec "${CONTAINER_NAME}" bash -c "tail -F /var/log/mail/mail.log | grep --max-count 1 'Passed SPAM'" # Same result here: docker exec "${CONTAINER_NAME}" bash -c "timeout 20 tail -F /var/log/mail/mail.log | grep --max-count 1 'Passed SPAM'" # Times out with failure, and due to -q has no output: timeout 20 bash -c "docker logs --follow ${CONTAINER_NAME} | grep -q 'Passed SPAM'" # Times out with failure, but correctly shows output of requested max lines (even if there were more present) timeout 20 bash -c "docker logs --follow ${CONTAINER_NAME} | grep --max-count 1 'Passed SPAM'" # Times out but **passes** (docker logs was stopped by timeout, grep was successful) timeout 20 docker logs --follow "${CONTAINER_NAME}" | grep --max-count 1 'Passed SPAM'

function _passed_spam() { # Any method from above here } # Replace as 2nd assert in either test case of `tests/parallel/set1/spam_virus/spam_junk_folder.bats: run _passed_spam assert_success

Perhaps it's something different with our environments or how docker logs --follow works on our machines? For me, given the above observations, it doesn't seem like the stream is closed?

I had also verified by running the same command in another terminal session directly while the test was running. It would find the match and output it, but not exit until the timeout was reached.

I've had a similar experience with -q (not as nice as --max-count), where either option was working at least for decoder as the value to match when the container was starting up (I could not reproduce with Passed SPAM for some reason..). I could repeat the command in the terminal several times and it'll exit quickly with either option successful. However a few seconds after that and it would fail, even though the logs command would clearly show the lines 🤷‍♂️

Clearly something is iffy there, at least on my end :\

UPDATE: My grep command is the problem.

I swapped grep for rg (ripgrep), and now it works as you'd expect...

So something wrong with my grep command I guess? 🤷‍♂️

I'd rather avoid the -q or --max-count approach, at least with a local grep call, as it seems that's where the problem is. The containers grep was working fine with tail, which would be ok as it avoids false positives when I run tests locally.

We could add ripgrep as a dependency for testing like we do with jq I suppose? (but then CI may need to grab/install it each time, unless it already provides that like it does jq)

So something wrong with my grep command I guess? 🤷‍♂️

yes 😄

docker logs ${CONTAINER_NAME} | grep 'Passed SPAM {RelayedTaggedInbound,Quarantined}'"

When you don't pass -q or --max-count 1, grep runs infinitly. Why should it stop?

If you only need to now, if a keyword was found, go with -q. If you need the output to parse further or make assertions, go with --max-count 1.

When you don't pass -q or --max-count 1, grep runs infinitly. Why should it stop?

You're referring to the current version in master I referenced? That is not using --follow, so it does not run infinitely. The attempts I tried can be found in the collapsed "> Reproduction" section.

I collapsed it after responding with an update about my local grep being the problem. I understand the reason for -q or --max-count 1, but only if we're using grep within the container (doesn't make sense for docker logs --follow due to that), as for whatever reason my local grep is misbehaving which would be problematic for me running tests locally 😅 (something I've been doing quite frequently lately)

If we're to use either option with docker logs --follow, then I'd need to know why my grep is broken and how to fix it, or add ripgrep as a dependency for tests.

The attempts I tried can be found in the collapsed "> Reproduction" section.

Thanks, I overlooked that.

BTW: Are you running the tests on debian/ubuntu?

Are you running the tests on debian/ubuntu?

EndeavourOS (ArchLinux based)

Depending on how much more time you want to invest, you may try the "grep" binary from debian:

docker create --name grep debian docker cp grep:/bin/grep . docker rm grep

I am wondering, that such a basic tool behaves differently between linux distros.

PS: If we are ever in doubt, we should stick with debian based distros supported for testing (because CI is also running on ubuntu).

georglauterbach

Looks really good to me! Just some very minor stylistic changes :)

test/helper/change-detection.bash

test/helper/common.bash

test/helper/setup.bash

test/tests/serial/mail_changedetector.bats

georglauterbach · 2023-01-14T18:10:34Z

Got an upcoming PR about changes to test/helper/*.sh which should be merged after this and #3004. The PR will adjust mail_privacy.bats and refactor test/helper/*.sh, i.e. apply our style guides and documentation. Just FYI so we avoid large merge conflicts.

UPDATE: branch pushed, name = tests/more-reqrites-1

polarathene · 2023-01-14T21:15:09Z

UPDATE: branch pushed, name = tests/more-reqrites-1

I see you've got a nice clean separation of commits so that's fine if you want to bundle mail_privacy.bats into that 👍

If you're planning to rework more tests into that branch, maybe use a separate branch for the broader change with renaming; then after that's merged cherry-pick the extra commits into a new branch / PR? 😅

I'm not bothered either way, but it is preferable to separate out diff noise (like the method renaming interfering with a different scope of changes) from the "review changes" view when possible :)

Small heads-up, there will be some conflicts for either you or me to deal with as I'm presently putting together a test file that collects all the process checking and restarts into a single file. That affects some of the tests your function rename touches as it deletes the test cases from those parallel set tests.

It affects the following tests presently:

parallel/set1/spam_virus/clamav.bats
parallel/set1/spam_virus/fail2ban.bats
parallel/set1/spam_virus/disabled_clamav_spamassassin.bats
parallel/set1/spam_virus/postgrey_enabled.bats
serial/mail_fetchmail.bats
serial/mail_smtponly.bats
serial/mail_with_ldap.bats
serial/mail_with_postgrey_disabled_by_default.bats
serial/tests.bats

It may also remove the need for check_if_process_is_running() helper, or I'll be modifying that method.

casperklein · 2023-01-14T21:59:33Z

test/helper/common.bash

+  if [[ -z $EXPECTED_COUNT ]]
+  then
+    # +1 of starting count:
+    EXPECTED_COUNT=$(bc <<< "$(__get_count) + 1")


Suggested change

EXPECTED_COUNT=$(bc <<< "$(__get_count) + 1")

EXPECTED_COUNT=$(( $(__get_count) + 1 ))

Oh weird, I remember having trouble getting that to work at the time and had ended up with the bc <<< version 😅

That works well, thanks! 😁

EDIT: Broke smtp-delivery.bats where an implicit count is used.

No glue why this doesn't work and the tests fail 🤷

Alright, I found the root cause.

Because, set -e is in use, the $(()) operation fails, because the called function __get_count returns exit code 1, when there are 0 matches.

As 0 matches is a valid return value (for us), this should be fixed by appending || true:

docker exec "${CONTAINER_NAME}" grep --count "${MATCH_CONTENT}" "${MATCH_IN_LOG}" || true.

After that, EXPECTED_COUNT=$(( $(__get_count) + 1 )) works fine.

georglauterbach · 2023-01-14T22:23:32Z

I'll be resolving the conflicts, don't worry! I'll be using a train ride tomorrow, so I hope we can merge this until tomorrow evening - that'd be nice :D

polarathene · 2023-01-14T23:48:16Z

There was a failure related to the change detection wait method in parallel set3 smtp-delivery. Not sure if it's related to the last change suggestion. I re-ran the jobs and it seems to have failed that test again.

polarathene · 2023-01-14T23:52:29Z

Confirmed. The mail_changedetector test didn't error as that conditional path in the helper method isn't covered by it. When no explicit expected count is set, which is the case for usage elsewhere, it should set the +1 of current count.

The change @casperklein proposed caused instant failure during setup_file() when called. Reverted to how it was before which works correctly.

`supervisorctl tail` is not the most reliably way to get logs for the latest change detection and has been known to be fragile in the past. We can instead read the full log for the service directly with `tac` and `sed` to extract all log content since the last change detection. Common asserts have also been extracted out into separate methods.

Container 1 is still blocked at this point from an existing lock and change event. Make the lock stale immediately and no extra sleep is required when paired with the helper method to wait until the event is processed (which should remove the stale lock).

- Simplify the test case so it's easier to grok. - 2nd test case (blocking) extracts out initial setup into a separate method and merges the later service restart logic which is redundant. - Additional comments for improved context of what is going on / expected.

- Add explicit counting arg to change detection support. - Extract revised logic into it's own generic helper method. - Utilize this for a separate method that monitors for a change event having started, but not waiting for completion. This allows dropping the 40 sec of remaining `sleep` in `mail_changedetector` test. It was also required due to potentially missing the timing of a change event completing concurrently in a 2nd container that needed to be waited on and then checked.

- Switch to common container setup helpers - Update container name and change usage to variables instead. - Adopt the new convention of prefix variable for test cases (revised test case descriptions).

This test file was already adapted to the original common setup helpers. - `TEST_NAME` replaced with `CONTAINER_NAME`. - Prefix var added, test case descriptions drop explicit prefix. - No other changes.

- New helper file for sharing these helpers to tests. - Includes the helpful log method from changedetector tests. - No longer need to maintain duplicate copies of these methods during the test migration. All tests that use them are now importing the separate helper file. - `tls_letsencrypt.bats` has switched to using the log helper. - Generic log count helper is removed from `test_helper/common.bash` as any test that needs it in future can adapt to `helper/common.bash`.

This helper does not seem useful as moving away from `supervisorctl tail` and no other tests had a need for it.

No other tests depend on this. Future tests will adopt the revised versions from `helper/setup.bash`. Additionally updates `helper/setup.bash` comments that are no longer applicable to `TEST_TMP_CONFIG` and `CONTAINER_NAME`.

Review feedback Co-authored-by: Georg Lauterbach <44545919+georglauterbach@users.noreply.github.com>

Review feedback request

Co-authored-by: Casper <casperklein@users.noreply.github.com>

This reverts commit ed8078b.

georglauterbach · 2023-01-15T11:31:35Z

Can be merged. I'll be placing a new PR then for the refactored helper ;)

casperklein · 2023-01-15T12:20:51Z

There was a failure related to the change detection wait method in parallel set3 smtp-delivery.

How to run only the set3 tests? This doesn't work:

make tests/parallel/set3

parallel: invalid option -- '-'
parallel [OPTIONS] command -- arguments
        for each argument, run command with argument, in parallel
parallel [OPTIONS] -- commands
        run specified commands in parallel
   bats warning: Executed 0 instead of expected 31 tests

31 tests, 0 failures, 31 not run in 1 seconds

make: *** [Makefile:50: tests/parallel/set3] Error 1

Edit: For now, I use test/bats/bin/bats -T test/tests/parallel/set3/smtp-delivery.bats

georglauterbach · 2023-01-15T12:44:46Z

Did you run make generate-accounts before? Running the command make clean generate-accounts tests/parallel/setX works for me.

casperklein · 2023-01-15T12:48:55Z

I did run just "make" to create all necessary stuff. Then tests failed and I wanted to re-run only set3.

Same with your example:

make clean generate-accounts tests/parallel/set3
parallel: invalid option -- '-'
parallel [OPTIONS] command -- arguments
        for each argument, run command with argument, in parallel
parallel [OPTIONS] -- commands
        run specified commands in parallel
   bats warning: Executed 0 instead of expected 31 tests

31 tests, 0 failures, 31 not run in 1 seconds

make: *** [Makefile:50: tests/parallel/set3] Error 1

Better solution found.

georglauterbach · 2023-01-15T12:53:32Z

I will have a look later at this too.

georglauterbach · 2023-01-15T14:02:39Z

Tested it again; runs smoothly on my system (Ubuntu 22.04 LTS). Packages are up-to-date.

EDIT: Does make clean generate-accounts test/smtp-delivery work?

casperklein · 2023-01-15T14:06:27Z

EDIT: Does make clean generate-accounts test/smtp-delivery work?

yes

polarathene · 2023-01-15T23:30:14Z

Ok, missed my opportunity to merge with both approvals it seems 😛

I've added the solution @casperklein requested, hope this is good for merging now 👍

polarathene added area/tests kind/improvement Improve an existing feature, configuration file or the documentation labels Jan 11, 2023

polarathene added this to the v12.0.0 milestone Jan 11, 2023

polarathene requested review from casperklein and georglauterbach January 11, 2023 08:08

polarathene self-assigned this Jan 11, 2023

polarathene commented Jan 11, 2023

View reviewed changes

test/helper/change-detection.bash Outdated Show resolved Hide resolved

polarathene commented Jan 11, 2023

View reviewed changes

georglauterbach requested changes Jan 12, 2023

View reviewed changes

polarathene requested a review from georglauterbach January 13, 2023 22:15

georglauterbach approved these changes Jan 13, 2023

View reviewed changes

georglauterbach previously approved these changes Jan 13, 2023

View reviewed changes

casperklein reviewed Jan 14, 2023

View reviewed changes

polarathene dismissed georglauterbach’s stale review via ed8078b January 14, 2023 22:26

polarathene requested review from georglauterbach and casperklein January 14, 2023 22:27

casperklein previously approved these changes Jan 14, 2023

View reviewed changes

polarathene dismissed casperklein’s stale review via 76aea88 January 14, 2023 23:51

polarathene requested a review from casperklein January 14, 2023 23:52

casperklein previously approved these changes Jan 15, 2023

View reviewed changes

polarathene added 5 commits January 15, 2023 18:34

tests(chore): Migrate to current test conventions

4378031

- Switch to common container setup helpers - Update container name and change usage to variables instead. - Adopt the new convention of prefix variable for test cases (revised test case descriptions).

polarathene and others added 8 commits January 15, 2023 18:34

tests(chore): Convert setup-cli.bats to new test conventions

c9a8d5c

This test file was already adapted to the original common setup helpers. - `TEST_NAME` replaced with `CONTAINER_NAME`. - Prefix var added, test case descriptions drop explicit prefix. - No other changes.

tests(refactor): tls_letsencrypt.bats remove _get_service_logs()

fa4ec9c

This helper does not seem useful as moving away from `supervisorctl tail` and no other tests had a need for it.

chore: Minor style changes

a28d19d

Review feedback Co-authored-by: Georg Lauterbach <44545919+georglauterbach@users.noreply.github.com>

chore: Relocate inline docs

91610dc

Review feedback request

chore: Simplify setting EXPECTED_COUNT

32aea55

Co-authored-by: Casper <casperklein@users.noreply.github.com>

Revert "chore: Simplify setting EXPECTED_COUNT"

655bb50

This reverts commit ed8078b.

polarathene force-pushed the tests/migrate-changedetector branch from 76aea88 to 655bb50 Compare January 15, 2023 05:34

georglauterbach approved these changes Jan 15, 2023

View reviewed changes

georglauterbach previously approved these changes Jan 15, 2023

View reviewed changes

casperklein self-requested a review January 15, 2023 12:40

chore: Use || true to simplify setting EXPECTED_COUNT correctly

0893c98

polarathene dismissed georglauterbach’s stale review via 0893c98 January 15, 2023 23:25

polarathene requested a review from georglauterbach January 15, 2023 23:30

casperklein approved these changes Jan 16, 2023

View reviewed changes

Merge branch 'master' into tests/migrate-changedetector

cbf58ef

georglauterbach approved these changes Jan 16, 2023

View reviewed changes

polarathene merged commit 8d80c63 into docker-mailserver:master Jan 16, 2023

casperklein mentioned this pull request Jan 19, 2023

tests(refactor): Improve consistency and documentation for test helpers #3012

Merged

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tests(refactor): Adjust `mail_changedetector` + change detection helpers #2997

tests(refactor): Adjust `mail_changedetector` + change detection helpers #2997

polarathene commented Jan 11, 2023 •

edited

polarathene Jan 11, 2023 •

edited

casperklein Jan 12, 2023 •

edited

This comment was marked as outdated.

polarathene Jan 12, 2023 •

edited

polarathene Jan 13, 2023

casperklein Jan 13, 2023

polarathene Jan 13, 2023

casperklein Jan 14, 2023

polarathene Jan 14, 2023

casperklein Jan 14, 2023 •

edited

georglauterbach left a comment

georglauterbach commented Jan 14, 2023 •

edited

polarathene commented Jan 14, 2023 •

edited

casperklein Jan 14, 2023

polarathene Jan 14, 2023 •

edited

casperklein Jan 15, 2023

casperklein Jan 15, 2023 •

edited

georglauterbach commented Jan 14, 2023

polarathene commented Jan 14, 2023

polarathene commented Jan 14, 2023

georglauterbach commented Jan 15, 2023

casperklein commented Jan 15, 2023 •

edited

georglauterbach commented Jan 15, 2023

casperklein commented Jan 15, 2023 •

edited

georglauterbach commented Jan 15, 2023

georglauterbach commented Jan 15, 2023 •

edited

casperklein commented Jan 15, 2023

polarathene commented Jan 15, 2023


		repeat_until_success_or_timeout 60 __is_changedetector_finished
		repeat_until_success_or_timeout 20 __has_expected_count

	repeat_until_success_or_timeout 20 __has_expected_count
	timeout 60 docker exec "${CONTAINER_NAME}" bash -c "tail -F '${MATCH_IN_LOG}' \| grep --max-count ${EXPECTED_COUNT} '${MATCH_CONTENT}'"

	# message will be added to a queue with varying delay until amavis receives it
	run repeat_until_success_or_timeout 60 bash -c "docker logs ${CONTAINER_NAME} \| grep 'Passed SPAM {RelayedTaggedInbound,Quarantined}'"
	assert_success

	EXPECTED_COUNT=$(bc <<< "$(__get_count) + 1")
	EXPECTED_COUNT=$(( $(__get_count) + 1 ))

tests(refactor): Adjust mail_changedetector + change detection helpers #2997

tests(refactor): Adjust mail_changedetector + change detection helpers #2997

Conversation

polarathene commented Jan 11, 2023 • edited

Description

Type of change

Checklist:

polarathene Jan 11, 2023 • edited

Choose a reason for hiding this comment

casperklein Jan 12, 2023 • edited

Choose a reason for hiding this comment

This comment was marked as outdated.

polarathene Jan 12, 2023 • edited

Choose a reason for hiding this comment

polarathene Jan 13, 2023

Choose a reason for hiding this comment

casperklein Jan 13, 2023

Choose a reason for hiding this comment

polarathene Jan 13, 2023

Choose a reason for hiding this comment

casperklein Jan 14, 2023

Choose a reason for hiding this comment

polarathene Jan 14, 2023

Choose a reason for hiding this comment

casperklein Jan 14, 2023 • edited

Choose a reason for hiding this comment

georglauterbach left a comment

Choose a reason for hiding this comment

georglauterbach commented Jan 14, 2023 • edited

polarathene commented Jan 14, 2023 • edited

casperklein Jan 14, 2023

Choose a reason for hiding this comment

polarathene Jan 14, 2023 • edited

Choose a reason for hiding this comment

casperklein Jan 15, 2023

Choose a reason for hiding this comment

casperklein Jan 15, 2023 • edited

Choose a reason for hiding this comment

georglauterbach commented Jan 14, 2023

polarathene commented Jan 14, 2023

polarathene commented Jan 14, 2023

georglauterbach commented Jan 15, 2023

casperklein commented Jan 15, 2023 • edited

georglauterbach commented Jan 15, 2023

casperklein commented Jan 15, 2023 • edited

georglauterbach commented Jan 15, 2023

georglauterbach commented Jan 15, 2023 • edited

casperklein commented Jan 15, 2023

polarathene commented Jan 15, 2023

tests(refactor): Adjust `mail_changedetector` + change detection helpers #2997

tests(refactor): Adjust `mail_changedetector` + change detection helpers #2997

polarathene commented Jan 11, 2023 •

edited

polarathene Jan 11, 2023 •

edited

casperklein Jan 12, 2023 •

edited

polarathene Jan 12, 2023 •

edited

casperklein Jan 14, 2023 •

edited

georglauterbach commented Jan 14, 2023 •

edited

polarathene commented Jan 14, 2023 •

edited

polarathene Jan 14, 2023 •

edited

casperklein Jan 15, 2023 •

edited

casperklein commented Jan 15, 2023 •

edited

casperklein commented Jan 15, 2023 •

edited

georglauterbach commented Jan 15, 2023 •

edited