Reference tests and related improvements #3286

matthewrmshin · 2019-08-12T09:32:52Z

These changes are attempts to partially address #2894 and #3046.

User visible changes:

Removed settings:
- [cylc]log resolved dependencies
- [cylc][[reference test]]* except expected task failures.
Moved [cylc]abort if any task fails to [cylc][[events]]abort if any task fails so it lives with the other abort if/on ... settings.
Removed the cylc check-triggering command.
Log task trigger regardless, at level INFO.
Fixed cylc submit command - job unable to load job.sh.
Fixed cylc run --stop-cycle-point=POINT logic.
Ignore job poll message, when task is already in a retrying state. This fixes a flaky test in a busy environment when multiple messages and polls come in at quite a large interval - confusing the event manager.
Fixed: retrying held tasks should no longer be released for submission (broken by hold_swap => is_held #3230).

Internal changes:

Cleaner reference test logic:
- Detect reference test option automatically on shutdown.
- Less logic required to deal with reference test configuration.
- Remove the need to run an external command.
- Generate a filtered test log in reference test - less loading/parsing.
- Print only messages for reference/test log. (No more unnecessary date/time, level, etc in future reference logs.)
- Parse reference/test logs on opened file handles instead of loading the full logs into memory.
Simplify abort on task failure logic.
Ensure that all test suite directories are cleaned up on success.
Taking out various suite timeout settings in the tests that were causing instability in busy environments. Auto inject 3 minutes inactivity settings in reference tests. (The timeout setting relies on the suite to stall before timing out. The inactivity setting is better in this respect.)

To follow on after this PR:

Look at any battery test files that are taking too long to run, and find ways to reduce.
Look at any battery test files that can be converted to unit tests.
Modify remaining test files (shell scripts) to pass shellcheck in normal mode.

Requirements check-list

I have read CONTRIBUTING.md and added my name as a Code Contributor.
Contains logically grouped changes (else tidy your branch by rebase).
Does not contain off-topic changes (use other PRs for other changes).

Appropriate tests are included (unit and/or functional).
Already covered by existing tests.

Appropriate change log entry included.

(master branch) I have opened a documentation PR at Remove redundant reference test related settings cylc-doc#55.

matthewrmshin · 2019-08-13T21:23:00Z

Discovered several flaws in various tests on forward port of #3288. All fixed now.

matthewrmshin · 2019-08-14T22:01:15Z

Sorry for the large number of files changed. I'll squash the branch later, but I want to get the test to pass first. Hopefully, it will now pass on a single attempt on Travis CI. (It has been doing that in my environment in my latest attempts.) 🤞

hjoliver · 2019-08-14T22:38:11Z

One test failed, damn it 😠

matthewrmshin · 2019-08-14T22:49:11Z

Should have moved those tests to flaky, but I haven't seen them fail for quite a while now. Unfortunately, I've just got 2 other failures in the test running in my environment. Usual suspects. I'll take a look at them tomorrow as well.

matthewrmshin · 2019-08-22T15:20:25Z

I have done enough for the PR. Would appreciate a quick review so I can move on to something else.

Travis CI passes, but there is no sign of Codacy and CodeCov checks for some reason. (Perhaps due to the earlier Github outage?)

matthewrmshin · 2019-08-23T09:58:42Z

Re-based.

matthewrmshin · 2019-08-23T10:03:48Z

A happy one-try-pass in my environment this morning 🌞 running:

./etc/bin/run-functional-tests.sh -j 12 ./tests/ && ./etc/bin/run-functional-tests.sh -j 1 ./flakytests/

Let's hope Travis CI is as happy.

hjoliver · 2019-08-23T12:11:05Z

I have done enough for the PR. Would appreciate a quick review so I can move on to something else.

Travis CI passes, but there is no sign of Codacy and CodeCov checks for some reason. (Perhaps due to the earlier Github outage?)

I'll get on to this ASAP - thought the reload bug fix higher priority today.

User visible changes: * Removed settings: * `[cylc]log resolved dependencies` * `[cylc][[reference test]]*` except `expected task failures`. * Moved `[cylc]abort if any task fails` to `[cylc][[events]]abort if any task fails` so it lives with the other `abort if/on ...` settings. * Removed the `cylc check-triggering` command. * Log task trigger regardless, at level INFO. * Fixed `cylc submit` command - job unable to load `job.sh`. * Fixed `cylc run --stop-cycle-point=POINT` logic. * Ignore job poll message, when task is already in a *retrying* state. This fixes a flaky test in a busy environment when multiple messages and polls come in at quite a large interval - confusing the event manager. * Fixed: retrying held tasks should no longer be released for submission. Internal changes: * Cleaner reference test logic: * Detect reference test option automatically on shutdown. * Less logic required to deal with reference test configuration. * Remove the need to run an external command. * Generate a filtered test log in reference test - less loading/parsing. * Print only messages for reference/test log. (No more unnecessary date/time, level, etc in future reference logs.) * Parse reference/test logs on opened file handles instead of loading the full logs into memory. * Simplify abort on task failure logic. * Taking out various suite *timeout* settings in the tests that were causing instability in busy environments. Auto inject 3 minutes *inactivity* settings in reference tests. (The *timeout* setting relies on the suite to stall before timing out. The *inactivity* setting is better in this respect.)

hjoliver · 2019-08-25T10:54:27Z

cylc/flow/task_events_mgr.py

@@ -99,7 +99,7 @@ def log_task_job_activity(ctx, suite, point, name, submit_num=None):
        LOG.debug(ctx_str)


-class TaskEventsManager(object):
+class TaskEventsManager():


Python 3 class definition syntax: the official docs actually omit the parentheses here (if no explicit inheritance). Doesn't really matter though.

Still interesting to know, had no idea it worked (actually quickly tested in a terminal here 😬 )

hjoliver

LGTM 🎉

hjoliver · 2019-08-25T11:26:36Z

(@kinow - as 2nd reviewer, no need to spend too much time going over this; it's mostly "just tests" and pretty clearly a nice improvement).

kinow · 2019-08-25T11:32:41Z

Oh, looking forward to some more stability in the functional tests now. Thanks @matthewrmshin !!!

matthewrmshin added this to the cylc-8.0a2 milestone Aug 12, 2019

matthewrmshin self-assigned this Aug 12, 2019

matthewrmshin force-pushed the reftest branch from 634b597 to a25d57c Compare August 12, 2019 09:44

cylc deleted a comment Aug 12, 2019

matthewrmshin mentioned this pull request Aug 12, 2019

Remove redundant reference test related settings cylc/cylc-doc#55

Merged

matthewrmshin force-pushed the reftest branch from a25d57c to 9902012 Compare August 12, 2019 16:10

cylc deleted a comment Aug 12, 2019

matthewrmshin force-pushed the reftest branch from 9902012 to 875b74e Compare August 12, 2019 17:43

cylc deleted a comment Aug 12, 2019

matthewrmshin force-pushed the reftest branch from 875b74e to 5be60e1 Compare August 12, 2019 21:16

cylc deleted a comment Aug 12, 2019

matthewrmshin mentioned this pull request Aug 13, 2019

Test header: poll abort on timeout. #3288

Merged

6 tasks

cylc deleted a comment Aug 13, 2019

matthewrmshin force-pushed the reftest branch from 665df03 to ae964d2 Compare August 13, 2019 22:31

cylc deleted a comment Aug 13, 2019

hjoliver mentioned this pull request Aug 14, 2019

Fix cmp_ok arg order in a test (master). #3289

Merged

6 tasks

cylc deleted a comment Aug 14, 2019

matthewrmshin force-pushed the reftest branch 2 times, most recently from d9e802a to 1cd52ce Compare August 14, 2019 14:21

cylc deleted a comment Aug 14, 2019

matthewrmshin marked this pull request as ready for review August 14, 2019 21:55

cylc deleted a comment Aug 14, 2019

matthewrmshin requested a review from hjoliver August 22, 2019 13:10

cylc deleted a comment Aug 22, 2019

matthewrmshin force-pushed the reftest branch from e907ae3 to 703b63a Compare August 23, 2019 09:58

cylc deleted a comment Aug 23, 2019

matthewrmshin force-pushed the reftest branch from 703b63a to 75aac20 Compare August 23, 2019 11:00

cylc deleted a comment Aug 23, 2019

matthewrmshin force-pushed the reftest branch 2 times, most recently from 8265269 to 364daf9 Compare August 23, 2019 15:02

matthewrmshin force-pushed the reftest branch from 364daf9 to f95d7e3 Compare August 23, 2019 15:58

kinow mentioned this pull request Aug 24, 2019

Update copyright notices for setup (& other) files #3310

Merged

6 tasks

hjoliver reviewed Aug 25, 2019

View reviewed changes

cylc deleted a comment Aug 25, 2019

hjoliver approved these changes Aug 25, 2019

View reviewed changes

kinow approved these changes Aug 25, 2019

View reviewed changes

kinow merged commit 283bda1 into cylc:master Aug 25, 2019

hjoliver mentioned this pull request Aug 25, 2019

Remove obsolete command category "hook". #3311

Merged

kinow mentioned this pull request Aug 25, 2019

test battery: flaky tests #2894

Closed

matthewrmshin deleted the reftest branch August 27, 2019 08:30

matthewrmshin modified the milestones: cylc-8.0a2, cylc-8.0a1 Aug 27, 2019

matthewrmshin mentioned this pull request Aug 28, 2019

Remove reference test functionality from cylc. #830

Open

matthewrmshin mentioned this pull request Sep 13, 2019

Unify global and local configs. #3348

Closed

dpmatthews mentioned this pull request Aug 13, 2021

Task fails to retry correctly due to polling #3460

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reference tests and related improvements #3286

Reference tests and related improvements #3286

matthewrmshin commented Aug 12, 2019 •

edited

matthewrmshin commented Aug 13, 2019

matthewrmshin commented Aug 14, 2019

hjoliver commented Aug 14, 2019

matthewrmshin commented Aug 14, 2019

matthewrmshin commented Aug 22, 2019

matthewrmshin commented Aug 23, 2019

matthewrmshin commented Aug 23, 2019

hjoliver commented Aug 23, 2019

hjoliver Aug 25, 2019

kinow Aug 25, 2019

hjoliver left a comment

hjoliver commented Aug 25, 2019 •

edited

kinow commented Aug 25, 2019

Reference tests and related improvements #3286

Reference tests and related improvements #3286

Conversation

matthewrmshin commented Aug 12, 2019 • edited

matthewrmshin commented Aug 13, 2019

matthewrmshin commented Aug 14, 2019

hjoliver commented Aug 14, 2019

matthewrmshin commented Aug 14, 2019

matthewrmshin commented Aug 22, 2019

matthewrmshin commented Aug 23, 2019

matthewrmshin commented Aug 23, 2019

hjoliver commented Aug 23, 2019

hjoliver Aug 25, 2019

Choose a reason for hiding this comment

kinow Aug 25, 2019

Choose a reason for hiding this comment

hjoliver left a comment

Choose a reason for hiding this comment

hjoliver commented Aug 25, 2019 • edited

kinow commented Aug 25, 2019

matthewrmshin commented Aug 12, 2019 •

edited

hjoliver commented Aug 25, 2019 •

edited