feat: Lighten end_to_end.yml workflow #1243

maximearmstrong · 2022-08-18T20:38:50Z

Summary:

This PR lightens the end-to-end workflow. It concentrates the end-to-end workflow efforts in a single file and samples GTFS Schedule sources from the Mobility Database catalogs. This way, each time the workflow is run, the snapshot validator is tested using different sources.

Using the latest URLs from the Mobility Database fixes issue #1206, since the SSL error was raised when using agency or transitfeeds URLs.

Changes:

The end_to_end_big.yml and end_to_end_100.yml workflows are removed so that everything about the end-to-end workflow happens in end_to_end.yml.
end_to_end.yml is refactored to mimic the behavior of the acceptance_test.yml workflow. The artifacts are named similarly to those in the acceptance test workflow.
harvest_latest_versions.py is updated to allow sampling. 5% of the GTFS Schedule sources in the database are used for the end-to-end workflow (excluding sources requiring authentication for simplicity).
queue_runner.sh is udpated to add a flag. It the flag is set to true, the queue runner will validate the datasets using both the snapshot and master validators, otherwise only the snapshot one.

Expected behavior:

The end-to-end workflow is run on each commit and samples different GTFS Schedule sources from the Mobility Database each time.

Please make sure these boxes are checked before submitting your pull request - thanks!

Run the unit tests with gradle test to make sure you didn't break anything
Format the title like "feat: [new feature short description]". Title must follow the Conventional Commit Specification(https://www.conventionalcommits.org/en/v1.0.0/).
Linked all relevant issues
~~Include screenshot(s) showing how this pull request works and fixes the issue(s)~~

github-actions · 2022-08-18T21:23:53Z

Thank you for this contribution! 🍰✨🦄

Information about source corruption

1 out of 1347 sources are corrupted.
The following sources are corrupted:

no-unknown-agder-kollektivtrafikk-as-gtfs-1078

Acceptance test details

The changes in this pull request did not trigger any new errors on known GTFS datasets from the MobilityDatabase.
Download the full acceptance test report for commit bdb3361 here (report will disappear after 90 days).

…obilityData/gtfs-validator into issue/1233/lighten-end-to-end-workflow

github-actions · 2022-08-18T22:01:25Z

Thank you for this contribution! 🍰✨🦄

Information about source corruption

1 out of 1346 sources are corrupted.
The following sources are corrupted:

ar-buenos-aires-colectivos-buenos-aires-gtfs-1220

Acceptance test details

The changes in this pull request did not trigger any new errors on known GTFS datasets from the MobilityDatabase.
Download the full acceptance test report for commit 86ec7b9 here (report will disappear after 90 days).

bdferris-v2 · 2022-08-23T00:20:30Z

One high-level comment: am I reading the code correctly that a different sub-set of feeds will be selected with each run? I feel like that could be problematic. For example, if a particular feed causes a PR to fail, you wouldn't necessarily get the same feed on the next run, making it hard to repro. I think I'd be more in favor of a consistent set of feeds for each run.

maximearmstrong · 2022-08-23T22:24:47Z

@bdferris-v2 Our original idea was that we would cover more feeds this way, with different feeds being tested at each commit, but you bring a very good point. I changed the behaviour and added a consistent set of 50 feeds.

@emmambd Do you think the selection makes sense? I've selected feeds from various locations, some with features, some aggregated. Let me know if you think more or other feeds should be added.

github-actions · 2022-08-23T22:39:24Z

Thank you for this contribution! 🍰✨🦄

Information about source corruption

1 out of 1347 sources are corrupted.
The following sources are corrupted:

ar-buenos-aires-colectivos-buenos-aires-gtfs-1220

Acceptance test details

The changes in this pull request did not trigger any new errors on known GTFS datasets from the MobilityDatabase.
Download the full acceptance test report for commit 233a3bb here (report will disappear after 90 days).

isabelle-dr · 2022-08-24T11:13:41Z

@maximearmstrong do we need to update (END_TO_END.md)[https://github.com/MobilityData/gtfs-validator/blob/master/docs/END_TO_END.md]?

emmambd · 2022-08-24T14:01:36Z

Do you think the selection makes sense? I've selected feeds from various locations, some with features, some aggregated. Let me know if you think more or other feeds should be added.

@maximearmstrong I don't see any feed types missing so this list looks good to me! My one thought/consideration might be to look at end-to-end test failures in the past and see which feeds it failed on. That would be a last "check" to see if there are any other feed types that would be valuable to include.

maximearmstrong · 2022-08-24T14:43:31Z

@maximearmstrong do we need to update (END_TO_END.md)[https://github.com/MobilityData/gtfs-validator/blob/master/docs/END_TO_END.md]?

@isabelle-dr Sure, I wasn't aware of that file. Its content is outdated - I suggest we delete it. If you think there's value in having our workflows documented, we can add WORKFLOWS.md in another PR to explain our workflows, including test_package_doc.yml and end_to_end.yml. Would it be a good plan?

scripts/queue_runner.sh

scripts/mobility-database-harvester/harvest_latest_versions.py

github-actions · 2022-08-29T20:46:28Z

✅ Rule acceptance tests passed.
New Errors: 0 out of 1345 datasets (~0%) are invalid due to code change, which is less than the provided threshold of 1%.
Dropped Errors: 0 out of 1345 datasets (~0%) are invalid due to code change, which is less than the provided threshold of 1%.
0 out of 1345 sources (~0 %) are corrupted.
Commit: bd765ab
Download the full acceptance test report here (report will disappear after 90 days).
✅ Rule acceptance tests passed.

isabelle-dr · 2022-08-29T21:08:31Z

@maximearmstrong sounds good, that's a good plan!

…obilityData/gtfs-validator into issue/1233/lighten-end-to-end-workflow

github-actions · 2022-08-29T22:25:41Z

✅ Rule acceptance tests passed.
New Errors: 0 out of 1346 datasets (~0%) are invalid due to code change, which is less than the provided threshold of 1%.
Dropped Errors: 0 out of 1346 datasets (~0%) are invalid due to code change, which is less than the provided threshold of 1%.
0 out of 1346 sources (~0 %) are corrupted.
Commit: 1475ba9
Download the full acceptance test report here (report will disappear after 90 days).
✅ Rule acceptance tests passed.

maximearmstrong added 5 commits August 18, 2022 16:37

Refactors end_to_end workflow

6162a42

Fixes end_to_end workflow

4ccb5ef

Fixes harvest_latest_versions.py

906da27

Fixes harvest_latest_versions.py

38ac351

Fixes harvest_latest_versions.py

8cfd313

maximearmstrong self-assigned this Aug 18, 2022

maximearmstrong marked this pull request as ready for review August 18, 2022 21:26

Merge branch 'master' into issue/1233/lighten-end-to-end-workflow

b472be4

maximearmstrong requested a review from bdferris-v2 August 18, 2022 21:26

maximearmstrong added 3 commits August 18, 2022 17:30

Removes end_to_end_big.yml and end_to_end_100.yml

3c4f348

Merge branch 'master' into issue/1233/lighten-end-to-end-workflow

5c4e25e

Merge branch 'issue/1233/lighten-end-to-end-workflow' of github.com:M…

cbe57ad

…obilityData/gtfs-validator into issue/1233/lighten-end-to-end-workflow

maximearmstrong mentioned this pull request Aug 18, 2022

feat: Refactor the output comparator to consider dropped errors #1238

Merged

3 tasks

maximearmstrong requested a review from isabelle-dr August 22, 2022 17:20

Adds consistent set of feeds as sample

f32dc37

bdferris-v2 reviewed Aug 26, 2022

View reviewed changes

scripts/queue_runner.sh Show resolved Hide resolved

bdferris-v2 reviewed Aug 26, 2022

View reviewed changes

scripts/mobility-database-harvester/harvest_latest_versions.py Outdated Show resolved Hide resolved

maximearmstrong and others added 3 commits August 29, 2022 16:21

Removes random from harvest_latest_versions.py

f9948af

Updates master flag in queue_runner.sh

49c7dda

Merge branch 'master' into issue/1233/lighten-end-to-end-workflow

3207e88

maximearmstrong added 3 commits August 29, 2022 17:28

Removes docs/END_TO_END.md

75a1fa5

Merge branch 'master' into issue/1233/lighten-end-to-end-workflow

750eddf

Merge branch 'issue/1233/lighten-end-to-end-workflow' of github.com:M…

2b1e9f5

…obilityData/gtfs-validator into issue/1233/lighten-end-to-end-workflow

bdferris-v2 approved these changes Aug 29, 2022

View reviewed changes

maximearmstrong merged commit 0a22069 into master Aug 30, 2022

maximearmstrong deleted the issue/1233/lighten-end-to-end-workflow branch August 30, 2022 22:26

maximearmstrong mentioned this pull request Aug 30, 2022

Document workflows in a new WORKFLOW.md file #1247

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Lighten end_to_end.yml workflow #1243

feat: Lighten end_to_end.yml workflow #1243

maximearmstrong commented Aug 18, 2022 •

edited

Loading

github-actions bot commented Aug 18, 2022

github-actions bot commented Aug 18, 2022

bdferris-v2 commented Aug 23, 2022

maximearmstrong commented Aug 23, 2022

github-actions bot commented Aug 23, 2022

isabelle-dr commented Aug 24, 2022

emmambd commented Aug 24, 2022

maximearmstrong commented Aug 24, 2022 •

edited

Loading

github-actions bot commented Aug 29, 2022

isabelle-dr commented Aug 29, 2022

github-actions bot commented Aug 29, 2022

feat: Lighten end_to_end.yml workflow #1243

feat: Lighten end_to_end.yml workflow #1243

Conversation

maximearmstrong commented Aug 18, 2022 • edited Loading

github-actions bot commented Aug 18, 2022

Information about source corruption

Acceptance test details

github-actions bot commented Aug 18, 2022

Information about source corruption

Acceptance test details

bdferris-v2 commented Aug 23, 2022

maximearmstrong commented Aug 23, 2022

github-actions bot commented Aug 23, 2022

Information about source corruption

Acceptance test details

isabelle-dr commented Aug 24, 2022

emmambd commented Aug 24, 2022

maximearmstrong commented Aug 24, 2022 • edited Loading

github-actions bot commented Aug 29, 2022

isabelle-dr commented Aug 29, 2022

github-actions bot commented Aug 29, 2022

maximearmstrong commented Aug 18, 2022 •

edited

Loading

maximearmstrong commented Aug 24, 2022 •

edited

Loading