fix(tests): fix integration test filter with dynamic job determination #589

njbrake · 2025-10-30T14:37:22Z

Description

Claude Code helped me with this refactor and the nice explanation. I've never used the github 'actions' folder before but this seems like a nice way to organize the logic to avoid the workflow file blowing up. I also opted to put it into the action.yml instead of elevating it to a separate shell script because it seemed like logic that was only relevant for github actions, and wouldn't be used by people outside of that env.

Problem

The GitHub Actions workflow filtering logic was causing integration tests to be skipped incorrectly when using the filter input parameter with workflow_dispatch.

Root Cause

The workflow used simple substring matching with contains() to determine whether jobs should run:

if: github.event.inputs.filter == '' || contains(needs.expected-providers.outputs.providers, github.event.inputs.filter)

This approach had a critical issue: test function filters were broken. When passing pytest test filters like "test_completion" or "test_streaming", the jobs would be skipped entirely because these strings don't exist in the provider list. The jobs would skip before pytest ever had a chance to run.

The workflow conflated two different types of filtering:

Job-level: "Should we set up infrastructure for these providers?"
pytest-level: "Which specific test functions should run?"

Example Failure Scenario

# User wants to run all completion tests across all providers
gh workflow run tests-integration.yaml --field filter="test_completion"

Expected: Both test jobs run, pytest filters to only completion tests
Actual: Both jobs skip because contains("anthropic,bedrock,...,minimax", "test_completion") returns false

Solution

Created a reusable composite action (.github/actions/determine-jobs) that intelligently analyzes the filter to determine which test jobs should run:

Provider matching: Checks if the filter contains any provider name from either the integration or local provider lists
Smart fallback: If the filter doesn't match any provider name, assumes it's a test function filter and runs all relevant jobs, letting pytest handle the filtering
- "test_completion" → runs both jobs, pytest filters the tests ✅
- "test_streaming" → runs both jobs, pytest filters the tests ✅
Provider-specific optimization: When a provider is explicitly named, only the relevant job runs
- "minimax" → only runs integration tests job ✅
- "ollama" → only runs local tests job ✅

Testing Scenarios

Filter Input	Integration Job	Local Job	Behavior
(empty)	✅ Runs	✅ Runs	All tests
`minimax`	✅ Runs	❌ Skips	Only minimax tests
`ollama`	❌ Skips	✅ Runs	Only ollama tests
`test_completion`	✅ Runs	✅ Runs	All completion tests
`test_streaming and minimax`	✅ Runs	❌ Skips	Minimax streaming tests
`llama`	✅ Runs	❌ Skips	All llama provider tests

Changes

Created .github/actions/determine-jobs/action.yml - Composite action with smart filter analysis logic
Updated determine-jobs-to-run job to use the new composite action
Updated run-integration-tests job condition (already correctly configured)
Updated run-local-integration-tests job condition (already correctly configured)

Benefits

✅ pytest test function filters now work correctly
✅ Provider name filters work correctly
✅ Jobs only run when they need to, saving CI resources
✅ Logic is extracted into a reusable, testable composite action
✅ Workflow YAML is cleaner and easier to maintain
✅ The workflow is more intuitive and matches user expectations

PR Type

🐛 Bug Fix ## Relevant issues

Checklist

I have added unit tests that prove my fix/feature works
New and existing tests pass locally
Documentation was updated where necessary
I have read and followed the contribution guidelines```

codecov · 2025-10-30T14:38:15Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
see 39 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

njbrake added 2 commits October 30, 2025 10:29

fix(tests): fix integration test filter with dynamic job determination

0264153

reorg

53f5e37

njbrake requested a review from daavoo October 30, 2025 14:37

njbrake and others added 5 commits October 30, 2025 10:43

clean exit

d1c5bfd

allow no tests

04e92c5

more retries for slower init

0647506

lint

041226e

Merge branch 'main' into brake/github_test_logic

34dc2ca

njbrake merged commit 6f563d7 into main Oct 30, 2025
10 checks passed

njbrake deleted the brake/github_test_logic branch October 30, 2025 15:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(tests): fix integration test filter with dynamic job determination #589

fix(tests): fix integration test filter with dynamic job determination #589

Uh oh!

njbrake commented Oct 30, 2025

Uh oh!

codecov bot commented Oct 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix(tests): fix integration test filter with dynamic job determination #589

fix(tests): fix integration test filter with dynamic job determination #589

Uh oh!

Conversation

njbrake commented Oct 30, 2025

Description

Problem

Root Cause

Example Failure Scenario

Solution

Testing Scenarios

Changes

Benefits

PR Type

Checklist

Uh oh!

codecov bot commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Oct 30, 2025 •

edited

Loading