LEADS-247: add lightspeed evaluation e2e tests by VladimirKadlec · Pull Request #1313 · lightspeed-core/lightspeed-stack

VladimirKadlec · 2026-03-12T10:31:21Z

Description

add lightspeed evaluation e2e tests

Type of change

Tools used to create PR

Identify any AI code assistants used in this PR (for transparency and review context)

Assisted-by: N/A
Generated by: N/A

Related Tickets & Documents

Related Issue #
Closes #LEADS-247

Checklist before requesting a review

I have performed a self-review of my code.
PR has passed all pre-merge test jobs.
If it is a core feature, I have added thorough tests.

Summary by CodeRabbit

Chores
- Added a CI workflow to run end-to-end test suites with environment setup, service startup, health checks, test execution, log capture on failures, and cleanup.
Tests
- Integrated automated end-to-end testing into the CI pipeline to continuously validate system behavior during pushes and pull requests.

coderabbitai · 2026-03-12T10:31:27Z

Walkthrough

A new GitHub Actions workflow is added to run Lightspeed Evaluation end-to-end tests: it checks out repos, loads lightspeed-evaluation config, starts services via Docker Compose, waits for the LSC service, runs test commands, exposes logs on failure, and performs cleanup.

Changes

Cohort / File(s)	Summary
CI/CD Workflow `.github/workflows/e2e_tests_lightspeed_evaluation.yaml`	Adds a new GitHub Actions workflow triggered on push and pull_request_target that checks out the repo (with PR target handling), loads and validates lightspeed-evaluation config, starts services with `docker-compose-library.yaml`, waits for LSC on port 8080, runs e2e tests (installing deps, syncing, `make e2e_tests_lcore`), captures logs on failure, and performs cleanup (stop containers/remove images).

Sequence Diagram(s)

sequenceDiagram
    rect rgba(220, 230, 241, 0.5)
    participant GH as GitHub Actions
    end
    rect rgba(200, 240, 200, 0.5)
    participant Repo as Repository Checkout
    participant Eval as lightspeed-evaluation
    end
    rect rgba(250, 230, 200, 0.5)
    participant Docker as Docker Compose
    participant LSC as LSC Service (port 8080)
    participant Tests as Test Runner
    end

    GH->>Repo: checkout repository (handle PR_TARGET)
    GH->>Eval: fetch & load lightspeed-evaluation config
    GH->>Docker: start services (docker-compose-library.yaml)
    Docker->>LSC: bring up LSC service
    GH->>LSC: wait for health / port 8080 readiness
    GH->>Tests: run e2e steps (install, sync, make e2e_tests_lcore)
    Tests-->>GH: report results
    alt failure
        GH->>Docker: collect container logs
    end
    GH->>Docker: cleanup containers and images (local dev step)

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~12 minutes

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Title check	✅ Passed	The title accurately and clearly summarizes the main change: adding an end-to-end test workflow for Lightspeed Evaluation, with LEADS-247 providing proper ticket reference.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

📝 Coding Plan for PR comments

Generate coding plan

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

radofuchs

LGTM

coderabbitai

Actionable comments posted: 3

🧹 Nitpick comments (2)

.github/workflows/e2e_tests_lightspeed_evaluation.yaml (2)

103-117: Consider reducing log verbosity during polling.

Showing 30 lines of logs on every 2-second iteration (line 111) produces very verbose output. Consider only showing logs on final timeout failure or reducing frequency. Also, adding --silent to curl would suppress the progress meter.

♻️ Suggested improvement

       - name: Wait for the LSC
         run: |
           echo "Waiting for service on port 8080..."
           for i in {1..30}; do
-            if curl --output /dev/null --fail http://localhost:8080/v1/models ; then
+            if curl --silent --output /dev/null --fail http://localhost:8080/v1/models ; then
               echo "Service is up!"
               exit 0
             fi
-            docker compose -f docker-compose-library.yaml logs --tail=30
+            # Show logs less frequently to reduce noise
+            if [ $((i % 5)) -eq 0 ]; then
+              docker compose -f docker-compose-library.yaml logs --tail=10
+            fi
             echo "Still waiting..."
             sleep 2
           done
 
           echo "Service did not start in time"
+          docker compose -f docker-compose-library.yaml logs --tail=50
           exit 1

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.github/workflows/e2e_tests_lightspeed_evaluation.yaml around lines 103 -
117, The polling loop in the "Wait for the LSC" step is overly noisy because it
runs curl without --silent and prints `docker compose -f
docker-compose-library.yaml logs --tail=30` on every iteration; change the curl
call to include `--silent` and move or gate the docker compose logs so they only
print on the final timeout (or at a reduced frequency, e.g., every 5th
iteration) instead of every 2 seconds, updating the existing loop that checks
`curl http://localhost:8080/v1/models` and the `docker compose ... logs
--tail=30` invocation accordingly.

6-15: Consider adding concurrency control to prevent parallel runs.

Multiple workflow runs on the same PR (e.g., rapid pushes) could interfere with each other. Adding a concurrency group would cancel in-progress runs when a new one starts.

♻️ Suggested improvement

 jobs:
   e2e_tests:
     runs-on: ubuntu-latest
+    concurrency:
+      group: e2e-lightspeed-${{ github.head_ref || github.ref }}
+      cancel-in-progress: true
     strategy:
       fail-fast: false

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.github/workflows/e2e_tests_lightspeed_evaluation.yaml around lines 6 - 15,
The workflow lacks concurrency control which can cause overlapping runs for the
same PR; add a concurrency section at the top-level of the workflow (near jobs
or the existing top keys) to define a group that uniquely identifies the PR/run
(e.g., use github.ref or github.head_ref) and set cancel-in-progress: true so
the e2e_tests job (name: "E2E Tests for Lightspeed Evaluation job", job id:
e2e_tests) will cancel any in-flight run when a new run for the same PR starts.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In @.github/workflows/e2e_tests_lightspeed_evaluation.yaml:
- Line 1: Update the top-line comment to reference the actual file name: change
the commented filename string "# .github/workflows/e2e_tests.yml" to "#
.github/workflows/e2e_tests_lightspeed_evaluation.yaml" so the comment matches
the file name (ensure the exact string e2e_tests_lightspeed_evaluation.yaml is
used).
- Around line 30-32: The workflow uses github.event.pull_request.head.ref
(branch name) when constructing ref for checkout, which is not a reproducible
commit; replace github.event.pull_request.head.ref with
github.event.pull_request.head.sha so ref becomes
github.event.pull_request.head.sha || github.sha to guarantee the workflow
checks out the exact PR commit SHA (use this change where ref: ${{ ... }} is
set).
- Around line 141-148: The workflow uses an invalid command string "docker
compose down lightspeed-stack --rmi \"all\""; fix it by removing the service
name (down takes no services) and either call "docker compose down --rmi all" to
tear down the entire composition or, if you intended to stop/remove only the
lightspeed-stack service, run "docker compose stop lightspeed-stack" followed by
"docker compose rm -f lightspeed-stack"; ensure any non-default compose file is
passed with -f if needed.

---

Nitpick comments:
In @.github/workflows/e2e_tests_lightspeed_evaluation.yaml:
- Around line 103-117: The polling loop in the "Wait for the LSC" step is overly
noisy because it runs curl without --silent and prints `docker compose -f
docker-compose-library.yaml logs --tail=30` on every iteration; change the curl
call to include `--silent` and move or gate the docker compose logs so they only
print on the final timeout (or at a reduced frequency, e.g., every 5th
iteration) instead of every 2 seconds, updating the existing loop that checks
`curl http://localhost:8080/v1/models` and the `docker compose ... logs
--tail=30` invocation accordingly.
- Around line 6-15: The workflow lacks concurrency control which can cause
overlapping runs for the same PR; add a concurrency section at the top-level of
the workflow (near jobs or the existing top keys) to define a group that
uniquely identifies the PR/run (e.g., use github.ref or github.head_ref) and set
cancel-in-progress: true so the e2e_tests job (name: "E2E Tests for Lightspeed
Evaluation job", job id: e2e_tests) will cancel any in-flight run when a new run
for the same PR starts.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 424b6ec4-d9a7-4ae9-b414-78e6246a4c79

📥 Commits

Reviewing files that changed from the base of the PR and between 6d7c76a and 525d625.

📒 Files selected for processing (1)

.github/workflows/e2e_tests_lightspeed_evaluation.yaml

.github/workflows/e2e_tests_lightspeed_evaluation.yaml

VladimirKadlec force-pushed the leads-247 branch from 0fda973 to 525d625 Compare March 12, 2026 10:45

radofuchs approved these changes Mar 12, 2026

View reviewed changes

VladimirKadlec marked this pull request as ready for review March 12, 2026 13:04

coderabbitai bot reviewed Mar 12, 2026

View reviewed changes

.github/workflows/e2e_tests_lightspeed_evaluation.yaml Outdated Show resolved Hide resolved

.github/workflows/e2e_tests_lightspeed_evaluation.yaml Show resolved Hide resolved

.github/workflows/e2e_tests_lightspeed_evaluation.yaml Outdated Show resolved Hide resolved

add lightspeed evaluation e2e tests

f6a7c52

VladimirKadlec force-pushed the leads-247 branch from 525d625 to f6a7c52 Compare March 12, 2026 13:30

VladimirKadlec changed the title ~~add lightspeed evaluation e2e tests~~ LEADS-247: add lightspeed evaluation e2e tests Mar 12, 2026

VladimirKadlec requested a review from radofuchs March 12, 2026 13:38

radofuchs merged commit 1938666 into lightspeed-core:main Mar 13, 2026
23 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LEADS-247: add lightspeed evaluation e2e tests#1313

LEADS-247: add lightspeed evaluation e2e tests#1313
radofuchs merged 1 commit intolightspeed-core:mainfrom
VladimirKadlec:leads-247

VladimirKadlec commented Mar 12, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Mar 12, 2026 •

edited

Loading

Uh oh!

radofuchs left a comment

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

VladimirKadlec commented Mar 12, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Tools used to create PR

Related Tickets & Documents

Checklist before requesting a review

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Uh oh!

radofuchs left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

VladimirKadlec commented Mar 12, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Mar 12, 2026 •

edited

Loading