chore(api): Speed up stackup snapshot tests #18700

sfoster1 · 2025-06-20T21:48:23Z

@SyntaxColoring and @mjhuff added these extremely sick snapshot tests for labware geometry. Only downside was they took forever, but they're awesome and a solid basis for confidently modifying and improving that logic in the future. By parallelizing them - which was really easy since they were already doing subprocess stuff, so there was no shared logic to break - we can cut the test execution time down to something reasonable to run in your inner loop.

Also, there's not much of a point to this being in pytest since we're not taking advantage of anything it can do while everything's in one big test (which it has to be to take advantage of the multiprocessing); so pop it out into a new little integration-testing place and make it a command-line application that can now do filtering and such.

to come out of draft

test that the filtering stuff works as well as it ought to
fix the log and stdout/stderr spam from the console
do something about running out of file descriptors lol

mjhuff

This is so great, thanks a lot for doing this. Kid you not, this will probably save at least 30 hours of running this locally over the next couple of months.

mjhuff · 2025-06-23T13:19:07Z

.github/workflows/api-test-lint-deploy.yaml

@@ -120,6 +120,8 @@ jobs:
        run: make -C api test-cov
      - name: Ensure assets build
        run: make -C api sdist wheel
+      - name: Check labware stacking regression tests


Suggested change

- name: Check labware stacking regression tests

- name: Check stacking regression tests

Kind of a future proofing nit, but since we'll probably add gripper stacking tests shortly, can we rephrase this?

mjhuff · 2025-06-23T13:21:20Z

api/integration_testing/__init__.py

Not that I know better, but is there a reason this testing lives here as opposed to something like api/tests/integration_testing, even if it's not managed by pytest?

it's just nice to have everything in api/tests/ be owned by pytest so you can do py.test tests without having to play weird directory exclusion games

mjhuff · 2025-06-23T13:30:50Z

api/integration_testing/labware_stackup/data.py

I don't think it's worth addressing now (and would probably be worth thinking through more before committing to a change), but since this is so much faster than the current version, I wonder if we can do something like just pass tuples of all the labware/adapters for all their versions.

yeah we definitely should, especially with the ability to do filtering. but let's get this in first

mjhuff · 2025-06-23T13:31:55Z

api/integration_testing/labware_stackup/stackup_test.py

+    )
+
+    print(f"Processing {robot_info}...")
+    executor = ProcessPoolExecutor()


Ah, this is where the magic happens.

The labware stackup integration tests rule but were also very slow, because they were running serially and there are 8,000 of them. By making them run in parallel using concurrent.futures.ProcessPoolExecutor, we can make it take like a minute instead of 2 hours.

This was running under pytest but it wasn't really _using_ pytest, and therefore didn't have the upsides like test collection and parametrization to balance the downsides like it being annoying to add command line flags. By making it a standalone application, we get all that stuff. While we're at it, refactor so it'll be easier to add new kinds of tests that take advantage of the test spec generation and so on.

codecov · 2025-06-23T13:52:17Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 24.75%. Comparing base (981511a) to head (c42553f).
Report is 3 commits behind head on edge.

Additional details and impacted files

@@           Coverage Diff           @@
##             edge   #18700   +/-   ##
=======================================
  Coverage   24.75%   24.75%           
=======================================
  Files        3284     3284           
  Lines      285464   285416   -48     
  Branches    28663    28655    -8     
=======================================
  Hits        70662    70662           
+ Misses     214781   214733   -48     
  Partials       21       21

Flag	Coverage Δ
protocol-designer	`19.04% <ø> (+<0.01%)`	⬆️
step-generation	`5.25% <ø> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

ddcc4 · 2025-06-26T00:46:58Z

Hey, the stacking regression tests seem to take forever and then time out in CI (which makes it hard for me to get stuff merged into edge when I have to race against other people making changes).

Do you have an example of them ever completing successfully in CI? Even for this PR, they just time out after reaching

Processed 1490/7474 items. Successful: 209, Errors: 1281
Error: The operation was canceled.

https://github.com/Opentrons/opentrons/actions/runs/15826957756/job/44609936201?pr=18700

sfoster1 requested review from SyntaxColoring and mjhuff June 20, 2025 21:48

mjhuff approved these changes Jun 23, 2025

View reviewed changes

sfoster1 added 5 commits June 23, 2025 09:44

chore: move out of the pytest zone

de7bd2b

chore: automatically run tests

a9a06a6

test name

c42553f

sfoster1 force-pushed the speed-up-stackup-teasts branch from d4b3f06 to c42553f Compare June 23, 2025 13:45

sfoster1 marked this pull request as ready for review June 23, 2025 13:46

sfoster1 requested review from a team as code owners June 23, 2025 13:46

swallow logging from inferior

28cf89f

sfoster1 merged commit 1801cde into edge Jun 23, 2025
22 of 25 checks passed

sfoster1 deleted the speed-up-stackup-teasts branch June 23, 2025 15:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore(api): Speed up stackup snapshot tests #18700

chore(api): Speed up stackup snapshot tests #18700

Uh oh!

sfoster1 commented Jun 20, 2025 •

edited

Loading

Uh oh!

mjhuff left a comment

Uh oh!

mjhuff Jun 23, 2025

Uh oh!

mjhuff Jun 23, 2025

Uh oh!

sfoster1 Jun 23, 2025

Uh oh!

mjhuff Jun 23, 2025

Uh oh!

sfoster1 Jun 23, 2025

Uh oh!

mjhuff Jun 23, 2025

Uh oh!

codecov bot commented Jun 23, 2025 •

edited

Loading

Uh oh!

Uh oh!

ddcc4 commented Jun 26, 2025 •

edited

Loading

Uh oh!

Uh oh!

	- name: Check labware stacking regression tests
	- name: Check stacking regression tests

chore(api): Speed up stackup snapshot tests #18700

chore(api): Speed up stackup snapshot tests #18700

Uh oh!

Conversation

sfoster1 commented Jun 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

to come out of draft

Uh oh!

mjhuff left a comment

Choose a reason for hiding this comment

Uh oh!

mjhuff Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

mjhuff Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

sfoster1 Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

mjhuff Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

sfoster1 Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

mjhuff Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Jun 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

ddcc4 commented Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

sfoster1 commented Jun 20, 2025 •

edited

Loading

codecov bot commented Jun 23, 2025 •

edited

Loading

ddcc4 commented Jun 26, 2025 •

edited

Loading