chore: improve dal integration test performance #8332

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

zacharyhamm merged 2 commits into main from test-improvements

Jan 22, 2026

Contributor

zacharyhamm commented Jan 22, 2026 •

edited

Loading

Two changes here:

Adds arguments to the dal test macro that enable and disable specific
"servers" per test. By default, only the rebaser and pinga will be
started, and you can enable the rest with:

enable_veritech
enable_edda
enable_forklift

Note that I have not enabled edda for any test, and all tests pass, so
it looks like we don't have any real integration testing for edda at the
moment.

The one you'll want the most is enable_veritech, which you should be
sure to enable any time you will have to execute functions.

This took about 10-30 seconds off my test runs.

And

In read_wait_for_memory we spin checking if the requested object has
landed in the memory cache (via the events stream) and then we look at
disk and durable storage. This adds a 2+ second overhead for many
fetches, which either are not about to land in the memory cache because
they are for old objects which have been evicted, or because we just
don't propagate data fast enough across NATS for it to land.

Checking the normal read path before trying to spin has a huge
performance result for the integration tests, easily 100 seconds faster
in CI, and will likely have a positive impact throughout the stack.

Contributor Author

zacharyhamm commented Jan 22, 2026

/try

github-actions bot commented Jan 22, 2026 •

edited

Loading

Dependency Review

✅ No vulnerabilities or OpenSSF Scorecard issues found.

Scanned Files

None

github-actions bot added A-dal A-si-test-macros labels

github-actions bot commented Jan 22, 2026 •

edited

Loading

Okay, starting a try! I'll update this comment once it's running...
🚀 Try running here! 🚀

zacharyhamm force-pushed the test-improvements branch from 51b7fa1 to 791eedf Compare

January 22, 2026 21:21

Contributor Author

zacharyhamm commented Jan 22, 2026

/try

github-actions bot commented Jan 22, 2026 •

edited

Loading

Okay, starting a try! I'll update this comment once it's running...
🚀 Try running here! 🚀

github-actions bot added A-dal-test A-si-layer-cache labels

Contributor Author

zacharyhamm commented Jan 22, 2026

/try

github-actions bot commented Jan 22, 2026 •

edited

Loading

Okay, starting a try! I'll update this comment once it's running...
🚀 Try running here! 🚀

zacharyhamm force-pushed the test-improvements branch from 7b1d46b to 58cda4a Compare

January 22, 2026 22:49

Contributor Author

zacharyhamm commented Jan 22, 2026

/try

github-actions bot commented Jan 22, 2026 •

edited

Loading

Okay, starting a try! I'll update this comment once it's running...
🚀 Try running here! 🚀

zacharyhamm added 2 commits

January 22, 2026 16:51


          chore: only start required servers per test

436baa3

Adds arguments to the dal test macro that enable and disable specific
"servers" per test. By default, only the rebaser and pinga will be
started, and you can enable the rest with:

  enable_veritech
  enable_edda
  enable_forklift

Note that I have not enabled edda for any test, and all tests pass, so
it looks like we don't have any real integration testing for edda at the
moment.

The one you'll want the most is enable_veritech, which you should be
sure to enable any time you will have to execute functions.

This took about 10-30 seconds off my test runs.


          chore: read_wait_for_memory should check disk/durable first

fd02731

In read_wait_for_memory we spin checking if the requested object has
landed in the memory cache (via the events stream) and then we look at
disk and durable storage. This adds a 2+ second overhead for many
fetches, which either are not about to land in the memory cache because
they are for old objects which have been evicted, or because we just
don't propagate data fast enough across NATS for it to land.

Checking the normal read path before trying to spin has a huge
performance result for the integration tests, easily 100 seconds faster
in CI, and will likely have a positive impact throughout the stack.

zacharyhamm force-pushed the test-improvements branch from 58cda4a to fd02731 Compare

January 22, 2026 22:55

zacharyhamm changed the title ~~chore: only start required servers per test~~ chore: improve dal integration test performance

zacharyhamm marked this pull request as ready for review

January 22, 2026 22:57

Contributor Author

zacharyhamm commented Jan 22, 2026

/try

github-actions bot commented Jan 22, 2026 •

edited

Loading

Okay, starting a try! I'll update this comment once it's running...
🚀 Try running here! 🚀

nickgerace approved these changes

View reviewed changes

lib/dal/tests/integration_test/action/schema_level.rs

    
              use pretty_assertions_sorted::assert_eq;

              #[test]

              #[test(enable_veritech)]

Contributor

nickgerace Jan 22, 2026

Okay I've wanted this idea for like ~3-4 years and I am SO happy to finally see it! Yes yes yes!

lib/si-test-macros/src/dal_test.rs

Comment on lines +48 to +63

    
                  // Conditionally start servers based on macro arguments

                  if args.should_start_server("forklift") {

                      expander.setup_start_forklift_server();

                  }

                  if args.should_start_server("veritech") {

                      expander.setup_start_veritech_server();

                  }

                  if args.should_start_server("pinga") {

                      expander.setup_start_pinga_server();

                  }

                  if args.should_start_server("edda") {

                      expander.setup_start_edda_server();

                  }

                  if args.should_start_server("rebaser") {

                      expander.setup_start_rebaser_server();

                  }

Contributor

nickgerace Jan 22, 2026

Ended up being simple! Nice.

lib/si-test-macros/src/lib.rs

Comment on lines +33 to +55

    
              const DEFAULT_SERVERS: &[&str] = &["rebaser", "pinga"];

              impl Args {

                  /// Check if a specific server should be disabled

                  pub(crate) fn should_skip_server(&self, server_name: &str) -> bool {

                      let skip_ident = format!("skip_{server_name}");

                      self.vars.iter().any(|v| v == &skip_ident)

                  }

                  pub(crate) fn should_enable_server(&self, server_name: &str) -> bool {

                      let skip_ident = format!("enable_{server_name}");

                      self.vars.iter().any(|v| v == &skip_ident)

                  }

                  /// Check if a specific server should be started

                  pub(crate) fn should_start_server(&self, server_name: &str) -> bool {

                      if DEFAULT_SERVERS.contains(&server_name) {

                          !self.should_skip_server(server_name)

                      } else {

                          self.should_enable_server(server_name)

                      }

                  }

              }

Contributor

nickgerace Jan 22, 2026

Ah this is it. I never took the time to get deep into the idea, but I'm glad it was relatively small in the end.

zacharyhamm added this pull request to the merge queue

Merged via the queue into main with commit 4dfc790

28 checks passed

zacharyhamm deleted the test-improvements branch

January 22, 2026 23:54

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

A-dal A-dal-test A-si-layer-cache A-si-test-macros