Handle key errors gracefully in a static queue #272

zarifmahfuz · 2024-04-18T23:29:58Z

The static queue executes tests during bisect and it leads to an incorrect outcome plus unexpected errors when we specify a non-existing test ID in the test order provided to the bisect command. We want to start bisecting test orders in main to reduce false positive outcomes. So several test IDs in a given flaky test order that was captured in an older commit could now be missing due to change in test file path / renamed test / deleted test. If we don't handle missing keys gracefully, bisect incorrect thinks that the static queue came across a test failure, which leads to incorrect outcomes.

casperisfine · 2024-04-19T06:23:03Z

when we specify a non-existing test ID in the test order provided to the bisect command

But why do we do that?

Also if there is a legitimate reason to provide non-existing IDs, then it's not the queue responsability to ignore these, but the runner.

ChrisBr · 2024-04-19T10:05:56Z

But why do we do that?

Also if there is a legitimate reason to provide non-existing IDs, then it's not the queue responsability to ignore these, but the runner.

I agree with Jean, I don't think we can or should just ignore key errors.

ChrisBr · 2024-04-19T10:07:42Z

IMO if a test id doesn't exist the bisect is invalid and can't be trusted.

zarifmahfuz · 2024-04-19T14:44:47Z

But why do we do that?

@casperisfine So there is usually some delay between when we consume a failing test and when we trigger a leakbot bisect on the test order that failed the test. This delay is between 1-3 days because we have a lot of failing test orders to analyze and we cap to 50 leakbot bisects per hour. In those 1-3 days of delay, somes test ids could have changed due to change in test file path / renamed test / deleted test.

zarifmahfuz · 2024-04-19T14:45:33Z

IMO if a test id doesn't exist the bisect is invalid and can't be trusted.

Why would it be invalid? Bisect could still isolate a leak if the non-existing test id does not raise a missing test id error.

zarifmahfuz · 2024-04-19T14:54:37Z

@casperisfine to further advocate, we've been seeing a lot of false positives of bisects lately and large portion of them are due to the fact that that we run the bisects on a branch that is older than a commit where a fix is made. Always running bisects on main would get rid of a large portion of the false positives that we've been experiencing.

We could add some alternate logic on the client side that compares git commits between a bisect was caught vs when a potential fix could've been made to main but that easily gets complicated and hard to get right.

ChrisBr · 2024-04-21T12:34:28Z

We could add some alternate logic on the client side that compares git commits between a bisect was caught vs when a potential fix could've been made to main but that easily gets complicated and hard to get right.

I don't think it needs to be that complicated, you can just alter the test order file.

# load_all_tests

if ENV["REMOVE_NOT_EXISTING_TESTS"]
  loaded_tests = Minitest::Test.runnables.flat_map do |runnable|
    runnable.runnable_methods.map do |method_name|
      "#{runnable}##{method_name}"
    end
  end

  tests_to_bisect = File.read("test_order.log").lines.map(&:squish)

  test_order = tests_to_bisect & loaded_tests

  puts "Removing #{tests_to_bisect.size - test_order.size} tests that no longer exist"

  File.write("test_artifacts/test_order.log", test_order)
end

casperisfine · 2024-04-22T07:19:27Z

In those 1-3 days of delay, somes test ids could have changed due to change in test file path / renamed test / deleted test.

It sounds to me that the solution should be for the leakbot build to use the same app revision than the failure report.

ChrisBr · 2024-04-22T14:22:42Z

It sounds to me that the solution should be for the leakbot build to use the same app revision than the failure report.

Yeah we do that for most part but sometimes want to check on main if the same issue still exists.

Handle key errors gracefully in a static queue

1d8fa18

zarifmahfuz requested a review from a team April 18, 2024 23:30

zarifmahfuz mentioned this pull request Apr 23, 2024

Initialize bisect queue after loading tests #273

Merged

shopify-shipitnext bot closed this Apr 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle key errors gracefully in a static queue #272

Handle key errors gracefully in a static queue #272

zarifmahfuz commented Apr 18, 2024

casperisfine commented Apr 19, 2024

ChrisBr commented Apr 19, 2024

ChrisBr commented Apr 19, 2024

zarifmahfuz commented Apr 19, 2024

zarifmahfuz commented Apr 19, 2024

zarifmahfuz commented Apr 19, 2024 •

edited

Loading

ChrisBr commented Apr 21, 2024 •

edited

Loading

casperisfine commented Apr 22, 2024

ChrisBr commented Apr 22, 2024

Handle key errors gracefully in a static queue #272

Handle key errors gracefully in a static queue #272

Conversation

zarifmahfuz commented Apr 18, 2024

casperisfine commented Apr 19, 2024

ChrisBr commented Apr 19, 2024

ChrisBr commented Apr 19, 2024

zarifmahfuz commented Apr 19, 2024

zarifmahfuz commented Apr 19, 2024

zarifmahfuz commented Apr 19, 2024 • edited Loading

ChrisBr commented Apr 21, 2024 • edited Loading

casperisfine commented Apr 22, 2024

ChrisBr commented Apr 22, 2024

zarifmahfuz commented Apr 19, 2024 •

edited

Loading

ChrisBr commented Apr 21, 2024 •

edited

Loading