Swap order of starting up RELAY/SOCKET processes in multi-master tests #66

mistydemeo · 2016-12-08T20:19:08Z

The tests should fail here since they're missing the fix from #65. cc @aroben

aroben · 2016-12-08T20:56:27Z

Hm, I put the tests in this order in 96b6712 on purpose. Maybe we need a second set of tests that boots the servers in the other order?

mistydemeo · 2016-12-08T20:57:09Z

~~After looking at it more in #65 (comment), we probably want to make sure we're checking the exit status/stdout of both processes?~~ Sorry, that's exactly what the commit you linked addressed. So, yeah, we might want to add extra tests to cover both cases.

aroben · 2016-12-08T21:01:10Z

The only tricky thing with checking the exit code of both processes is that you don't know which master will run the failing test. If the central master runs the failing test, then the remote master will exit 0. If the remote master runs the failing test, the remote master will exit 1.

mistydemeo · 2016-12-08T21:24:30Z

Modified to test in both orders.

aroben · 2016-12-08T21:31:54Z

test/minitest5.bats

  export TEST_QUEUE_RELAY_TOKEN=$(date | cksum | cut -d' ' -f1)
-  TEST_QUEUE_RELAY=0.0.0.0:12345 bundle exec minitest-queue ./test/samples/sample_minitest5.rb || true &
+  TEST_QUEUE_RELAY=0.0.0.0:12345 bundle exec minitest-queue ./test/samples/sample_minitest5.rb 2>&1 | tee /tmp/out.txt || true &


Did you mean to leave this debug output in?

aroben · 2016-12-08T21:36:24Z

test/minitest5.bats

+  TEST_QUEUE_RELAY=0.0.0.0:12345 run bundle exec minitest-queue ./test/samples/sample_minitest5.rb
+  wait
+
+  assert_status 1


I think this will sometimes fail due to the issue I mentioned. There's no guarantee that the remote master will run the failing test. If it doesn't, its exit code will be 0 (and it won't contain any failure output). We need a way to ensure that the right master fails. Something like this might work inside a test:

def test_foo whoami = if ENV["TEST_QUEUE_SOCKET"] "central" elsif ENV["TEST_QUEUE_RELAY"] "remote" end refute ENV["MULTI_MASTER_FAIL"] == whoami end

I.e., if you set MULTI_MASTER_FAIL=central, the central master will fail. If you set MULTI_MASTER_FAIL=remote, the remote master will fail. If you put this new test inside the MiniTestSleep suites then it will be highly likely to run in both masters and fail or pass appropriately.

Ben suggested an alternate approach, sleeping the order of the masters to induce the desired one to pick up the test. Pushed that version.

aroben · 2016-12-13T15:02:05Z

test/sleepy_runner.rb

+      sleep 5
+    elsif ENV['SLEEP_AS_MASTER'] && !relay?
+      sleep 5
+    end


I don't think this will guarantee that a particular master's workers will run the failing suite. around_filter gets called inside each worker just after receiving the suite to run, so it seems like workers will be handed an initial suite to run and then will sleep. I.e., imagine the following execution when we're trying to get the remote master to run the failing suite:

SLEEP_AS_MASTER=1

Central master boots and spawns workers

Central master hands the failing suite to one of its own workers

Central master's workers sleep

Remote master boots and connects to central master

Central master hands all the rest of the suites to the remote master's workers

Remote master's workers run those suites and remote master exits with success

Central master's workers wake up and run their suites, including the failing one

For the sleeping to be effective I think it needs to happen before suites are assigned, i.e., before the workers send their first POP messages.

For the sleeping to be effective I think it needs to happen before suites are assigned, i.e., before the workers send their first POP messages.

Nuts, this makes total sense. My bad.

Do we need to make changes to the runner itself to give us a place to inject this behavior for testing?

Putting the sleep in Runner#after_fork should do the trick I think.

Good point. 😅 Swapped.

aroben

This is great. Thank you!

aroben · 2016-12-13T17:53:13Z

CI is red, but that is what we want. #65 will fix it.

mistydemeo force-pushed the fix_relay_in_multi-master branch from f54c406 to e17718f Compare December 8, 2016 21:19

aroben requested changes Dec 8, 2016

View reviewed changes

mistydemeo force-pushed the fix_relay_in_multi-master branch 2 times, most recently from 6ae459d to 87a836b Compare December 13, 2016 00:50

aroben reviewed Dec 13, 2016

View reviewed changes

Run multi-master tests against both processes

303d356

mistydemeo force-pushed the fix_relay_in_multi-master branch from 87a836b to 303d356 Compare December 13, 2016 17:21

aroben approved these changes Dec 13, 2016

View reviewed changes

aroben merged commit be472d8 into tmm1:master Dec 13, 2016

mistydemeo deleted the fix_relay_in_multi-master branch December 13, 2016 17:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Swap order of starting up RELAY/SOCKET processes in multi-master tests #66

Swap order of starting up RELAY/SOCKET processes in multi-master tests #66

mistydemeo commented Dec 8, 2016

aroben commented Dec 8, 2016

mistydemeo commented Dec 8, 2016 •

edited

aroben commented Dec 8, 2016

mistydemeo commented Dec 8, 2016

aroben Dec 8, 2016

aroben Dec 8, 2016

mistydemeo Dec 13, 2016

aroben Dec 13, 2016

bhuga Dec 13, 2016

aroben Dec 13, 2016

mistydemeo Dec 13, 2016

aroben left a comment

aroben commented Dec 13, 2016

Swap order of starting up RELAY/SOCKET processes in multi-master tests #66

Swap order of starting up RELAY/SOCKET processes in multi-master tests #66

Conversation

mistydemeo commented Dec 8, 2016

aroben commented Dec 8, 2016

mistydemeo commented Dec 8, 2016 • edited

aroben commented Dec 8, 2016

mistydemeo commented Dec 8, 2016

aroben Dec 8, 2016

Choose a reason for hiding this comment

aroben Dec 8, 2016

Choose a reason for hiding this comment

mistydemeo Dec 13, 2016

Choose a reason for hiding this comment

aroben Dec 13, 2016

Choose a reason for hiding this comment

bhuga Dec 13, 2016

Choose a reason for hiding this comment

aroben Dec 13, 2016

Choose a reason for hiding this comment

mistydemeo Dec 13, 2016

Choose a reason for hiding this comment

aroben left a comment

Choose a reason for hiding this comment

aroben commented Dec 13, 2016

mistydemeo commented Dec 8, 2016 •

edited