Make ScmTimestamp's doc test more robust #1005

asomers · 2019-01-06T04:50:10Z

The old test made assumptions about the responsiveness of the test
environment. The new test does not, and it's simpler too.

asomers · 2019-01-06T04:50:29Z

cc @pusateri @Wolvereness

Wolvereness · 2019-01-06T07:54:11Z

You've missed one of the fundamental assumptions for whether or not this API is functional in the target environment. The point of this particular method of time-stamping is that the OS is assigning it, and it's independent of when the caller finally reads the packets.

If the test environment isn't "responsive", then the API probably doesn't work, as it's fundamentally supposed to be a method of time-stamping in that particular environment.

The previous test sends two packets, 250ms apart, and assumes the maximum system discrepancy is less than 5ms. If this treshold is problematic, then I suggest simply increasing it, or if a target system is giving responses with anywhere near the maximum 250ms discrepancy, consider the API to not be functional on that target.

asomers · 2019-01-06T15:28:42Z

What the old test did is it sent two packets separated by a sleep. It basically asserted that the sleep plus the 2nd sendmsg call took no more than 255ms. But there's nothing to ensure that. On a non-real-time OS, a process can be delayed at any time for any reason. If you look at the man page for nanosleep(2), you'll notice that it says "The suspension time may be longer than requested". The test failure doesn't indicate that SO_TIMESTAMP is non-functional on the target; it indicates that the target is busy.

What changed recently is that FreeBSD's CI moved from BuildBot to Cirrus-CI. Perhaps the Cirrus VMs are oversubscribed or something?

The new test is insensitive to arbitrary delays. All it does is ensure that the packet's received timestamp lies between two bounds. That assumes that the clock is monotonic, which isn't 100% true because SO_TIMESTAMP uses the real-time clock instead of the monotonic clock[1]. But I don't see a way around that.

Really, all that the test needs to do is check that some kind of time, any kind of time, is returned. Even if it's off by a year, then Nix is still probably binding the API correctly.

The test failure is because I used a function that wasn't introduced until Rust 1.27.0. I'll fix it.

[1] On FreeBSD 12 this is selectable with the SO_TS_CLOCK sockopt. But that isn't available on other platforms.

Wolvereness · 2019-01-06T15:52:00Z

I don't think tests should ever expect to handle non-monotonic time. The test is also not expecting the sleep to be 250ms; it's actually timing the delay itself, and then seeing if the delay between packets is within 5ms of what was measured. So, if the system time changed, it should be accounted for unless it was between the packet send and the timestamp get.

Are you getting the output for the tests? How off is it?

asomers · 2019-01-06T16:33:10Z

Well, the old test was measuring a delay. But it still didn't account for the time taken up by the sendmsg calls themselves. They aren't instantaneous. Nor does the received timestamp necessarily get recorded during the sendmsg call. That's an implementation detail. It could be recorded when the data is copied from the sending socket's buffer to the receiving socket's buffer, and that could happen at any point before recvmsg returns. The discrepancy, in one failing case, was 13 ms.

Wolvereness · 2019-01-06T16:36:00Z

I suggest just changing it from 5ms to 50ms then, and one extra check for the initial sleep to be at least 200ms.

asomers · 2019-01-06T16:41:53Z

Increasing the time limit wouldn't remove the test's sensitivity to the environment's responsiveness. It would just loosen it a bit. The new test should work even on the most unresponsive systems, even if the process gets paused for an hour. It only requires that the clock be monotonic.

The old test made assumptions about the responsiveness of the test environment. The new test does not, and it's simpler too.

asomers · 2019-01-07T16:02:05Z

bors r+

1005: Make ScmTimestamp's doc test more robust r=asomers a=asomers The old test made assumptions about the responsiveness of the test environment. The new test does not, and it's simpler too. Co-authored-by: Alan Somers <asomers@gmail.com>

bors · 2019-01-07T16:34:59Z

Build succeeded

continuous-integration/travis-ci/push
FreeBSD 11.2

Make ScmTimestamp's doc test more robust

eba2fc2

The old test made assumptions about the responsiveness of the test environment. The new test does not, and it's simpler too.

asomers force-pushed the scm_timestamp_test branch from ac2ac4c to eba2fc2 Compare January 6, 2019 21:11

bors bot merged commit eba2fc2 into nix-rust:master Jan 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make ScmTimestamp's doc test more robust #1005

Make ScmTimestamp's doc test more robust #1005

Uh oh!

asomers commented Jan 6, 2019

Uh oh!

asomers commented Jan 6, 2019

Uh oh!

Wolvereness commented Jan 6, 2019

Uh oh!

asomers commented Jan 6, 2019

Uh oh!

Wolvereness commented Jan 6, 2019

Uh oh!

asomers commented Jan 6, 2019

Uh oh!

Wolvereness commented Jan 6, 2019

Uh oh!

asomers commented Jan 6, 2019

Uh oh!

asomers commented Jan 7, 2019

Uh oh!

bors bot commented Jan 7, 2019

Uh oh!

Uh oh!

Make ScmTimestamp's doc test more robust #1005

Make ScmTimestamp's doc test more robust #1005

Uh oh!

Conversation

asomers commented Jan 6, 2019

Uh oh!

asomers commented Jan 6, 2019

Uh oh!

Wolvereness commented Jan 6, 2019

Uh oh!

asomers commented Jan 6, 2019

Uh oh!

Wolvereness commented Jan 6, 2019

Uh oh!

asomers commented Jan 6, 2019

Uh oh!

Wolvereness commented Jan 6, 2019

Uh oh!

asomers commented Jan 6, 2019

Uh oh!

asomers commented Jan 7, 2019

Uh oh!

bors bot commented Jan 7, 2019

Build succeeded

Uh oh!

Uh oh!