Improve time precision on Windows #80

bmermet · 2017-06-15T21:47:06Z

This switches from time.Now to GetSystemTimePreciseAsFileTime to improve the precision of the timestamps and duration computations from ~1ms to ~1µs. This comes at a performance cost since this call is much slower than the original one. I've run benchmarks in my VM and it makes a ~10x difference:

BenchmarkNormalTimeNow-2 100000000 12.7 ns/op
BenchmarkHighPrecisionTime-2 20000000 101 ns/op

This remain very fast, so I think the precision gain is worth the performance cost.

Switch from time.Now to GetSystemTimePreciseAsFileTime to improve the precision of the timestamps and duration computations.

palazzem · 2017-06-16T21:19:16Z

tracer/time_windows.go

+
+// This method is more precise than the go1.8 time.Now on Windows
+// See https://msdn.microsoft.com/en-us/library/windows/desktop/hh706895(v=vs.85).aspx
+// It is however ~10x slower and requires Windows 8+.


To me the overhead is fine only if the issue is a "real" issue. What I mean, is that I can accept having the instrumentation a bit slower (still in the order of nanoseconds) if the access to the system shared memory to get the time is not optimal to users, especially if it provides bad results. It doesn't impact non-Windows users, so it's something we can try and improve later.

@LotharSee do you agree or we should tackle better this issue?

As a note, I suggested the same approach for a DNS Check in the Datadog agent: https://github.com/DataDog/integrations-core/blob/2209b0c316378fcde2563b49c9200e0ffef18d91/dns_check/check.py#L15-L26

Even if it's a different problem (a polling check that is processed in another process), it's still required because it gets wrong time on Windows.

Yes this imprecision is currently an issue.
@bmermet did you check that GetSystemTimePreciseAsFileTime raises much more precise results? This initial issue was that time.Now() could easily return 0 — maybe that's something we could easily add a test for?

Agree a test would ne nice. Possibly tricky as all those timing tests are hard to achieve... Maybe only check that if we sleep a microsecond, the time elapsed is non-zero?

The problem is that indeed testing time related stuff is tricky. Since the time.Sleep() function uses the same clocks as the time.Now() timer, any use of time.Sleep() will result in a non 0 difference between two calls to time.Now(). I chose to poll the value of the high precision timer until it changes, and assert that the low precision one didn't change in the meantime.

NIce catch, indeed, that Sleep() pb is annoying.

ufoot · 2017-06-19T09:10:01Z

tracer/time_windows.go

+// precise implementation based on time.Now()
+func init() {
+	if err := windows.LoadGetSystemTimePreciseAsFileTime(); err != nil {
+		now = lowPrecisonNow


I think it could be nice to have some info about this, I mean, a log message (dumping err with a surrounding message explaining we are in fallback mode and timing can be imprecise). This just to help support / us when trying to dig into timing issues.

ufoot · 2017-06-19T09:11:36Z

tracer/time_windows.go

+
+// This method is more precise than the go1.8 time.Now on Windows
+// See https://msdn.microsoft.com/en-us/library/windows/desktop/hh706895(v=vs.85).aspx
+// It is however ~10x slower and requires Windows 8+.


Agree a test would ne nice. Possibly tricky as all those timing tests are hard to achieve... Maybe only check that if we sleep a microsecond, the time elapsed is non-zero?

ufoot

A little nitpick about test flakiness, but fine for me.

ufoot · 2017-06-19T16:17:27Z

tracer/time_windows.go

+
+// This method is more precise than the go1.8 time.Now on Windows
+// See https://msdn.microsoft.com/en-us/library/windows/desktop/hh706895(v=vs.85).aspx
+// It is however ~10x slower and requires Windows 8+.


NIce catch, indeed, that Sleep() pb is annoying.

ufoot · 2017-06-19T16:21:24Z

tracer/time_windows_test.go

+}
+
+func TestHighPrecisionTimerIsMoreAccurate(t *testing.T) {
+	startLow := lowPrecisionNow()


This is flaky, but as you say above, it's hard to avoid. I think you could reduce the flakiness by something like :

for startLow == lowPrecisionNow() {} // make sure we're just after a "low precision boundary" startLow = lowPrecisionNow()

Improve time precision on Windows

f37c68d

Switch from time.Now to GetSystemTimePreciseAsFileTime to improve the precision of the timestamps and duration computations.

bmermet requested review from palazzem, derekwbrown and ufoot June 15, 2017 21:48

palazzem reviewed Jun 16, 2017

View reviewed changes

palazzem added the core label Jun 18, 2017

ufoot added this to the 0.5.0 milestone Jun 19, 2017

ufoot reviewed Jun 19, 2017

View reviewed changes

Adding test to show improved timer precision

b814f9a

ufoot approved these changes Jun 19, 2017

View reviewed changes

bmermet merged commit 3b32275 into master Jun 20, 2017

bmermet deleted the bmermet/timerprecision branch June 20, 2017 17:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve time precision on Windows #80

Improve time precision on Windows #80

bmermet commented Jun 15, 2017

palazzem Jun 16, 2017 •

edited

Loading

palazzem Jun 16, 2017

LotharSee Jun 17, 2017

ufoot Jun 19, 2017

bmermet Jun 19, 2017

ufoot Jun 19, 2017

ufoot Jun 19, 2017

ufoot Jun 19, 2017

ufoot left a comment

ufoot Jun 19, 2017

ufoot Jun 19, 2017

Improve time precision on Windows #80

Improve time precision on Windows #80

Conversation

bmermet commented Jun 15, 2017

palazzem Jun 16, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ufoot left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

palazzem Jun 16, 2017 •

edited

Loading