-
Notifications
You must be signed in to change notification settings - Fork 363
tests/timer failures on FreeBSD #48
Comments
It was succeeding but we were claiming failure Reported in #48
I see these occasional timer interval failures in my fbsd VM and had chalked them up to VM timing issues. I don't know (because I haven't delved in!) whether this lateness is due to timer granularity in the system, or whether the timer is being rolled into the next 100ms window by the |
Seeing it on HW too with Westmere timers. It's easy to trigger by making all the cores busy. It does seem to be hitting the code you identified. Any intuition on a fix? |
Instrumenting shows this:
looks like both of the late cases are when |
I'm not 100% clear on why this isn't behaving. My best guess is that we're hitting a combination of factors:
I tried making the kevent setup for the timer use 1/2, 1/4, 1/8th granularities to make sure that we're waking up often enough, and they all tend to miss the precise due time, even when It's also possible that the timerwheel code is subtly wrong, but it is weird that we only see these failures on freebsd |
Documenting for others, when running https://gist.github.com/kev009/8106940 I see at least 1 ms precision using kqueue which seems to match OS X using timerwheel. Looking into the second bullet. |
I think this was just fixed in FreeBSD-HEAD in r272528. Is anyone able to re-test it? |
@erikarn it looks a lot better in the happy case, within 1ms. @wez in some unhappy case, it seems to be getting chucked into a different schedule (see final run). Any thoughts?
|
@erikarn I think the other failures are related to load/power management. Not sure if there is a bug report or feature request there? But I think this can be closed out for phenom. |
You can file a FreeBSD bug if you can reproduce a timer problem with power -a On 21 October 2014 15:59, Kevin Bowling notifications@github.com wrote:
|
Thank you for reporting this issue and appreciate your patience. We've notified the core team for an update on this issue. We're looking for a response within the next 30 days or the issue may be closed. |
worst bot evar |
:-/ |
I haven't dug into this yet. I'm not getting these consistently, sometime the run will be successful. The affinity failed message is always present ("nbio" and "job" in the message were added by me).
The text was updated successfully, but these errors were encountered: