sdjournal: fix a race in GetEntry test #181

lucab · 2016-06-20T10:23:43Z

Hard-sleep instead of waiting for a journal event.
This fixes a race due to waiting for any events but enumerating only
matching ones.

Closes #180

Hard-sleep instead of waiting for a journal event. This fixes a race due to waiting for any events but enumerating only matching ones.

lucab · 2016-06-20T10:33:21Z

All together #178, #179, and #181 let the whole testsuite run on travis without failures.

jonboulle · 2016-06-22T15:02:07Z

@lucab how about just putting them up in a single PR?

lucab · 2016-06-22T15:13:29Z

@jonboulle #179 is some "creative" travis usage which I'm not sure if ok. The other two could have been merged yes, but this last one was unplanned at first as I was expecting #180 to be harder to trace down.

jonboulle · 2016-06-22T15:17:27Z

@lucab is that like "creative accounting"? ;-).
What are your concerns with that approach exactly? Longer term, we could consider migrating this to our Jenkins infrastructure for more flexibility with different systemds.

lucab · 2016-06-22T15:31:45Z

@jonboulle exactly 😄

I expanded the discussion in that PR.

I'm not joining here the other two testsuite-fixing PRs now, but I'll avoid wasting resources next time. Relevant to this specific PR, there is a thread ongoing on systemd ML: https://lists.freedesktop.org/archives/systemd-devel/2016-June/036946.html

s-urbaniak · 2016-07-05T09:10:19Z

sdjournal/journal_test.go

-	if r < 0 {
-		t.Fatalf("Error waiting to journal")
-	}
+	time.Sleep(time.Duration(1) * time.Second)


Can you expand on why a hard sleep is better than waiting for the journal? This seems counter-intuitive to me as time.Sleep cries for flakes later on.

From ML thread (quoting myself in lack of a better source):

sd_journal_wait() will trigger on *any* events, while sd_journal_get_data() will apply the filter and find no matching entries.

In our context, Wait() will return as soon as there is a new message available in the log even if it doesn't match the filter, in which case Next() will return EOF (if expected entry is not yet available).

This sleep could be replaced with something like a Wait&Next with a retrial counter, but it will not eliminate the flakiness of the test: it will then depend on the magic number of retrials and additionally also on how many events happen before our expected entry.

We could loop over sd_journal_wait()/sd_journal_get_data() until we eventually get our message matching the field, but I agree this could end up in an endless loop.

Not really happy about the time.Sleep at all, so let's observe this in subsequent CI builds.

Me neither, but I think sdjournal would need to grow a C API for filtered event-triggering to properly address this usecase. May I go on and merge this as-is for now?

sdjournal: fix a race in GetEntry test

ee32dd7

Hard-sleep instead of waiting for a journal event. This fixes a race due to waiting for any events but enumerating only matching ones.

lucab added kind/bug area/dev-tooling needs/review labels Jun 20, 2016

s-urbaniak reviewed Jul 5, 2016
View reviewed changes

lucab merged commit 5f8a27b into coreos:master Jul 12, 2016

lucab mentioned this pull request Jul 12, 2016

sdjournal: add GetDataBytes() and GetDataValueBytes() to Journal #176

Merged

lucab mentioned this pull request Oct 17, 2016

sdjournal: add GetUniqueValues #202

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

sdjournal: fix a race in GetEntry test #181

sdjournal: fix a race in GetEntry test #181

Uh oh!

lucab commented Jun 20, 2016 •

edited

Loading

Uh oh!

lucab commented Jun 20, 2016 •

edited

Loading

Uh oh!

jonboulle commented Jun 22, 2016

Uh oh!

lucab commented Jun 22, 2016

Uh oh!

jonboulle commented Jun 22, 2016

Uh oh!

lucab commented Jun 22, 2016 •

edited

Loading

Uh oh!

s-urbaniak Jul 5, 2016

Uh oh!

lucab Jul 5, 2016 •

edited

Loading

Uh oh!

s-urbaniak Jul 5, 2016

Uh oh!

lucab Jul 6, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sdjournal: fix a race in GetEntry test #181

sdjournal: fix a race in GetEntry test #181

Uh oh!

Conversation

lucab commented Jun 20, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lucab commented Jun 20, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jonboulle commented Jun 22, 2016

Uh oh!

lucab commented Jun 22, 2016

Uh oh!

jonboulle commented Jun 22, 2016

Uh oh!

lucab commented Jun 22, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

s-urbaniak Jul 5, 2016

Choose a reason for hiding this comment

Uh oh!

lucab Jul 5, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

s-urbaniak Jul 5, 2016

Choose a reason for hiding this comment

Uh oh!

lucab Jul 6, 2016

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lucab commented Jun 20, 2016 •

edited

Loading

lucab commented Jun 20, 2016 •

edited

Loading

lucab commented Jun 22, 2016 •

edited

Loading

lucab Jul 5, 2016 •

edited

Loading