PARQUET-507: Reduce the runtime of rle-test #37

wesm · 2016-02-03T00:08:10Z

I twiddled this a bit to cut the runtime in half. I'd like to reduce it further but looking for feedback -- my preference would be to use system entropy (std::random_device) to seed the PRNG and print the seed on failure. So we could run far fewer tests (e.g. only 50 or 100 or so) and occasionally run into flakiness or failure if we refactor and break something internally. Thoughts?

julienledem · 2016-02-03T01:09:53Z

if we randomly generate input data we can also print the corresponding input on failure.
That way it is easy to add it as a fixed test case for the input that made it fail.

wesm · 2016-02-03T05:43:17Z

I'll see if I can make each iteration a function of a single randomly generated seed, and print that on failure, so in the event of a random failure it will be reproducible. Then we can trim the runtime down to a few hundred ms or less

asandryh · 2016-02-03T14:44:13Z

src/parquet/util/rle-test.cc

Since iters < niters = 500, this condition is always false.

Note: I didn't write this code. Will clean it up more per the comments (and reducing runtime further)

This is only a performance-improving suggestion. ;)

…d on failure

wesm · 2016-02-05T16:04:56Z

@asandryh @julienledem I further shortened the runtime (whole test suite takes < 500ms for me locally) and using device entropy to generate PRNG seeds. On failure the seed is printed. Feel free to merge when build green

asandryh · 2016-02-06T00:36:17Z

src/parquet/util/rle-test.cc

+
+  // prng setup
+  std::random_device rd;
+  std::uniform_int_distribution<int> dist(1, std::numeric_limits<int>::max());


A small optimization: std::uniform_int_distribution<int> dist(1,20);
and change line 366 to int group_size = dist(gen);

asandryh · 2016-02-06T00:37:36Z

lgtm

wesm · 2016-02-06T01:18:21Z

Done, thanks

julienledem · 2016-02-06T20:08:07Z

+1

wesm · 2016-02-06T20:12:33Z

thank you!

Preallocate vector in BitRle.Random and run half as many iterations

a357dd1

asandryh reviewed Feb 3, 2016
View reviewed changes

Further shorten random tests; use device entropy and print random see…

ba97491

…d on failure

Buglet

0ed951a

asandryh reviewed Feb 6, 2016
View reviewed changes

Tidying per comments

d75f2ed

asfgit closed this in a5892f5 Feb 6, 2016

wesm deleted the PARQUET-507 branch February 6, 2016 20:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

PARQUET-507: Reduce the runtime of rle-test #37

PARQUET-507: Reduce the runtime of rle-test #37

Uh oh!

wesm commented Feb 3, 2016

Uh oh!

julienledem commented Feb 3, 2016

Uh oh!

wesm commented Feb 3, 2016

Uh oh!

asandryh Feb 3, 2016

Uh oh!

wesm Feb 3, 2016

Uh oh!

asandryh Feb 3, 2016

Uh oh!

wesm commented Feb 5, 2016

Uh oh!

asandryh Feb 6, 2016

Uh oh!

asandryh commented Feb 6, 2016

Uh oh!

wesm commented Feb 6, 2016

Uh oh!

julienledem commented Feb 6, 2016

Uh oh!

wesm commented Feb 6, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

PARQUET-507: Reduce the runtime of rle-test #37

PARQUET-507: Reduce the runtime of rle-test #37

Uh oh!

Conversation

wesm commented Feb 3, 2016

Uh oh!

julienledem commented Feb 3, 2016

Uh oh!

wesm commented Feb 3, 2016

Uh oh!

asandryh Feb 3, 2016

Choose a reason for hiding this comment

Uh oh!

wesm Feb 3, 2016

Choose a reason for hiding this comment

Uh oh!

asandryh Feb 3, 2016

Choose a reason for hiding this comment

Uh oh!

wesm commented Feb 5, 2016

Uh oh!

asandryh Feb 6, 2016

Choose a reason for hiding this comment

Uh oh!

asandryh commented Feb 6, 2016

Uh oh!

wesm commented Feb 6, 2016

Uh oh!

julienledem commented Feb 6, 2016

Uh oh!

wesm commented Feb 6, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants