Develop #13

jafingerhut · 2012-11-05T11:05:30Z

Hugo:

Please take a look at these modifications to criterium when you get a chance, and let me know if they or some modified version of them would be interesting to you to pull into your version.

The extra return data is useful to me when recording results from many different Clojure versions and JDKs. The changes to execute-expr reduce the overhead per call to the expression being benchmarked. The change in measurement times is quite noticeable for expressions with short run times.

…sion to ret val of runtime-details

…e-details ret val

…hmarked expression.

…on-count

Keeping 8192 previous return values of the benchmarked expression in memory at once is way too many for expressions that return large data structures.

… sequence of expressions The goal is to see whether the results are noticeably different when executed 'round robin' instead of doing the same expression over and over for an extended period of time (e.g. 1 min). Instead try 10 different expressions for 100 millisec each, for example, and then repeat that 60 times if :sample-count is 60. One needs to be cautious in interpreting the results, e.g. if one expression in the sequence tends to leave behind lots of garbage that won't be collected until a later expression is being executed.

hugoduncan · 2012-11-15T21:24:18Z

Andy,

This looks very interesting! Will try to get to it when conj is over.

hugoduncan · 2012-11-19T19:57:08Z

Thanks for this Andy! All merged.

The memory allocation for the results had been bothering me for a while, so great to see it fixed.

The round robin form looks interesting too - maybe you could add something to the README with a motivation/explanation of when to use it.

jafingerhut · 2012-11-19T20:15:56Z

Glad it could be of use.

Note that the memory allocation thing that I fixed in one of my later commits of the group was one that I introduced myself in an earlier commit, I think (storing 8192 earlier results was way too much, reducing it to 4 was better).

I believe that before my changes, your code was not storing any intermediate results from one iteration to the next, but instead was calculating their hash values in each iteration. Avoiding the hash calculation in the timing loop is what I was hoping to avoid.

So my approach is still probably worse than yours in its use of memory, if the expression return value is large. It may or may not be faster in those cases depending upon how long the original code took to calculate hashes in each iteration, versus how long it takes my latest version to do the garbage collection 4 evaluations later.

I will definitely consider adding something to the README with a motivation of the round robin thing. It was motivated by Rich Hickey's recent message to clojure-dev about how to do microbenchmarking in the face of Java's HotSpot JIT, but I'd prefer to have more results showing it actually produces noticeably different results before I believe it is actually useful.

One down side of the round robin idea is that it leaves garbage from the evaluation of one expression around that might be included in GC time while timing a later expression. It would be good to have a clear example of that, too, in the documentation, as a warning for anyone who tries it.

hugoduncan · 2012-11-19T21:18:16Z

The garbage collection of return values happens in either approach, if I understand correctly.

I assume you mean this message from Rich: https://groups.google.com/d/msg/clojure-dev/0c-VNhEKVkI/Z7R1qKqsfN0J

I'm not convinced that round-robin is really a solution to the issue, though I haven't a better solution at the moment, given the need to maintain timing resolution.

One way of preventing GC interaction would be to force a GC between different expressions, but that possibly leads to other issues.

hugoduncan · 2012-11-19T21:55:20Z

@ztellman This code is now on the develop branch - maybe you could comment as to whether it improves the timing resolution, and if not, then open a top level issue for that.

jafingerhut and others added 7 commits November 3, 2012 06:27

Add the values of system properties java.version and java.runtime.ver…

827ebd8

…sion to ret val of runtime-details

Add Clojure version and system property sun.arch.data.model to runtim…

69eaecb

…e-details ret val

Return options used as part of benchmark* return value

0d23bb4

Change execute-expr so it has less overhead per execution of the benc…

db075c7

…hmarked expression.

Fix divide-by-0 and exceeding range of int errors in estimate-executi…

8fdd0ad

…on-count

Enhancement to my earlier change to execute-expr, reducing memory usage

5b6a0fd

Keeping 8192 previous return values of the benchmarked expression in memory at once is way too many for expressions that return large data structures.

hugoduncan closed this Nov 19, 2012

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Develop #13

Develop #13

jafingerhut commented Nov 5, 2012

hugoduncan commented Nov 15, 2012

hugoduncan commented Nov 19, 2012

jafingerhut commented Nov 19, 2012

hugoduncan commented Nov 19, 2012

hugoduncan commented Nov 19, 2012

Develop #13

Develop #13

Conversation

jafingerhut commented Nov 5, 2012

hugoduncan commented Nov 15, 2012

hugoduncan commented Nov 19, 2012

jafingerhut commented Nov 19, 2012

hugoduncan commented Nov 19, 2012

hugoduncan commented Nov 19, 2012