Fix Issue 14368 - rawRead performance #3127

charles-cooper · 2015-03-30T17:40:57Z

reduce performance gap between fread and rawRead
https://issues.dlang.org/show_bug.cgi?id=14368

Currently std.stdio rawRead is much slower than cstdio fread. Benchmark indicates that rawRead is roughly 2.5x slower than directly calling fread in a tight loop (not counting time in syscalls). The overhead comes from the compiler failing to inline calls to std.exception.enforce, calling errnoEnforce even when fread's return indicates success, and from buffer slicing overhead.

This patch fast paths the ordinary (no error) case, reducing the overhead to almost nil when compiled with gdc and about 1.35x when compiled with dmd.

reduce performance gap between fread and rawRead cf. https://issues.dlang.org/show_bug.cgi?id=14368

schveiguy · 2015-03-30T17:48:49Z

std/stdio.d

@@ -725,7 +725,8 @@ $(D rawRead) always reads in binary mode on Windows.
    {
        import std.exception : enforce, errnoEnforce;

-        enforce(buffer.length, "rawRead must take a non-empty buffer");
+        if (!buffer.length)
+            enforce(false, "rawRead must take a non-empty buffer");


Instead of calling enforce, you can just throw.

Thanks, will do. Any idea why enforce does not get inlined? The body just consists of if (!value) throw new Exception(msg)

dmd does not inline functions with lazy parameters. It's a big problem for enforce, been around forever.

Interesting. Would it make sense for enforce to be implemented as a mixin instead of a function with lazy parameters?

This requires using mixin at call site. e.g.:

mixin(enforce(cond, msg));

which is ugly and awkward.

The pity here is that in 99% of cases, the message passed to enforce does not involve any computation. The whole point of using the lazy parameter is to defer construction of the exception message until you have confirmed the condition is false. If the construction takes 0 time, then there is no point to defer. An inlineEnforce function would be a kludge, but may get the job done for now.

Personally, I find enforce quite useless. It's not difficult to do if(!cond) throw Exception(msg);

Actually it's even more awkward than that, msg can't be a runtime string, it has to be a compile-time string that generates a string at runtime.

I see.

Or change enforce to take a strict parameter (and thus get auto inlined) and have a lazyEnforce which takes a lazy parameter for those cases where the benefit of laziness outweights the benefit of inlining.

Really though you're right, the compiler should be able to infer that there is no cost to strictly evaluating msg if it is immutable.

cf. https://github.com/D-Programming-Language/phobos/pull/3127/files#r27414148

schveiguy · 2015-03-30T18:29:52Z

Thanks, looks good to me.

andralex · 2015-03-31T05:23:27Z

std/stdio.d


-        enforce(buffer.length, "rawRead must take a non-empty buffer");
+        if (!buffer.length)
+            throw new Exception("rawRead must take a non-empty buffer");


Is this improving anything?

andralex · 2015-03-31T05:33:34Z

Sorry, read the code before the comments. I need to see the code used for benchmarking. This looks really odd.

It seems to me we could get better improvements by eliminating a call to error() after each read.

andralex · 2015-03-31T05:54:36Z

OK, I measured the code in the issue. Though the test code is contrived (hard to fine one who reads so little at a time yet cares about performance), it's worth improving things. Please eliminate fread_success (unnecessary and breaks naming convention) and let's pull this in. Thanks.

schveiguy · 2015-03-31T12:09:03Z

It seems to me we could get better improvements by eliminating a call to error() after each read.

That is what this improvement does. If you read the documentation for fread, an error can only occur when the result of the call does not equal the requested read length. This code short circuits based on that principle. The only call to error should occur on a real error, or eof.

You may have already realized this, but it wasn't clear from your last note.

charles-cooper · 2015-03-31T16:10:05Z

Sure, I can get rid of the once-used variable. But so far the original author of the code, two reviewers, and myself have initially been unclear about the semantics of fread's return value -- that ANY short read could be indicative of error, while buffer.length == freadResult indicates there was no error.

IMO having a one-off variable instead of a comment is self-documenting because a) it causes the maintainer to look twice, possibly checking man fread in the process and b) names the type of the result so it looks more like a crude form of pattern matching than an arbitrary check.

Taking into account the confusion so far, I propose that the below code is clearer because it communicates that slicing the buffer and checking errno makes semantic sense only when fread returns a short item count.

assert (freadResult <= buffer.length); // fread never returns result greater than nmemb
immutable possibleFailure = (freadResult != buffer.length);
if (possibleFailure)
{
    errnoEnforce(!error);
    auto safeSlice = buffer[0 .. freadResult]
    return safeSlice;
}
return buffer;

schveiguy · 2015-03-31T17:57:09Z

MO having a one-off variable instead of a comment is self-documenting because a) it causes the maintainer to look twice, possibly checking man fread in the process and b) names the type of the result so it looks more like a crude form of pattern matching than an arbitrary check.

I disagree, the comment is sufficient. I'd rather just see a comment in there than give the compiler an excuse to waste cycles.

charles-cooper · 2015-04-03T16:26:19Z

Hey, what is the status of this pull request? The most recent patch addresses the comments provided earlier.

schveiguy · 2015-04-03T17:07:41Z

LGTM

FYI, updates to the branch do not get sent as a notification to email, so there's no way to know something like that changes unless you add a comment.

schveiguy · 2015-04-03T17:08:44Z

Auto-merge toggled on

Fix Issue 14368 - rawRead performance

charles-cooper · 2015-04-03T17:41:48Z

I see -- Thank you!

andralex · 2015-04-03T17:48:43Z

@charles-cooper thanks for doing and arguing this work properly, and thx @schveiguy for reviewing and following up!

Fix Issue 14368 - rawRead performance

e741c24

reduce performance gap between fread and rawRead cf. https://issues.dlang.org/show_bug.cgi?id=14368

schveiguy reviewed Mar 30, 2015
View reviewed changes

style -- throw exception instead of calling enforce

962073a

cf. https://github.com/D-Programming-Language/phobos/pull/3127/files#r27414148

andralex reviewed Mar 31, 2015
View reviewed changes

style suggestions

4493008

schveiguy added a commit that referenced this pull request Apr 3, 2015

Merge pull request #3127 from charles-cooper/issue_14368

4d30c1d

Fix Issue 14368 - rawRead performance

schveiguy merged commit 4d30c1d into dlang:master Apr 3, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Issue 14368 - rawRead performance #3127

Fix Issue 14368 - rawRead performance #3127

charles-cooper commented Mar 30, 2015

schveiguy Mar 30, 2015

charles-cooper Mar 30, 2015

schveiguy Mar 30, 2015

charles-cooper Mar 30, 2015

schveiguy Mar 30, 2015

schveiguy Mar 30, 2015

charles-cooper Mar 30, 2015

schveiguy commented Mar 30, 2015

andralex Mar 31, 2015

andralex commented Mar 31, 2015

andralex commented Mar 31, 2015

schveiguy commented Mar 31, 2015

charles-cooper commented Mar 31, 2015

schveiguy commented Mar 31, 2015

charles-cooper commented Apr 3, 2015

schveiguy commented Apr 3, 2015

schveiguy commented Apr 3, 2015

charles-cooper commented Apr 3, 2015

andralex commented Apr 3, 2015

Fix Issue 14368 - rawRead performance #3127

Fix Issue 14368 - rawRead performance #3127

Conversation

charles-cooper commented Mar 30, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

schveiguy commented Mar 30, 2015

Choose a reason for hiding this comment

andralex commented Mar 31, 2015

andralex commented Mar 31, 2015

schveiguy commented Mar 31, 2015

charles-cooper commented Mar 31, 2015

schveiguy commented Mar 31, 2015

charles-cooper commented Apr 3, 2015

schveiguy commented Apr 3, 2015

schveiguy commented Apr 3, 2015

charles-cooper commented Apr 3, 2015

andralex commented Apr 3, 2015