Optimize representation of Error without backtrace #20

cramertj · 2017-11-05T23:58:32Z

This change modifies Error to not store empty backtraces.
This saves space and allows zero-sized errors without
backtraces to be stored without any allocation.

This change modifies Error to not store empty backtraces. This saves space and allows zero-sized errors without backtraces to be stored without any allocation.

withoutboats · 2017-11-06T20:44:18Z

src/error.rs

+            // Attempt to add a backtrace
+            let backtrace = Backtrace::new();
+            if backtrace.is_none() {
+                Box::new(failure) as Box<FailOrWithBacktrace>


Is this actually guaranteed not to allocate if the failure is a ZST?

Yes, but to be clear, this code doesn't rely on that behavior for correctness.

withoutboats · 2017-11-06T20:47:33Z

src/error.rs

-                Ok(ret)
-            }
-            _       => Err(self)
+        if self.downcast_ref::<T>().is_some() {


I would like more comments on how this rewrite of the unsafe code works and is correct. To me it looks like this is wrong, but I'm not confident I'm right about that.

In particular, if there is a backtrace, you don't seem to ever actually drop the backtrace here. This is what the ptr::read stuff was about.

Sure, I'll add a comment. The drop is occurring implicitly since Box::from_raw(ptr as *mut WithBacktrace<T>) creates a Box<WithBacktrace<T>>. The failure field is moved out, and the rest of the box (including the remaining backtrace field) is dropped.

withoutboats · 2017-11-06T20:55:55Z

This saves space and allows zero-sized errors without backtraces to be stored without any allocation.

I don't think saving a couple words in the heap is a win worth the additional complexity, so this commit is only worth it because (if) it allows ZST errors to not allocate when backtraces are turned off. I'm not certain that this will actually work, though. Do we guarantee that a Box will have a null data pointer if the data would be a ZST?

Another concern we've talked about is how this will interact with #9. If we start double boxing things, we don't go from this being an alloc to no allocs, we go from 2 allocs to 1 (which is much less important).

All in all I'm increasingly unsure about what direction we should be optimizing in here. As @cramertj and I have talked about over IRC, there is an 'optimal' (highly unsafe) representation in which you store the vtable inline with the data, making it one word, and use alignment to mask a vtable vs heap pointer so you also don't alloc in the ZST/no-backtrace case. But that's a lot of complexity.

Finally, there's a question fo guarantees that both this and #9 raise. RIght now, I'd feel comfortable guaranteeing that Error will never be more than 2 words - adding a test for it and such. But I'm not nearly as comfortable guaranteeing that ZST/no-backtrace errors will never allocate, or that Error is only 1 word, because of how these different optimizations run up against each other.

I know I told you a few days ago that I thought separating this from the other PR would be an easier merge, sorry to backpedal on that 😅.

cramertj · 2017-11-06T22:19:15Z

No worries! There's a lot of things to consider here-- I agree that we wouldn't want to pretend to offer guarantees that we don't actually provide.

this commit is only worth it because (if) it allows ZST errors to not allocate when backtraces are turned off

Yup, totally agree.

Do we guarantee that a Box will have a null data pointer if the data would be a ZST?

No, we do not. In fact, we guarantee the opposite-- the data pointer cannot be a null pointer (at the moment, it's usually 0x1, but that's probably going to change soon). However, we do guarantee that no allocation is actually performed.

Another concern we've talked about is how this will interact with #9. If we start double boxing things, we don't go from this being an alloc to no allocs, we go from 2 allocs to 1 (which is much less important).

Yes, if we go that route I'd advocate for either (a) removing the optimization or (b) using the leftover-from-alignment bits of the top-level Box to store a marker bit indicating whether the rest of the pointer points to either data or a vtable. Personally, I'd prefer option (b), but I agree with you that it comes at the cost of more complicated and unsafe code. However, there are steps we can take to minimize the amount of actual unsafe code, and I'm fairly confident I could put together a PR that made this all fairly simple and easy to validate. If we decide to double-box, i'd like to at least write the code and discuss its costs and benefits concretely before making a decision here.

I'd feel comfortable guaranteeing that Error will never be more than 2 words... But I'm not nearly as comfortable guaranteeing that ZST/no-backtrace errors will never allocate, or that Error is only 1 word...

As far as future size/optimization guarantees, I'd assumed that you wouldn't provide any concrete performance guarantees until the library had more time to mature and we'd gotten some practical experience/benchmarks. If you want to provide an explicit guarantee that Error is never more than two words on any platform, than I think it's fine to provide documentation and a test for that. I agree with you that it would be unwise to guarantee a single-word representation or a non-allocating representation until we've gotten more hands-on experience using and benchmarking real-world code using failure::Error.

withoutboats · 2017-11-08T23:01:54Z

I think I'm inclined to do neither this nor #9 prior to the 0.1 release, and try to collect feedback based on use about which direction the performance issues tilt.

cramertj · 2017-11-08T23:08:07Z

That's fine w/ me-- these are all backwards compatible.

cramertj force-pushed the opt-nobacktrace branch from 6f32767 to 16f2756 Compare November 6, 2017 00:02

Optimize representation of Error without backtrace

fec787c

This change modifies Error to not store empty backtraces. This saves space and allows zero-sized errors without backtraces to be stored without any allocation.

cramertj force-pushed the opt-nobacktrace branch from 16f2756 to fec787c Compare November 6, 2017 00:06

withoutboats reviewed Nov 6, 2017

View reviewed changes

withoutboats mentioned this pull request Nov 8, 2017

Size of failure::Error is too big #9

Open

withoutboats mentioned this pull request Nov 17, 2017

Ideal representation of Error #51

Open

cramertj closed this Jan 31, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize representation of Error without backtrace #20

Optimize representation of Error without backtrace #20

cramertj commented Nov 5, 2017

withoutboats Nov 6, 2017

cramertj Nov 6, 2017 •

edited

Loading

withoutboats Nov 6, 2017

cramertj Nov 6, 2017

withoutboats commented Nov 6, 2017

cramertj commented Nov 6, 2017 •

edited

Loading

withoutboats commented Nov 8, 2017

cramertj commented Nov 8, 2017

Optimize representation of Error without backtrace #20

Optimize representation of Error without backtrace #20

Conversation

cramertj commented Nov 5, 2017

withoutboats Nov 6, 2017

Choose a reason for hiding this comment

cramertj Nov 6, 2017 • edited Loading

Choose a reason for hiding this comment

withoutboats Nov 6, 2017

Choose a reason for hiding this comment

cramertj Nov 6, 2017

Choose a reason for hiding this comment

withoutboats commented Nov 6, 2017

cramertj commented Nov 6, 2017 • edited Loading

withoutboats commented Nov 8, 2017

cramertj commented Nov 8, 2017

cramertj Nov 6, 2017 •

edited

Loading

cramertj commented Nov 6, 2017 •

edited

Loading