Having the length of a "seq" or "map" be tied to the allocated capacity is dangerous. #115

TyOverby · 2015-05-31T20:12:41Z

For many decoders, (like JSON), calling read_seq with the length of items in a Json array or map is fine because the Json array or map is already in memory, so having a Vec or HashMap initialize with_capacity isn't that bad.

However, in a bug found by @jmesmon, the bincode decoder can be tricked into getting Vec to preallocate more memory than the system has, causing the program crash. I put in a lot of restrictions in Bincode to prevent DOS attacks, with the goal of being able to use Bincode on a public port for gamedev, but the automatic "capacity is the same as length" would let any attacker take down any program using bincode to decode.

I've implemented a fix here. This would be a breaking change for people that implement their own Decoders because the type signature differs, but elements that are Decodable won't need to change. Furthermore, the #[derive(Decodable)] plugin shouldn't need to change because it doesn't emit any read_map or read_seq calls.

The text was updated successfully, but these errors were encountered:

TyOverby · 2015-05-31T21:26:25Z

This not only effects bincode, but cbor as well.

BurntSushi · 2015-05-31T21:51:49Z

I managed to fix this in my slow decoder because it has an intermediate representation. But CBOR does suffer from this bug in the direct decoder for the same reasons that bincode suffers from it.

I can't think of another way to fix this either. A breaking change is unfortunate, but it looks like it might be limited to decoder implementors rather that users of decodable.

codyps · 2015-06-03T16:12:35Z

It might be worth considering (potentially in addition to the changes proposed by TyOverby) having an additional Fn passed to read_map & read_seq that allows estimation of the memory that would be allocated for a given number of elements.

Otherwise, making decoders oom free will involve either arbitrary caps on seq & map initial capacity or simply disabling pre-allocation entirely.

Also, a complimentary (but more involved) solution to this issue may be structures to specify a per-member limit on number of elements (looking at many existing protocols, limits on lengths of things that would otherwise be unbounded are fairly common). This last bit should be able to live somewhere other than the rustc-serialize crate, however.

canndrew · 2016-03-11T06:10:02Z

Bump. This makes any decoder with decent performance unusable in network protocols. Could we incorporate @toverby's fix and bump the version number to 0.4.0?

TyOverby · 2016-03-11T17:23:28Z

CC: @alexcrichton

Old discussion here: #121

alexcrichton · 2016-03-11T20:37:42Z

This may end up just being something that's just a case of where serde is recommended over rustc-serialize, unfortunately.

TyOverby · 2016-03-11T21:11:45Z

Is rustc-serialize de facto deprecated? If so, this is going to be a terrible hit to the usability of bincode. Right now rustc-serialize is the only choice for bincode users on stable and beta releases of rust. Without #[derive(Serialize, Deserialize)] available for serde on nightly, the instructions for using bincode go from

Write a single derive, and then you get incredibly fast binary serialization for free

to

Implement custom serialization and deserialization visitors (sometimes in the hundreds of LOC) and pray that you deserialize in the exact same order that you serialized, otherwise you'll get un-debuggable crashes at runtime.

I've already had several people open up bugs on Bincode because they used serde and made mistakes in their implementations. In fact, when @erickt graciously added serde support to bincode, he didn't implement serialize/deserialize for structs in the bincode tests, meaning that we are only able to run our tests on nightly.

Serde is great, and I love that I can support both rustc-serialize and serde at the same time, but forcing users into serde is not fun at all. In fact, if people are only doing serialization / deserialization using bincode, then I would argue that serde is strictly worse (performance & ease of maintainability).

If the core team is going to stop supporting rustc-serialize and fully support serde, I'd understand, but it would be an incredibly disappointing decision, especially for those of us that already depend on it and would like bug fixes like this one.

aturon · 2016-03-12T00:47:59Z

cc me

aturon · 2016-03-12T00:52:43Z

@TyOverby We're in the process of making it more ergonomic to use serde on stable Rust -- basically support for custom derive on stable. I think this addresses your ease of maintainability concern?

It may also make it possible to have an evolving, out-of-tree rustc_serialize setup, but ... we really don't want to further splinter the ecosystem.

Could you elaborate on the performance worries with serde? Are they fundamental?

In terms of the core team stance, the jury is still out; we're trying to make stable derive ergonomic in general, but haven't yet worked closely with @erickt on the guts of serde itself. Meanwhile, the existing rustc_serialize is effectively frozen -- breaking changes to the internal compiler version aren't really an option, as far as I can see.

cc @erickt

aturon · 2016-03-12T00:55:36Z

Hm, I missed that the breakage here is for decoders only. Perhaps there is some wiggle room -- we should think about it. @alexcrichton, maybe we can discuss at the next libs meeting?

alexcrichton · 2016-03-12T01:20:26Z

Yes that sounds prudent, I've added an agenda item.

TyOverby · 2016-03-14T05:45:44Z

Hi @aturon, thanks for looking into this! My main concern about serde is the maintainability aspect. Right now I have roughly 200 lines of struct definitions in a game that I'm working on, and switching from #[derive(.., ..)] to handwritten visitors would make me reconsider using my own library.

I don't think the performance issue is fundamental, but I remember seeing in erickt's blog post that bincode on rustc-serialize was faster than on serde. I haven't done any benchmarks recently.

canndrew · 2016-03-15T07:11:17Z

It would be really great if this could be fixed. Over at @maidsafe we're using rustc_serialize and bincode to serialise all messages between p2p network peers. Currently, anyone can crash another peer by sending them a malicious packet. The options for fixing this are:

Abandon rustc_serialise and switch to serde (a major pain in the ass since stuff like macros and custom derive aren't stable yet)
Manually implement Encodable, Decodable for all types which contain containers and can be sent over the network. (Also a major pain in the ass. Also error prone; someone will end up sticking a Vec inside a type which is #[derive(Decodable)])
Switch to an inefficient serialisation format with a decoder that doesn't preallocate buffers. eg. cbor with it's slow decoder (bad for performance and bandwidth usage).
Use this branch of bincode and be very careful about how we use the size limit in bincode's decode_from. (Error prone, some types we send in containers are much larger than others so we'd have to walk a tightrope between rejecting Vec<u8>s that we should be able to decode and being vulnerable to DOS when decoding, say, a Vec<Sha512Digest>)

None of these solutions are very appealing so I hope it's not considered too much effort to fix the root problem in rustc_serialise. I'd be happy to spend time submitting PRs to any crates.io crates that this breaks.

erickt · 2016-03-16T18:40:25Z

@TyOverby / @canndrew: It really is unfortunate how painful it is to use Syntex on stable rust. I'm currently working on an RFC to enhance Rust's support for code generators that mitigates some of the issues we currently have that hopefully will cut down on most of the major pain points.

@TyOverby / @aturon: I haven't re-run performance tests in quite some time. Last time though, serde was slower at serialization (2143MB/s vs 3278MB/s), and slightly faster at deserialization (310MB/s vs 291MB/s). I didn't put too much effort into trying to optimize my bincode implementation, so there might be plenty of performance lying around.

@TyOverby / @canndrew: The problem with bumping rustc_serialize is that the rust macro is really tied to the specific function signature. To keep with Rust's backwards compatibility promise, we'd have to add #[derive(RustcSerialize2, RustcDeserialize2)] that had your new signature.

TyOverby · 2016-03-16T20:11:05Z

@erickt Wouldn't bumping the major version fit with back-compat promises?

sfackler · 2016-03-16T22:53:19Z

@TyOverby #[derive(RustcSerialize)] is implemented in rustc itself, so "bumping the major version" would consist of bumping to Rust 2.0.

TyOverby · 2016-03-17T00:04:22Z

@sfackler I don't think that #[derive(RustcDeserialize)] spits out any calls to read_seq and read_map. In fact, the only thing that my PR would break are implementers of Decoder, and custom sequences and maps, of which there are very few.

alexcrichton · 2016-03-24T17:30:54Z

Ok, the libs team (cc @rust-lang-nursery/libs) discussed this in triage yesterday, and the conclusion was that we're interested in fixing this but would like to avoid breaking changes here at all costs. Along those lines it seems like there are two ways to fix this without breaking the API? Namely:

We don't call with_capacity at all, namely Remove calls to <Collection>::with_capacity(). #122. This seems reasonable as we could basically say "if you want speed, use serde". This library is already known to be slower, so perf critical applications in theory already aren't using it.
We could do something with cmp::min to ensure that only something small is used. For example we could preallocate at most 1MB from any with_capacity call.

@TyOverby do you have an idea which solution may be the best to move forward with?

TyOverby · 2016-03-24T22:11:53Z

Thanks for meeting and taking this into consideration! I think that option 2 would be a very good compromise.

alexcrichton · 2016-03-24T23:53:14Z

Ok! Would you be up for sending a PR for that?

alexcrichton · 2016-03-24T23:53:22Z

or am I forgetting that it already exists...

TyOverby · 2016-03-25T15:06:33Z

I'll modify #122

TyOverby · 2016-03-25T19:27:08Z

Just kidding, here it is #150

alexcrichton · 2016-03-30T03:59:00Z

superseded by #150

alexcrichton · 2016-03-30T03:59:34Z

er, fixed in that PR

dtolnay · 2017-02-13T05:35:29Z

The discussion seems to imply that Serde doesn't have this problem... but it does serde-rs/serde#744.

TyOverby mentioned this issue May 31, 2015

Add a test case that exibits over-capacity out-of-memory error BurntSushi/rust-cbor#4

Merged

TyOverby mentioned this issue Jun 17, 2015

Add capacity bounds to read_map and read_seq. #121

Closed

TyOverby mentioned this issue Mar 10, 2016

Respect size limits in read_seq and read_map. bincode-org/bincode#63

Merged

alexcrichton closed this as completed Mar 30, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Having the length of a "seq" or "map" be tied to the allocated capacity is dangerous. #115

Having the length of a "seq" or "map" be tied to the allocated capacity is dangerous. #115

TyOverby commented May 31, 2015

TyOverby commented May 31, 2015

BurntSushi commented May 31, 2015

codyps commented Jun 3, 2015

canndrew commented Mar 11, 2016

TyOverby commented Mar 11, 2016

alexcrichton commented Mar 11, 2016

TyOverby commented Mar 11, 2016

aturon commented Mar 12, 2016

aturon commented Mar 12, 2016

aturon commented Mar 12, 2016

alexcrichton commented Mar 12, 2016

TyOverby commented Mar 14, 2016

canndrew commented Mar 15, 2016

erickt commented Mar 16, 2016

TyOverby commented Mar 16, 2016

sfackler commented Mar 16, 2016

TyOverby commented Mar 17, 2016

alexcrichton commented Mar 24, 2016

TyOverby commented Mar 24, 2016

alexcrichton commented Mar 24, 2016

alexcrichton commented Mar 24, 2016

TyOverby commented Mar 25, 2016

TyOverby commented Mar 25, 2016

alexcrichton commented Mar 30, 2016

alexcrichton commented Mar 30, 2016

dtolnay commented Feb 13, 2017

Having the length of a "seq" or "map" be tied to the allocated capacity is dangerous. #115

Having the length of a "seq" or "map" be tied to the allocated capacity is dangerous. #115

Comments

TyOverby commented May 31, 2015

TyOverby commented May 31, 2015

BurntSushi commented May 31, 2015

codyps commented Jun 3, 2015

canndrew commented Mar 11, 2016

TyOverby commented Mar 11, 2016

alexcrichton commented Mar 11, 2016

TyOverby commented Mar 11, 2016

aturon commented Mar 12, 2016

aturon commented Mar 12, 2016

aturon commented Mar 12, 2016

alexcrichton commented Mar 12, 2016

TyOverby commented Mar 14, 2016

canndrew commented Mar 15, 2016

erickt commented Mar 16, 2016

TyOverby commented Mar 16, 2016

sfackler commented Mar 16, 2016

TyOverby commented Mar 17, 2016

alexcrichton commented Mar 24, 2016

TyOverby commented Mar 24, 2016

alexcrichton commented Mar 24, 2016

alexcrichton commented Mar 24, 2016

TyOverby commented Mar 25, 2016

TyOverby commented Mar 25, 2016

alexcrichton commented Mar 30, 2016

alexcrichton commented Mar 30, 2016

dtolnay commented Feb 13, 2017