Change default decoder limits #295

methane · 2018-03-19T08:50:49Z

Current default limits seems too large. It can cause DoS attack.
Change default value to more safe value.

Current plan is "about 1MiB on amd64 system".

max_bin_len: 1024*1024
max_str_len: 1024*1024
max_array_len: 1024*1024/8 (each pointer has 8 bytes)
max_map_len: 1024*1024/32 (8byte key,hash,value, and some extra space)

The text was updated successfully, but these errors were encountered:

To avoid DoS attack, make default size limit smaller. Fixes #295

codypiersall · 2019-01-22T19:52:00Z

Hmmm, is there any chance this can be reverted? I think this is breaking a lot of code that has legitimate uses for larger messages than 1 MiB. It certainly caused surprising crashes for me, and I guess I'm not alone. I do appreciate the DoS issue though. Maybe it would be possible to not allocate the memory for the string/bin/ext until all the data is present? Like store all the chunks of data in a boring old Python list until the length of all the components of the list add up to the length that was promised? I haven't looked at the implementation at all to know if that even makes sense or would be possible. Or maybe add a special case for when all the data to pack/unpack is already present? These ideas are based on the understanding that the attack vector is something like: someone sends you data (maybe over a socket, maybe from a file) that says they are giving you a 2 GiB EXT message, but then they give you nothing else. So they can do minimal work and make you allocate tons of memory. If the data actually is present, that seems like less of an issue.

On a related note, it would be really nice if the docs could be updated to the latest version that incorporates these changes; when reading the docs (which are currently for 0.5 on readthedocs if I'm looking at the right thing) it still shows 2**32 - 1 as the max size. It took me a bit to realize I was looking at the wrong version.

Thanks for the great library!

codypiersall · 2019-01-22T19:54:40Z

I guess my idea about waiting for all the data doesn't work for array and map types though, since the data needs to be parsed to figure out when it's all present for those types.

jfolz · 2019-01-22T20:06:33Z

A migration guide would certainly go a long way to help devs update their code for upcoming versions.

codypiersall · 2019-01-22T21:17:49Z

One more idea: we could have a kwarg trusted_source which if True ignores the limits on max_bin_len and the rest. It would default to False. This would solve the issue where you have to pass a lot of keyword arguments; currently, if you want to indicate that you trust the other side, you have to set bin/str/array/map/ext separately.

If you are interested in this, I would be willing to attempt a PR in the next couple weeks.

methane · 2019-01-22T23:34:55Z

It certainly caused surprising crashes for me, and I guess I'm not alone.

It means many people doesn't aware max_xxx_type options although it's very important
to avoid DoS attack.
That's why I think I must change the default value to safe.

Of course, I know there are many applications they are not relating to DoS attack.
But the library doesn't know it. Application should specify it.

Maybe it would be possible to ...

It is not practical because it will kill performance, or introduce a lot of code (and bugs) I don't want maintain.

On a related note, it would be really nice if the docs could be updated to the latest version that incorporates these changes;

I'm really sorry. I didn't notice the doc isn't updated automatically.
I manually start build and I configured webhook too. The doc should be updated soon.

methane · 2019-01-22T23:41:28Z

A migration guide would certainly go a long way to help devs update their code for upcoming versions.

Sorry, I can't understand this English.
Docstring and ChangeLog is explicit already. Please don't require me to write more English.
Writing English is very hard to me, maybe than you think. It leads me to burn-out.

methane · 2019-01-22T23:43:07Z

One more idea: we could have a kwarg trusted_source which if True ignores the limits on max_bin_len and the rest.

I dislike the idea. I don't want to add any more options unless strictly required.
And two knob for one thing seems ugly too.

methane · 2019-01-23T00:25:37Z

I came up with an idea to mitigate the pain.

Changing all max_xxx_len of unpackb to 0. It means "auto".
unpackb uses len(packed_data) for max_xxx_len when it's 0. In case of max_map_len, it's len(packed_data)/2.

For Unpacker, max_buffer_size can be used for "auto", but max_buffer_size is 0 (unlimited) for now. Changing it breaks backward compatibility again. Maybe, I can change it when 1.0.

codypiersall · 2019-01-23T01:20:40Z

Thanks for the feedback @methane.

It is not practical because it will kill performance, or introduce a lot of code (and bugs) I don't want maintain.

That's fair.

I don't want to add any more options unless strictly required. And two knob for one thing seems ugly too.

This is the same problem I was trying to solve. From my perspective, the various max_xxx_len are 5 knobs for one thing: I wanted a way for msgpack to attempt to decode whatever it was given. Your solution you came up with just above would work just fine for me.

Ping me back if you don't have the time or desire to implement it and I will give it a try.

Side note: Does it even make sense to have the max_xxx_len args to unpackb? As I understand it, it's not really susceptible to the same DoS since you can always verify if the needed number of bytes are present.

methane · 2019-01-23T01:23:56Z

Side note: Does it even make sense to have the max_xxx_len args to unpackb? As I understand it, it's not really susceptible to the same DoS since you can always verify if the needed number of bytes are present.

After "auto" is implemented, the removal can be considered.
But it should be in 1.0, not 0.6.1.

codypiersall · 2019-01-23T01:24:53Z

Makes sense to me.

fake-name · 2019-01-28T07:18:55Z

Has anyone ever had a DoS attack that this even helps mitigate? Is anyone exposing messagepack interfaces to untrusted inputs? This sounds like a solution in search of a problem.

If you're going to add a interface for handling untrusted inputs, don't just overwrite the current API with one that will no longer work in a lot of cases. Adding a separate unpacker like UnpackUntrusted or similar would seem much more clear to me.

And two knob for one thing seems ugly too.

Right now, you have four knobs I have to set. And I'm in a context where both ends are my code, the transport is SSL wrapped, and a DoS is not a concern at all. The whole "unpack limit" thing in general is silly for my use case (and probably most others).

Honestly, I'd be most in support of just getting rid of it entirely. If people need to worry about DoS attacks, they can just look at the size of the string they're shoving into the unpacker.

See msgpack/msgpack-python#338 and msgpack/msgpack-python#295

methane · 2019-01-28T08:32:41Z

Has anyone ever had a DoS attack that this even helps mitigate? Is anyone exposing messagepack interfaces to untrusted inputs? This sounds like a solution in search of a problem.

RPC is one of the usage msgpack is designed for. (See this article)
And Fluentd is most successed project which has msgpack API.

Right now, you have four knobs I have to set.

Four knobs for four thing. It's orthological.
And, again, you don't "have to". I added "auto limit" in #342. You can just use only one option for Unpacker, and zero option for unpackb now.

methane · 2019-01-28T08:49:12Z

If people need to worry about DoS attacks, they can just look at the size of the string they're shoving into the unpacker.

Only five bytes input (e.g. b'\xdd\xff\xff\xff\xff') can consume 32GB RAM.
When CPython provides public API for preallocated dict, five bytes (e.g. b'\xdf\xff\xff\xff\xff' will consume 96GB RAM.

fake-name · 2019-01-29T04:53:37Z

Only five bytes input (e.g. b'\xdd\xff\xff\xff\xff') can consume 32GB RAM.

Well, huh. Never mind, then.

methane added the 0.6 label Mar 19, 2018

methane mentioned this issue Nov 8, 2018

unpacker: Make default size limit smaller #319

Merged

methane closed this as completed in #319 Nov 8, 2018

methane added a commit that referenced this issue Nov 8, 2018

unpacker: Make default size limit smaller (#319)

3b80233

To avoid DoS attack, make default size limit smaller. Fixes #295

jnardone mentioned this issue Nov 30, 2018

Changes to msgpack cause models to fail to load explosion/spaCy#2996

Closed

ppwwyyxx added a commit to tensorpack/tensorpack that referenced this issue Dec 6, 2018

Adapt to msgpack change in msgpack/msgpack-python#295

9a1043e

Marcelo-Theodoro mentioned this issue Jan 2, 2019

Update msgpack to 0.6.0 django/channels_redis#146

Closed

jooh mentioned this issue Jan 16, 2019

transplant crippled on msgpack-python 0.6.0 (max_bin_len change) bastibe/transplant#67

Closed

fake-name added a commit to fake-name/AutoTriever that referenced this issue Jan 28, 2019

Revert msgpack-python because of ridiculous defaults in newer versions.

f46074b

See msgpack/msgpack-python#338 and msgpack/msgpack-python#295

fake-name added a commit to fake-name/AutoTriever that referenced this issue Jan 28, 2019

Revert msgpack-python because of ridiculous defaults in newer versions.

3076de8

See msgpack/msgpack-python#338 and msgpack/msgpack-python#295

danielballan mentioned this issue May 10, 2019

Sort out how to configure decoder limits bluesky/suitcase-msgpack#1

Closed

ppwwyyxx added a commit to tensorpack/dataflow that referenced this issue May 25, 2019

Adapt to msgpack change in msgpack/msgpack-python#295

2c545fb

ppwwyyxx added a commit to tensorpack/dataflow that referenced this issue May 25, 2019

Adapt to msgpack change in msgpack/msgpack-python#295

d64d8f4

ptoomey3 mentioned this issue Jun 21, 2019

Some inputs hang Rails console cabo/cbor-ruby#11

Closed

lukasraska mentioned this issue May 1, 2020

[BUG] salt-api can't handle large volumes of data with msgpack > 0.5.6 saltstack/salt#57026

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change default decoder limits #295

Change default decoder limits #295

methane commented Mar 19, 2018

codypiersall commented Jan 22, 2019

codypiersall commented Jan 22, 2019 •

edited

jfolz commented Jan 22, 2019

codypiersall commented Jan 22, 2019

methane commented Jan 22, 2019

methane commented Jan 22, 2019

methane commented Jan 22, 2019

methane commented Jan 23, 2019

codypiersall commented Jan 23, 2019

methane commented Jan 23, 2019

codypiersall commented Jan 23, 2019

fake-name commented Jan 28, 2019 •

edited

methane commented Jan 28, 2019 •

edited

methane commented Jan 28, 2019 •

edited

fake-name commented Jan 29, 2019

Change default decoder limits #295

Change default decoder limits #295

Comments

methane commented Mar 19, 2018

codypiersall commented Jan 22, 2019

codypiersall commented Jan 22, 2019 • edited

jfolz commented Jan 22, 2019

codypiersall commented Jan 22, 2019

methane commented Jan 22, 2019

methane commented Jan 22, 2019

methane commented Jan 22, 2019

methane commented Jan 23, 2019

codypiersall commented Jan 23, 2019

methane commented Jan 23, 2019

codypiersall commented Jan 23, 2019

fake-name commented Jan 28, 2019 • edited

methane commented Jan 28, 2019 • edited

methane commented Jan 28, 2019 • edited

fake-name commented Jan 29, 2019

codypiersall commented Jan 22, 2019 •

edited

fake-name commented Jan 28, 2019 •

edited

methane commented Jan 28, 2019 •

edited

methane commented Jan 28, 2019 •

edited