--big-value-split-threshold Details #26

jamescarr · 2015-01-28T16:02:10Z

I was looking for some more details on the --big-value-split-threshold option.

What is the value of N? The size of a value in bytes?
How can I safely turn the option on? I just tried to in production and we received A LOT of timeouts for every get attempt that was over the threshold. I got too scared and backed out.

The text was updated successfully, but these errors were encountered:

alikhtarov · 2015-01-30T21:41:20Z

Every value over the size of N will be split into smaller chunks of size N (last chunk might be less than N). The original key becomes the 'index' key which stores a random suffix and total number of chunks; the chunks are stored at modified keys that includes the same random suffix and the chunk id.

The random suffix is needed for consistency - you can simply remove the original key and the chunks will be 'deleted' - there's no way to access them without knowing the random suffix. It also takes care of simultaneous sets, since only one key will win the race, and only its random suffix will be valid.

The way we deployed it on a live system was in stages. First we deployed reads only - if you set N to some large value (like 1000000000), the logic is still enabled on the read path, but will not actually split any values. This makes sure that all clients can understand split values once we start writing them.
Second stage was lowering N to actually start splitting the values. The exact value we use is 524288.

Note that all chunks will be sent to the same memcache box as the original key would be, so that means you're still transferring the same amount of data from a single memcache box to the client. If you want to transfer huge values this way, you still have to wait for individual chunks to arrive serially, so that might explain the timeouts you see - can you share the size of the values you're setting/fetching and the value of N you tried?

jamescarr · 2015-03-03T15:57:18Z

Thanks this clears it all up.

My timeouts were unrelated it turned out.

jamescarr · 2015-03-03T15:58:18Z

If anyone wants some fun, it turned out that the cached value for the view context of our blog posts was weighing in at 6.5mb. Each. This got rejected but I think once we turned on big value splitting it overloaded our cache servers. ;-)

marko-jovicic · 2015-06-29T19:45:00Z

The random suffix is needed for consistency - you can simply remove the original key and the chunks will be 'deleted' - there's no way to access them without knowing the random suffix. It also takes care of simultaneous sets, since only one key will win the race, and only its random suffix will be valid.

I have few questions about big value split threshold option:

When key is deleted, I find it out that other related parts/chunks are not deleted. Is this expected behaviour or not?
In case of simultaneous sets, last key that is set has references to its other parts/chunks which is ok. However, chunks that are related to previous sets are not deleted, but they should be?

mcrouter is 1.0 version (built using Dockerfile). Memcached version is 1.4.13.
mcrouter is started with this command:

# 5 bytes just in test purposes
mcrouter --big-value-split-threshold=5  --config-str='{"pools":{"A":{"servers":["127.0.0.1:5001"]}},"route":"PoolRoute|A"}' -p 5000

jamescarr closed this as completed Mar 3, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

--big-value-split-threshold Details #26

--big-value-split-threshold Details #26

jamescarr commented Jan 28, 2015

alikhtarov commented Jan 30, 2015

jamescarr commented Mar 3, 2015

jamescarr commented Mar 3, 2015

marko-jovicic commented Jun 29, 2015

--big-value-split-threshold Details #26

--big-value-split-threshold Details #26

Comments

jamescarr commented Jan 28, 2015

alikhtarov commented Jan 30, 2015

jamescarr commented Mar 3, 2015

jamescarr commented Mar 3, 2015

marko-jovicic commented Jun 29, 2015