allow unpickling failures to be treated as cache misses #96

slingamn · 2012-08-17T01:48:02Z

Consider the following scenario:

One version of an application stores a Python object in memcached via pylibmc, which automatically pickles it.
A new version of the application which no longer has the object's class is deployed.
The new application tries to retrieve the object, resulting in an exception (for example, an AttributeError).

One natural response to this would be to treat the cache retrieval as a miss. However, implementing this against the current pylibmc behavior is somewhat awkward and difficult. Every call to _pylibmc.Client.get must be wrapped in a try-except that distinguishes _pylibmc.MemcachedError (problems with remote memcached) from other exception types (unpickling failures).

Would it make sense to submit a patch to pylibmc adding a configuration switch such that if it is enabled, _pylibmc.Client.get responds to an unpickling error by suppressing it and returning None, as in the case of a miss?

The text was updated successfully, but these errors were encountered:

bukzor · 2012-08-17T16:57:48Z

This makes sense to me.
+1

lericson · 2012-08-19T14:36:29Z

I'm wondering if a subclass is maybe a better idea?

bukzor · 2012-08-19T18:04:17Z

How would you go about subclassing in order to unambiguously turn unpickling issues into cache misses?

slingamn · 2012-08-19T20:39:23Z

The problem I ran into is that since the unpickling code is inside the C module, it's sort of inaccessible to customization.

If the issue is that another configuration variable would be too crufty, perhaps pylibmc could define a special _pylibmc.SerializationException, then catch exceptions thrown during unpickling and wrap them with SerializationException? I think the core issue here is unambiguously communicating that there was a pickling problem, rather than having to rely on a blanket except Exception:.

bukzor · 2012-09-27T23:26:11Z

bump

lericson · 2012-10-02T00:14:32Z

I'm not going to wrap all exceptions in a parent exception because that sucks.

bukzor · 2012-10-02T06:24:36Z

Could you address my question please?

How would you go about subclassing in order to unambiguously turn unpickling issues into cache misses?

(Disclaimer: This is not a rant, I'm just trying to be very clear, and avoid confusion.)

I believe we could wrap get with a subclass method which catches all exceptions and simulates a cache miss, but I'd be worried about catching unrelated memcache errors in that case. We'd have to catch all exceptions because unpickling errors don't generate a particular exception type. In the general case it's just calling a function with the wrong data, so you'd often get TypeError, KeyError, or AttributeError (which we're currently getting in our test environment).

Even if we disregarded that issue, get_multi presents a separate issue: an unpickling failure for one key causes no value at all (a NULL pointer) to be propagated back. Even if we wrapped it with a bare try: except:, we wouldn't be able to generate the "correct" return value in any reasonable way. For example if we had a get_multi([1,2,3]) and the value for 2 resulted in an unpickling error, we'd still want to return the key/value pairs for 1 and 3.

So I wonder how we'd do this with subclassing.

I think we solve the above issues in a fairly neat way if we simply refactor _PylibMC_parse_memcached_value and/or _PylibMC_Unpickle to be override-able class methods. That way we know exactly which errors we're suppressing, and can get the desired behavior from get_multi in a natural way. Incidentally, it would also give us a good spot to send messages to our python-based logging system when we have this type of cache miss, so that we can know if they get out of hand.

Would you accept this kind of patch? Can you give tips on the best way to do it?

bukzor · 2012-10-02T06:37:07Z

Incidentally, exposing overridable methods solves #75 in a much neater way: people that prefer json can override their client to use json for serialization, assuming that you similarly refactor _PylibMC_SerializeValue to be a Client method.

bukzor mentioned this issue Oct 3, 2012

Allow subclasses to override the deserialization logic #102

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

allow unpickling failures to be treated as cache misses #96

allow unpickling failures to be treated as cache misses #96

slingamn commented Aug 17, 2012

bukzor commented Aug 17, 2012

lericson commented Aug 19, 2012

bukzor commented Aug 19, 2012

slingamn commented Aug 19, 2012

bukzor commented Sep 27, 2012

lericson commented Oct 2, 2012

bukzor commented Oct 2, 2012

bukzor commented Oct 2, 2012

allow unpickling failures to be treated as cache misses #96

allow unpickling failures to be treated as cache misses #96

Comments

slingamn commented Aug 17, 2012

bukzor commented Aug 17, 2012

lericson commented Aug 19, 2012

bukzor commented Aug 19, 2012

slingamn commented Aug 19, 2012

bukzor commented Sep 27, 2012

lericson commented Oct 2, 2012

bukzor commented Oct 2, 2012

bukzor commented Oct 2, 2012