Mark IDC fetched records as read_only #274

dylanahsmith · 2016-06-08T17:39:47Z

This was attempted before in #85, which gives a good explanation for the problem:

IdentityCache is a forever inconsistent datastore. It is a duplication of state across two different storage backends, and unless we somehow confirmed commit in both places before acknowledging, we can't guarantee that the two are in sync. The ruby process could die before it has a chance to issue the memcache DEL, the request could get lost if the connection is interrupted or any number of other things could prevent IDC from reflecting whats truly in the database.

This being the case, it is sketchy to write something you find in IDC back to the database, because you might be writing out of date data to the database. This is already the case with a MySQL insert in Rails cause you had to have read that data a finite amount of time ago, but that window is usually much smaller, and we have locking primitives and transactions to manage this. With IDC, we don't have that, and we don't have any way of knowing if the data is out of date without asking the database, which entirely defeats the purpose of IDC.

So, I suggest we return only readonly records from IDC. If you are gonna write a record, first fetch its values from the DB.

One thing I think we should do differently from the pull request, is that records returned from IDC methods when IdentityCache.should_cache? returns false should not be marked as read-only. The problem with marking those records as read-only is that it prevents us from sharing code to use for reading and writing, e.g. calculating some aggregation for a confirmation page which should be consistent with the calculation for the operation itself.

On the other hand, records fetched from the database on a cache miss when IdentityCache.should_cache? returns true should be marked as read-only. This way code that does the wrong thing will fail deterministically, rather than passing on a cache miss and failing on a cache hit.

We might also want to consider making this behaviour configurable, so that we don't get blocked on Shopify needed to be updated to work with this new behaviour, which was probably the main reason that the previous PR didn't get finished and merged. That way we can at least have identity cache do the right thing by default on the next release with breaking changes.

The text was updated successfully, but these errors were encountered:

airhorns · 2016-06-08T17:50:21Z

All excellent points, I agree! It's tempting to let records from the DB miss path be writable since they are technically up to date but I think determinism and sanity is sooo much more important. Good call.

camilo · 2016-06-09T20:05:57Z

I think @daniellaniyo can do this after she finishes with the unchain-ability of resutls

eugeneius · 2016-06-23T05:53:53Z

One thing I think we should do differently from the pull request, is that records returned from IDC methods when IdentityCache.should_cache? returns false should not be marked as read-only.

In a test suite that uses transactions for isolation, IdentityCache.should_use_cache? is always false, as there's always at least one transaction open when your code is running. If we only mark records as readonly when IdentityCache.should_use_cache? is true, then code that updates a record fetched from IdentityCache could work fine in tests but blow up in production. 💥

dylanahsmith · 2016-06-23T07:51:10Z

Thanks for pointing that out, since we have a IdentityCache.should_use_cache? monkey patch in Shopify which not only checks multiple active database connections for open transactions, but also ignores the transactional fixture transaction. Perhaps we should bring over that check into this repo so tests behave like production by default.

eugeneius · 2016-06-24T10:21:39Z

That would be great! Ping me if you want me to test it first - we also use multiple databases in production.

camilo · 2016-08-31T03:54:41Z

@daniellaniyo I think we have a branch where @eugeneius can test now right?

daniellaniyo · 2016-08-31T19:17:14Z

@camilo Yes! Actually the tests could be carried out on the current master branch and setting IdentityCache.fetch_read_only_records to true, or using the IdentityCache.with_fetch_read_only_records block around specific sections.

Also note that we haven't included the IdentityCache.should_use_cache? monkey patch from Shopify yet.

eugeneius · 2016-09-04T23:21:42Z

Thanks @camilo / @daniellaniyo - but the IdentityCache.should_use_cache? monkey patch is actually the part I was most interested in testing!

I was working on a similar problem recently (preventing background jobs from being enqueued inside a transaction) - I've shared the approach I took there in #293.

eugeneius · 2016-09-05T01:58:44Z

I just ran our application's test suite against master with my patch from #293 applied.

There were a bunch of failures, but it looks like they're all legitimate cases of fetching a record from the cache and updating it. I have some work to do before I can think about testing in production 😬

camilo · 2017-03-05T17:50:12Z

I will close this issue, since this is the default behaviour since a while ago.

dylanahsmith mentioned this issue Jul 7, 2016

Deprecate setting the inverse active record association on cache hit. #279

Merged

daniellaniyo mentioned this issue Jul 28, 2016

Mark IDC fetched records as read_only #282

Merged

eugeneius mentioned this issue Sep 4, 2016

Support multiple databases and transactional tests in IdentityCache.should_use_cache? #293

Merged

camilo closed this as completed Mar 5, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mark IDC fetched records as read_only #274

Mark IDC fetched records as read_only #274

dylanahsmith commented Jun 8, 2016

airhorns commented Jun 8, 2016

camilo commented Jun 9, 2016

eugeneius commented Jun 23, 2016

dylanahsmith commented Jun 23, 2016

eugeneius commented Jun 24, 2016

camilo commented Aug 31, 2016

daniellaniyo commented Aug 31, 2016

eugeneius commented Sep 4, 2016

eugeneius commented Sep 5, 2016

camilo commented Mar 5, 2017

Mark IDC fetched records as read_only #274

Mark IDC fetched records as read_only #274

Comments

dylanahsmith commented Jun 8, 2016

airhorns commented Jun 8, 2016

camilo commented Jun 9, 2016

eugeneius commented Jun 23, 2016

dylanahsmith commented Jun 23, 2016

eugeneius commented Jun 24, 2016

camilo commented Aug 31, 2016

daniellaniyo commented Aug 31, 2016

eugeneius commented Sep 4, 2016

eugeneius commented Sep 5, 2016

camilo commented Mar 5, 2017