list_regions sends down CMAs that don't have census data #50

mountainMath · 2017-08-15T19:08:52Z

I noticed that the list_regions call sends down data for some CMAs that don't actually have census data attached to them. This is fixed on the server now, will take a day for the server side cache to expire. So we should all make sure we refresh the cached regions in 24 hours.

dshkol · 2017-08-15T19:23:04Z

Does it make sense to add a function to wipe caches? Or is that taken care by forcing the load function to redownload the data. Kind of a power user need.

atheriel · 2017-08-15T19:43:41Z

The existing use_cache = FALSE parameter will do just that. I think I mentioned it in the documentation, too.

atheriel · 2017-08-15T19:43:59Z

(So I think we can close this.)

mountainMath · 2017-08-15T21:05:39Z

I am starting to think that we need to strike a balance between caching and making sure data is up-to-date. For example, on the list_datasets call the user may not find out about new datasets that we might add. Even if it is slow-changing, that might cause problems. Also, the list_vectors will likely undergo changes as I clean up the server side data.

I see three reasons we do caching:

to avoid unnecessarily spending API points (the reason for the limit on API points is point 2)
to reduce server load
to allow people to run and refine their analysis offline

Point 1. only applies to the load_data calls, and I think letting the user decided when to reload the data is good. For the other calls it would be just fine if we only make calls, say, once a day. That the user doesn't have to worry about refinements to the vector data. Or we use the cache-expiry headers from the http call to determine how long they stay fresh, that way we can up that time server side at a later stage when the calls become stable.

atheriel · 2017-08-15T21:11:46Z

One way to do this would be to cache the caching information from the server (e.g. the ETag) along with the object, and then send an If-Changed-Since request to the server. I believe this is what browsers do, but I'd have to think about how to do it in R.

atheriel · 2017-08-15T21:12:36Z

Alternatively, we could store the cache timestamp and force and an update if it gets too old.

dshkol · 2017-08-15T22:43:28Z

That could work. You could prompt the user with a note that their cached data is old and give them the option to reload, but that might be overkill.

atheriel · 2017-08-16T02:38:08Z

Actually, it's likely a simple warning would suffice, and be the least intrusive.

atheriel · 2017-08-17T23:08:44Z

This should be closed now that #54 is merged. If we want to have more discussion of cache invalidation, it should be in a separate issue I think.

atheriel mentioned this issue Aug 16, 2017

Add cache timeouts #54

Merged

mountainMath closed this as completed Aug 18, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

list_regions sends down CMAs that don't have census data #50

list_regions sends down CMAs that don't have census data #50

mountainMath commented Aug 15, 2017

dshkol commented Aug 15, 2017

atheriel commented Aug 15, 2017

atheriel commented Aug 15, 2017

mountainMath commented Aug 15, 2017

atheriel commented Aug 15, 2017

atheriel commented Aug 15, 2017

dshkol commented Aug 15, 2017

atheriel commented Aug 16, 2017

atheriel commented Aug 17, 2017

list_regions sends down CMAs that don't have census data #50

list_regions sends down CMAs that don't have census data #50

Comments

mountainMath commented Aug 15, 2017

dshkol commented Aug 15, 2017

atheriel commented Aug 15, 2017

atheriel commented Aug 15, 2017

mountainMath commented Aug 15, 2017

atheriel commented Aug 15, 2017

atheriel commented Aug 15, 2017

dshkol commented Aug 15, 2017

atheriel commented Aug 16, 2017

atheriel commented Aug 17, 2017