Improve performance of compute_ranks country matching #282 #283

jaredlockhart · 2016-05-02T23:02:30Z

Rather than compare distances to every country, attempt to find a country which contains the point and fail fast. Otherwise compute distances and find the nearest one.

I compared performance between doing this in memory and letting the database do it using st_contains, and the performance with the db was every so slightly worse than the in memory case, but only slightly. However, reducing load on the db is a good thing, so I think doing it in memory should be fine. This is roughly 4 times faster than simply sorting by all distances and doing no containment checks, according to my local benchmarking.

jaredlockhart · 2016-05-02T23:02:46Z

@hannosch r?

coveralls · 2016-05-03T00:44:29Z

Coverage remained the same at 100.0% when pulling e6a52b2 on 282 into 5fa2be8 on master.

hannosch · 2016-05-03T09:47:05Z

@jaredkerim My comments are in #285 - code speaks more clearly :)

And yeah, the DB would have the additional R-Tree index, which is missing in the in-memory case. But it does add the extra network latency to talk to the DB each time. If the two are close in runtime, doing it locally is definitely better.

Improve performance of compute_ranks country matching #282

e6a52b2

jaredlockhart merged commit 8edc7f9 into master May 3, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance of compute_ranks country matching #282 #283

Improve performance of compute_ranks country matching #282 #283

jaredlockhart commented May 2, 2016

jaredlockhart commented May 2, 2016

coveralls commented May 3, 2016

hannosch commented May 3, 2016

Improve performance of compute_ranks country matching #282 #283

Improve performance of compute_ranks country matching #282 #283

Conversation

jaredlockhart commented May 2, 2016

jaredlockhart commented May 2, 2016

coveralls commented May 3, 2016

hannosch commented May 3, 2016