Euclidean metric is much slower than Angular? #41

ronxin · 2014-12-21T03:32:04Z

I am running an experiment with 1.2 million nodes, each represented by a 500-dim vector. On the same machine, when using Angular distance, it takes about 1 minute to build a tree; whereas using Euclidean, it takes about 1 hour. Is this difference caused by Euclidean distance having to calculate an extra offset value, or other things?

erikbern · 2014-12-30T08:45:38Z

I never used Euclidean distance much, so the code isn't very optimized. There's some hacky stuff in create_split that's really slow for Euclidean, and it's probably not super hard to optimize if you have some time to spend on it: https://github.com/spotify/annoy/blob/master/src/annoylib.h#L193

erikbern · 2015-05-25T22:03:44Z

See #65 – this makes Euclidean indexes substantially faster

erikbern closed this as completed May 25, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Euclidean metric is much slower than Angular? #41

Euclidean metric is much slower than Angular? #41

ronxin commented Dec 21, 2014

erikbern commented Dec 30, 2014

erikbern commented May 25, 2015

Euclidean metric is much slower than Angular? #41

Euclidean metric is much slower than Angular? #41

Comments

ronxin commented Dec 21, 2014

erikbern commented Dec 30, 2014

erikbern commented May 25, 2015