Make Vectors.most_similar super fast by loading from cache #88

ines · 2019-11-21T00:55:27Z

More reliable solution and also how we're going to do it for the updated demo 🎉 Using the 06_precompute_cache.py script, nearest-neighbor queries can be pre-computed and saved with the component. This makes the data larger, but the most_similar super fast. If a cache is available, it's loaded from disk/bytes and used. If not, most_similar falls back to using Vectors.most_similar.

Resolves #86.

ines and others added 15 commits November 20, 2019 16:52

Update script and add docs

27fe3fa

Fix typo

2c9699d

Support cache

d18a80d

Fix option shortcut

4e8d85f

Update caching logic

a2d4238

Fix precompute_cache script

caaf095

Format

b97f0fd

Fix cache

d1a4562

Update 06_precompute_cache.py

4a52c69

Add cache to test model

b80fbb0

Don't zero out set self scores

e3690e1

Add cache tests

2d106eb

Update README.md [ci skip]

ce0b233

Update README.md [ci skip]

a69d262

Increment version [ci skip]

f9a2465

ines marked this pull request as ready for review November 21, 2019 01:52

Fix types [ci skip]

9e9a8d3

honnibal merged commit 0c0965f into master Nov 21, 2019

ines deleted the feature/cache branch November 21, 2019 15:51

svlandeg added demo Online demo enhancement Feature requests and improvements labels Apr 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make Vectors.most_similar super fast by loading from cache #88

Make Vectors.most_similar super fast by loading from cache #88

ines commented Nov 21, 2019

Make Vectors.most_similar super fast by loading from cache #88

Make Vectors.most_similar super fast by loading from cache #88

Conversation

ines commented Nov 21, 2019