Skip to content

yagays/wikipedia_es_similarity

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Text Similarity Search by using Elasticsearch

Preparation

$ wget https://dumps.wikimedia.org/other/cirrussearch/20190826/jawiki-20190826-cirrussearch-content.json.gz

$ wget https://github.com/singletongue/WikiEntVec/releases/download/20190520/jawiki.word_vectors.200d.txt.bz2
$ bunzip2 jawiki.word_vectors.200d.txt.bz2
$ docker-compose up
$ python build_index_wikipedia.py

Text Similarity Search

$ python search.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages