Skip to content

xwhan/pylucene-bm25

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

Test the default pylucene bm25 for open-domain QA (Natural Questions)

Only tested for java-8-openjdk-amd64

export JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64

https://lucene.apache.org/pylucene/install.html

Construct the corpus index

python indexer.py --corpus-path /data/xwhan/data/nq-dpr/corpus/psgs_w100.tsv --index-path /home/xwhan/data/nq-dpr/corpus/lucene_index 

testing the retrieval

python retriever.py 

Answer Recall

Method Top20 Top 50 Top 100
pylucene 61.9 71.0 76.5
DPR 78.4 - 85.4

About

Lucene open-domain QA retrieval in python

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages