Skip to content
Permalink
Browse files

Add alternative faster script implemented in Java and add replication…

… log for doc2query (#780)
  • Loading branch information...
ronakice authored and lintool committed Aug 13, 2019
1 parent 3a7fc19 commit 44a47a7d0c13c8d7c5d9204bedc9f67e026e15dc
Showing with 9 additions and 0 deletions.
  1. +9 −0 docs/experiments-doc2query.md
@@ -54,6 +54,14 @@ python ./src/main/python/msmarco/retrieve.py --hits 1000 --index msmarco-passage
--qid_queries msmarco-passage/queries.dev.small.tsv --output msmarco-passage/run.dev.small.expanded-topk10.tsv
```

Alternatively, we can run the same script implemented in Java, which is a bit faster:

```
./target/appassembler/bin/SearchMsmarco -hits 1000 -threads 1 \
-index msmarco-passage/lucene-index-msmarco-expanded-topk10 -qid_queries msmarco-passage/queries.dev.small.tsv \
-output msmarco-passage/run.dev.small.expanded-topk10.tsv
```

Finally, to evaluate:

```
@@ -153,3 +161,4 @@ TREC CAR corpus v2.0 in this experiment instead of corpus v1.5 used in the paper
## Replication Log

+ Results replicated by [@justram](https://github.com/justram) on 2019-08-09 (commit [`5f098f`](https://github.com/justram/Anserini/commit/5f098f23527611bca1224149bc2d155adce1e48))
+ Results replicated by [@ronakice](https://github.com/ronakice) on 2019-08-13 (commit [`5b29d16`](https://github.com/castorini/anserini/commit/5b29d1654abc5e8a014c2230da990ab2f91fb340))

0 comments on commit 44a47a7

Please sign in to comment.
You can’t perform that action at this time.