Skip to content

Commit

Permalink
Update experiments-msmarco-passage.md (#38)
Browse files Browse the repository at this point in the history
We use huggingface's model in the pipeline so can model prep stage. Also, standardized naming convention of the model.
  • Loading branch information
ronakice committed May 29, 2020
1 parent 591e7ff commit 9b5eca6
Showing 1 changed file with 1 addition and 12 deletions.
13 changes: 1 addition & 12 deletions docs/experiments-msmarco-passage.md
Original file line number Diff line number Diff line change
Expand Up @@ -57,17 +57,6 @@ wget https://git.uwaterloo.ca/jimmylin/anserini-indexes/raw/master/index-msmarco
tar xvfz indexes/index-msmarco-passage-20191117-0ed488.tar.gz -C indexes
```

## Model Prep

Let's download and extract monoBERT into `models`:

```
wget https://www.dropbox.com/s/jr0hpksboh7pa48/monobert_msmarco_large.zip -P models
unzip models/monobert_msmarco_large.zip -d models
```

While running the re-ranking script with the monoT5 model, it is automatically downloaded from Google Cloud Storage.

Now, we can begin with re-ranking the set.

## Re-Ranking with monoBERT
Expand All @@ -77,7 +66,7 @@ First, lets evaluate using monoBERT!
```
python -um pygaggle.run.evaluate_passage_ranker --split dev \
--method seq_class_transformer \
--model-name-or-path castorini/monobert_large_msmarco \
--model-name-or-path castorini/monobert-large-msmarco \
--dataset data/msmarco_ans_small/ \
--index-dir indexes/index-msmarco-passage-20191117-0ed488 \
--task msmarco \
Expand Down

0 comments on commit 9b5eca6

Please sign in to comment.