This is the code for the Two-Step SPLADE paper (https://www.springerprofessional.de/en/two-step-splade-simple-efficient-and-effective-approximation-of-/26866450). There is two ways of using this code: Either you can replicate our results using our precomputed indexes or you can reproduce it by generating the indexes.
- Download all index and query files from:
- Install pisa (from our folder)
- Run the desired table line (method #) for each dataset (example run_line)
- Aggregate results
- Install all submodules (including our pisa)
- Generate anserini files with run_splade.sh
- Count tokens with token_count.py
- Index files with index.sh
- Convert to pisa with convert_pisa.sh
- Generate reorder file with reorder.py
- Generate reorder index with reorder.sh
- Run the desired table line (method #) for each dataset (example run_line)
- Aggregate results
Feel free to create a new issue or to send me an email directly (cadurosar@gmail.com)
@inproceedings{lassance2024two,
title={Two-Step SPLADE: Simple, Efficient and Effective Approximation of SPLADE},
author={Lassance, Carlos and Dejean, Herv{\'e} and Clinchant, St{\'e}phane and Tonellotto, Nicola},
booktitle={European Conference on Information Retrieval},
pages={349--363},
year={2024},
organization={Springer}
}