Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Splade v1 #144

Merged
merged 23 commits into from
Mar 13, 2024
Merged

Add Splade v1 #144

merged 23 commits into from
Mar 13, 2024

Conversation

NirantK
Copy link
Contributor

@NirantK NirantK commented Mar 12, 2024

This adds our first Sparse Vector model to FastEmbed!

We add one of the more modern implementations of SPLADE: https://huggingface.co/prithivida/Splade_PP_en_v1

This has the main advantage of being permissively licensed

@NirantK NirantK marked this pull request as draft March 12, 2024 08:52
fastembed/text/text_embedding.py Show resolved Hide resolved
fastembed/text/text_embedding.py Outdated Show resolved Hide resolved
tests/test_sparse_embeddings.py Show resolved Hide resolved
experiments/02_SPLADE_to_ONNX.ipynb Show resolved Hide resolved
@NirantK NirantK marked this pull request as ready for review March 12, 2024 09:09
fastembed/text/sparse.py Outdated Show resolved Hide resolved
fastembed/text/sparse.py Outdated Show resolved Hide resolved
fastembed/text/sparse.py Outdated Show resolved Hide resolved
@generall
Copy link
Member

generall commented Mar 12, 2024

Made a small refactoring to make sure we will use appropriate types for sparse embeddings. Had to move to stuff around to avoid code duplication. One last thing for this PR is to move model into qdrant HF, otherwise LGTM

fastembed/sparse/splade_pp.py Show resolved Hide resolved
fastembed/sparse/splade_pp.py Show resolved Hide resolved
tests/test_sparse_embeddings.py Outdated Show resolved Hide resolved
@NirantK NirantK requested a review from generall March 13, 2024 03:54
@NirantK NirantK merged commit d817da2 into main Mar 13, 2024
17 checks passed
@NirantK NirantK deleted the splade-v1 branch March 13, 2024 12:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants