Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

possibly use ctranslate2 and/or infinity embeddings #143

Closed
BBC-Esq opened this issue Mar 1, 2024 · 1 comment
Closed

possibly use ctranslate2 and/or infinity embeddings #143

BBC-Esq opened this issue Mar 1, 2024 · 1 comment
Labels
enhancement request denied repository owner use only enhancement request under review repository owner use only

Comments

@BBC-Esq
Copy link
Owner

BBC-Esq commented Mar 1, 2024

Faster embeddings via Infinity from well-known ctranslate2 expert:

https://github.com/michaelfeil/infinity

@BBC-Esq BBC-Esq added the enhancement request under review repository owner use only label Mar 1, 2024
@BBC-Esq BBC-Esq changed the title possibly use infinity embeddings possibly use ctranslate2 and/or infinity embeddings Mar 17, 2024
@BBC-Esq
Copy link
Owner Author

BBC-Esq commented Mar 17, 2024

Another option is to use ctranslate2 directly to create the embeddings, and if I switch to faiss, they can be directly input into the vector database. Would simply need to encode a query and make sure the faiss database gets it correctly along with any necessary parameters in order to conduct a search...basically looking for ways to speed up database creation and/or search.

base script:

https://github.com/michaelfeil/infinity/blob/main/libs/infinity_emb/infinity_emb/transformer/embedder/ct2.py

@BBC-Esq BBC-Esq added the enhancement request denied repository owner use only label Mar 22, 2024
@BBC-Esq BBC-Esq closed this as completed Mar 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement request denied repository owner use only enhancement request under review repository owner use only
Projects
None yet
Development

No branches or pull requests

1 participant