Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cool work! Considering using a biomedical embedding model? #220

Open
Andy-jqa opened this issue Dec 11, 2023 · 0 comments
Open

Cool work! Considering using a biomedical embedding model? #220

Andy-jqa opened this issue Dec 11, 2023 · 0 comments

Comments

@Andy-jqa
Copy link

Thank you for your wonderful work. It seems that the current system uses OpenAI's embedding models for dense retrieval, which might be sub-optimal (and costly). We have a biomedical embedding model trained by 255M real PubMed user logs. Would appreciate it if you could check out:

Paper: https://academic.oup.com/bioinformatics/article/39/11/btad651/7335842
Query encoder: https://huggingface.co/ncbi/MedCPT-Query-Encoder
Article encoder:https://huggingface.co/ncbi/MedCPT-Article-Encoder

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant