Skip to content

Conversation

jordanstephens
Copy link
Member

@jordanstephens jordanstephens commented Jul 21, 2025

This is the branch that I just successfully deployed to production.

This PR builds on @commit111 's work in #70, #75, and #76

Key changes:

  • factored out embeddings.py just so that there is one place where we load the model and we can reference it in both the app rag_system.py and in the Dockerfile.
  • Set the SENTENCE_TRANSFORMERS_HOME to /root/.cache/sentence_transformers so that it is out of the way when copying app source code
  • revert the uwsgi mode from --lazy-apps to the default (load before fork)

@jordanstephens jordanstephens requested a review from commit111 July 21, 2025 18:40
@jordanstephens jordanstephens changed the title Jordan/sentence transformers location Caching Sentence Transformers in Docker image Jul 21, 2025
@jordanstephens jordanstephens force-pushed the jordan/sentence-transformers-location branch from 7ffb1bb to cc22d15 Compare July 21, 2025 18:43
@jordanstephens jordanstephens merged commit 62e33cb into main Jul 21, 2025
5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants