Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Llama2 chat deployment in kserve #5

Open
wants to merge 4 commits into
base: main
Choose a base branch
from
Open

Conversation

hsteude
Copy link
Contributor

@hsteude hsteude commented Oct 5, 2023

This Pull Request introduces the code necessary for deploying llama2 models utilizing kserve. Given that there is no existing kserve implementation for Hugging Face transformers, we've employed custom kserve predictors to achieve this functionality.

How to Test:

See if you're able to deploy it on your own by following the detailed steps provided in the README. Feedback and commits are very welcome :).

@hsteude hsteude mentioned this pull request Oct 5, 2023
@hsteude
Copy link
Contributor Author

hsteude commented Oct 5, 2023

Just noticed that it probably makes sense to put all the image related stuff inside the image directory...

@hsteude hsteude requested review from geier and removed request for samoturk, mcreutz and geier January 26, 2024 10:59
@hsteude
Copy link
Contributor Author

hsteude commented Jan 26, 2024

@geier : Let's try to get this one on a demo deployment.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant