install-embeddings

Need

(eksctl)[https://eksctl.io/installation/]
(aws cli)[https://docs.aws.amazon.com/cli/latest/userguide/getting-started-install.html]
(helm cli)[https://helm.sh/docs/intro/install/#helm]
(kubectl)[https://kubernetes.io/docs/tasks/tools/#kubectl]

Check AWS quota

Ensure you have quotas for

${gpu_count}*4 for On-Demand G and VT instances in the region of choice
At least 1 load-balancer per each model you want. (Not per server running)

Create eks cluster and install needed plugins

Modify the following lines in create_cluster.sh

To get your account id run

aws sts get-caller-identity

https://github.com/devflowinc/install-embeddings/blob/d55047e2992a03ae0c98478a4a80d7dc8dcda6f7/create_cluster.sh#L7-L12

Run ./create_cluster.sh to generate the cluster

Specify your embedding models

Modify embedding_models.yaml for the models that you want to use

Install the helm chart

helm upgrade -i embedding-release oci://registry-1.docker.io/trieve/embeddings-helm -f embedding_models.yaml

Get your model endpoints

kubectl get ing

Cleanup

helm uninstall embedding-release
./delete_cluster.sh

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
assets		assets
README.md		README.md
create_cluster.sh		create_cluster.sh
delete_cluster.sh		delete_cluster.sh
embedding_models.yaml		embedding_models.yaml
nvdp.yaml		nvdp.yaml
nvidia-device-plugin.yaml		nvidia-device-plugin.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

install-embeddings

Check AWS quota

Create eks cluster and install needed plugins

Cleanup

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

install-embeddings

Check AWS quota

Create eks cluster and install needed plugins

Cleanup

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages