Skip to content

pmig/install-embeddings

 
 

Repository files navigation

install-embeddings

Need

Check AWS quota

Ensure you have quotas for

  • ${gpu_count}*4 for On-Demand G and VT instances in the region of choice
  • At least 1 load-balancer per each model you want. (Not per server running)

Create eks cluster and install needed plugins

Modify the following lines in create_cluster.sh

To get your account id run

aws sts get-caller-identity

https://github.com/devflowinc/install-embeddings/blob/d55047e2992a03ae0c98478a4a80d7dc8dcda6f7/create_cluster.sh#L7-L12

Run ./create_cluster.sh to generate the cluster

  1. Specify your embedding models

Modify embedding_models.yaml for the models that you want to use

  1. Install the helm chart
helm upgrade -i embedding-release oci://registry-1.docker.io/trieve/embeddings-helm -f embedding_models.yaml
  1. Get your model endpoints
kubectl get ing

Cleanup

helm uninstall embedding-release
./delete_cluster.sh

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Shell 100.0%