Skip to content

Commit c25063f

Browse files
authored
Minor fixes for CodeGen Xeon and Gaudi Kubernetes codegen.yaml and doc updates (#613)
* Minor fixes for CodeGen Xeon and Gaudi Kubernetes codegen.yaml and doc updates Signed-off-by: dmsuehir <dina.s.jones@intel.com>
1 parent 4f3be23 commit c25063f

File tree

2 files changed

+8
-2
lines changed

2 files changed

+8
-2
lines changed

CodeGen/kubernetes/manifests/README.md

Lines changed: 6 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,8 @@
66
77
> You can also customize the "MODEL_ID" if needed.
88
9-
> You need to make sure you have created the directory `/mnt/opea-models` to save the cached model on the node where the CodeGEn workload is running. Otherwise, you need to modify the `codegen.yaml` file to change the `model-volume` to a directory that exists on the node.
9+
> You need to make sure you have created the directory `/mnt/opea-models` to save the cached model on the node where the CodeGen workload is running. Otherwise, you need to modify the `codegen.yaml` file to change the `model-volume` to a directory that exists on the node.
10+
> Alternatively, you can change the `codegen.yaml` to use a different type of volume, such as a persistent volume claim.
1011
1112
## Deploy On Xeon
1213

@@ -30,10 +31,13 @@ kubectl apply -f codegen.yaml
3031

3132
To verify the installation, run the command `kubectl get pod` to make sure all pods are running.
3233

33-
Then run the command `kubectl port-forward svc/codegen 7778:7778` to expose the CodeGEn service for access.
34+
Then run the command `kubectl port-forward svc/codegen 7778:7778` to expose the CodeGen service for access.
3435

3536
Open another terminal and run the following command to verify the service if working:
3637

38+
> Note that it may take a couple of minutes for the service to be ready. If the `curl` command below fails, you
39+
> can check the logs of the codegen-tgi pod to see its status or check for errors.
40+
3741
```
3842
kubectl get pods
3943
curl http://localhost:7778/v1/codegen -H "Content-Type: application/json" -d '{

CodeGen/kubernetes/manifests/gaudi/codegen.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -271,6 +271,8 @@ spec:
271271
resources:
272272
limits:
273273
habana.ai/gaudi: 1
274+
memory: 64Gi
275+
hugepages-2Mi: 500Mi
274276
volumes:
275277
- name: model-volume
276278
hostPath:

0 commit comments

Comments
 (0)