File tree Expand file tree Collapse file tree 3 files changed +4
-3
lines changed
microservices-connector/config Expand file tree Collapse file tree 3 files changed +4
-3
lines changed Original file line number Diff line number Diff line change 26
26
repository : ghcr.io/huggingface/text-generation-inference
27
27
pullPolicy : IfNotPresent
28
28
# Overrides the image tag whose default is the chart appVersion.
29
- tag : " 2.2.0"
29
+ # `sha-e4201f4-intel-cpu` is the image tag for intel cpu optimized tgi image
30
+ tag : " sha-e4201f4-intel-cpu"
30
31
31
32
# empty for CPU
32
33
accelDevice : " "
Original file line number Diff line number Diff line change 87
87
optional : true
88
88
securityContext :
89
89
{}
90
- image : " ghcr.io/huggingface/text-generation-inference:2.2.0 "
90
+ image : " ghcr.io/huggingface/text-generation-inference:sha-e4201f4-intel-cpu "
91
91
imagePullPolicy : IfNotPresent
92
92
volumeMounts :
93
93
- mountPath : /data
Original file line number Diff line number Diff line change @@ -19,7 +19,7 @@ The ChatQnA uses the below prebuilt images if you choose a Xeon deployment
19
19
- dataprep-redis: opea/dataprep-redis: latest
20
20
- tei_xeon_service: ghcr.io/huggingface/text-embeddings-inference: cpu-1 .5
21
21
- tei_embedding_service: ghcr.io/huggingface/text-embeddings-inference: cpu-1 .5
22
- - tgi-service: ghcr.io/huggingface/text-generation-inference:2.2.0
22
+ - tgi-service: ghcr.io/huggingface/text-generation-inference: sha-e4201f4-intel-cpu
23
23
- redis-vector-db: redis/redis-stack:7.2.0-v9
24
24
25
25
Should you desire to use the Gaudi accelerator, two alternate images are used for the embedding and llm services.
You can’t perform that action at this time.
0 commit comments