Skip to content

Commit e5affb9

Browse files
update V1.0 benchmark manifest (#822)
Co-authored-by: Zhenzhong1 <zhenzhong.xu@intel.com>
1 parent e2a74f7 commit e5affb9

22 files changed

+38
-70
lines changed

ChatQnA/benchmark/oob_no_wrapper/with_rerank/eight_gaudi/no_wrapper_oob_eight_gaudi_with_rerank.yaml

Lines changed: 1 addition & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -134,7 +134,7 @@ metadata:
134134
name: embedding-dependency-deploy
135135
namespace: default
136136
spec:
137-
replicas: 1
137+
replicas: 8
138138
selector:
139139
matchLabels:
140140
app: embedding-dependency-deploy
@@ -223,10 +223,6 @@ spec:
223223
- '2048'
224224
- --max-total-tokens
225225
- '4096'
226-
- --max-batch-total-tokens
227-
- '65536'
228-
- --max-batch-prefill-tokens
229-
- '4096'
230226
env:
231227
- name: OMPI_MCA_btl_vader_single_copy_mechanism
232228
value: none

ChatQnA/benchmark/oob_no_wrapper/with_rerank/four_gaudi/no_wrapper_oob_four_gaudi_with_rerank.yaml

Lines changed: 1 addition & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -134,7 +134,7 @@ metadata:
134134
name: embedding-dependency-deploy
135135
namespace: default
136136
spec:
137-
replicas: 1
137+
replicas: 4
138138
selector:
139139
matchLabels:
140140
app: embedding-dependency-deploy
@@ -223,10 +223,6 @@ spec:
223223
- '2048'
224224
- --max-total-tokens
225225
- '4096'
226-
- --max-batch-total-tokens
227-
- '65536'
228-
- --max-batch-prefill-tokens
229-
- '4096'
230226
env:
231227
- name: OMPI_MCA_btl_vader_single_copy_mechanism
232228
value: none

ChatQnA/benchmark/oob_no_wrapper/with_rerank/single_gaudi/no_wrapper_oob_single_gaudi_with_rerank.yaml

Lines changed: 0 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -223,10 +223,6 @@ spec:
223223
- '2048'
224224
- --max-total-tokens
225225
- '4096'
226-
- --max-batch-total-tokens
227-
- '65536'
228-
- --max-batch-prefill-tokens
229-
- '4096'
230226
env:
231227
- name: OMPI_MCA_btl_vader_single_copy_mechanism
232228
value: none

ChatQnA/benchmark/oob_no_wrapper/with_rerank/two_gaudi/no_wrapper_oob_two_gaudi_with_rerank.yaml

Lines changed: 1 addition & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -134,7 +134,7 @@ metadata:
134134
name: embedding-dependency-deploy
135135
namespace: default
136136
spec:
137-
replicas: 1
137+
replicas: 2
138138
selector:
139139
matchLabels:
140140
app: embedding-dependency-deploy
@@ -223,10 +223,6 @@ spec:
223223
- '2048'
224224
- --max-total-tokens
225225
- '4096'
226-
- --max-batch-total-tokens
227-
- '65536'
228-
- --max-batch-prefill-tokens
229-
- '4096'
230226
env:
231227
- name: OMPI_MCA_btl_vader_single_copy_mechanism
232228
value: none

ChatQnA/benchmark/oob_no_wrapper/without_rerank/eight_gaudi/no_wrapper_oob_eight_gaudi_without_rerank.yaml

Lines changed: 1 addition & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -134,7 +134,7 @@ metadata:
134134
name: embedding-dependency-deploy
135135
namespace: default
136136
spec:
137-
replicas: 1
137+
replicas: 8
138138
selector:
139139
matchLabels:
140140
app: embedding-dependency-deploy
@@ -223,10 +223,6 @@ spec:
223223
- '2048'
224224
- --max-total-tokens
225225
- '4096'
226-
- --max-batch-total-tokens
227-
- '65536'
228-
- --max-batch-prefill-tokens
229-
- '4096'
230226
env:
231227
- name: OMPI_MCA_btl_vader_single_copy_mechanism
232228
value: none

ChatQnA/benchmark/oob_no_wrapper/without_rerank/four_gaudi/no_wrapper_oob_four_gaudi_without_rerank.yaml

Lines changed: 1 addition & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -134,7 +134,7 @@ metadata:
134134
name: embedding-dependency-deploy
135135
namespace: default
136136
spec:
137-
replicas: 1
137+
replicas: 4
138138
selector:
139139
matchLabels:
140140
app: embedding-dependency-deploy
@@ -223,10 +223,6 @@ spec:
223223
- '2048'
224224
- --max-total-tokens
225225
- '4096'
226-
- --max-batch-total-tokens
227-
- '65536'
228-
- --max-batch-prefill-tokens
229-
- '4096'
230226
env:
231227
- name: OMPI_MCA_btl_vader_single_copy_mechanism
232228
value: none

ChatQnA/benchmark/oob_no_wrapper/without_rerank/single_gaudi/no_wrapper_oob_single_gaudi_without_rerank.yaml

Lines changed: 0 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -223,10 +223,6 @@ spec:
223223
- '2048'
224224
- --max-total-tokens
225225
- '4096'
226-
- --max-batch-total-tokens
227-
- '65536'
228-
- --max-batch-prefill-tokens
229-
- '4096'
230226
env:
231227
- name: OMPI_MCA_btl_vader_single_copy_mechanism
232228
value: none

ChatQnA/benchmark/oob_no_wrapper/without_rerank/two_gaudi/no_wrapper_oob_two_gaudi_without_rerank.yaml

Lines changed: 1 addition & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -134,7 +134,7 @@ metadata:
134134
name: embedding-dependency-deploy
135135
namespace: default
136136
spec:
137-
replicas: 1
137+
replicas: 2
138138
selector:
139139
matchLabels:
140140
app: embedding-dependency-deploy
@@ -223,10 +223,6 @@ spec:
223223
- '2048'
224224
- --max-total-tokens
225225
- '4096'
226-
- --max-batch-total-tokens
227-
- '65536'
228-
- --max-batch-prefill-tokens
229-
- '4096'
230226
env:
231227
- name: OMPI_MCA_btl_vader_single_copy_mechanism
232228
value: none

ChatQnA/benchmark/tuned/with_rerank/four_gaudi/tuned_four_gaudi_with_rerank.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -167,10 +167,10 @@ spec:
167167
- containerPort: 80
168168
resources:
169169
limits:
170-
cpu: 80
170+
cpu: 76
171171
memory: 20000Mi
172172
requests:
173-
cpu: 80
173+
cpu: 76
174174
memory: 20000Mi
175175
volumeMounts:
176176
- mountPath: /data

ChatQnA/benchmark/tuned/with_rerank/single_gaudi/tuned_single_gaudi_with_rerank.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -167,10 +167,10 @@ spec:
167167
- containerPort: 80
168168
resources:
169169
limits:
170-
cpu: 80
170+
cpu: 76
171171
memory: 20000Mi
172172
requests:
173-
cpu: 80
173+
cpu: 76
174174
memory: 20000Mi
175175
volumeMounts:
176176
- mountPath: /data

0 commit comments

Comments
 (0)