How to define infer/eval configs to accelerate when nginx provides multiple API/url model inference calls? #138

hanjr92 · 2023-07-31T08:43:38Z

hanjr92
Jul 31, 2023

hello! I use nginx to provide multiple remote model inference interfaces. How can I set inference configs to accelerate the inference process?
Now，i set infer cfg like this：
`infer = dict(
partitioner=dict(type=SizePartitioner, max_task_size=500),
runner=dict(
type=LocalRunner,
max_num_workers=4,
task=dict(type=OpenICLInferTask)),
)

eval = dict(
partitioner=dict(type=NaivePartitioner),
runner=dict(
type=LocalRunner,
max_num_workers=32,
task=dict(type=OpenICLEvalTask)),
)Does this parametermax_num_workers` mean that four urls can be used for parallel inference through nginx?

Answered by Leymore

Jul 31, 2023

Yes, max_num_workers can be used for parallel inference. But I suggest a round-robin over the urls in the Model class, which is easier to implement and more intuitive in concept.
You may find this document helpful

View full answer

Leymore · 2023-07-31T10:08:53Z

Leymore
Jul 31, 2023
Maintainer

Yes, max_num_workers can be used for parallel inference. But I suggest a round-robin over the urls in the Model class, which is easier to implement and more intuitive in concept.
You may find this document helpful

0 replies

hanjr92 · 2023-08-02T10:10:47Z

hanjr92
Aug 2, 2023
Author

thanks! another question for your help!
If I submit an API for testing my LLM. Which set of datasets will be used for testing? Base_ Medium or chat_ Medium or a secret for tester?

6 replies

hanjr92 Aug 2, 2023
Author

So the ratings displayed on the official website are evaluated through chat_medium?

gaotongxiao Aug 2, 2023

For API-based models like ChatGPT, yes. For other open-source models which allow us to access the raw logits, we use base_medium to test, as discriminative evaluation comes with less noise.

hanjr92 Aug 2, 2023
Author

i see. thanks for your reply.

hanjr92 Aug 3, 2023
Author

There are a total of 38 datasets on the official website LLM Leaderboard, but there are 52 datasets in chat_medium. Is there any difference between them?

gaotongxiao Aug 3, 2023

While OpenCompass is continuously supporting more datasets, the leaderboard may lie behind since the evaluation takes time. We will post updates to the leaderboard recently.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to define infer/eval configs to accelerate when nginx provides multiple API/url model inference calls? #138

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 2 comments 6 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

How to define infer/eval configs to accelerate when nginx provides multiple API/url model inference calls? #138

hanjr92 Jul 31, 2023

Replies: 2 comments · 6 replies

Leymore Jul 31, 2023 Maintainer

hanjr92 Aug 2, 2023 Author

hanjr92 Aug 2, 2023 Author

gaotongxiao Aug 2, 2023

hanjr92 Aug 2, 2023 Author

hanjr92 Aug 3, 2023 Author

gaotongxiao Aug 3, 2023

hanjr92
Jul 31, 2023

Replies: 2 comments 6 replies

Leymore
Jul 31, 2023
Maintainer

hanjr92
Aug 2, 2023
Author

hanjr92 Aug 2, 2023
Author

hanjr92 Aug 2, 2023
Author

hanjr92 Aug 3, 2023
Author