Issue when sending parallel requests

Hello,

I am getting the following error when I send multiple requests in parallel to the inference endpoint:

ERROR:  503
{
  "code": 503,
  "type": "ServiceUnavailableException",
  "message": "Model \"restorer\" has no worker to serve inference request. Please use scale workers API to add workers. If this is a sequence inference, please check if it is closed, or expired; or exceeds maxSequenceJobQueueSize"
}

I have two separate processes that can access the inference API.
Any ideas?



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Issue when sending parallel requests #3361

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue when sending parallel requests #3361

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions