[BUG]: Invocation of model ID deepseek.r1-v1:0 with on-demand throughput isn’t supported. Retry your request with the ID or ARN of an inference profile that contains this model. #3441

NameMeLeo · 2025-03-11T03:38:59Z

How are you running AnythingLLM?

Docker (local)

What happened?

I got the below output when I am using bedrock with deepseek, its seems that bedrock required a inferenceConfig as below.

Error msg: Invocation of model ID deepseek.r1-v1:0 with on-demand throughput isn’t supported. Retry your request with the ID or ARN of an inference profile that contains this model.

aws sample api query:

{
  "modelId": "deepseek.r1-v1:0",
  "contentType": "application/json",
  "accept": "application/json",
  "body": {
    "inferenceConfig": {
      "max_tokens": 512
    },
    "messages": [
      {
        "role": "user",
        "content": "this is where you place your input text"
      }
    ]
  }
}

Are there known steps to reproduce?

set model ID to deepseek.r1-v1:0 in LLM provider page

No response

The text was updated successfully, but these errors were encountered:

NameMeLeo · 2025-03-12T04:14:48Z

another error from bedrock when using r1 with model id us.deepseek.r1-v1:0

AWSBedrock:streaming - could not stream chat. Unsupported content block type(s): { "reasoningContent": { "text": "Okay" } }

NameMeLeo added the possible bug label Mar 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG]: Invocation of model ID deepseek.r1-v1:0 with on-demand throughput isn’t supported. Retry your request with the ID or ARN of an inference profile that contains this model. #3441

[BUG]: Invocation of model ID deepseek.r1-v1:0 with on-demand throughput isn’t supported. Retry your request with the ID or ARN of an inference profile that contains this model. #3441

NameMeLeo commented Mar 11, 2025 •

edited

Loading

NameMeLeo commented Mar 12, 2025

[BUG]: Invocation of model ID deepseek.r1-v1:0 with on-demand throughput isn’t supported. Retry your request with the ID or ARN of an inference profile that contains this model. #3441

[BUG]: Invocation of model ID deepseek.r1-v1:0 with on-demand throughput isn’t supported. Retry your request with the ID or ARN of an inference profile that contains this model. #3441

Comments

NameMeLeo commented Mar 11, 2025 • edited Loading

How are you running AnythingLLM?

What happened?

Are there known steps to reproduce?

NameMeLeo commented Mar 12, 2025

NameMeLeo commented Mar 11, 2025 •

edited

Loading