[Microsoft.Extensions.AI] The Cohere Command R+ always returns "Bad Request" #5708

yamanaiyuki · 2024-11-30T03:40:36Z

Description

I am creating a program using the AzureAIInferenceChatClient, and there was an issue.
The Mistral-large-2407 is working normally, but the Cohere-command-r-plus-08-2024 throws an exception.
I prepared a short sample for verification.
According to the sample, when CompleteAsync() is executed, "400 (Bad Request)" is returned.

Sample Code:

using Azure;
using Azure.AI.Inference;
using Microsoft.Extensions.AI;

string? AZURE_COHERE_ENDPOINT = Environment.GetEnvironmentVariable("AZURE_COHERE_ENDPOINT");
string? AZURE_COHERE_API_KEY = Environment.GetEnvironmentVariable("AZURE_COHERE_API_KEY");
string? modelId = Environment.GetEnvironmentVariable("AZURE_COHERE_MODELID");

ArgumentNullException.ThrowIfNull(AZURE_COHERE_ENDPOINT);
ArgumentNullException.ThrowIfNull(AZURE_COHERE_API_KEY);
ArgumentNullException.ThrowIfNull(modelId);

Uri endpoint = new(AZURE_COHERE_ENDPOINT);
AzureKeyCredential credential = new(AZURE_COHERE_API_KEY);

ChatCompletionsClient chatCompletionsClient = new(endpoint, credential);

Response<ModelInfo> modelInfo = chatCompletionsClient.GetModelInfo();
Console.WriteLine($"Model name: {modelInfo.Value.ModelName}");
Console.WriteLine($"Model type: {modelInfo.Value.ModelType}");
Console.WriteLine($"Model provider name: {modelInfo.Value.ModelProviderName}");

try
{
    AzureAIInferenceChatClient client = new(chatCompletionsClient, modelId);

    ChatCompletion result = await client.CompleteAsync("What is AI?");
    Console.WriteLine(result);
}
catch (Exception ex)
{
    Console.WriteLine(ex);
}

Message printed to the console:

Model name: Cohere Command R+
Model type: chat-completion
Model provider name: Cohere
Azure.RequestFailedException: {"message":"invalid type: parameter messages.content is of type array but should be of type string"}
Status: 400 (Bad Request)
ErrorCode: Bad Request

Content:
{"error":{"code":"Bad Request","message":"{\"message\":\"invalid type: parameter messages.content is of type array but should be of type string\"}","status":400}}

Headers:
Cache-Control: no-cache, no-store, no-transform, must-revalidate, private, max-age=0
Pragma: no-cache
Vary: REDACTED
x-accel-expires: REDACTED
x-ms-rai-invoked: REDACTED
X-Request-ID: REDACTED
ms-azureml-model-error-reason: REDACTED
ms-azureml-model-error-statuscode: REDACTED
x-ms-client-request-id: 4053b640-fc4a-4be5-ab2d-bb183756abd2
Request-Context: REDACTED
azureml-model-session: REDACTED
Date: Tue, 26 Nov 2024 23:49:27 GMT
Content-Length: 162
Content-Type: application/json
Expires: Thu, 01 Jan 1970 00:00:00 UTC

   at Azure.Core.HttpPipelineExtensions.ProcessMessageAsync(HttpPipeline pipeline, HttpMessage message, RequestContext requestContext, CancellationToken cancellationToken)
   at Azure.AI.Inference.ChatCompletionsClient.CompleteAsync(RequestContent content, String extraParams, RequestContext context)
   at Azure.AI.Inference.ChatCompletionsClient.CompleteAsync(ChatCompletionsOptions chatCompletionsOptions, CancellationToken cancellationToken)
   at Microsoft.Extensions.AI.AzureAIInferenceChatClient.CompleteAsync(IList`1 chatMessages, ChatOptions options, CancellationToken cancellationToken)
   at Program.<Main>$(String[] args) in C:\teacup\source\repos\AzureAIInferenceExample\Program.cs:line 27

Is Command R+ functioning correctly?
It has been confirmed to function correctly by generating JSON on its own in the following manner.
However, this method does not allow us to take advantage of the benefits of Microsoft.Extensions.AI.

Sample Code used for validation:

using Newtonsoft.Json;
using System.Net.Http.Headers;
using System.Text;

string? AZURE_COHERE_ENDPOINT = Environment.GetEnvironmentVariable("AZURE_COHERE_ENDPOINT");
string? AZURE_COHERE_API_KEY = Environment.GetEnvironmentVariable("AZURE_COHERE_API_KEY");
string? modelId = Environment.GetEnvironmentVariable("AZURE_COHERE_MODELID");

ArgumentNullException.ThrowIfNull(AZURE_COHERE_ENDPOINT);
ArgumentNullException.ThrowIfNull(AZURE_COHERE_API_KEY);
ArgumentNullException.ThrowIfNull(modelId);

Uri endpoint = new(AZURE_COHERE_ENDPOINT + "/chat/completions");

using HttpClient client = new();
client.DefaultRequestHeaders.Authorization = new AuthenticationHeaderValue("Bearer", AZURE_COHERE_API_KEY);
client.DefaultRequestHeaders.Accept.Add(new MediaTypeWithQualityHeaderValue("application/json"));

var requestContent = new
{
    model = modelId,
    messages = new[]
    {
        new
        {
            role = "user",
            content = "What is AI?"
        }
    }
};

string jsonContent = JsonConvert.SerializeObject(requestContent);
StringContent content = new(jsonContent, Encoding.UTF8, "application/json");

try
{
    HttpResponseMessage response = await client.PostAsync(endpoint, content);
    response.EnsureSuccessStatusCode();

    string responseString = await response.Content.ReadAsStringAsync();
    Console.WriteLine(responseString);
}
catch (Exception ex)
{
    Console.WriteLine(ex);
}

Assumption
Unfortunately, I don't have an environment to build and verify on hand, but by checking the source code, the following issues are apparent.
It seems that Microsoft.Extensions.AI.AzureAIInferenceChatClient is not covering the specification differences of Azure.AI.Inference.ChatRequestUserMessage.

Microsoft.Extensions.AI.AzureAIInferenceChatClient:
It is functionally managed uniformly with List.

When there are multiple texts:
- Uses the provided IList contents.
When there is a single text:
- Creates List(1) and contains new TextContent(content) in it.

Azure.AI.Inference.ChatRequestUserMessage:
It internally has two parts, string Content and IList MultimodalContentItems, which are mutually exclusive, and only one of them is used.
There are two constructors, and each has a different initialization method.

When there are multiple texts:
- Write with Utf8JsonWriter.WriteStartArray() and Utf8JsonWriter.WriteEndArray() from MultimodalContentItems.
When there is a single text:
- Since MultimodalContentItems == null, it writes with Utf8JsonWriter.WriteStringValue() from Content.

Reproduction Steps

Create a console application with any name.
Install Azure.AI.Inference, Microsoft.Extensions.AI, and Microsoft.Extensions.AI.AzureAIInference from NuGet.
The latest versions are 1.0.0-beta.2 and 9.0.1-preview.1.24570.5.
Copy and paste the source code, and appropriately modify AZURE_COHERE_ENDPOINT, AZURE_COHERE_API_KEY, and modelId.
Run it.
Check the console screen.

using Azure;
using Azure.AI.Inference;
using Microsoft.Extensions.AI;

string? AZURE_COHERE_ENDPOINT = Environment.GetEnvironmentVariable("AZURE_COHERE_ENDPOINT");
string? AZURE_COHERE_API_KEY = Environment.GetEnvironmentVariable("AZURE_COHERE_API_KEY");
string? modelId = Environment.GetEnvironmentVariable("AZURE_COHERE_MODELID");

ArgumentNullException.ThrowIfNull(AZURE_COHERE_ENDPOINT);
ArgumentNullException.ThrowIfNull(AZURE_COHERE_API_KEY);
ArgumentNullException.ThrowIfNull(modelId);

Uri endpoint = new(AZURE_COHERE_ENDPOINT);
AzureKeyCredential credential = new(AZURE_COHERE_API_KEY);

ChatCompletionsClient chatCompletionsClient = new(endpoint, credential);

Response<ModelInfo> modelInfo = chatCompletionsClient.GetModelInfo();
Console.WriteLine($"Model name: {modelInfo.Value.ModelName}");
Console.WriteLine($"Model type: {modelInfo.Value.ModelType}");
Console.WriteLine($"Model provider name: {modelInfo.Value.ModelProviderName}");

try
{
    AzureAIInferenceChatClient client = new(chatCompletionsClient, modelId);

    ChatCompletion result = await client.CompleteAsync("What is AI?");
    Console.WriteLine(result);
}
catch (Exception ex)
{
    Console.WriteLine(ex);
}

Expected behavior

Below is the normal output for Mistral Large.
It is expected to be displayed similarly with Command R+.

Model name: mistral-large-2407
Model type: chat-completion
Model provider name: Mistral
AI, or Artificial Intelligence, refers to the simulation of human intelligence processes by machines, especially computer systems. These processes include learning (acquiring information and rules for using the information), reasoning (using the rules to reach approximate or definite conclusions), and self-correction. Here are some key aspects of AI:

1. **Machine Learning (ML)**: A subset of AI that involves training algorithms to learn from data, make predictions, or decisions without being explicitly programmed.

2. **Deep Learning (DL)**: A subset of machine learning that uses neural networks with many layers to learn and make decisions on data.

3. **Natural Language Processing (NLP)**: A branch of AI that deals with the interaction between computers and humans through natural language.

4. **Computer Vision**: A field of AI that focuses on enabling computers to interpret and understand the visual world, e.g., recognizing objects in images or videos.

5. **Robotics**: A domain that involves designing, building, and operating robots, often incorporating AI for control, perception, and cognition.

AI applications are vast and include search algorithms, recommendation systems, voice assistants, fraud detection, autonomous vehicles, and much more. The goal of AI is to create systems that can function intelligently and independently, mimicking human-like intelligence.

Actual behavior

Model name: Cohere Command R+
Model type: chat-completion
Model provider name: Cohere
Azure.RequestFailedException: {"message":"invalid type: parameter messages.content is of type array but should be of type string"}
Status: 400 (Bad Request)
ErrorCode: Bad Request

Content:
{"error":{"code":"Bad Request","message":"{\"message\":\"invalid type: parameter messages.content is of type array but should be of type string\"}","status":400}}

Headers:
Cache-Control: no-cache, no-store, no-transform, must-revalidate, private, max-age=0
Pragma: no-cache
Vary: REDACTED
x-accel-expires: REDACTED
x-ms-rai-invoked: REDACTED
X-Request-ID: REDACTED
ms-azureml-model-error-reason: REDACTED
ms-azureml-model-error-statuscode: REDACTED
x-ms-client-request-id: 4053b640-fc4a-4be5-ab2d-bb183756abd2
Request-Context: REDACTED
azureml-model-session: REDACTED
Date: Tue, 26 Nov 2024 23:49:27 GMT
Content-Length: 162
Content-Type: application/json
Expires: Thu, 01 Jan 1970 00:00:00 UTC

   at Azure.Core.HttpPipelineExtensions.ProcessMessageAsync(HttpPipeline pipeline, HttpMessage message, RequestContext requestContext, CancellationToken cancellationToken)
   at Azure.AI.Inference.ChatCompletionsClient.CompleteAsync(RequestContent content, String extraParams, RequestContext context)
   at Azure.AI.Inference.ChatCompletionsClient.CompleteAsync(ChatCompletionsOptions chatCompletionsOptions, CancellationToken cancellationToken)
   at Microsoft.Extensions.AI.AzureAIInferenceChatClient.CompleteAsync(IList`1 chatMessages, ChatOptions options, CancellationToken cancellationToken)
   at Program.<Main>$(String[] args) in C:\teacup\source\repos\AzureAIInferenceExample\Program.cs:line 27

Regression?

No response

Known Workarounds

No response

Configuration

No response

Other information

No response

The text was updated successfully, but these errors were encountered:

stephentoub · 2024-12-02T17:20:04Z

Thanks for the bug report. Addressed in #5714.

yamanaiyuki added bug This issue describes a behavior which is not expected - a bug. untriaged labels Nov 30, 2024

stephentoub self-assigned this Dec 2, 2024

stephentoub added area-AI and removed untriaged labels Dec 2, 2024

stephentoub mentioned this issue Dec 2, 2024

Fix handling of text-only user messages in AzureAIInferenceChatClient #5714

Merged

dotnet-policy-service bot added the work in progress 🚧 label Dec 2, 2024

stephentoub closed this as completed in #5714 Dec 3, 2024

dotnet-policy-service bot added work in progress 🚧 and removed work in progress 🚧 labels Dec 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Microsoft.Extensions.AI] The Cohere Command R+ always returns "Bad Request" #5708

[Microsoft.Extensions.AI] The Cohere Command R+ always returns "Bad Request" #5708

yamanaiyuki commented Nov 30, 2024

stephentoub commented Dec 2, 2024

[Microsoft.Extensions.AI] The Cohere Command R+ always returns "Bad Request" #5708

[Microsoft.Extensions.AI] The Cohere Command R+ always returns "Bad Request" #5708

Comments

yamanaiyuki commented Nov 30, 2024

Description

Reproduction Steps

Expected behavior

Actual behavior

Regression?

Known Workarounds

Configuration

Other information

stephentoub commented Dec 2, 2024