Closed
Description
Describe the bug
When using GPT-4o audio-preview with Semantic Kernel, the audio response isn't return The code runs without errors, but no sound is returned from the AI response.
To Reproduce
Steps to reproduce the behavior:
- Create a new .NET 9 project
- Install Microsoft.SemanticKernel v1.47.0
- Configure SDK with Azure OpenAI and the gpt-4o-audio-preview model
- Set up audio input from microphone using NAudio
- Request both text and audio responses with ChatResponseModalities.Text | ChatResponseModalities.Audio
- No Audio response and only text.
Expected behavior
The AI should respond both with text (which works) and audio (which doesn't return).
Screenshots
not applicable.
Platform
- Language: C#
- Source: NuGet package Microsoft.SemanticKernel version 1.47.0
- AI model: Azure OpenAI gpt-4o-audio-preview
- IDE: Visual Studio
- OS: Windows
Additional context
I followed the example from https://devblogs.microsoft.com/semantic-kernel/using-openais-audio-preview-model-with-semantic-kernel/ and implemented microphone input using NAudio. The text response works correctly.
Metadata
Metadata
Assignees
Type
Projects
Status
Backlog: Planned