Skip to content

Support audio input#22679

Open
fjord-oai wants to merge 1 commit into
mainfrom
fjord/audio_input
Open

Support audio input#22679
fjord-oai wants to merge 1 commit into
mainfrom
fjord/audio_input

Conversation

@fjord-oai
Copy link
Copy Markdown
Contributor

Summary

Adds audio as a structured tool-output content item, gated by model input modalities.

  • Adds InputModality::Audio and input_audio function-call output items.
  • Converts MCP audio content blocks to Responses-compatible structured output for audio-capable models.
  • Fails MCP/code-mode audio output clearly when the selected model does not advertise audio input.
  • Adds a strict code-mode audio(...) helper.
  • Regenerates app-server protocol schemas so clients can round-trip the new content item shape.

@fjord-oai fjord-oai force-pushed the fjord/audio_input branch 4 times, most recently from 77f5348 to ccd6084 Compare May 15, 2026 21:50
Copy link
Copy Markdown

@damian-openai damian-openai left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks great! I think on the encoder side aside from mp3, wav we also support ogg, do we verify that it is one of these formats?

3:53 PM

@fjord-oai
Copy link
Copy Markdown
Contributor Author

Looks great! I think on the encoder side aside from mp3, wav we also support ogg, do we verify that it is one of these formats?

chatted offline, we'll keep it open for now while we're figuring out exactly what is supported

Copy link
Copy Markdown
Contributor

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 519d780064

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Comment thread codex-rs/utils/output-truncation/src/lib.rs Outdated
@fjord-oai fjord-oai force-pushed the fjord/audio_input branch from 519d780 to b42b2ff Compare May 16, 2026 05:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants