Skip to content

v1.4.0

Choose a tag to compare

@digitallysavvy digitallysavvy released this 13 May 16:43
· 89 commits to main since this release
v1.4.0
72bb423

Added

  • Added NewDeepgramTTS AgentKit vendor support for Deepgram TTS beta.
  • Added AgentSession.Think() and ThinkWithOptions() for sending custom instructions to a
    running agent through the Agent Management API.
  • Added Agent.WithInterruption() and WithInterruptionConfig() for the new top-level
    interruption configuration.
  • Added MLLM turn detection support through TurnDetection on NewOpenAIRealtime,
    NewGeminiLive, and NewVertexAI.
  • Added Agent.WithTools() and WithTools() option for enabling MCP tool invocation.
  • Added AudioScenario AgentKit support for parameters.audio_scenario.
  • Added Headers field to all LLM vendors: NewOpenAI, NewAzureOpenAI, NewAnthropic, and
    NewGemini.

Changed

  • Agent.WithMllm() now sets mllm.enable = true and no longer emits deprecated
    advanced_features.enable_mllm.
  • MLLM vendors now match the v2.6 API shape and no longer emit removed fields such as style.

Fixed

  • Fixed MiniMax TTS preset stripping so inferred reseller presets remove key, model,
    group_id, and url before sending requests.
  • Fixed preset-backed NewOpenAITTS so APIKey is optional when credentials are supplied server-
    side by a reseller preset.
  • Expanded AgentKit parity coverage for interruption, MLLM turn detection, Deepgram TTS, LLM
    headers, MLLM enable behavior, and preset-backed OpenAI TTS.

Full Changelog: v1.3.4...v1.4.0