Release v2.3.0 - Bring Your Own LLM Infrastructure · rapidaai/voice-ai

What's Changed

This release ships three major updates to Rapida.

Custom LLM

You can now bring your own LLM into Rapida with the new Custom LLM provider.

Supported API compatibility:

OpenAI Chat Completions (/v1/chat/completions)
OpenAI Responses (/v1/responses)
Anthropic Messages (/v1/messages)
Google Gemini (generateContent)
OpenAI Compatible (Ollama, vLLM, LM Studio, TGI)

You can point Rapida at your own base URL, pass optional headers, and keep the same workflow inside the product while changing the model layer underneath.

Related PR: #108

Ambient Audio

We also added ambient audio so calls do not feel silent or broken in production.

Adds background presence to live calls
Useful for receptionist, support, concierge, and outbound flows
Helps phone and web deployments feel more natural

Related PR: #113

Assistant Authentication

You can now authenticate a session before the agent starts.

Configure an HTTP authentication endpoint for inbound or outbound sessions
Pass headers, request body, timeout, and condition rules
Control fail behavior with Block or Do nothing
Verify sessions before initialization instead of letting every call go straight to the agent

This makes it easier to add your own policy, routing, verification, or access control step before Rapida starts the conversation.

Related PR: #116

Why This Matters

Bring your own model infrastructure into Rapida
Improve the live call experience without extra media plumbing
Add a verification layer before the assistant starts a session

Breaking Changes

None.

Upgrade Guide

Self-hosted

git pull origin main
docker compose up -d --build

Rapida Cloud

No action required.

New Contributors

@eschmidbauer made their first contribution in #108

Full Changelog: v2.2.0...v2.3.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v2.3.0 - Bring Your Own LLM Infrastructure

Choose a tag to compare

Sorry, something went wrong.