Skip to content

v2.3.0 - Bring Your Own LLM Infrastructure

Latest

Choose a tag to compare

@iamprashant iamprashant released this 13 May 05:20
· 141 commits to main since this release
Immutable release. Only release title and notes can be modified.
5d14f77

What's Changed

This release ships three major updates to Rapida.

Custom LLM

You can now bring your own LLM into Rapida with the new Custom LLM provider.

Supported API compatibility:

  • OpenAI Chat Completions (/v1/chat/completions)
  • OpenAI Responses (/v1/responses)
  • Anthropic Messages (/v1/messages)
  • Google Gemini (generateContent)
  • OpenAI Compatible (Ollama, vLLM, LM Studio, TGI)

You can point Rapida at your own base URL, pass optional headers, and keep the same workflow inside the product while changing the model layer underneath.

Related PR: #108

Ambient Audio

We also added ambient audio so calls do not feel silent or broken in production.

  • Adds background presence to live calls
  • Useful for receptionist, support, concierge, and outbound flows
  • Helps phone and web deployments feel more natural

Related PR: #113

Assistant Authentication

You can now authenticate a session before the agent starts.

  • Configure an HTTP authentication endpoint for inbound or outbound sessions
  • Pass headers, request body, timeout, and condition rules
  • Control fail behavior with Block or Do nothing
  • Verify sessions before initialization instead of letting every call go straight to the agent

This makes it easier to add your own policy, routing, verification, or access control step before Rapida starts the conversation.

Related PR: #116

Why This Matters

  • Bring your own model infrastructure into Rapida
  • Improve the live call experience without extra media plumbing
  • Add a verification layer before the assistant starts a session

Breaking Changes

None.

Upgrade Guide

Self-hosted

git pull origin main
docker compose up -d --build

Rapida Cloud

No action required.

New Contributors

Full Changelog: v2.2.0...v2.3.0