Real time RAG Voice Agent, powered by Cartesia

This project implements a VOICE RAG Agent powered by Cartesia

Installation

Ensure you have Python 3.11 or later installed and run:

pip install -r requirements.txt

Implementation 1: voice_agent_openai.py

This implementation uses OpenAI's services for speech-to-text and cartesia for speech synthesis, simpler setup if you already have OpenAI API keys.

Requirements

Setup

Copy .env.example to .env
Configure the following environment variables:

OPENAI_API_KEY=your_openai_api_key
CARTESIA_API_KEY=your_cartesia_api_key
LIVEKIT_URL=your_livekit_url
LIVEKIT_API_KEY=your_livekit_api_key
LIVEKIT_API_SECRET=your_livekit_api_secret
ASSEMBLYAI_API_KEY=your_assemblyai_api_key

Running

python voice_agent_openai.py start

Connecting to Agent Playground

Livekit Agents Playground

Implementation 2: voice_agent.py

This implementation uses AssemblyAI for speech processing and Ollama (with Gemma) for language tasks.

Setup

Install Ollama

# For macOS
brew install ollama

# For Linux
curl -fsSL https://ollama.com/install.sh | sh

Pull Gemma Model
```
ollama pull gemma3
```

Configure Environment Copy .env.example to .env and set:

CARTESIA_API_KEY=your_cartesia_api_key
LIVEKIT_URL=your_livekit_url
LIVEKIT_API_KEY=your_livekit_api_key
LIVEKIT_API_SECRET=your_livekit_api_secret
ASSEMBLYAI_API_KEY=your_assemblyai_api_key

Running

Start Ollama server:
```
ollama serve
```
In a new terminal, run the voice agent:
```
python voice_agent.py start
```

✍️ About the Author

Built by Adityeah

I build AI agents that solve real human problems, not just productivity ones.

📰 Read the full breakdown in my newsletter: Adityeah's Newletter

🤝 Connect on LinkedIn: Aditya Chaudhari

Contribution

Contributions are welcome! Please fork the repository and submit a pull request with your improvements.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
chat-engine-storage		chat-engine-storage
docs		docs
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
voice_agent.py		voice_agent.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Real time RAG Voice Agent, powered by Cartesia

Installation

Implementation 1: voice_agent_openai.py

Requirements

Setup

Running

Connecting to Agent Playground

Implementation 2: voice_agent.py

Setup

Running

✍️ About the Author

Contribution

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Real time RAG Voice Agent, powered by Cartesia

Installation

Implementation 1: voice_agent_openai.py

Requirements

Setup

Running

Connecting to Agent Playground

Implementation 2: voice_agent.py

Setup

Running

✍️ About the Author

Contribution

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages