Sidekick 🤖

Real-time AI video call assistant with perfectly synchronized lip movements. Talk to historical figures, fictional characters, or create your own.

What is this?

Sidekick lets you have face-to-face video conversations with AI characters. Unlike typical voice assistants, these characters have visual presence with realistic lip-sync, making conversations feel natural and engaging.

Features

Live Video Conversations - WebRTC-powered real-time video calls with AI characters
Perfect Lip Sync - Powered by Decart's cutting-edge lipsync technology
Customizable Characters - Define personality, voice, and appearance via simple YAML configs
Low Latency - Optimized pipeline for natural conversation flow
Smart Interruptions - Handles conversation turns naturally with VAD and smart turn detection

Requirements

Python 3.10+
API keys for Groq, ElevenLabs, and Decart
A character video file (static face video works best)
Decent internet connection for real-time streaming

Quick Start

Clone and install

git clone https://github.com/DecartAI/sidekick.git
cd sidekick
python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

Set up your API keys

cp .env.example .env
# Add your API keys:
# - GROQ_API_KEY (LLM)
# - ELEVENLABS_API_KEY (Voice)
# - DECART_API_KEY (Lipsync)

Run with a character

# Talk to Cleopatra
python sidekick.py --character cleopatra.yaml

# Or meet V1X3N, the sassy AI
python sidekick.py --character v1x3n.yaml

Open client.html in your browser and hit Connect

Creating Your Own Characters

Characters are defined in YAML files. Here's the structure:

name: YourCharacter
voice_id: elevenlabs_voice_id
video_path: videos/YourCharacter.mp4
greeting: "Your character's opening line"
system_prompt: |
  Detailed personality and behavior instructions
  for the LLM to roleplay as your character

See cleopatra.yaml and v1x3n.yaml for examples.

Architecture

Built on Pipecat for pipeline orchestration:

STT: Whisper (MLX optimized on Mac)
LLM: Groq (Llama 3.3 70B)
TTS: ElevenLabs
Lipsync: Decart
Transport: WebRTC via aiortc

Command Line Options

python sidekick.py [options]

Options:
  --character PATH     Character config file (required)
  --host HOST         Server host (default: 0.0.0.0)
  --port PORT         Server port (default: 8080)
  --mlx               Use MLX for Whisper STT (Mac only)
  --audio-sample-rate Audio sample rate (default: 16000)

License

MIT

Contributing

PRs welcome! Please check existing issues first.

Built with ❤️ for more natural AI interactions

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

Sidekick 🤖

What is this?

Features

Requirements

Quick Start

Creating Your Own Characters

Architecture

Command Line Options

License

Contributing

About

Uh oh!

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
lipsync		lipsync
processors		processors
videos		videos
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
cleopatra.yaml		cleopatra.yaml
client.html		client.html
requirements.txt		requirements.txt
sidekick.py		sidekick.py
v1x3n.yaml		v1x3n.yaml

Uh oh!

License

Uh oh!

DecartAI/sidekick

Folders and files

Latest commit

History

Repository files navigation

Sidekick 🤖

What is this?

Features

Requirements

Quick Start

Creating Your Own Characters

Architecture

Command Line Options

License

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages