Voice Assistant

Personal voice assistant using Twilio Voice and OpenAI's Realtime API for phone-based AI conversations.

Architecture

sequenceDiagram
    participant Caller
    participant Twilio
    participant TwilioFunction as Twilio Function<br/>(Allowlist)
    participant Assistant as your-tunnel-name.trycloudflare.com
    participant OpenAI as OpenAI Realtime API

    Caller->>Twilio: Dials phone number
    Twilio->>TwilioFunction: Incoming call webhook

    alt Number in allowlist
        TwilioFunction->>Assistant: POST /incoming-call<br/>(with Twilio signature)
        Assistant->>Assistant: Validate signature
        Assistant->>Assistant: Generate WebSocket token
        Assistant-->>TwilioFunction: Return TwiML with<br/>WebSocket URL + token
        TwilioFunction-->>Twilio: TwiML response
        Twilio->>Assistant: Connect to /media-stream<br/>(with token)
        Assistant->>Assistant: Validate token
        Assistant->>OpenAI: Establish WebSocket

        loop Audio streaming
            Twilio->>Assistant: Audio from caller
            Assistant->>OpenAI: Forward audio
            OpenAI->>Assistant: AI response audio
            Assistant->>Twilio: Forward to caller
        end
    else Number not in allowlist
        TwilioFunction-->>Twilio: Reject call
        Twilio-->>Caller: Call rejected
    end

Security

This application uses three layers of security to protect against unauthorized access:

Layer 1: Phone Number Allowlist (Twilio Function)

Runs on Twilio's infrastructure before reaching your server
Only approved phone numbers can proceed
See twilio/allowlist-function.js for implementation

Layer 2: Twilio Signature Validation

Validates all webhook requests from Twilio
Ensures requests are authentic and haven't been tampered with
Uses HMAC-SHA1 with your TWILIO_AUTH_TOKEN

Layer 3: WebSocket Token Authentication

Single-use tokens generated for each call
60-second expiration window
Prevents unauthorized WebSocket connections

Prerequisites

Python 3.13+
uv for dependency management
Twilio account with a voice-capable phone number
OpenAI API key with Realtime API access
cloudflared for local development tunneling
Docker (optional, for containerized deployment)
Fly.io account (optional, for production deployment)

Setup

1. Configure environment

cp .env.example .env

Edit .env and configure:

OPENAI_API_KEY - Your OpenAI API key
TWILIO_AUTH_TOKEN - Your Twilio Auth Token (found in Twilio Console)
ZAPIER_MCP_URL - Zapier MCP server URL (default: https://mcp.zapier.com/api/mcp/mcp)
ZAPIER_MCP_PASSWORD - Zapier API key in base64 format (get from Zapier MCP Developer)
ASSISTANT_INSTRUCTIONS - AI assistant personality, behavior, and tool usage instructions
VOICE - OpenAI voice name (e.g., alloy, shimmer, nova)
PORT - Server port (default: 5050)
TEMPERATURE - AI temperature (default: 0.8)

Note: WEBHOOK_URL and ALLOWED_NUMBERS are only used in the Twilio Function (see Layer 1 security below), not in your local application.

2. Start cloudflared tunnel

Option A: Quick temporary tunnel (random URL):

make tunnel-quick

Copy the forwarding URL (e.g., https://xyz.trycloudflare.com).

Option B: Named tunnel with stable domain (one-time setup):

# 1. Authenticate with Cloudflare
cloudflared tunnel login

# 2. Create a named tunnel
cloudflared tunnel create assistant

# 3. Route a DNS hostname (replace with your domain)
cloudflared tunnel route dns assistant assistant.yourdomain.com

# 4. Create ~/.cloudflared/assistant.yml with your tunnel ID and domain

# 5. Run the tunnel
make tunnel

3. Deploy Twilio Allowlist Function

Create the Function:

In Twilio Console, go to Functions & Assets > Services
Create a new Service (e.g., "voice-assistant-auth")
Add a new Function with path /incoming-call
Copy the code from twilio/allowlist-function.js
In Environment Variables, add:
- ALLOWED_NUMBERS - Comma-separated phone numbers (e.g., +15551234567,+15559876543)
- WEBHOOK_URL - Your assistant URL (e.g., https://your-tunnel-name.trycloudflare.com/incoming-call)
Deploy the service

Configure Your Phone Number:

Navigate to Phone Numbers > Manage > Active Numbers
Select your number
Set A call comes in to Function: Select your deployed function
Save

Run

Local Development

uv run python main.py

Call your Twilio number to talk with the assistant.

Deployment

Docker

Build and run locally:

docker compose up

Run with cloudflared tunnel (dev profile):

docker compose --profile dev up

Fly.io

Initial setup:

# Install flyctl
brew install flyctl

# Authenticate
fly auth login

# Launch app (generates fly.toml and creates app, but doesn't deploy)
fly launch --no-deploy

Configure environment:

Set secrets:

fly secrets set OPENAI_API_KEY=your_key_here
fly secrets set TWILIO_AUTH_TOKEN=your_token_here
fly secrets set ZAPIER_MCP_PASSWORD=your_zapier_api_key_base64

Non-secret environment variables (VOICE, TEMPERATURE, ASSISTANT_INSTRUCTIONS, ZAPIER_MCP_URL) are configured in fly.toml.

Deploy:

fly deploy

Get your app URL:

fly status

Your webhook URL will be: https://[your-app-name].fly.dev/incoming-call

Use this URL as your WEBHOOK_URL in the Twilio Function.

Scale to single machine (optional):

fly scale count 1 -y

Custom domain setup (optional):

Get your Fly IP addresses:

fly ips list

In your DNS provider (e.g., Cloudflare), add DNS records for your custom domain:
- A record: Point to the IPv4 address shown in fly ips list
- AAAA record: Point to the IPv6 address shown in fly ips list
Add the custom domain to Fly (triggers Let's Encrypt certificate):

fly certs add your-domain.com

Check certificate status:

fly certs show your-domain.com

Once issued, update your Twilio Function's WEBHOOK_URL to use your custom domain:
- Example: https://your-domain.com/incoming-call

View logs:

fly logs

MCP Integration (Zapier)

This assistant integrates with Zapier's MCP server, which connects to multiple services:

Todoist: Task management and reminders
Gmail: Email search

Setup

Get Zapier MCP API Key:
- Go to Zapier MCP Developer
- Generate an API key
- Set secret as ZAPIER_MCP_PASSWORD

Configure in .env:

ZAPIER_MCP_URL=https://mcp.zapier.com/api/mcp/mcp
ZAPIER_MCP_PASSWORD=your_zapier_api_key_base64

The MCP configuration is set in main.py:

{
    "type": "mcp",
    "server_label": "zapier",
    "server_url": ZAPIER_MCP_URL,
    "headers": {
        "Authorization": f"Bearer {ZAPIER_MCP_PASSWORD}"
    },
    "require_approval": "never"
}

Available Voice Commands

Once configured, you can use natural language for:

Todoist Tasks:

"Add buy milk to my todo list"
"What tasks do I have today?"
"Mark task as complete"
"What's due tomorrow?"

Gmail Search:

"Search my email for messages about project updates"
"Find emails from John sent this week"

Todoist API Tips

When fetching today's tasks, the assistant uses the Todoist API with the filter=today parameter:

GET https://api.todoist.com/rest/v2/tasks?filter=today

This ensures accurate results when users ask "what do I have to do today?" or similar queries.

Features

Real-time voice conversation with OpenAI
Natural interrupt handling and AI preemption
Bidirectional audio streaming between Twilio and OpenAI
Task management via Zapier MCP (Todoist integration)
Email search via Zapier MCP (Gmail integration)

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github/workflows		.github/workflows
twilio		twilio
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
fly.toml		fly.toml
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock
web_ui.py		web_ui.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Voice Assistant

Architecture

Security

Layer 1: Phone Number Allowlist (Twilio Function)

Layer 2: Twilio Signature Validation

Layer 3: WebSocket Token Authentication

Prerequisites

Setup

1. Configure environment

2. Start cloudflared tunnel

3. Deploy Twilio Allowlist Function

Run

Local Development

Deployment

Docker

Fly.io

MCP Integration (Zapier)

Setup

Available Voice Commands

Todoist API Tips

Features

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

oldsj/assistant

Folders and files

Latest commit

History

Repository files navigation

Voice Assistant

Architecture

Security

Layer 1: Phone Number Allowlist (Twilio Function)

Layer 2: Twilio Signature Validation

Layer 3: WebSocket Token Authentication

Prerequisites

Setup

1. Configure environment

2. Start cloudflared tunnel

3. Deploy Twilio Allowlist Function

Run

Local Development

Deployment

Docker

Fly.io

MCP Integration (Zapier)

Setup

Available Voice Commands

Todoist API Tips

Features

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages