Skip to content

OkayJerry/realtime-sts

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Realtime Speech-to-Speech

realtime-sts_diagram

Quick Setup

  1. Websocket Server

Step Terminal Purpose Quick Reference
1 Install dependencies. pip install -r requirements.txt
2 1 Runs the server. python websocket-server/main.py
3 2 Exposes server to Twilio. ngrok http 8081
4 Runs the server. python websocket-server/main.py
  1. Environment (.env) Variables

Name Purpose How to Find
OPENAI_API_KEY For connecting to OpenAI's Realtime API. Sign In > + Create a new secret key > Create secret key
PUBLIC_URL To establish live, bidirectional communication stream. On Terminal 2, ngrok will list this under "Session Status" as the forwarding URL.
FIREBASE_SERVICE_ACCOUNT_KEY_PATH For storing call data (e.g. transcripts, events). Create a Google Firebase project and follow these instructions.
  1. Twilio

Step Instruction
1 Create Twilio account.
2 Purchase and activate phone number.
3 Set "A call comes in" webhook to https://forwarding-url-goes-here.ngrok.app/twiml.

Additional Plans

About

AI Speech-to-Speech Server. Built with Twilio and the OpenAI Realtime API.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages