A multimodal agent that uses livekit, deepgram and openai to chat with users in real-time.
git clone https://github.com/simandebvu/multimodal
cd multimodal
pip install -r requirements.txtDescribe how to use your project. For example:
python agent.py- Multimodal agent that uses livekit, deepgram and openai to chat with users in real-time
- Uses the
livekit-agentslibrary to create a multimodal agent - Uses the
livekit-plugins-deepgramandlivekit-plugins-openailibraries to use deepgram and openai respectively - Uses the
dotenvlibrary to manage environment variables - Uses the
requestslibrary to send requests to the livekit server - Uses the
websocketslibrary to connect to the livekit server - Uses the
asynciolibrary to handle asynchronous tasks - Uses the
jsonlibrary to handle json data
This project requires:
- Python 3.8+
- OpenAI API
- Deepgram API
- dotenv
- Create a
.envfile in the root directory. - Add your API keys and organization ID:
OPENAI_API_KEY=your_openai_api_key_here
OPENAI_ORGANIZATION_ID=your_organization_id_here
DEEPGRAM_API_KEY=your_deepgram_api_key_here