Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming
-
Updated
Jun 8, 2024 - Python
Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming
Provides unlimited ElevenLabs API calls.
Takes a youtube video, clones the voice and re-creates that video in a different language
Text to Speech by ElevenLabs
script automates the creation of YouTube Shorts videos on a given topic. It uses OpenAI's GPT-4 for script generation, ElevenLabs for voiceovers, and Pexels for video footage
A smart AI voice assistant with multi-language support and long-term memory. Currently best for Swedish and English. Compatible with Windows and Raspberry Pi. The assistant can use various functions and tools to answer questions (Google, Wolfram Alpha, etc.). Based on OpenAI's GPT-models, Google STT and TTS, and ElevenLabs TTS.
Speak with AI - Run locally using ollama or use OpenAI - XTTS or OpenAI Speech or ElevenLabs
The Podcast Generator project combines OpenAI's GPT-3.5-turbo for script generation and Eleven Labs AI Text-to-Speech (TTS) for realistic audio. It automates podcast creation by aggregating content from RSS feeds, allowing GPT-3.5-turbo to craft engaging scripts converted into lifelike audio using Eleven Labs' TTS.
Text-to-speach Python scripts for podcasting
Kent State Hackathon 2024 - (2nd Place) Y.A.L.L.O.
This simple project aims to make easier long form TTS synthesis, using ElevenLabs API.
TEXT📲 +1 (877) 274-1880 a problem to solve➡️get a ☎️ call from Thrawn 🔥
Quick test of ElevenLabs API and client libraries.
A fastapi wrapper of babca / python-gsmmodem for a waveshare sim7600x. Not an exact copy of the 'python-gsmmodem' so be sure to uninstall that lib or venv to run | Open-source Twilio with LLM batteries
Translate the voice in a different language from the original language in real time.
A small sample of scraping a traffic rss feed from England through elvenlabs for natural speech processing.
A voice changer made using google's speech to text, and elevenlabs
babelSUM is a .txt, .epub and pdf autoreader. it chunks text and reads it in segments. user can search or ai summarise.
This Python script is for a voice interface chatbot named Jervis. It uses OpenAI's GPT-3.5-turbo-instruct model to respond to user input. The chatbot responds by Elevenlabs Voices. Conversation are saved to MongoDB, and MP3 file local and can be emailed if needed.
Add a description, image, and links to the elevenlabs-api topic page so that developers can more easily learn about it.
To associate your repository with the elevenlabs-api topic, visit your repo's landing page and select "manage topics."