🎙️ Speak with AI - Run locally using ollama or OpenAI - XTTS or OpenAI Speech or ElevenLabs
-
Updated
Jul 24, 2024 - Python
🎙️ Speak with AI - Run locally using ollama or OpenAI - XTTS or OpenAI Speech or ElevenLabs
Translate the voice in a different language from the original language in real time.
Provides unlimited ElevenLabs API calls.
Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming
Kent State Hackathon 2024 - (2nd Place) Y.A.L.L.O.
Text-to-speach Python scripts for podcasting
This Python script is for a voice interface chatbot named Jervis. It uses OpenAI's GPT-3.5-turbo-instruct model to respond to user input. The chatbot responds by Elevenlabs Voices. Conversation are saved to MongoDB, and MP3 file local and can be emailed if needed.
script automates the creation of YouTube Shorts videos on a given topic. It uses OpenAI's GPT-4 for script generation, ElevenLabs for voiceovers, and Pexels for video footage
Takes a youtube video, clones the voice and re-creates that video in a different language
A fastapi wrapper of babca / python-gsmmodem for a waveshare sim7600x. Not an exact copy of the 'python-gsmmodem' so be sure to uninstall that lib or venv to run | Open-source Twilio with LLM batteries
Quick test of ElevenLabs API and client libraries.
The Podcast Generator project combines OpenAI's GPT-3.5-turbo for script generation and Eleven Labs AI Text-to-Speech (TTS) for realistic audio. It automates podcast creation by aggregating content from RSS feeds, allowing GPT-3.5-turbo to craft engaging scripts converted into lifelike audio using Eleven Labs' TTS.
A smart AI voice assistant with multi-language support and long-term memory. Currently best for Swedish and English. Compatible with Windows and Raspberry Pi. The assistant can use various functions and tools to answer questions (Google, Wolfram Alpha, etc.). Based on OpenAI's GPT-models, Google STT and TTS, and ElevenLabs TTS.
TEXT📲 +1 (877) 274-1880 a problem to solve➡️get a ☎️ call from Thrawn 🔥
This simple project aims to make easier long form TTS synthesis, using ElevenLabs API.
babelSUM is a .txt, .epub and pdf autoreader. it chunks text and reads it in segments. user can search or ai summarise.
A small sample of scraping a traffic rss feed from England through elvenlabs for natural speech processing.
Text to Speech by ElevenLabs
Add a description, image, and links to the elevenlabs-api topic page so that developers can more easily learn about it.
To associate your repository with the elevenlabs-api topic, visit your repo's landing page and select "manage topics."