Fun and interactive command-line program that listens to your questions, provides intelligent responses, and speaks the answers back to you.
-
Updated
Jun 3, 2024 - Python
Fun and interactive command-line program that listens to your questions, provides intelligent responses, and speaks the answers back to you.
💬📝 A small dictation app using OpenAI's Whisper speech recognition model.
A web UI Project In order to learn the large language model. This project includes features such as chat, quantization, fine-tuning, prompt engineering templates, and multimodality.
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音
🧠 Leon is your open-source personal assistant.
A Python library for solving reCAPTCHA v2 and v3 with Playwright
Its equipped with a variety of features to enhance your productivity and convenience. It can open and close any apps, search anything on Google and Wikipedia, check the temperature, facilitate message passing, transcribe spoken words into text, play games, utilize AI features , perform keyboard shortcuts, control volume, play songs, etc...
Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines
Anaouder mouezh e Brezhoneg gant Vosk
A centralized, real-time analysis of multimedia data with Apache Spark
Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-text dictation.
Synchronized Translation for Videos. Video dubbing
WhisperClip simplifies your life by automatically transcribing audio recordings and saving the text directly to your clipboard. With just a click of a button, you can effortlessly convert spoken words into written text, ready to be pasted wherever you need it. This application harnesses the power of OpenAI’s Whisper for free.
Speech Recognition for Ukrainian
Whisper command line client compatible with original OpenAI client based on CTranslate2.
This AI Smart Speaker uses speech recognition and text-to-speech to enable voice-driven conversations with OpenAI. The user speaks a prompt into the microphone, and the program sends the prompt to OpenAI to generate a response. The response is then converted to an audio file and played back to the user.
Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Add a description, image, and links to the speech-to-text topic page so that developers can more easily learn about it.
To associate your repository with the speech-to-text topic, visit your repo's landing page and select "manage topics."