speech-to-text

Here are 1,029 public repositories matching this topic...

mezbaul-h / kinkajou

Fun and interactive command-line program that listens to your questions, provides intelligent responses, and speaks the answers back to you.

python text-to-speech tts cli-app speech-to-text command-line-tool large-language-models llm

Updated Jun 3, 2024
Python

savbell / whisper-writer

Star

💬📝 A small dictation app using OpenAI's Whisper speech recognition model.

speech-recognition openai speech-to-text dictation whisper typing-assistant openai-api openai-whisper faster-whisper

Updated Jun 3, 2024
Python

smalltong02 / keras-llm-robot

Star

A web UI Project In order to learn the large language model. This project includes features such as chat, quantization, fine-tuning, prompt engineering templates, and multimodality.

text-to-speech chatbot gemini knowledgebase speech-to-text vectorization multimodal faiss rag milvus streamlit llm code-interpreter chatgpt pgvector fastchat

Updated Jun 3, 2024
Python

jianchang512 / pyvideotrans

Star

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，并添加配音

text-to-speech speech-to-text video-transition

Updated Jun 3, 2024
Python

leon-ai / leon

Star

🧠 Leon is your open-source personal assistant.

Updated Jun 3, 2024
Python

Xewdy444 / Playwright-reCAPTCHA

Star

A Python library for solving reCAPTCHA v2 and v3 with Playwright

library recaptcha solver asyncio speech-to-text playwright

Updated Jun 3, 2024
Python

Its equipped with a variety of features to enhance your productivity and convenience. It can open and close any apps, search anything on Google and Wikipedia, check the temperature, facilitate message passing, transcribe spoken words into text, play games, utilize AI features , perform keyboard shortcuts, control volume, play songs, etc...

Updated Jun 3, 2024
Python

edenai / edenai-apis

Star

Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines

python nlp api natural-language-processing text-to-speech ocr ai computer-vision aggregator machine-translation image-processing speech-recognition speech-to-text optical-character-recognition ai-as-a-service video-recognition pre-trained-model document-parsing

Updated Jun 3, 2024
Python

gweltou / vosk-br

Star

Anaouder mouezh e Brezhoneg gant Vosk

speech-to-text stt breton-language breton vosk vosk-models

Updated Jun 3, 2024
Python

dimastatz / deep-signal

Star

A centralized, real-time analysis of multimedia data with Apache Spark

apache-spark speech-to-text openai-whisper

Updated Jun 3, 2024
Python

Mohamad-Hussein / speech-assistant

Star

Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-text dictation.

desktop-app translation offline speech speech-to-text transcription dictation whisper huggingface openai-whisper whisper-ai distil-whisper

Updated Jun 3, 2024
Python

R3gm / SoniTranslate

Star

Synchronized Translation for Videos. Video dubbing

text-to-speech translation tts speech-to-text stt audio-processing asr document-translator dubbing diarization automatic-dubbing subtitle-to-speech translate-audio translate-video video-dubbing

Updated Jun 2, 2024
Python

gustavostz / whisper-clip

Star

WhisperClip simplifies your life by automatically transcribing audio recordings and saving the text directly to your clipboard. With just a click of a button, you can effortlessly convert spoken words into written text, ready to be pasted wherever you need it. This application harnesses the power of OpenAI’s Whisper for free.

python productivity clipboard speech-recognition openai speech-to-text whisper audio-processing productivity-tools audio-transcription whisper-ai

Updated Jun 2, 2024
Python

egorsmkv / speech-recognition-uk

Star

Speech Recognition for Ukrainian

text-to-speech speech tts speech-synthesis speech-recognition speech-to-text ukrainian ukrainian-language

Updated Jun 2, 2024
Python

Softcatala / whisper-ctranslate2

Star

Whisper command line client compatible with original OpenAI client based on CTranslate2.

speech-recognition speech-to-text whisper openai- openai-whisper

Updated Jun 2, 2024
Python

Olney1 / ChatGPT-OpenAI-Smart-Speaker

Star

This AI Smart Speaker uses speech recognition and text-to-speech to enable voice-driven conversations with OpenAI. The user speaks a prompt into the microphone, and the program sends the prompt to OpenAI to generate a response. The response is then converted to an audio file and played back to the user.

text-to-speech ai smarthome artificial-intelligence speech-recognition openai speech-to-text smartspeaker gpt-4 chatgpt