Converts speech to text from any audio/video file using OpenAI API
-
Updated
Jun 8, 2024 - Python
Converts speech to text from any audio/video file using OpenAI API
Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
WhisperAudioTranscriber is an asynchronous audio recording and transcription tool built using Python. It utilizes the Hugging Face API, specifically leveraging the powerful capabilities of OpenAI's Whisper model
Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.
A simple Django project to demonstrate Google Speech Recognition.
A versatile CLI and Python wrapper for Google's Gemini Pro large language models. Streamline the creation of chatbots, generate dynamic text, analyze images and transcribe audio with ease.
Add a description, image, and links to the audio-transcribing topic page so that developers can more easily learn about it.
To associate your repository with the audio-transcribing topic, visit your repo's landing page and select "manage topics."