speech-transcription

Here are 14 public repositories matching this topic...

Dadangdut33 / Speech-Translate

A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.

python translate whisper tkinter-python speech-translation speech-transcription

Updated Jan 18, 2024
Python

Appen / UHV-OTS-Speech

Star

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

speech-recognition speech-processing audio-segmentation gender-classification speaker-diarization synthetic-speech-detection topic-detection speech-seperation speaker-identification accent-detection speech-transcription speech-annotation

Updated Mar 25, 2023
Forth

jhauret / vibravox

Star

Speech to Phoneme, Bandwidth Extension and Speaker Verification using the Vibravox dataset.

pytorch hydra datasets speaker-verification speech-enhancement pytorch-lightning speech-transcription bandwidth-extension

Updated Nov 6, 2024
Python

srinivr / kaldi-long-audio-alignment

Star

Long audio alignment using Kaldi

speech-recognition automatic-speech-recognition speech-to-text kaldi transcription asr speechrecognition split-audio longaudio-alignment audio-segments speech-transcription

Updated Apr 22, 2021
Shell

KevKibe / African-Whisper

Sponsor

Star

🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.

speech speech-recognition speech-to-text whisper asr speech-translation speech-transcription

Updated Nov 4, 2024
Python

PranavPutsa1006 / Speaker-Diarization

Star

Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python

deep-learning neural-networks speech-to-text mfcc speaker-diarization spectral-clustering voice-activity-detection speech-segmentation speech-detection speech-transcription embeddings-extraction

Updated Jun 18, 2023
Jupyter Notebook

capjamesg / awsnap.js

Star

Navigate websites by clicking your fingers and saying the link you want to visit.

webaudio-api audio-classification tensorflow-js speech-transcription

Updated Oct 1, 2023
HTML

otonomee / mic2transcript

Star

CLI tool that continuously transcribes audio from the device's built-in microphone to a text file. Runs in the background, providing an ongoing log of ambient audio as text.

audio cli speech openai transcription whisper cli-tool speech-transcription