A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
-
Updated
Aug 3, 2024 - Python
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
视频音频生成字幕,生成srt文件。无需申请第三方API,本地实现音频转文本。基于Transformer的视频字幕生成框架。A GUI tool for generating subtitle from videos and generating srt files.
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
A python package to build AI-powered real-time audio applications
On-device streaming speech-to-text engine powered by deep learning
Effortlessly add AI-generated transcription subtitles to your videos
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
Easily take an entire YouTube playlist and turn it into high quality transcripts using Whisper.
🙊 software for creating speech recognition models.
turnkey self-hosted offline transcription and diarization service with llm summary
Korean Alphabet Transcription
Crowdsourcing platform for full text transcription and tagging. https://crowd.loc.gov
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
On-device speech-to-text engine powered by deep learning
Python implementation of pre-processing for End-to-End speech recognition
Streamlining SLAM-seq analysis with ultra-high sensitivity
Translates standard alphabet based text to Grade 2 Braille and back.
ASR with PyTorch
Add a description, image, and links to the transcription topic page so that developers can more easily learn about it.
To associate your repository with the transcription topic, visit your repo's landing page and select "manage topics."