creates text from audio of A/V input file, using docker, sphinx. extracts keywords and NLP entities. leverages OpenNews, Stanford, Oxford, CMU and more
-
Updated
Mar 4, 2017 - Shell
creates text from audio of A/V input file, using docker, sphinx. extracts keywords and NLP entities. leverages OpenNews, Stanford, Oxford, CMU and more
OLAMI API Quickstart cURL Samples (in bash)
This folder contains a solution for speech recognition and synthesis using the Microsoft Server Speech Platform Runtime (Version 11)
Adnabod lleferydd Cymraeg gyda Kaldi ASR | Welsh language speech recognition using Kaldi ASR
Video Summarization - Summarized a video lecture and converted it to a slideshow using Speech-to-text, Keyword extraction and OpenCV Shot detection.
Top level code to transcribe English audio/video files into text/subtitles
Realtime internet radio stream speech recognition with Julius & ffmpeg
Dockerfile for compiling Kaldi for Android.
This is a Kaldi tutorial for beginners
Recognize base phones (/a/, /u/, /i/) from a given speech and indicate the indices of all phones.
PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).
Automatic Speech Recognition (ASR) - German
OpenSIPS + RTPEngine Recording + Speech Recognition in HEP
This repository contains my attempt to use two famous speech recognition frameworks (Kaldi, CMU Sphinx4) for Arabic Language using the publicly-available dataset "Arabic Corpus of Isolated Words"
A Bash script designed to make training sphinx4 and pocketsphinx acoustic libraries faster and easier
Hemera - Intelligent System
Iban-based Kaldi recipe for Indonesian speech Corpus, presented at ASJ Spring 2019.
Scripts para treino de modelos acústicos
Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."