Hide content and notifications from this user.
Contact Support about this user's behavior.
Speech Recognition using DeepSpeech2 and the CTC activation function. Edit
A Real-time Speaker Recognition System with GUI
Generate text and audio splits from strings successfully aligned by Gentle
PyTorch CTC Decoder bindings
Find audio/video links in a webpage
A MXNet implementation of Baidu's DeepSpeech architecture
Language Modeling with RNN using TensorFlow
Top level code to transcribe English audio/video files into text/subtitles
The official repository of the Eesen project
Clean podcast data
A TensorFlow implementation of Baidu's DeepSpeech architecture
Convolutional Neural Network for Text Classification in Tensorflow
Deep learning models trained to correct input errors in short, message-like text
Various scripts used while playing around with Google Brain's billion word language model
MLP based Voice Activity Detection
MMSE STSA Speech enhancement
LogMMSE speech enhancement/noise reduction
This library provides common speech features for ASR including MFCCs and filterbank energies.
Noise reduction using Log_MMSE method implement by C language
A waveform image generator
A library for cross-browser normalization of keyboard events