Stars
《Machine Learning Systems: Design and Implementation》- Chinese Version
Multilingual G2P in 100 languages
A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
Grapheme to phoneme conversion with deep learning.
A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation
Embedded segmental K-means (ES-KMeans) in Python.
A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)
A PyPI package for fast word/character error rate (WER/CER) calculation
alfred-py: A deep learning utility library for **human**, more detail about the usage of lib to: https://zhuanlan.zhihu.com/p/341446046
Tracking the progress in end-to-end speech translation
Tracking the progress in non-autoregressive generation (translation, transcription, etc.)
A curated list of awesome self-supervised methods
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
Dynamic Meta-Embeddings for Improved Sentence Representations
🎆Interactive Online Platform that Visualizes Algorithms from Code
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
A very simple framework for state-of-the-art Natural Language Processing (NLP)
A fast, efficient universal vector embedding utility package.
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive lea…