Skip to content
View kahne's full-sized avatar

Block or report kahne

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

《Machine Learning Systems: Design and Implementation》- Chinese Version

TeX 4,339 450 Updated Apr 13, 2024

Multilingual G2P in 100 languages

Jupyter Notebook 315 25 Updated May 26, 2023

A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset

Python 346 70 Updated Dec 24, 2021

Neural end-to-end Speech Translation Toolkit

Python 308 42 Updated Jun 28, 2022

Grapheme to phoneme conversion with deep learning.

Python 382 45 Updated Dec 8, 2023

A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation

Python 528 57 Updated Apr 2, 2023

Embedded segmental K-means (ES-KMeans) in Python.

Python 14 6 Updated Apr 22, 2024

A Neural Framework for MT Evaluation

Python 562 88 Updated Mar 26, 2025

Phonetisaurus G2P

Shell 467 121 Updated Jun 1, 2024

A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)

Python 698 143 Updated Mar 25, 2025

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

1,881 238 Updated Jun 6, 2024

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

1,318 142 Updated Jun 6, 2024

CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)

Python 372 43 Updated Sep 14, 2021

A PyPI package for fast word/character error rate (WER/CER) calculation

Python 71 15 Updated Jul 1, 2023

alfred-py: A deep learning utility library for **human**, more detail about the usage of lib to: https://zhuanlan.zhihu.com/p/341446046

Python 912 137 Updated Sep 3, 2024

Tracking the progress in end-to-end speech translation

260 25 Updated Oct 25, 2023

Tracking the progress in non-autoregressive generation (translation, transcription, etc.)

307 28 Updated Mar 15, 2023

A curated list of awesome self-supervised methods

6,249 836 Updated Jul 3, 2024

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Python 2,615 245 Updated Jun 22, 2024

Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.

1,863 226 Updated Jun 27, 2022

Dynamic Meta-Embeddings for Improved Sentence Representations

Python 331 49 Updated Sep 25, 2020

🎆Interactive Online Platform that Visualizes Algorithms from Code

JavaScript 47,260 7,345 Updated Jun 9, 2024

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python 15,993 3,568 Updated Jun 2, 2023

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Python 14,123 2,112 Updated Mar 31, 2025

A fast, efficient universal vector embedding utility package.

Python 1,644 119 Updated Aug 3, 2023

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive lea…

C++ 8,543 1,928 Updated Oct 17, 2024
Showing results