kahne

Follow

Changhan Wang kahne

Follow

GenAI/Llama @ Meta

235 followers · 3 following

Achievements

Achievements

Stars

openmlsys / openmlsys-zh

《Machine Learning Systems: Design and Implementation》- Chinese Version

TeX 4,339 450 Updated Apr 13, 2024

lingjzhu / CharsiuG2P

Multilingual G2P in 100 languages

Jupyter Notebook 315 25 Updated May 26, 2023

kakaobrain / g2pm

A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset

Python 346 70 Updated Dec 24, 2021

bytedance / neurst

Neural end-to-end Speech Translation Toolkit

Python 308 42 Updated Jun 28, 2022

spring-media / DeepPhonemizer

Grapheme to phoneme conversion with deep learning.

Python 382 45 Updated Dec 8, 2023

facebookresearch / voxpopuli

A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation

Python 528 57 Updated Apr 2, 2023

kamperh / eskmeans

Embedded segmental K-means (ES-KMeans) in Python.

Python 14 6 Updated Apr 22, 2024

Unbabel / COMET

A Neural Framework for MT Evaluation

Python 562 88 Updated Mar 26, 2025

AdolfVonKleist / Phonetisaurus

Phonetisaurus G2P

Shell 467 121 Updated Jun 1, 2024

dmort27 / epitran

A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)

Python 698 143 Updated Mar 25, 2025

jim-schwoebel / voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

1,881 238 Updated Jun 6, 2024

coqui-ai / open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

1,318 142 Updated Jun 6, 2024

facebookresearch / covost

CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)

Python 372 43 Updated Sep 14, 2021

kahne / fastwer

A PyPI package for fast word/character error rate (WER/CER) calculation

Python 71 15 Updated Jul 1, 2023

lucasjinreal / alfred

alfred-py: A deep learning utility library for **human**, more detail about the usage of lib to: https://zhuanlan.zhihu.com/p/341446046

Python 912 137 Updated Sep 3, 2024

kahne / SpeechTransProgress

Tracking the progress in end-to-end speech translation

260 25 Updated Oct 25, 2023

kahne / NonAutoregGenProgress

Tracking the progress in non-autoregressive generation (translation, transcription, etc.)

307 28 Updated Mar 15, 2023

jason718 / awesome-self-supervised-learning

A curated list of awesome self-supervised methods

6,249 836 Updated Jul 3, 2024

readbeyond / aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Python 2,615 245 Updated Jun 22, 2024

syhw / wer_are_we

Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.

1,863 226 Updated Jun 27, 2022

facebookresearch / DME

Dynamic Meta-Embeddings for Improved Sentence Representations

Python 331 49 Updated Sep 25, 2020

algorithm-visualizer / algorithm-visualizer

🎆Interactive Online Platform that Visualizes Algorithms from Code

JavaScript 47,260 7,345 Updated Jun 9, 2024

tensorflow / tensor2tensor

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python 15,993 3,568 Updated Jun 2, 2023

flairNLP / flair

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Python 14,123 2,112 Updated Mar 31, 2025

plasticityai / magnitude

A fast, efficient universal vector embedding utility package.

Python 1,644 119 Updated Aug 3, 2023

VowpalWabbit / vowpal_wabbit

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive lea…

C++ 8,543 1,928 Updated Oct 17, 2024