Mesolitica
We develop Multimodality Artificial Intelligence for South East Asia.
Pinned Loading
Repositories
Showing 10 of 51 repositories
- WavTokenizer-package Public Forked from jishengpeng/WavTokenizer
[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling
- UniCodec-fix Public Forked from Jiang-Yidi/UniCodec
[ACL 2025 Main] UniCodec: a unified audio codec with a single codebook to support multi-domain audio data, including speech, music, and sound
- vllm-llmaudio Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
- dia-fix-compile Public Forked from nari-labs/dia
A TTS model capable of generating ultra-realistic dialogue in one pass.
- trl-fix Public Forked from huggingface/trl
Train transformer language models with reinforcement learning.
Top languages
Loading…