Mesolitica
We develop Multimodality Artificial Intelligence for South East Asia.
Pinned Loading
Repositories
Showing 10 of 51 repositories
- WavTokenizer-package Public Forked from jishengpeng/WavTokenizer
[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling
mesolitica/WavTokenizer-package’s past year of commit activity - UniCodec-fix Public Forked from Jiang-Yidi/UniCodec
[ACL 2025 Main] UniCodec: a unified audio codec with a single codebook to support multi-domain audio data, including speech, music, and sound
mesolitica/UniCodec-fix’s past year of commit activity - vllm-llmaudio Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
mesolitica/vllm-llmaudio’s past year of commit activity - dia-fix-compile Public Forked from nari-labs/dia
A TTS model capable of generating ultra-realistic dialogue in one pass.
mesolitica/dia-fix-compile’s past year of commit activity - trl-fix Public Forked from huggingface/trl
Train transformer language models with reinforcement learning.
mesolitica/trl-fix’s past year of commit activity
Most used topics
Loading…