A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.
-
Updated
Oct 22, 2024 - Python
A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.
Official repository for our NeurIPS 2024 paper: DiffNorm: Self-Supervised Normalization for Non-autoregressive Speech-to-speech Translation
Add a description, image, and links to the non-autoregressive-transformers topic page so that developers can more easily learn about it.
To associate your repository with the non-autoregressive-transformers topic, visit your repo's landing page and select "manage topics."