ICTNLP
- 245 followers
- Beijing, China
- http://nlp.ict.ac.cn
- ict_nlp@ict.ac.cn
Pinned Loading
Repositories
- FastLongSpeech Public
FastLongSpeech is a novel framework designed to extend the capabilities of Large Speech-Language Models for efficient long-speech processing without necessitating dedicated long-speech training data.
- StreamUni Public
StreamUni is a framework that efficiently enables unified Large Speech-Language Models to accomplish streaming speech translation in a cohesive manner.
- LLaVA-Mini Public
LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.
- StreamSpeech Public
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
- Stream-Omni Public
Stream-Omni is a GPT-4o-like language-vision-speech chatbot that simultaneously supports interaction across various modality combinations.
- SLED-TTS Public
Streamable Text-to-Speech model using a language modeling approach, without vector quantization
- MonoAttn-Transducer Public
Code for ICML25 Paper "Overcoming Non-monotonicity in Transducer-based Streaming Generation"
-
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…