Change the repository type filter
All
Repositories list
69 repositories
CMOT
Public- A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.
Auto-RAG
PublicLLaMA-Omni
PublicLLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.TACS
PublicStreamSpeech
PublicStreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.SemLing-MNMT
PublicDASpeech
PublicCode for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".ComSpeech
PublicCode for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".StreamSpeech-site
PublicCTC-S2UT
PublicTruthX-site
PublicDST
PublicTruthX
PublicCode for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"SiLLM
PublicSiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a Large Language model as the translation model and employs a traditional SiMT model for policy-decision to achieve SiMT through collaboration.TA-AT
PublicLengthBiasDNMT
PublicPCFG-NAT
PublicSAMMT
PublicHMT
PublicSource code for ICLR 2023 spotlight paper "Hidden Markov Transformer for Simultaneous Machine Translation"DiSeg
PublicSource code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"BayLing
Public“百聆”是一个基于LLaMA的语言对齐增强的英语/中文大语言模型,具有优越的英语/中文能力,在多语言和通用任务等多项测试中取得ChatGPT 90%的性能。BayLing is an English/Chinese LLM equipped with advanced language alignment, showing superior capability in English/Chinese generation, instruction following and multi-turn interaction.Convex-Learning
PublicBT4ST
PublicCode for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".CRESS
PublicCode for ACL 2023 main conference paper "Understanding and Bridging the Modality Gap for Speech Translation".PLUVR
Public