ICTNLP

All

69 repositories

MoCE
Public
code for paper: "MoCE: Adaptive Mixture of Contextualization Experts for Byte-based Neural Machine Translation"
Python
•2•2•0•0•Updated Nov 7, 2024Nov 7, 2024
CMOT
Public
Code for ACL 2023 main conference paper "CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation"
Python
•1•14•2•0•Updated Oct 29, 2024Oct 29, 2024
NAST-S2x
Public
A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.
non-autoregressive simultaneous-translation speech-generation speech-to-speech-translation non-autoregressive-transformers
Python
•4•60•1•1•Updated Oct 22, 2024Oct 22, 2024
Auto-RAG
Public
Python
•
Apache License 2.0
•0•4•0•0•Updated Oct 22, 2024Oct 22, 2024
LLaMA-Omni
Public
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
speech-to-text speech-to-speech large-language-models multimodal-large-language-models speech-language-model speech-interaction
Python
•
Apache License 2.0
•169•2.5k•37•2•Updated Sep 24, 2024Sep 24, 2024
TACS
Public
Source code for Truth-Aware Context Selection: Mitigating the Hallucinations of Large Language Models Being Misled by Untruthful Contexts
Python
•2•14•2•0•Updated Sep 2, 2024Sep 2, 2024
StreamSpeech
Public
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
text-to-speech translation machine-translation voice speech tts speech-synthesis speech-recognition speech-to-text all-in-one
Python
•
MIT License
•71•945•11•2•Updated Aug 24, 2024Aug 24, 2024
Multiscale-Contextualization
Public
ACL2024 Integrating Multi-scale Contextualized Information for Byte-based Neural Machine Translation
Python
•1•7•0•0•Updated Aug 9, 2024Aug 9, 2024
SemLing-MNMT
Public
Code for ACL 2024 paper "Improving Multilingual Neural Machine Translation by Utilizing Semantic and Linguistic Features".
Python
•0•2•0•0•Updated Jul 31, 2024Jul 31, 2024
DASpeech
Public
Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".
machine-translation speech-translation speech-to-speech speech-to-speech-translation
Python
•5•60•1•0•Updated Jul 22, 2024Jul 22, 2024
ComSpeech
Public
Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".
text-to-speech machine-translation speech-translation non-autoregressive-translation speech-to-speech-translation zero-shot-speech-translation
Python
•6•23•2•0•Updated Jul 2, 2024Jul 2, 2024
SU4MT
Public
Code for EMNLP 2023 paper "Enhancing Neural Machine Translation with Semantic Units"
Python
•0•8•0•0•Updated Jun 25, 2024Jun 25, 2024
StreamSpeech-site
Public
JavaScript
•1•2•0•0•Updated Jun 17, 2024Jun 17, 2024
ComSpeech-Site
Public
JavaScript
•1•2•0•0•Updated Jun 12, 2024Jun 12, 2024
CTC-S2UT
Public
Code for ACL 2024 findings paper "CTC-based Non-autoregressive Textless Speech-to-Speech Translation"
0•8•0•0•Updated Jun 11, 2024Jun 11, 2024
TruthX-site
Public
HTML
•0•1•0•0•Updated Jun 7, 2024Jun 7, 2024
DST
Public
DST is a Decoder-only simultaneous machine translation model, which can conduct policy decision and translation concurrently
Python
•
MIT License
•1•7•4•0•Updated Jun 6, 2024Jun 6, 2024
TruthX
Public
Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"
safety llama representation language-model mistral explainable-ai hallucination baichuan hallucinations gpt-4
Python
•
GNU General Public License v3.0
•5•125•4•0•Updated Mar 26, 2024Mar 26, 2024
SiLLM
Public
SiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a Large Language model as the translation model and employs a traditional SiMT model for policy-decision to achieve SiMT through collaboration.
translation machine-translation large large-language-models simultaneous-machine-translation chatgpt llama2
Python
•2•15•0•0•Updated Feb 22, 2024Feb 22, 2024
TA-AT
Public
Official code for AAAI24 paper "TA&AT: Enhancing Task-Oriented Dialog with Turn-Level Auxiliary Tasks and Action-Tree Based Scheduled Sampling"
0•0•0•0•Updated Jan 5, 2024Jan 5, 2024
LengthBiasDNMT
Public
0•0•0•0•Updated Jan 5, 2024Jan 5, 2024
PCFG-NAT
Public
Code for NeurIPS 2023 paper "Non-autoregressive Machine Translation with Probabilistic Context-free Grammar".
Cuda
•
MIT License
•0•10•0•0•Updated Jan 4, 2024Jan 4, 2024
SAMMT
Public
Code for EMNLP 2023 paper "Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation"
Python
•
Other
•2•3•0•0•Updated Dec 11, 2023Dec 11, 2023
HMT
Public
Source code for ICLR 2023 spotlight paper "Hidden Markov Transformer for Simultaneous Machine Translation"
machine-translation simultaneous-translation simultaneous-machine-translation
Python
•
MIT License
•2•21•3•0•Updated Dec 11, 2023Dec 11, 2023
DiSeg
Public
Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"
segment streaming machine-translation speech segmentation sequence-segmentation speech-translation simultaneous-translation simultaneous-machine-translation streaming-speech-to-text
Python
•
MIT License
•2•33•2•0•Updated Dec 6, 2023Dec 6, 2023
BayLing
Public
“百聆”是一个基于LLaMA的语言对齐增强的英语/中文大语言模型，具有优越的英语/中文能力，在多语言和通用任务等多项测试中取得ChatGPT 90%的性能。BayLing is an English/Chinese LLM equipped with advanced language alignment, showing superior capability in English/Chinese generation, instruction following and multi-turn interaction.
translation interactive machine-translation chinese llama human-performance cross-lingual multilingual-translation general-language-model gpt4
Python
•
GNU General Public License v3.0
•19•296•12•0•Updated Dec 3, 2023Dec 3, 2023
Convex-Learning
Public
Code for NeurIPS 2023 paper "Beyond MLE: Convex Learning for Text Generation"
Python
•0•12•1•0•Updated Oct 25, 2023Oct 25, 2023
BT4ST
Public
Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".
machine-translation speech-to-text speech-translation
Python
•2•13•3•0•Updated Oct 25, 2023Oct 25, 2023
CRESS
Public
Code for ACL 2023 main conference paper "Understanding and Bridging the Modality Gap for Speech Translation".
machine-translation speech-to-text speech-translation
Python
•2•14•0•0•Updated Oct 25, 2023Oct 25, 2023
PLUVR
Public
Code for ACL 2022 main conference paper "Neural Machine Translation with Phrase-Level Universal Visual Representations".
Python
•6•21•0•0•Updated Oct 25, 2023Oct 25, 2023