ReflectionLLMMT

TasTe: Teaching Large Language Models to Translate through Self-Reflection

📣 News

[24/04/2024] Our code and dataset for TasTe is released!
[15/05/2024] Our paper is accepted to the ACL 2024 main conference!
[13/06/2024] Our paper is published on arXiv: arXiv:2406.08434

🔗 Quick Links

About TasTe
File Structure
Requirements
Quick Start
Citation

🤖 About TasTe

The TasTe framework, which is short for Teaching Large Language Models to Translate through Self-Reflection, is designed as a two-stage inference process to enhance the translation quality of MT-LLMs. It consists of the following two stages:

Stage 1: Generate preliminary translations (i.e. drafts) and conduct self-assessment of the translation quality at the same time.
Stage 2: Refine the preliminary translations according to the predicted quality levels to obtain final outputs.

To ensure the sufficient capability for the entire self-reflective translation process, the LLMs are fine-tuned on a multi-task training set. The dataset consists of three parts:

Basic Translation: Common parallel corpus to provide the LLMs with correct multilingual knowledge.
Quality Prediction: Sources and translation candidates with their evaluated COMET scores to equip the LLMs with knowldge about translation quality and capabilities to make translation assessments.
Draft Refinement: Preliminary translations with their COMET scores and enhanced translations to teach the LLMs to refine drafts according to their quality scores.

The Framework of TasTe

📜 File Structure

Directory	Contents
`checkpoints/`	Fine-tuned model checkpoints
`data/`	Experimental Data
`infer/`	Testing scripts
`results/`	Testing outputs
`train/`	Fine-tuning scripts

🛠️ Requirements

TasTe is developed with HuggingFaces's transformers and Deepspeed-chat.

Python 3.7.9
Pytorch 1.13.1+cu111
Transformers==4.28
accelerate==0.19.0
numpy==1.21.6
deepspeed==0.9.0
scikit-learn

🚀 Quick Start

Installation

git clone https://github.com/YutongWang1216/ReflectionLLMMT.git
cd ReflectionLLMMT
pip install -r requirments.txt

Fine-tuning for TasTe models

(1) FixEmb: Tuning with Embedding Layers Fixed

train/train_fixemb.sh

(2) Full: Tuning with Full Parameters

train/train_full.sh

Make sure to fill in the following parameters before running:

work_dir=/path/to/ReflectionLLMMT      # path to the ReflectionLLMMT root directory
model_name=name_of_your_model          # name your model, e.g. bloom_fixemb
settings=tc                            # training settings, choices=[tc, qe, mt]
premodel=/path/to/original/checkpoint  # path to the pretrained model checkpoint directory
GPU_NUM=8                              # number of available GPUs
GPU=0,1,2,3,4,5,6,7                    # GPU ids

There are three choices of training settings, corresponsing to three different training sets:

tc - Fine-tune with data in data/train_tc.json to get a TasTe model in Text Classification style.
qe - Fine-tune with data in data/train_qe.json to get a TasTe model in Quality Estimation style.
mt - Fine-tune with data in data/train_mt.json to get a MT-baseline model.

Evaluating TasTe models

infer/test.sh

Make sure to fill in the following parameters before running:

work_dir=/path/to/ReflectionLLMMT  # path to the ReflectionLLMMT root directory
lang=zh-en                         # language pair to be tested in, choices=['zh-en', 'en-zh', 'de-en', 'en-de']
test_model=name_of_model           # name of the fine-tuned model you gave
settings=tc                        # model settings, choices=[tc, qe, mt]
GPU_NUM=8                          # number of available GPUs

There are also three choices of testing settings, corresponsing to three different training settings:

tc - Test a TasTe model in Text Classification style.
qe - Test a TasTe model in Quality Estimation style.
mt - Test a MT-baseline model.

📝 Citation

If you find this repo useful, please cite our paper as:

@misc{wang2024taste,
      title={TasTe: Teaching Large Language Models to Translate through Self-Reflection}, 
      author={Yutong Wang and Jiali Zeng and Xuebo Liu and Fandong Meng and Jie Zhou and Min Zhang},
      year={2024},
      eprint={2406.08434},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
checkpoints		checkpoints
data		data
images		images
infer		infer
results		results
train		train
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ReflectionLLMMT

📣 News

🔗 Quick Links

🤖 About TasTe

📜 File Structure

🛠️ Requirements

🚀 Quick Start

Installation

Fine-tuning for TasTe models

Evaluating TasTe models

📝 Citation

About

Releases

Packages

Languages

License

YutongWang1216/ReflectionLLMMT

Folders and files

Latest commit

History

Repository files navigation

ReflectionLLMMT

📣 News

🔗 Quick Links

🤖 About TasTe

📜 File Structure

🛠️ Requirements

🚀 Quick Start

Installation

Fine-tuning for TasTe models

Evaluating TasTe models

📝 Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages