GitHub - amazon-science/prism-finetuned

Code for the research paper "Trained MT Metrics Learn to Cope with Machine-translated References"

Installation

Install PyTorch
cd .. && pip install -r requirements.txt
pip install git+https://github.com/google-research/mt-metrics-eval

mt-metrics-eval

python -m mt_metrics_eval.mtme --download (Puts ~1G of data into $HOME/.mt-metrics-eval)

Downloading the pre-trained model

wget http://data.statmt.org/prism/m39v1.tar
tar xf m39v1.tar
mkdir models
mv m39v1 models

Preparing the data

Downloading MQM data

See scripts/download_data.sh

Extracting relative rankings

mkdir data/wmt_rr
python scripts/convert_mqm_to_relative_ranking_data.py

Concatenating the language pairs and creating a trainâ€“valid split

See scripts/create_data_split.sh

Preprocessing data for Prism fine-tuning with fairseq

mkdir data/prism_finetuning_data
python scripts/prepare_prism_finetuning_data.py (might take a while)

Fine-tuning

bash scripts/finetune_main.sh

Metric usage

Please refer to the reference implementation of Prism (https://github.com/thompsonb/prism) for instructions on using the metric

Meta-evaluation

pip install -r requirements-eval.txt
python scripts/run_meta_evaluation.py

Post-editese experiments

python post_editese/scripts/run_.py

Citation

Please cite this work as:

@misc{vamvasetal2023trainedmetrics,
      title={Trained MT Metrics Learn to Cope with Machine-translated References},
      author={Vamvas, Jannis and Domhan, Tobias and Trenous, Sony and Sennrich, Rico and Hasler, Eva},
      booktitle={Proceedings of the Eighth Conference on Machine Translation (WMT)},
      year={2023}
}

Security

See CONTRIBUTING for more information.

License

This library is licensed under the CC-BY-NC-4.0 License.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
meta_evaluation		meta_evaluation
mt_metrics_eval_custom		mt_metrics_eval_custom
post_editese		post_editese
predictions		predictions
prism_finetuning		prism_finetuning
scripts		scripts
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
THIRD-PARTY-LICENSES		THIRD-PARTY-LICENSES
prism.py		prism.py
requirements-eval.txt		requirements-eval.txt
requirements.txt		requirements.txt

License

amazon-science/prism-finetuned

Folders and files

Latest commit

History

Repository files navigation

Installation

mt-metrics-eval

Downloading the pre-trained model

Preparing the data

Downloading MQM data

Extracting relative rankings

Concatenating the language pairs and creating a trainâ€“valid split

Preprocessing data for Prism fine-tuning with fairseq

Fine-tuning

Metric usage

Meta-evaluation

Post-editese experiments

Citation

Security

License

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Languages