Skip to content

Releases: amazon-science/contrastive-controlled-mt

Formality Classifier

17 Feb 21:57
58db8b7

Choose a tag to compare

A multilingual classifier trained to predict the formality of text for the language pairs: EN-RU, EN-KO, EN-VI, and EN-PT. The xlm-roberta-base model was fine-tuned on human-written formal and informal text following the setup from Briakou et al., EMNLP 2021. Please note that the models come with the following licensing.

EN-HI

11 Mar 18:18

Choose a tag to compare

Updated EN-HI pre-trained model using updated sacremoses tokenizer addressing #3. Model was trained on open datasets using the Sockeye 3 PyTorch NMT toolkit. For further details see Pre-trained models. Please note that the model comes with the following licensing.

EN-IT,RU

16 Feb 17:19

Choose a tag to compare

Pre-trained baseline models

Baseline models for EN-IT and EN-RU language arcs. Each model is trained on open datasets using the Sockeye 3 PyTorch NMT toolkit. For further details see Pre-trained models. Please note that the models come with the following licensing.

EN-DE,ES,JA,HI

01 Feb 20:16

Choose a tag to compare

Pre-trained baseline models

Each model is trained on open datasets using the Sockeye 3 PyTorch NMT toolkit. For further details see Pre-trained models. Please note that the models come with the following licensing.