No description, website, or topics provided.
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
ApplyToPoFile
PoToOpenNMT
translations
README.md

README.md

Introduction

This repository contains Neural Machine Translation proof of concepts done at Softcatalà

Test have been done with https://github.com/OpenNMT/OpenNMT-tf

Runing the models

Models

We have created two models:

Samples

Examples of how new files (not previous part of the training corpus) how they look like with different models:

https://github.com/jordimas/nmt-softcatala/tree/master/translations

Translating a new PO file

Code is at ApplyToPoFile subdirectory

  • Install TensorFlow serving
  • Run run-model-sever.sh (server)
  • Run python3 ApplyToPoFile PO_file

By default all strings are marked as fuzzy

Building the models

Building a OpenNMT corpus from PO files

This produce 6 files:

  • src-test.txt - Source file used to test the model
  • tgt-test.txt - Target file used to test the model
  • src-train.txt - Source file used to train the model
  • tgt-train.txt - Target file used to train the model
  • src-val.txt - Source file used to validate the model
  • tgt-val.txt - Target file used to validate the model

These files should be copied

Build model

onmt-main train_and_eval --model_type NMTSmall --config config/opennmt-defaults.yml config/data/toy-ende.yml

Contact

Email address: Jordi Mas: jmas@softcatala.org