GitHub - wenkokke/UoE-MT1-PBT: Answers to Machine Translation coursework 1: Decoding.

There are two python programs here (-h for usage):

-decode translates input sentences from French to English. -compute-model-score computes p(e|f) for a translated corpus.

These commands work in a pipeline. For example:

> python decode | python compute-model-score

There is also a module:

-model.py implements very simple interfaces for language models and translation models, so you don't have to.

You can finish the assignment without modifying this file at all. You should look at it if you need to understand the interface to the translation and language model.

The data directory contains files derived from the Canadian Hansards, originally aligned by Ulrich Germann:

-input: French sentences to translate.

-tm: a phrase-based translation model. Each line is in the form:

French phrase ||| English phrase ||| log_10(translation_prob)

-lm: a trigram language model file in ARPA format.

log_10(ngram_prob)   ngram   log_10(backoff_prob)

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
data		data
doc		doc
.gitignore		.gitignore
README.md		README.md
compute-model-score		compute-model-score
decode		decode
default.selected		default.selected
models.py		models.py
part2.out		part2.out
part2.py		part2.py
part3.out		part3.out
part3.py		part3.py
plots.py		plots.py
try-with		try-with

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Contributors 2

Languages

wenkokke/UoE-MT1-PBT

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages