GitHub - antot/neural_vs_-phrasebased_smt_eacl17: Files related to the publication "A Multifaceted Evaluation of Neural versus Phrase-Based Machine Translation for 9 Language Directions" at EACL 2017

antot / neural_vs_-phrasebased_smt_eacl17 Public

Notifications You must be signed in to change notification settings
Fork 3
Star 1

Files related to the publication "A Multifaceted Evaluation of Neural versus Phrase-Based Machine Translation for 9 Language Directions" at EACL 2017

1 star 3 forks Branches Tags Activity

Star

Notifications

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
code		code
data_train_mono		data_train_mono
data_train_parallel		data_train_parallel
hjerson		hjerson
overlaps		overlaps
references		references
reordering		reordering
scores_by_length		scores_by_length
systems		systems
third		third
README		README
experiments.sh		experiments.sh
preprocessing.sh		preprocessing.sh
russian_stem_fix.sh		russian_stem_fix.sh
systems_list.txt		systems_list.txt

Repository files navigation

This repository contains files related to the publication:

Antonio Toral and Víctor M. Sánchez-Cartagena. A Multifaceted Evaluation of Neural versus Phrase-Based Machine Translation for 9 Language Directions. 15th EACL Conference. 2017.


-------------------
Contents (folders):
-------------------

code/		Code developed (Python):
			- Stemmer
			- Tokenizer
			- Scores by length

data*/		Urls to download the monolingual and parallel training data used

third/		Third party code used
			- chrF evaluation metric
			- Czech stemmer
			- hjerson
			- Moses v3 scripts

references/	References of WMT16 in their original format (sgm) and processed:
			- tokenised (.tok)
			- truecased (.true)
			- stemmed (.base). Czech stemming has 2 variants (-aggresive, -light) 

systems/	MT outputs of the best submissions at WMT6 in their original format (sgm)
		and processed:
			- tokenised (.tok)
			- truecased (.true)
			- stemmed (.base). Czech stemming has 2 variants (-aggresive, -light)

overlaps/	Output of the output similarity experiment (Section 3)

scores_*length/	Outputs of the experiment regarding sentence length (Section 6)

hjerson/	Outputs of the error categories experiment (Section 7) in different formats:
			- .cats
			- .errs
			- .html
			- .out (used to report results in the paper, Tables 7 and 8)
			- .sents





-----------------
Contents (files):
-----------------

preprocessing.sh	Data preprocessing:
			- desgmise
			- train monolingual data
			- train parallel data
			- reference translations and MT outputs

experiments.sh		Experiments:
			- Output similarity (Section 3). Function overlap_metric
			- Fluency (Section 4). VSC TODO
			- Reordering (Section 5). VSC TODO
			- Sentence length (Section 6). Function scores_by_length
			- Error categories (Section 7). Function hjerson

systems_list.txt	List and description of the machine translation systems used
			in the experiments

russian_stem_fix.txt	Instructions to fix the output of the stemmer used for Russian