Meteorson

MT error analysis with Meteor and Hjerson.

Usage

To run Meteorson, you will need source, reference, and hypothesis files for the data you wish to evaluate. The main pipeline script, errors-pipeline.sh, expects these files to have the same base name (e.g., news.tr-en) and the appropriate suffix (src, ref, hyp). Simply pass this base filename to the script, and it will automatically run through the process (storing files for intermediate stages in meteorson/work) and write two output files: an inline annotated text file (e.g., news.tr-en.cats.final) and a web page view (e.g., news.tr-en.cats.html).

Installation

Meteorson is available on GitHub. Clone the repository and make sure that the dependencies below are set. You will then need to compile the METEOR error classifier add-on:

javac -cp $METEOR/meteor-1.5.jar src/ErrorCategorizer.java

Dependencies

Meteorson relies on three external packages: the Perl interface to Stanford's CoreNLP (Lingua::StanfordCoreNLP), Meteor, and Hjerson. Hjerson and METEOR can be installed anywhere on the system as long as environment variables $METEOR and $HJERSON are set. As packaged, tokenization and lemmatization are performed by CoreNLP via Perl scripts, but another tokenizer and/or lemmatizer can be substituted by editing the pipeline script.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
src		src
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Meteorson

Usage

Installation

Dependencies

About

Releases

Packages

Languages

License

mjmartindale/meteorson

Folders and files

Latest commit

History

Repository files navigation

Meteorson

Usage

Installation

Dependencies

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages