forked from sfu-natlang/Kriya
-
Notifications
You must be signed in to change notification settings - Fork 0
Hierarchical phrase-based machine translation system
License
ChenxiCui/Kriya
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Kriya - A Hierarchical Phrase-based Statistical Machine Translation System ========================================================================== Kriya is an implementation of hierarchical phrase-based (hiero) SMT system. See Chiang, 2007 for description of Hiero model. Kriya is entirely implemented in Python and includes both grammar extractor and decoder modules. Please see the PBML paper for technical details specific to this implementation. For usage information, see src/README and doc/ directory. The example/ directory includes a toy Fr-En corpus for training along with a Kriya config file for training Hiero grammar. Please read example/README for details first. If you use this for your research consider citing: Baskaran Sankaran, Majid Razmara and Anoop Sarkar. 2012. Kriya – An end-to-end Hierarchical Phrase-based MT System. The Prague Bulletin of Mathematical Linguistics (PBML), (97), 83--98 Required Software: ------------------ The following is a list of software that might be required depending on what you are trying to do. * Python - 2.6.2 or later (but not Python 3.0+) * Giza++ - For word alignments * MOSES - for extracting phrase alignments from Giza++ output and (optionally) for MERT * SRILM - For building language model (LM) * KenLM - Library for querying the language model (a Python wrapper is included with the decoder and the source for the wrapper will be released soon) * MegaM - Mega model optimization package (download from: http://www.cs.utah.edu/~hal/megam/) * SVMRank - SVM for Ranking optimization (download from: http://www.cs.cornell.edu/people/tj/svm_light/svm_rank.html) At least one of the last two optimization tools are required for Kriya parameter optimization with PRO. License: -------- For license information, see the LICENCE file. For Contacts: ------------- Baskaran Sankaran, Majid Razmara and Anoop Sarkar {baskaran,mra44,anoop}@cs.sfu.ca NatLang Lab, School of Computing Science Simon Fraser University, Burnaby, BC V5A 1S6. Canada References: ----------- * Baskaran Sankaran, Majid Razmara and Anoop Sarkar. 2012. Kriya – An end-to-end Hierarchical Phrase-based MT System. The Prague Bulletin of Mathematical Linguistics (PBML), (97), 83--98 * David Chiang. 2007. Hierarchical phrase-based translation. Computational Linguistics, 32(2) 201-228
About
Hierarchical phrase-based machine translation system
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published
Languages
- Python 50.9%
- Perl 47.6%
- Shell 1.4%
- TeX 0.1%