Skip to content
master
Go to file
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
doc
 
 
 
 
src
 
 
 
 
 
 
 
 

README

Kriya - A Hierarchical Phrase-based Statistical Machine Translation System
==========================================================================

Kriya is an implementation of hierarchical phrase-based (hiero) SMT system. See Chiang, 2007 
for description of Hiero model.

Kriya is entirely implemented in Python and includes both grammar extractor and decoder modules.
Please see the PBML paper for technical details specific to this implementation.

For usage information, see src/README and doc/ directory. The example/ directory includes a toy
Fr-En corpus for training along with a Kriya config file for training Hiero grammar. Please read
example/README for details first.

If you use this for your research consider citing:
Baskaran Sankaran, Majid Razmara and Anoop Sarkar. 2012. Kriya – An end-to-end Hierarchical 
Phrase-based MT System. The Prague Bulletin of Mathematical Linguistics (PBML), (97), 83--98


Required Software:
------------------
The following is a list of software that might be required depending on what you are trying to do.

* Python  - 2.6.2 or later (but not Python 3.0+)

* Giza++  - For word alignments
* MOSES   - for extracting phrase alignments from Giza++ output and (optionally) for MERT
* SRILM   - For building language model (LM)
* KenLM   - Library for querying the language model (a Python wrapper is included with the decoder
            and the source for the wrapper will be released soon)
* MegaM   - Mega model optimization package (download from: http://www.cs.utah.edu/~hal/megam/)
* SVMRank - SVM for Ranking optimization (download from: http://www.cs.cornell.edu/people/tj/svm_light/svm_rank.html)

At least one of the last two optimization tools are required for Kriya parameter optimization with PRO.


License:
--------
For license information, see the LICENCE file.


For Contacts:
-------------
Baskaran Sankaran, Majid Razmara and Anoop Sarkar
{baskaran,mra44,anoop}@cs.sfu.ca

NatLang Lab, School of Computing Science
Simon Fraser University, 
Burnaby, BC V5A 1S6. Canada


References:
-----------
*  Baskaran Sankaran, Majid Razmara and Anoop Sarkar. 2012. Kriya – An end-to-end Hierarchical
        Phrase-based MT System. The Prague Bulletin of Mathematical Linguistics (PBML), (97), 83--98
* David Chiang. 2007. Hierarchical phrase-based translation. Computational Linguistics, 32(2) 201-228

About

Hierarchical phrase-based machine translation system

Resources

License

Releases

No releases published
You can’t perform that action at this time.