Runnable Morphological Analysis Tools from the UniMorph Project
This software is at an alpha stage.
Due to the size of DirecTL+ models, we do not include them here; please contact
gnicola2 AT jhu DOT edu for pre-trained models.
Uncompress DTL models into models/DTL directory. See the Releases tab above to download.
tar -xvzf DTLModel.tgz
Set environment variables to point to required binaries.
export DTL=<location of DTL binary> export CTRANSLATE=<location of ctranslate binary>
python src/analyze.py -i input.wordlist -o output.analyses -l language
python analyze.py -i Welsh.toAnalyze -o Welsh.out -l welsh
adyghe, albanian, amharic, arabic, armenian, asturian, azeri bashkir, basque, belarusian, bengali, breton, bulgarian, catalan, cornish, crimean-tatar, czech, danish, dutch, english, estonian, faroese, finnish, french, friulian, galician, georgian, german, greek, greenlandic, haida, hebrew, hindi, hungarian, icelandic, indonesian, ingrian, irish, italian, kabardian, kannada, karelian, kashubian, kazakh, khakas, khaling, kurmanji, ladin, latin, latvian, lithuanian, livonian, lower-sorbian, macedonian, maltese, mapudungun, middle-high-german, middle-low-german, murrinhpatha, navajo, neapolitan, norman, northern-sami, norwegian-bokmaal, norwegian-nynorsk, occitan, old-armenian, old-french, old-irish, old-saxon, pastho, persian, polish, portuguese, quechua, romanian, russian, sanskrit, scottish-gaelic, serbo-croatian, slovak, slovene, sorani, spanish, swahili, swedish, tagalog, tamil, tatar, telugu, tibetan, turkish, turkmen, ukrainian, urdu, uzbek, venetian, welsh, west-frisian, yiddish, zulu, ...