Project Ekler

Problem Definition

Since Turkish is an agglutinative language, memorizing all the suffixes can be problematic for new Turkish learners, especially with vowel harmony and ordering of the suffixes. So, we wanted to build a tool that can analyze the morphological structure of Turkish verbs, parse their suffixes, check for vowel harmony and suffix ordering, and correct the verbs if necessary.

Methods & Processes

Helsinki Finite State Technology library is used for this project. Verbs are categorized into 8 groups according to their voicing features and vowel harmony, and these groups are used for writing the verb lexicons. 2 different transducers are generated: Good HFST and Bad HFST. Bad HFST accepts all entries including inacurrate verbs & suffixes. It takes a possibly badly written input, parses root and suffixes, then returns it as an output. The output of Bad HFST is then used as an input into Good HFST, which has the correct phonological rules in place. Good HFST returns the correct form of the verb as the final output.

To exemplify, it can take a word such as yapmışdı and return its correct form yapmıştı.

Streamlit App

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
streamlit		streamlit
README.md		README.md
bad_hfst.lexc		bad_hfst.lexc
bad_hfst.txt		bad_hfst.txt
ekler.py		ekler.py
ekler_notebook.ipynb		ekler_notebook.ipynb
good_hfst.lexc		good_hfst.lexc
good_hfst.txt		good_hfst.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Ekler

Problem Definition

Methods & Processes

About

Releases

Packages

Contributors 4

Languages

eklerproject/ekler

Folders and files

Latest commit

History

Repository files navigation

Project Ekler

Problem Definition

Methods & Processes

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages