Join GitHub today
GitHub is home to over 40 million developers working together to host and review code, manage projects, and build software together.Sign up
Probably these could be used to distinguish between different spellings not only English language.
The way these work is you mark some words with
These indicators are specified manually. They hand-implement a basic part-of-speech detection algorithm.
NOTE: Using a different part of speech algorithm is complex, as you need accurate data (e.g. the current ones have issues with archaic English (Shakespeare, Edgar Allen Poe, etc.) and even get confused on simple English phrases). These are also trained to be English specific, so would need training for other languages, which is complex to maintain for the 70-80 languages that eSpeak supports.