Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

-o-n vs -on- (e.g., dekonaĵon) #21

Open
ProfessorO opened this issue Aug 28, 2016 · 6 comments
Open

-o-n vs -on- (e.g., dekonaĵon) #21

ProfessorO opened this issue Aug 28, 2016 · 6 comments

Comments

@ProfessorO
Copy link
Owner

The former is a noun + accusative (direct object) ending, while the latter means "fraction" (e.g., dek = ten, dekono = tenTH). So "dekonaĵon" (Genesis 14:20, tithe) is a challenge. The first -on- is a single root, while the one at the end is actually two roots, -o + -n.

In theory, we could add entries that have -on after all the numbers (duon, trion, kvaron, k.t.p.). However, those aren't roots—they're two parts. Looks like there's some interesting stuff about where roots can appear, as I think -on- can only appear in the middle of a word, while -o and -n can appear at the end.

Do we want to mark certain roots with some restrictions as to where they can appear? Only after numbers, only in the middle of a word, etc.

@ProfessorO
Copy link
Owner Author

For now I'm leaving "on" out as a root, as it's FAR less common than the -o and -n endings.

ProfessorO added a commit that referenced this issue Sep 1, 2016
… first (LPF?). Fixes #19, but doesn't address #21 or #22, and actually creates #23.
@stephenwade
Copy link
Collaborator

-on- can only appear in the middle of a word.

Can -o and -n also appear in the middle of a word? If so, then we still have no way of knowing whether "on" should be parsed as -on- or -o-n-.

@ProfessorO
Copy link
Owner Author

-o can definitely appear in the middle of compound words. For example, akvofonto, akv-o-font-o.

-n really should only appear at the very end of a word, as it's the direct-object marker for adjectives and nouns.

@stephenwade
Copy link
Collaborator

Hm, okay. Could -on appear at the end of a word? If not, we could make rules saying "n" should always be parsed at the end of a word and never parsed otherwise.

@ProfessorO
Copy link
Owner Author

ProfessorO commented Sep 5, 2016

So the root should be n$ instead of just n (using $ in the PCRE sense). Hmmm. That could work. Could also address the issue with iĝis parsing as i-ĝis instead of iĝ-is (#23), if the root were ^ĝis instead of just ĝis.

@stephenwade
Copy link
Collaborator

Yeah, I like that idea. I think we might also find priorities useful, and parse longest-first in each priority (set "n$" to be a higher priority than others, so that it's always parsed when it matches instead of being overridden by longer roots).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants