Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parsing inseparable prepositions with the definite article #4

Closed
dowens76 opened this issue Jul 13, 2013 · 6 comments
Closed

Parsing inseparable prepositions with the definite article #4

dowens76 opened this issue Jul 13, 2013 · 6 comments

Comments

@dowens76
Copy link
Member

Currently the schema parses inseparable prepositions with the article in a strange way that could cause problems. As an example, consider:
לַ/חֹ֣דֶשׁ
The lemma is l/2320. This term includes two segments, the preposition and the noun. It's parsing, however, would involve three: HR/Tp/Ncmsa.
Perhaps it would be better to parse the definite article as an attribute of the preposition, so: HRd/Ncmsa, the Rd representing "preposition with the definite article."

@DavidTroidl
Copy link
Member

Something has to be done, one way or the other. The site won't allow
HR/Td/Ncmsa. Would allowing HRd/Ncmsa cover the b, k and l
prepositions? I'd have to redo the morphology codes and the code parser
to make it work. It shouldn't be a problem. Otherwise it would mess up
the segmentation. Let me know.

On 7/13/2013 1:48 AM, Daniel Owens wrote:

Currently the schema parses inseparable prepositions with the article
in a strange way that could cause problems. As an example, consider:
לַ/חֹ֣דֶשׁ
The lemma is l/2320. This term includes two segments, the preposition
and the noun. It's parsing, however, would involve three: HR/Tp/Ncmsa.
Perhaps it would be better to parse the definite article as an
attribute of the preposition, so: HRd/Ncmsa, the Rd representing
"preposition with the definite article."


Reply to this email directly or view it on GitHub
#4.

@dowens76
Copy link
Member Author

I believe it would work for all three of the inseparable prepositions. They behave the same. I think we should do this.

@dowens76
Copy link
Member Author

I am assuming that this is the consensus now, using "HRd" for prepositions with the definite article. The default is without and needs no further code. I am closing this issue and will modify the parsing schema. The necessary changes will need to be made in the website.

@DavidTroidl
Copy link
Member

I agree that this is the right solution. I'll have to look at the
mechanics of the morph parser, to see how to fit this in.

David

On 7/16/2013 10:24 PM, Daniel Owens wrote:

I am assuming that this is the consensus now, using "HRd" for
prepositions with the definite article. The default is without and
needs no further code. I am closing this issue and will modify the
parsing schema. The necessary changes will need to be made in the website.


Reply to this email directly or view it on GitHub
#4 (comment).

@dowens76 dowens76 reopened this Jul 19, 2013
@dowens76
Copy link
Member Author

In light of private email conversation, we have not resolved this yet. In the interest of simplicity, we do not want to add an attribute of the preposition part of speech. Currently we have a particle type:

*p = definite article with inseparable preposition

If we want to use that, we should bring all prepositions under particles and just use two types. For the preposition we cannot use "Tr" because that currently signifies a relative. I think it would be easier to make a regular preposition "p" because it is memorable. What about these two particle types:

*p = preposition
*q = inseparable preposition with the definite article

That keeps the two close together, and "q" is a little bit like "p+." At least, it makes sense to me.

@DavidTroidl
Copy link
Member

This has been resolved in the Hebrew Morphology Codes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants