-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WordNet part-of-speech vs Ontolex part-of-speech #58
Comments
We chose this because there were some values, e.g., However, adding some |
Ok, then I will make an attempt at that for 1.2. |
Having taken a look at it, there doesn't seem to be a way to make it compatible in a satisfactory way. Defining Lexinfo unfortunately has a definite range (the |
Not sure I get what the issue is here. I would assume that |
I am not OWL expert, that is probably the main issue ;-) What you're saying is true. I guess |
If you add a subclass axiom between That is if we add wn:partOfSpeech owl:subPropertyOf lexinfo:partOfSpeech .
wn:PartOfSpeech owl:subClassOf lexinfo:PartOfSpeech .
wn:noun owl:sameAs lexinfo:noun . # etc. Then if we have X wn:partOfSpeech wn:noun We then infer
|
Ah, that makes a lot of sense. I was stuck thinking that of course the subProperty can't extend the range of it's parent property, but yeah... if we define everything as subclasses it will technically not be doing that. Thanks a lot for explaining in this detail. I do wonder what to do about the ... As for the actual inference, I think I will also have to take another look at the Prolog-like logic rule DSL used in Apache Jena to make sure it applies the same kind of reasoning you just described in practice. The default level of OWL inference was a bit too comprehensive (and therefore slow) on the DanNet data, so to make it snappier I basically removed all statements that didn't infer inverse relationships ;-) |
It would not be compatible with this schema to use a value of part-of-speech other than ones specified. We need this to ensure interoperability. (Of course we are open to proposals for new values) |
I'm sorry @jmccrae, but I have couple of remaining questions for things that are still not quite clear to me...
|
Essentially capitalised names are for classes and lower case for values. So
Exactly |
Scanning through the Turtle file, I noticed that you define your own POS relations and classes rather than use the
lexinfo:partOfSpeech
relation which is heavily used in the Ontolex specification, which I understand that @jmccrae helped bring to life. I'm unsure why this is the case?In the Ontolex specification it is specifically stated that
I think that this is an excellent ideal as it makes integration of existing datasets mostly a matter of merging sets of triples. The second best option would be having some kind of derived
lexinfo
relation triple which can be inferred via equivalent/subclass relations.Unfortunately, the GWA schema's
partOfSpeech
relation andPartOfSpeech
class are proprietary and not linked to any external definitions. I have used Ontolex as the basis for the new version of DanNet, so my part-of-speech tags are all defined usinglexinfo:partOfSpeech
relation rather thanwn:partOfSpeech
.How do you suggest we bridge this gap? The way I see it, either version 1.2 of the schema removes this bit and datasets use
lexinfo:partOfSpeech
directly -OR- a direct equivalency tolexinfo:partOfSpeech
is established in the schema -- preferably the first as it simplifies things.I could also add both
wn
andlexinfo
relations for allLexicalEntry
classes in the new DanNet, but that's both confusing and a messy fix IMO. Better to fix the schema than work around its flaws. Having competing standards for this is not a great situation.The relevant part of the schema:
The text was updated successfully, but these errors were encountered: