Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The very base of the vocabulary (44 items) has a duplication #94

Open
kornai opened this issue Jun 14, 2016 · 1 comment
Open

The very base of the vocabulary (44 items) has a duplication #94

kornai opened this issue Jun 14, 2016 · 1 comment

Comments

@kornai
Copy link
Owner

kornai commented Jun 14, 2016

=agt, =at, =dat, =for, =from, =obl, =pat, =poss, =rel, =to, all, also, an, angry, before, can/1246, cause, characteristic, color, country, er, female, food, for, has, human, identity, inherent, is_a, lack, male, monk, next_to, not, number, or, other, part_of, person, real, round, speak, target, want

The duplication is between =for and for, both of which appear in several places.

Also suspicious: human/person (they have the same definition, just like permit/allow,
but the current algorithm doesn't prune these well).

@makrai
Copy link
Collaborator

makrai commented Jun 27, 2016

  • =FOR and FOR: I will send a pull request replacing =FOR with =TO
  • human and person are both defined as man/659, this is OK in 4lang. This is a bad property of this 44-element DV, but such sub-optimalities may appear as the problem is NP-hard.
  • permit is defined as allow, which in turn is defined with more basic elements (=AGT SAY =DAT[=PAT[can/1246]], say CAUSE =DAT[=PAT[can/1246]]) this is also OK

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants