New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Language-specific PronTypes in Italian #353
Comments
Hi Dan, we discussed (with Simonetta, Cristina and the rest of the team) your proposal for UD v2 concerning Italian PronTypes and here are the conclusions.
Maria & c.
|
Great, thanks. As for the pronominal ordinals, I did not know that you were actually distinguishing specific usage. I am still slightly in favor of keeping them as |
http://universaldependencies.org/it/feat/PronType.html
http://universaldependencies.org/v2/features.html
When looking for possible extensions of the feature set for UD v2, I came across several Italian-specific values of the
PronType
feature:Clit
,Predet
andOrd
. I was wondering whether there are possibly better ways of annotating these words.The examples in the documentation of
PronType=Clit
seem similar to what is labeled as personal pronoun (PronType=Prs
) in other languages (including e.g. Spanish). Many languages have two possible forms of personal pronouns, short/clitic, and full/nonclitic. The usual solution is that both forms arePronType=Prs
, and another, language-specific feature distinguishes between the forms.Variant=Short
is used in some treebanks but maybe we could define something likeClitic=Yes
. Or are there reasons in Italian to say that mi, lo, si etc. are notPrs
? @msimi @SimonettaMontemagni @alessandrolenciAnother specific value is
PronType=Predet
. Used for (pre)determiners like tutti “all”, entrambi “both”. Again, I think that being placed before another determiner is a property orthogonal to our pronoun types. These two instances should be simplyPronType=Tot
.And finally,
PronType=Ord
. Used for ordinal numerals like primo “first”, secondo “second”, terzo “third”. In UD, these are not pronouns or determiners but adjectives (ADJ
). And their ordinal status should be marked byNumType=Ord
, which is a universal feature. So unless I am missing something, I believe thatPronType=Ord
should be removed from Italian.The text was updated successfully, but these errors were encountered: