Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

splitting of the pronoun "feara makkárge" #38

Open
lynnda-hill opened this issue Jan 18, 2022 · 4 comments
Open

splitting of the pronoun "feara makkárge" #38

lynnda-hill opened this issue Jan 18, 2022 · 4 comments
Assignees
Labels
bug Something isn't working

Comments

@lynnda-hill
Copy link
Contributor

lynnda-hill commented Jan 18, 2022

The attributive multi-word pronoun "feara makkárge" is automatically tokenized with a separate particle "ge" before any Constraint Grammar analysis.
For grammar rules it is a disadvantage to have a particle between an attributive pronoun and its nominal head.
It is also strange when two separate elements are analyzed as a MWE just to split the element in two.

Example sentence:

Olbmuid dieđut eará máilmmeosiin bođii mátkegirjjálašvuođas, man čállin ledje feara makkárge johttit dego mišunárat, jesuihtat, sisafárrejeaddjit, šlávagávppašeaddjit ja soalddáhat.

Analysis:

"<feara makkár>"
"feara makkár" Pron Indef Attr <W:0.0> @OBJ> SELECT:14209:r1569 MAP:23953:r484 #11->11 SUBSTITUTE:10139
; "feara makkár" MWE Pron Indef Sg Nom <W:0.0> SELECT:14209:r1569
"<ge>"
"ge" Pcle <W:0.0> @PCLE MAP:22078:r16 #12->12
:
"<johttit>"
"johtit" Ex/V IV Der/NomAg N Pl Nom <W:0.0> @<SPRED MAP:23586:r3349 &real-ImprtPl2-Inf #13->13 ADD:6065:real-ImprtPl2-Inf
real-ImprtPl2-Inf
"johtit" <W:0.0> @<SPRED MAP:23586:r3349 V IV Inf &SUGGEST #13->13 ADD:6065:real-ImprtPl2-Inf COPY:6211:real-ImprtPl2-Inf
johtit+V+IV+Inf johtit
"johtti" N NomAg Sem/Hum Pl Nom <W:0.0> @<SPRED MAP:23586:r3349 #13->13
; "johtit" V <ala-V> <eret> <rasta> <birra> <IN-Com-Veh> <XT-Acc-Measure> <SO-luhtte-Ani> <DE-Ill-Plc> <DE-sisa-Build> <DE-lusa-Ani> <PT-Gen-Plc><DE-Ill-Any> <PT-Gen-Plc> <PT-rastá-Plc> <PT-meaddel-Plc> <PT-čađa-Plc> <PT-bokte-Plc> <SO-Loc-Ani><DE-Ill-Ani> <SO-Loc-*Ani> <CO-mielde-Ani> <LO-luhtte-Any> <LO-Loc-Plc> IV Imprt Pl2 <W:0.0> SUBSTITUTE:3141 SUBSTITUTE:3174 SUBSTITUTE:3725 SUBSTITUTE:3806 SUBSTITUTE:3810 SUBSTITUTE:3879 SUBSTITUTE:3881 SUBSTITUTE:3886 SUBSTITUTE:3891 SUBSTITUTE:3893 SUBSTITUTE:3895 SUBSTITUTE:3978 SUBSTITUTE:3980 SUBSTITUTE:3987 SUBSTITUTE:3989 SUBSTITUTE:4017 SUBSTITUTE:4098 SUBSTITUTE:4673 SUBSTITUTE:4688 SUBSTITUTE:4714 @+FMAINV SUBSTITUTE:9160 MAP:16650:r406 REMOVE:6102:r948
@lynnda-hill lynnda-hill added the bug Something isn't working label Jan 18, 2022
@snomos
Copy link
Member

snomos commented Jan 18, 2022

So what do you want the output / analysis to be?

@lynnda-hill
Copy link
Contributor Author

lynnda-hill commented Jan 18, 2022

I would like to get:

"<feara makkárge>"

instead of:

"<feara makkár>"

@snomos
Copy link
Member

snomos commented Nov 16, 2023

After the latest changes in the clitics.lexc file, the analysis is now:

"<feara makkár>"
	"feara makkár" Pron Indef Attr <W:0.0> @OBJ> #11->11
"<ge>"
	"ge" Pcle Foc/Neg-ge <W:0.0> @PCLE #12->12
	"ge" Pcle Foc/Pos-ge <W:0.0> @PCLE #12->12

To avoid this split, we need to lexicalise the pronoun including the clitic. That will give us two analyses that needs to be disambiguated in the mwe-dis.cg3 file. If that is still not good, we need to remove feara makkár from the clitics altogether.

Any preferences, @lynnda-hill and @duomdaamaendra ?

@snomos
Copy link
Member

snomos commented Nov 16, 2023

According to the lexc code, feara makkár can take any clitic. Is this true? What is the status of ge in feara makkárge?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants