Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

duplicate flag declarations in ES-ANY #86

Closed
ghost opened this issue May 23, 2016 · 13 comments
Closed

duplicate flag declarations in ES-ANY #86

ghost opened this issue May 23, 2016 · 13 comments
Assignees
Labels
Milestone

Comments

@ghost
Copy link

ghost commented May 23, 2016

There are multiple issues with the ES-ANY variant:
error: line 2552: multiple definitions of an affix flag
error: line 2560: multiple definitions of an affix flag
error: line 2658: multiple definitions of an affix flag
error: line 2684: multiple definitions of an affix flag
error: line 2822: multiple definitions of an affix flag
error: line 2922: multiple definitions of an affix flag
error: line 3452: multiple definitions of an affix flag
error: line 3704: multiple definitions of an affix flag
error: line 4078: multiple definitions of an affix flag
error: line 4128: multiple definitions of an affix flag
error: line 4310: multiple definitions of an affix flag
error: line 4356: multiple definitions of an affix flag
error: line 5106: multiple definitions of an affix flag
error: line 5436: multiple definitions of an affix flag
error: line 5822: multiple definitions of an affix flag
error: line 5884: multiple definitions of an affix flag
error: line 6086: multiple definitions of an affix flag
error: line 6136: multiple definitions of an affix flag
error: line 6214: multiple definitions of an affix flag
error: line 6304: multiple definitions of an affix flag
error: line 6475: multiple definitions of an affix flag
error: line 6748: multiple definitions of an affix flag

Probably it is better tu change to FLAG LONG

@Almorca Almorca self-assigned this May 31, 2016
@Almorca Almorca added this to the Versión 2.2 milestone May 31, 2016
@Almorca Almorca added the error label May 31, 2016
@Almorca
Copy link
Collaborator

Almorca commented Jun 7, 2016

Arreglo el error con el commit ebb5353

El problema es que para que hunspell considere que un fichero de afijos está en formato UTF-8 hay que añadir el flag FLAG UTF-8 ya que el flag SET UTF-8 sólo indica el formato del fichero .dic

Para comprobar el error se usa el comando
hunspell -d es_ES
para ver que el diccionario funciona correctamente y no devuelve ningún error.

@Almorca Almorca closed this as completed Jun 7, 2016
@ghost
Copy link
Author

ghost commented Jun 8, 2016

What you are saying is not exactly true.

When SET UTF-8 is used, both file sshould be in utf8.
But then, not all chars are valid in a flag.

To make things more manageble, one could consider to use FLAG long,
making plain readable two letter flags (without diacriticals) in the
affix file as well as the .dics.

When I look at the ES_ANY, there are multiple sections for the same
flag; that is really invalid.

Ruud

Almorca schreef op 2016-06-07 21:28:

Arreglo el error con el commit ebb5353 [1]

El problema es que para que hunspell considere que un fichero de
afijos está en formato UTF-8 hay que añadir el flag FLAG UTF-8 ya
que el flag SET UTF-8 sólo indica el formato del fichero .dic

Para comprobar el error se usa el comando
hunspell -d es_ES
para ver que el diccionario funciona correctamente y no devuelve
ningún error.

You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub [2], or mute the
thread [3].

Links:

[1]
ebb5353
[2] #86 (comment)
[3]
https://github.com/notifications/unsubscribe/AIFk_jHcDg8J8vpDcO8Ny4oCfBiOiUNDks5qJcZsgaJpZM4IkOly

@Almorca
Copy link
Collaborator

Almorca commented Jun 9, 2016

@baarsrj Sorry but I don't understand why we can't use all UTF-8 chars in a flag and because with FLAG long yes. Why do you thing that we should use FLAG long?

What is the problem with es_ANY? Where do you see multiple sections?

@ghost
Copy link
Author

ghost commented Jun 10, 2016

Just use es_any frim the commanline, like in hunspell -d es_any -L test.txt, and you will see errors reported.

Verzonden van mijn mobiele telefoon.

----- Bericht beantwoorden -----
Van: "Almorca" notifications@github.com
Aan: "sbosio/rla-es" rla-es@noreply.github.com
CC: "R. Baars" baarsrj@xs4all.nl, "Mention" mention@noreply.github.com
Onderwerp: [sbosio/rla-es] duplicate flag declarations in ES-ANY (#86)
Datum: do, jun. 9, 2016 20:49

@baarsrj Sorry but I don't understand why we can't use all UTF-8 chars in a flag and because with FLAG long yes. Why do you thing that we should use FLAG long?

What is the problem with es_ANY? Where do you see multiple sections?

—You are receiving this because you were mentioned.Reply to this email directly, view it on GitHub, or mute the thread.

@Almorca
Copy link
Collaborator

Almorca commented Jul 6, 2016

@baarsrj I have tested the file es_ANY and I can not see any error with the git version of dictionary. Could you post a image of your error?

@pgajdos
Copy link

pgajdos commented Dec 9, 2016

This depends on version of hunspell, it seems. I get the warning with 1.3.2, but not with 1.4.1.

@roperi
Copy link

roperi commented Mar 12, 2019

@Almorca

I am getting the same error as @baarsrj when I run a python script that calls hunspell. I do not have any version of hunspell installed but I do have libhunspell-dev installed AND no errors show when I run the same script in another server.

The dictionary version I use can be found here:
http://recursospython.com/wp-content/uploads/2016/02/hunspell-es.zip

Where I can get the git version of the dictionary? Or how do I solve the problem?

Thanks.

@roperi
Copy link

roperi commented Mar 12, 2019

Ok, people. I did a little more research and found the following comment from Caolan McNamara in this bug report at hunspell:

To fix this add "FLAG UTF-8" to your aff file to indicate that the flags are encoded UTF-8. The default mode is to assume the flags are single bytes, and in this case È and Ì are encoded as UTF-8 and are multi-byte codes and the lead byte of each multi-byte code is the same, hence the warning. The SET UTF-8 only states that the .dic is in UTF-8. Its a little odd I accept, but give "FLAG UTF-8" a go.

So I edited the es_ANY.aff and changed the line SET UTF-8 at the top of the file to FLAG UTF-8 and the error went away!

@cosmoscalibur
Copy link
Collaborator

@h-2-0 last version fixes that issue. Uncompress the OXT file and use DIC and AFF files.

@roperi
Copy link

roperi commented Mar 14, 2019

@cosmoscalibur

Thanks for letting me know. But I don't know which OXT file you talk about. I was always used the DIC and AFF files found in the zip file I linked above.

@cosmoscalibur
Copy link
Collaborator

@h-2-0 follow the link of the -> last version <-. Download OXT file of the localization that you need. Uncompress OXT, this contains DIC and AFF files.

@roperi
Copy link

roperi commented Mar 15, 2019

Thanks so much, @cosmocalibur ! I downloaded them already.

@RickieES
Copy link
Collaborator

@h-2-0, did it work? Can we mark this issue as fixed?

Thanks in advance.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

5 participants