New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Latest release breaks spell checking for Korean #903
Comments
bisected to: commit 05e44e0 (HEAD)
and the issue is a word of byte len 519 김수한무거북이와두루미삼천갑자동방삭치치카포사리사리센타워리워리세브리캉무드셀라구름위허리케인에담벼락서생원에고양이고양이는바둑이바둑이는돌돌이들 |
blen is an unsigned char, word is longer than that (in UTF8), so is newly correctly detected as not insertable so errors out and entire dict is discarded. Options are to leave it as is, and hunspell-ko has to remove the long entries to work, silently drop it instead of flagging an error, or make blen a bigger type |
hunspell#903 A problem since the sanity check added in: commit 05e44e0 Author: Caolán McNamara <caolanm@redhat.com> Date: Thu Sep 1 13:46:40 2022 +0100 Check word limit (hunspell#813) * check against hentry blen max
lets try making blen (and clen) unsigned short as the first port of call |
hunspell/hunspell#903 A problem since the sanity check added in: commit 05e44e069e4cfaa9ce1264bf13f23fc9abd7ed05 Author: Caolán McNamara <caolanm@redhat.com> Date: Thu Sep 1 13:46:40 2022 +0100 Check word limit (#813) * check against hentry blen max Change-Id: Iab2c062584da076260c3262537690435eae7f396 Reviewed-on: https://gerrit.libreoffice.org/c/core/+/145154 Tested-by: Jenkins Reviewed-by: Caolán McNamara <caolanm@redhat.com>
hunspell/hunspell#903 A problem since the sanity check added in: commit 05e44e069e4cfaa9ce1264bf13f23fc9abd7ed05 Author: Caolán McNamara <caolanm@redhat.com> Date: Thu Sep 1 13:46:40 2022 +0100 Check word limit (#813) * check against hentry blen max Change-Id: Iab2c062584da076260c3262537690435eae7f396 Reviewed-on: https://gerrit.libreoffice.org/c/core/+/145121 Tested-by: Jenkins Reviewed-by: Adolfo Jayme Barrientos <fitojb@ubuntu.com>
I think this warrants a 1.7.3 :) |
Even with this patch applied some parts of hunspell-kos test fail. See https://ci.debian.net/data/autopkgtest/unstable/amd64/h/hunspell-dict-ko/30142014/log.gz
|
That long string References: edit: add some more references. |
See also: https://bugzilla.redhat.com/show_bug.cgi?id=2158548
Test file:
korean.txt
That is wrong.
Downgrading to hunspell-1.7.1-1.fc38.x86_64 fixes the problem.
The text was updated successfully, but these errors were encountered: