Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

export get_wordchars #282

Closed
phajdan opened this issue Jun 4, 2012 · 3 comments
Closed

export get_wordchars #282

phajdan opened this issue Jun 4, 2012 · 3 comments
Milestone

Comments

@phajdan
Copy link
Contributor

phajdan commented Jun 4, 2012

Please add get_wordchars to exported library functions. Without it, the tokenization has to rely on manual parsing of the affix file.

Original comment by: milek_pl

Original Ticket: hunspell/feature-requests/41

@rrthomas
Copy link
Contributor

rrthomas commented Dec 19, 2016

It would be helpful for users of the hunspell binary (such as Emacs) if this information were also available from that program. See https://debbugs.gnu.org/cgi/bugreport.cgi?bug=17742

@rrthomas
Copy link
Contributor

I just noticed that this issue can be closed, as the get_wordchars function has been exported since at least 1.2.10 (the earliest release mentioned in git).

@dimztimz
Copy link
Contributor

Indeed. I should just note that get_wordchars() should not be used at all by client applications, it is not very reliable information. More sophisticated tokenization should be applied. Eg. the apostrophe is a character that sometimes should be taken as part of the word (when used as genetive 's) and sometimes not (when used as quotation mark).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants