Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP

Loading…

Use utf8 throughout... #16

Open
lyda opened this Issue · 0 comments

1 participant

@lyda
Owner

One of the words from the misspellings list that is not included is gardai -> gardaí due to the fact that utf8 is not supported.

1) Change the code such that it is.
2) Add a test case including "gardaí".
3) Revisit commit 475fe97 and _WORD_REGEX. Change it to do a findall and have it list the chars it allows in words rather than the chars that are not in words. Actually it could even dynamically generate that list from the wordlist it is using.

@myint myint referenced this issue from a commit in myint/misspellings
@myint myint Add Unicode support
This addresses items 1 and 2 of issue #16.
4412b7b
@myint myint referenced this issue from a commit in myint/misspellings
@myint myint Split on all non-words
Previously, there were some special cases (like "<"). This change takes
care of all non-words instead of just special cases. This resolves item
3 of issue #16 in an alternate way.
d2bc0b2
@myint myint referenced this issue from a commit in myint/misspellings
@myint myint Add Unicode support
This addresses items 1 and 2 of issue #16.
a5894e1
@myint myint referenced this issue from a commit in myint/misspellings
@myint myint Split on all non-words
Previously, there were some special cases (like "<"). This change takes
care of all non-words instead of just special cases. This resolves item
3 of issue #16 in an alternate way.
a32d0fc
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Something went wrong with that request. Please try again.