-
Notifications
You must be signed in to change notification settings - Fork 133
Words with accents show as misspelled #241
Comments
More likely than not, it's a problem with your dictionary. Atom doesn't deal well with dictionaries that are not UTF-8 encoded. |
Probably related to #212 ? @lierdakil : How to obtain UTF-8 encoded dictionaries? |
@dvictori: I think #212 is definitely going to cause you problems even with a UTF-8 dictionary. There is a defect on node-spellchecker that is trying to fix that. Until that is resolved, I don't know if we can do much more. |
@dvictori Just find some? e.g. https://github.com/wooorm/dictionaries |
@dmoonfire, I don't have any issues described in #212. Gentoo Linux, Atom 1.28.0, I probably would if my de-DE dictionary was, say, cp1252-encoded. |
@lierdakil: I stand corrected. Does it show it spelled correctly if you have |
Additionally, I've tried converting my dictionary from UTF8 to ISO8859-1 (as is common with extended latin hunspell dictionaries), and here's what I've got: |
Oh, I know why you are behaving. I found that the I suspect if you just had |
If I just had I was under the impression that modern Linux distributions prefer UTF-8 locales. Pretty sure at least Arch and Gentoo do. |
@lierdakil Bingo! I used the dictionaries from wooorm and now atom spell check is working. Just hope it won't break any other program. So far, libreoffice and firefox spell check looks fine. It would be nice though, for users less technically inclined, to be able to use their native dictionary, that comes with the operating system, without having to change the file. |
@dmoonfire, FWIW, running Atom with |
So i installed wooorm's spanish UTF-8 dictionary with |
@ventolinmono, you can point Atom to the directory where you installed the dictionary. Check spell-check settings. |
I just copied the files from wooorm repository to |
I never know about wooorm's dictionaries. They have a MIT license, so that is reasonable. If the UTF-8 is the only thing needed, I'll try creating a couple Atom packages to install specific language dictionaries and see if that behaves; the plugin system for spell-check is designed for that. |
@dmoonfire They do not have an MIT license. Every dictionary comes with a different license! |
@edusantana this worked for me! |
On archlinux, I solved it by doing: |
I would really like to avoid converting my dictionaries into UTF-8 encoding. I'm using original dictionaries from LibreOffice, sharing them between multiple applications and I'm not sure they'll be still working after the conversion. Sure, I can try it but I would like to avoid the conversion every time I update the dictionaries anyway. The |
@ferenczy Definitely. I found these issues: LibreOffice/dictionaries#7 in the libreoffice repo. And atom/node-spellchecker#89 in atom itself |
Ideally, a conversion shouldn't be needed because most dictionary files tell you their encoding. I'm trying to get back on this to look at it, I think the underlying problem is at the C++ layer which is no longer my strength, but I have a few obligations that are getting in the way. I want to fix this, mainly because it is driving me nuts too. :) |
@dmoonfire any luck with that? Any work around? |
@edusantana: Over the last week, I worked on a PR for It just took me a while to figure out text encoding on C++ on four different platforms. |
converting latin1 files to utf8 and changing the format tag did not work for me, as it somehow gets only a subset of the dictionary so it still shows correct words as misspelled. Is there any way for me to configure a path for the dictionary in a way that this extension will get it? I don't want to risk losing other spellcheck tools as they are working properly |
Atom 1.37 has a fix for passing accented characters for spell-checking. It handles dictionaries files that aren't UTF-8 encoded. Could you please check with the beta and see if it solves the problem? Thank you. |
@dmoonfire I will try it... Thanks!!! It works now!!! Look! |
It sounds like this is resolved, so I'm going to close this issue. Feel free to open a new one. |
Prerequisites
Description
On .md and .txt files spanish words with accents showed as misspelled but they are correct. Using aspell es-ES locales.
Steps to Reproduce
Expected behavior: Spell-check should recognize correct words with accents.
Actual behavior: Atom underlines all words with accents, although they are correct.
Reproduces how often: Always.
Versions
Atom : 1.23.3
Electron: 1.6.15
Chrome : 56.0.2924.87
Node : 7.4.0
apm 1.18.12
npm 3.10.10
node 6.9.5 x64
atom 1.23.3
python 2.7.13
git 2.11.0
Debian 9.
Additional Information
Tried checking the same file with aspell on command line and works fine. It recognizes words with accents as correct. Also tried different encodings.
The text was updated successfully, but these errors were encountered: