Issue with --ignore-words? #2451

jeeftor · 2022-08-15T16:08:06Z

I feel like things aren't working as they should

peternewman · 2022-08-16T11:44:12Z

Did you read our usage doc:
https://github.com/codespell-project/codespell#usage

Specifically:

Important note: The list passed to -I is case-sensitive based on how it is listed in the codespell dictionaries.

jeeftor · 2022-08-16T12:14:33Z

Um. They are all the same case???

peternewman · 2022-08-16T12:52:31Z

I previously left #2451 which was closed however I feel this was done incorrectly.

My ignore list was:

ignore.txt
FRO
THA
ND
SUR
My file I was spell checking was:

in.txt
FRO FRUM FROM
ND NORTH SOUTH
And the output of CodeSpell is:

codespell --ignore-words=ignore.txt in.txt
in.txt:1: FRO ==> FOR, FROM
in.txt:2: ND ==> AND, 2ND
As per the docs the spelling IS case sensitive... so am I missing something here?

FRO is in the ignore list - and as such shouldn't FRO in the input file not be flagged as an issue?

Grep dictionary.txt and you'll find fro is in lower case in our dictionary. Some clever magic makes it show as upper case because your input string is upper case.

Specifically:

case-sensitive based on how it is listed in the codespell dictionaries

jeeftor · 2022-08-17T14:25:41Z

If I understand correctly:

I put FRO in my ignore list... and I put FRO in my text but the built in fro (lowercase) is being flagged?

peternewman · 2022-08-17T14:36:12Z

Yes, put fro in your ignore list (as per the docs) and it should work as you intended.

andyholmes · 2022-08-18T00:13:37Z

Seems like a good way to close this would be a README.md update including such an example.

I think it's not obvious that "case-sensitive based on how it is listed in the codespell dictionaries", means that your ignore match is case-sensitive, while the dictionary match itself is case-insensitive.

peternewman · 2022-08-18T09:47:27Z

Seems like a good way to close this would be a README.md update including such an example.

Do you mean "so if you want to ignore FRO, you'll need to add fro to your ignore list, because the codespell dictionaries feature the correction fro->for?

I think it's not obvious that "case-sensitive based on how it is listed in the codespell dictionaries", means that your ignore match is case-sensitive, while the dictionary match itself is case-insensitive.

Pull requests or comments welcome for how to make that more obvious.

I must admit it's been a while since I've personally looked at that bit of code for why we don't just lower-case the ignore list but I suspect it might be to allow potential future features such as correcting javascript to JavaScript based on a case specific dictionary.

Would a warning/info notification that "you've ignored FRO; it doesn't exist in the dictionary but fro does" be useful?

Update the README to elaborate on case-sensitivity in ignored words and dictionaries. close codespell-project#2451

andyholmes · 2022-08-19T00:17:05Z

Took a stab at updating the README in #2466.

Would a warning/info notification that "you've ignored FRO; it doesn't exist in the dictionary but fro does" be useful?

I think a warning might be a good idea, I expect that's the place people will look first anyways.

mreidel-godaddy · 2022-08-23T07:41:06Z

@peternewman I just fell for the same issue: flagged was hasTable->hashable, I put hasTable in the ignore list, but it should have been hastable.

Would a warning/info notification that "you've ignored FRO; it doesn't exist in the dictionary but fro does" be useful?

That would have been very helpful indeed!

giswqs · 2022-10-06T04:59:03Z

The --ignore-words-list option seems to have no effect. The two words included in the ignored words list are still being flagged as typos. Anyone knows why?

Edit: changing the words in the ignored list to all lower case solves the issue. I think it is a bug.

nasa/Transform-to-Open-Science#275

luckman212 · 2022-11-20T20:07:40Z

What if I want aCount to be allowed/ignored but I want acount to be flagged as a misspelling? Impossible?

peternewman · 2022-11-21T01:15:43Z

What if I want aCount to be allowed/ignored but I want acount to be flagged as a misspelling? Impossible?

The ignore words are about skipping entries that are normally in our dictionary, so you can't skip a typo from the dictionary (ACOUNT) and expect some variants of it to be flagged and others to be ignored.

While I can see the benefit of this use case, I think it would be confusing for the majority of people where they want to skip a word as it's a special term in their particular domain for example.

I think two things would fix your use case, in the short term, using the ignore regex option to essentially vanish aCount from the source text codespell is checking, alternatively when we add camel-case support, that will fix it properly, with the benefit of also flagging aCont as a typo too (potentially)!

Sopor · 2022-11-23T21:19:39Z

I'm new to codespell, so maybe i do something wrong here, so please bear with me 😊

I wanted to ignore the word clientA
chain\client.lua:12: clientA ==> client

I created a config file and added
ignore-words-list = clientA
to ignore clientA, but it didn't worked.

Readme file:
Important note: The list passed to -I is case-sensitive based on how it is listed in the codespell dictionaries.

When i changed the clientA to clienta it worked.

Shouldn't i use the same case as the word i want to ignore?

andyholmes · 2022-11-23T22:06:32Z

No, that's what this issue is about clarifying. This is my current proposed change to the README:

Ignoring Words

When ignoring false positives, note that spelling errors are case-insensitive but words to ignore are case-sensitive. For example, the dictionary entry wrod will also match the typo Wrod, but to ignore it you must pass wrod.

The words to ignore can be passed in two ways:

-I: A file with a word per line to ignore::
```
 codespell -I FILE, --ignore-words=FILE
```
-L: A comma separated list of words to ignore on the command line::
```
 codespell -L word1,word2,word3,word4
```

prestoncarman · 2023-04-28T16:19:54Z

Ignoring words has been a confusing part of codespell for me. @andyholmes I like your explanation on how the ignore feature works.

After reading this threaded, I realize that the ignore list is not for the text or word in my source code, but for the codespell dictionary. This makes more sense as to why it must be lowercase.

I think adding the warning and clearing up the description in the README (using @andyholmes description above) would be helpful to new (and current) users of codespell.

peternewman added the question label Aug 16, 2022

peternewman closed this as completed Aug 16, 2022

jeeftor mentioned this issue Aug 16, 2022

Issue with --ignore-words #2452

Closed

peternewman reopened this Aug 16, 2022

peternewman mentioned this issue Aug 17, 2022

codespell 2.2.0 will not ignore the word 'Jupyter' #2458

Closed

andyholmes added a commit to andyholmes/codespell that referenced this issue Aug 18, 2022

Update README.rst

4622970

Update the README to elaborate on case-sensitivity in ignored words and dictionaries. close codespell-project#2451

andyholmes mentioned this issue Aug 18, 2022

Update README.rst #2466

Merged

CecileRobertMichon mentioned this issue Oct 14, 2022

add shouldnot to codespellignore kubernetes-sigs/cluster-api-provider-azure#2728

Merged

3 tasks

DimitriPapadopoulos closed this as completed in #2466 Apr 29, 2023

kfessel mentioned this issue Apr 19, 2024

driver/mtd_spi_nor, pkg/littlefs: improve reliability with corrupted flash (new PR) RIOT-OS/RIOT#20589

Open

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue with --ignore-words? #2451

Issue with --ignore-words? #2451

jeeftor commented Aug 15, 2022

peternewman commented Aug 16, 2022

jeeftor commented Aug 16, 2022

peternewman commented Aug 16, 2022 •

edited

ignore.txt

in.txt

jeeftor commented Aug 17, 2022

peternewman commented Aug 17, 2022

andyholmes commented Aug 18, 2022

peternewman commented Aug 18, 2022

andyholmes commented Aug 19, 2022

mreidel-godaddy commented Aug 23, 2022

giswqs commented Oct 6, 2022 •

edited

luckman212 commented Nov 20, 2022

peternewman commented Nov 21, 2022

Sopor commented Nov 23, 2022

andyholmes commented Nov 23, 2022

prestoncarman commented Apr 28, 2023

Issue with --ignore-words? #2451

Issue with --ignore-words? #2451

Comments

jeeftor commented Aug 15, 2022

peternewman commented Aug 16, 2022

jeeftor commented Aug 16, 2022

peternewman commented Aug 16, 2022 • edited

ignore.txt

in.txt

jeeftor commented Aug 17, 2022

peternewman commented Aug 17, 2022

andyholmes commented Aug 18, 2022

peternewman commented Aug 18, 2022

andyholmes commented Aug 19, 2022

mreidel-godaddy commented Aug 23, 2022

giswqs commented Oct 6, 2022 • edited

luckman212 commented Nov 20, 2022

peternewman commented Nov 21, 2022

Sopor commented Nov 23, 2022

andyholmes commented Nov 23, 2022

Ignoring Words

prestoncarman commented Apr 28, 2023

peternewman commented Aug 16, 2022 •

edited

giswqs commented Oct 6, 2022 •

edited