Some default words not censored #21

jcbrockschmidt · 2020-12-01T03:06:28Z

As of 0.7.0, the words "shi+" and "sh!+" have been added to the default wordlist. But they are not censored. Should we...

Remove them from the word list.
Add "+" to ALLOWED_CHARACTERS (and optionally add "+" to CHARS_MAPPING for "t").

Note that if we go with option 2, profanity separated by "+" (e.g. "fuck+fuck") will no longer be censored.

The text was updated successfully, but these errors were encountered:

snguyenthanh · 2020-12-04T17:12:48Z

What do you think about adding ! to char i and + to char t in the CHAR_MAPPING variable: https://github.com/snguyenthanh/better_profanity/blob/master/better_profanity/better_profanity.py#L33-L42

jcbrockschmidt · 2020-12-07T03:07:33Z

@snguyenthanh I think we should, yeah. That will fix the issue for these particular words.

We should also display a warning message to let the user know when a words/phrase is invalid and won't be censored.

jcbrockschmidt · 2020-12-07T03:23:16Z

I tried adding "!" to ALLOWED_CHARACTERS and it caused test_unicode_vietnamese_2 to fail:

FAIL: test_unicode_vietnamese_2 (__main__.ProfanityUnicodeTestVietnamese)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "tests.py", line 176, in test_unicode_vietnamese_2
    self.assertEqual(profanity.censor(bad_text), censored_text)
AssertionError: 'Con chó sủa **** gâu!' != 'Con chó sủa **** ****!'
- Con chó sủa **** gâu!
?                  ^^^
+ Con chó sủa **** ****!
?                  ^^^^

If a swear word ends with a "!", it will be ignored when "!" is an allowed character.

Here's a unit test we can use to test punctuation:

    def test_punctuation(self):
        bad_text = "Holy shit! Oh fuck, damn. What the hell? Shut up, asshole..."
        censored_text = "Holy ****! Oh ****, ****. What the ****? Shut up, ****..."
        self.assertEqual(profanity.censor(bad_text), censored_text)

jcbrockschmidt mentioned this issue Dec 1, 2020

Add unit test for checking all default swear words #24

Merged

jcbrockschmidt added bug Something isn't working discuss Discussion on the project's features / bugs labels Dec 7, 2020

jcbrockschmidt mentioned this issue Dec 7, 2020

Add benchmarking code and unit tests for full paragraphs #26

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some default words not censored #21

Some default words not censored #21

jcbrockschmidt commented Dec 1, 2020 •

edited

snguyenthanh commented Dec 4, 2020

jcbrockschmidt commented Dec 7, 2020

jcbrockschmidt commented Dec 7, 2020 •

edited

Some default words not censored #21

Some default words not censored #21

Comments

jcbrockschmidt commented Dec 1, 2020 • edited

snguyenthanh commented Dec 4, 2020

jcbrockschmidt commented Dec 7, 2020

jcbrockschmidt commented Dec 7, 2020 • edited

jcbrockschmidt commented Dec 1, 2020 •

edited

jcbrockschmidt commented Dec 7, 2020 •

edited