Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bridgy source should rename blacklist and similar offensive terms #908

Closed
tantek opened this issue Jan 9, 2020 · 1 comment
Closed

Bridgy source should rename blacklist and similar offensive terms #908

tantek opened this issue Jan 9, 2020 · 1 comment

Comments

@tantek
Copy link
Contributor

tantek commented Jan 9, 2020

Bridgy had a file called domain_blacklist.txt which was renamed to domain_blocklist.txt in a recent commit. This issue is for searching / tracking any other similar terms as identified by Microsoft in Chrome Issue 981129.

While we don’t have access to Microsoft’s PoliCheck tool mentioned therein, we can at least use the results they came up with in their searches of Chrome source as a first level approximation of terms to search for. E.g.:

  • change "master"/"slave" to "writer"/"reader"

Other terms may require manual inspection for context to evalute whether they are offensive or not. E.g.:

  • cracker
  • ho

Feel free to note other text profanity, geopolitical, or diversity issues in comments on this issue as discovered by PoliCheck analysis or other tools. (Please refrain from general offensive brainstorming though, instead, link to existing work and lists by others. Thanks.)

(Originally published at: https://tantek.com/2020/008/b1/bridgy-source-rename-offensive-terms)

@snarfed
Copy link
Owner

snarfed commented Jan 13, 2020

thanks again for the nudge!

i've confirmed that none of the words mentioned here appear in bridgy, granary, oauth-dropins, or webutil. i've also checked for the standard english curse words, and only found a couple instances of hell in test data in granary:

https://github.com/snarfed/granary/blob/2f62e7ff6ccfed5d65c2db09241a77698dfa2277/granary/tests/test_facebook.py#L1157
https://github.com/snarfed/granary/blob/2f62e7ff6ccfed5d65c2db09241a77698dfa2277/granary/tests/testdata/facebook.m.post.html#L226

as mentioned, PoliCheck is private, and similar public tools i've looked at focus more on profanity, as opposed to offensive words. so, i'm tentatively closing, but i'm happy to reopen if anyone wants to suggest a specific tool and usage details. (or even better, feel free to run it and send pull requests!)

@snarfed snarfed closed this as completed Jan 13, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants