Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bad words #5

Closed
Jivings opened this issue Nov 20, 2020 · 3 comments
Closed

Bad words #5

Jivings opened this issue Nov 20, 2020 · 3 comments

Comments

@Jivings
Copy link

Jivings commented Nov 20, 2020

I'm not sure why there is a need for curse words in the list (small.json).

image

Can we remove these or add an option to not return them?

@Jivings Jivings changed the title Swear words Bad words Nov 20, 2020
@MrXyfir
Copy link
Member

MrXyfir commented Nov 20, 2020

I would definitely like to remove these and any others that could potentially be offensive. There are word lists available we could use to strip them out automatically but I don't have the time to do that. Is this something you might be able to do and submit a PR?

@MrXyfir
Copy link
Member

MrXyfir commented Nov 26, 2020

I removed a bunch of bad/offensive words in v3.2.0

  • big.json: 370,099 -> 359,742
  • small.json: 128,660 -> 123,567

I took a very lazy approach to doing this, so surely there are some words I missed and others that were removed that shouldn't have been. Unfortunately most of the lists I've found are very incomplete so I simply combined them all together and then removed any words in big/small that started with one of the bad/offensive words.

If anyone wants to clean this up further I'd be happy to accept a pull request.

@Jivings
Copy link
Author

Jivings commented Nov 26, 2020

Excellent, thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants