Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Chinese domains #50

Closed
egorsmkv opened this issue Feb 15, 2016 · 5 comments
Closed

Chinese domains #50

egorsmkv opened this issue Feb 15, 2016 · 5 comments

Comments

@egorsmkv
Copy link

egorsmkv commented Feb 15, 2016

Why Chinese domains in the list? This is not a temporary address.

  • qq.com
  • sohu.com
  • 21cn.com
  • yeah.net
  • naver.com

And other.

@egorsmkv egorsmkv changed the title China domains Chinese domains Feb 15, 2016
@FGRibreau
Copy link
Owner

Good question, it seems they are considered spam domains by the community. I will wait for "+1" to remove some :)

@trisix
Copy link
Contributor

trisix commented Apr 8, 2016

+1 for this.

IMHO, Despite some Chinese domains do have a higher chance to be used by some Chinese spammers. Yet they are also be used by lots of normal users from China. And even some people who just work there. I think for those users in China, the domain "qq.com" is just like "gmail.com" or "yahoo.com" for a U.S. resident or some people who work in the U.S.

On the other hand, even Google can be trust at most of time. You still can't stop spammers out there creating lots of fake accounts with Gmail. And use those accounts to spam your website.

As a developer from Taiwan, our service get some users who keep posting spam sometimes. They use "qq.com", "hotmail.com", "gmail.com"...etc. as their email, some even use fake Facebook accounts. But we know that we can not ban those email domains or Facebook completely. Because there are more regular users who also use those email domains or Facebook. So we have to use some other ways to deal with those problems. (FYI, we extract features from their contents, and use something simple, like Naive Bayes spam filtering, to deal with it. Since the spam contents are mostly from some specific domain, they do have some patterns or keywords.)

BTW, I think "naver.com" is a Korean domain. "Line", the popular IM app in Asia is their best known product.

: )

@dlackty
Copy link
Contributor

dlackty commented Jun 13, 2016

@FGRibreau Some domains on this list have Wikipedia pages and in general they're quite common and popular in China:

@egorsmkv
Copy link
Author

Thx!

@FGRibreau
Copy link
Owner

👍 released in v3.0.8

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants