Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is there any principle for official filters? #1988

Closed
Mosney opened this issue Aug 3, 2020 · 35 comments
Closed

Is there any principle for official filters? #1988

Mosney opened this issue Aug 3, 2020 · 35 comments
Labels

Comments

@Mosney
Copy link

Mosney commented Aug 3, 2020

I noticed anti-ad has been included in the AdGuardHome official filters 304a51f.

I suppose I should notice you this list includes some irrelevant item in it, for example, it includes many domain about FaLunGong and its news/media site which banned from China authority but actually not AD or privacy-protection-purpose. I opened an issue (see this screenshot) to the origin maintainer gentlyxu but has been deleted , he says "Yes, I extremely hate those idiot so I will never unblock it and stop foolish and don't bother me again!".

So whether AdGuardHome official recommend filters have some rules/principle? Even it contain subjective item, totally advertising irrelevant. just its maintainer's like or dislike. I think chose this list as official filter is irresponsible, not only to users but also to your reputation.

Therefore I think this list can't be trusted and collect some proof then create a new repo to let more people know what he did.
See https://github.com/Mosney/anti-anti-AD

@ameshkov
Copy link
Member

ameshkov commented Aug 3, 2020

First of all, thank you for notifying us about this!

Please give me some time, I'll look into it. If this is true, the problem with this is not just that the rules are subjective, but that people don't expect this behavior. Nowhere in the list's repo I see a warning that it'll block more than just the ads.

Anyways, what alternative Chinese-focused lists do you suggest?

@Mosney
Copy link
Author

Mosney commented Aug 3, 2020

Nowhere in the list's repo I see a warning that it'll block more than just the ads.

Yep, that is the option I suggested before, but the maintainer ignore it and delete issue.

Anyways, what alternative Chinese-focused lists do you suggest?

I list at https://github.com/Mosney/anti-anti-AD 's last, select some Chinese-focused list here.

@ameshkov
Copy link
Member

ameshkov commented Aug 3, 2020

Yep, that is the option I suggested before, but the maintainer ignore it and delete issue.

Could you please point out these non-ad domains in the list?

I list at https://github.com/Mosney/anti-anti-AD 's last, select some Chinese-focused list here.

neoHosts seems to be abandoned?

@hezhijie0327
Copy link

Xnip2020-08-03_21-08-58

@DandelionSprout
Copy link
Contributor

DandelionSprout commented Aug 3, 2020

Although I haven't looked much into this breaking-news situation, there's two things about this that took me by surprise.

  1. Like what hezhijie confirms in the comment above, I know that neoHosts used to actively disclose that it's heavily anti-Falun Gong. Looking into the lists, those entries seem to be limited to neoHosts Full, and are not in neoHosts Basic... at least not at the time of writing. Who knows what the future may bring for those lists.

  2. Since anti-ad makes official list versions for Surge and SmartDNS, tools commonly used to evade blockings in PR-China, it seems confusing (although believable) to me that it'd add entries against one of the firewalls' most attacked targets.

@ameshkov
Copy link
Member

ameshkov commented Aug 3, 2020

@DandelionSprout well, this is explained in NeoHosts readme.

image

@DandelionSprout
Copy link
Contributor

I guess that confirms my worries, which previously had been based on a much older README version that was immortalised in FilterListsCOM's database.

@ameshkov
Copy link
Member

ameshkov commented Aug 3, 2020

So it seems that at least two popular Chinese lists prefer to take some stance on political/religious issues in addition to their main purpose.

I see two options here:

  1. Add a disclaimer in the UI that these lists may be blocking domains based on their authors' political/religious, or other preferences.
  2. Don't add them to the list of "vetted" lists and keep only single-purpose lists there.

Any opinion is welcome, but since we're talking about Chinese lists, I'd like to understand what Chinese users think about it.

亲爱的中国用户们,对于这件事情你们什么看法?

@DandelionSprout
Copy link
Contributor

DandelionSprout commented Aug 3, 2020

Option 3 could've been to add an "Exclusions" section for each list, known from the FiltersRegistry repo. But in this case that'd be a shortterm solution at best.

@Mosney
Copy link
Author

Mosney commented Aug 4, 2020

Could you please point out these non-ad domains in the list?

You can search Falun gong relevant keyword below in that list,

minghui
falun
ntdtv
dongtaiwang

And another user also found some media domain blocked, still has been deleted by maintainer. See those domain Screenshot before deleted.

neoHosts seems to be abandoned?

I don't think so, just not update frequently, he adds some AdGuardHome compatible in January after I ask.

@Mosney
Copy link
Author

Mosney commented Aug 4, 2020

Since anti-ad makes official list versions for Surge and SmartDNS, tools commonly used to evade blockings in PR-China, it seems confusing (although believable) to me that it'd add entries against one of the firewalls' most attacked targets.

I guess it's PRC particular circumstance that creates a lot of paradoxical psychology, or this maintainer just want spread subjective stance quietly. I also dislike those Falun Gong's website and its believer, but the question is you shouldn't use your position to make choices for others. Even don't mention these tricks to users who trust this list. neohost also block extra domain but he make a declaration clearly and still provide non-extra list. I think that's the biggest difference between them.

@Mosney
Copy link
Author

Mosney commented Aug 4, 2020

亲爱的中国用户们,对于这件事情你们什么看法?

In my view, because integrating multiple other upstream sources and maintain frequently, anti-AD still has itself's value as an AD list. That's the reason I tried to let maintainer make some change, failed obviously.

I have two options here:

  1. Like @ameshkov option.1, add some disclaimer to explain what those lists will do, for this instance, reference repo's link enough I think.

  2. Totally abandon it, turn to collect its upstream which only have single-purpose lists, It's almost equally effective. Just lose a little part which is these maintainer efforts.

@fire-dotcom
Copy link

Anyway they may follow something righteous for people and its not a fun. i have seen this behaviour in projects under restrictions placed saudi arabia, iran and few other countries.

So it seems that at least two popular Chinese lists prefer to take some stance on political/religious issues in addition to their main purpose.

I see two options here:
Add a disclaimer in the UI that these lists may be blocking domains based on their authors' political/religious, or other preferences.

Add a disclaimer: This may includes domains considered as inappropriate by regional law and blocklist author.

@DandelionSprout
Copy link
Contributor

I've stumbled across https://raw.githubusercontent.com/Licolnlee/AdBlockList/master/AdBlock, which seems promising at a surface glance.

@liamengland1
Copy link

These are mostly not hosts lists, but here are some Chinese adblock lists:

https://wsgzao.github.io/post/adguard/
https://kbsml.com/2020/06/21/adguard%e8%bf%87%e6%bb%a4%e8%a7%84%e5%88%99/
https://github.com/otobtc/ADhosts

@hezhijie0327
Copy link

I combined an AdFilter (Subscription) suitable for China, and extracted only the rules suitable for AGH (except Easylist, Easyprivacy, Adblock-nocoin-List and CoinBlockerLists, all of which are collected for Chinese websites; also remove the dead domain or host from it).

@Mosney
Copy link
Author

Mosney commented Aug 5, 2020

I've stumbled across https://raw.githubusercontent.com/Licolnlee/AdBlockList/master/AdBlock, which seems promising at a surface glance.

I follow this and find maintainer says "Project has been included in the NeoDevHost Project. https://github.com/neodevpro/neodevhost" Looks we have many alternative lists for chosen.

@Mosney
Copy link
Author

Mosney commented Aug 5, 2020

I've stumbled across https://raw.githubusercontent.com/Licolnlee/AdBlockList/master/AdBlock, which seems promising at a surface glance.

I follow this and find maintainer says "Project has been included in the NeoDevHost Project. https://github.com/neodevpro/neodevhost" Looks we have many alternative lists for chosen.

Actually not many, same issue on Licolnlee/AdBlockList and neodevpro/neodevhost, they may all subject-pollute by same upstream. But @hezhijie0327 's repo looks good, have no extra irrelevant domain.

@ameshkov
Copy link
Member

ameshkov commented Aug 5, 2020

Please note that it's crucial for a list to be actively maintained so that there was a place where people could report issues, and they would be solved. Simple compilations of other existing lists do not fit this description.

Among all the lists, Anti-Ad still seems to be the only one that's really actively developed. NeoHosts is not as active, there's a bunch of unaswered issues in their repo.

@Mosney
Copy link
Author

Mosney commented Aug 5, 2020

Another issue none about this topic, when I discussing anti-AD in some community's group, many people mentioned this rule is too radical, sometimes functional domains are blocked as well. Although current maintainer are diligent enough to make up for it, most people will not (or don't know how) to report to maintainer, in my observe. Anyway, list only one recommend regional list not a good idea, especially this only list not good enough, you may think about add some more option for users.

@hplee0120
Copy link

Please note that it's crucial for a list to be actively maintained so that there was a place where people could report issues, and they would be solved. Simple compilations of other existing lists do not fit this description.

Among all the lists, Anti-Ad still seems to be the only one that's really actively developed. NeoHosts is not as active, there's a bunch of unaswered issues in their repo.

Yes, I used the anti-AD anti-advertising list, and then stopped using it, because the list is often incorrectly blocked and overly aggressive. Sometimes it will affect daily use. This list is more like a list tailored by the author for himself . During this period of time, I have also tried other more suitable lists, but there is no progress for the time being. Most Chinese lists seem to be incompatible with ADH.

DandelionSprout added a commit to DandelionSprout/adfilt that referenced this issue Aug 10, 2020
I think this will be of interest to AdguardTeam/AdGuardHome#1988
@ameshkov
Copy link
Member

Should we add NeoHosts Basic or yHosts or both of them?

@Mosney
Copy link
Author

Mosney commented Aug 11, 2020

Should we add NeoHosts Basic or yHosts or both of them?

I prefer to add both, at least give users choice.

@ameshkov ameshkov added this to the v0.103 milestone Aug 11, 2020
@DandelionSprout
Copy link
Contributor

DandelionSprout commented Aug 11, 2020

Based on research I did this morning, which I forgot to tell about because I've felt tired today: (+ = good; • = thing to keep in mind)

yHosts (VeleSila's list version)

+ Very large coverage.
+ Also has support for Taiwanese sites.
+ Contains entries that attempt to make PC trial periods never end, which I approve of... provided the entries actually work.
+ No entries that'd break international sites that I could see.
• Has various entries that are used to block things on non-Chinese sites, e.g. CNN and deviantART.
• Seems to block ieonline.microsoft.com for slightly unclear reasons. I approve of its blocking of toolbar.msn.com, however.

NeoHosts Basic

+ Fairly good coverage
+ No entries that'd break international sites that I could see.
+ Focuses entirely on PR-Chinese sites, with no chance of overlapping with e.g. English and Japanese lists.
• The list contains IPv4 and IPv6 entries for each domain, when only one of them would really be needed in AdGuard Home.

@simpleandstupid
Copy link

simpleandstupid commented Aug 12, 2020

I think Chinese users should be given the right to choose. Not everyone is interested in politics and religion. Most people are not associated with blocked domains.

@ameshkov ameshkov modified the milestones: v0.103, v0.104 Aug 28, 2020
@ZeroClover
Copy link

neoHosts lacks the necessary maintenance, has not been updated for a long time, and has a high number of false positives, which I find unacceptable.

Besides, yHosts maintainers seem to use QQ (a mainland Chinese IM software) more than GitHub to receive feedback, which doesn't seem to be conducive to community feedback and problem-solving.

I agree with @ameshkov, and the point is that the maintainers of the list should actively maintain these rules, even if there are some unnecessary entries added because of their personal opinions, it is effortless to override them in an ADH deployment.

@Mosney
Copy link
Author

Mosney commented Aug 31, 2020

neoHosts lacks the necessary maintenance, has not been updated for a long time, and has a high number of false positives, which I find unacceptable.

True, I created a pull request but no any action, so I prefer don't add neohost in recommend list.

Besides, yHosts maintainers seem to use QQ (a mainland Chinese IM software) more than GitHub to receive feedback, which doesn't seem to be conducive to community feedback and problem-solving.

Not exactly, yHosts' new maintainer still work on GitHub issue, list QQ group just provide various feedback channel, as same as anti-AD

@ZeroClover
Copy link

ZeroClover commented Aug 31, 2020

@ameshkov According to the license agreement indicated by the maintainer of yHosts on README.md, it is necessary to contact the maintainer to obtain the authorization before adding yHosts to ADH.

此项目仅限于个人研究。
未经作者或维护小组同意,不得擅自用于商业活动或用于其他公共项目(包括但不限于嵌入APP,浏览器等)。
不得违反国家相关法律法规。
请于测试后24小时内删除。

@Mosney Based on the topic of this issue, I think yHosts is even more out of line with your needs, as the wiki for the project explicitly states that it will block content that is not an ad or tracker.

https://github.com/VeleSila/yhosts/wiki/%E9%83%A8%E5%88%86%E9%97%AE%E9%A2%98%E8%AF%B4%E6%98%8E

I agree with @guanhemeng.

Not everyone is interested in politics and religion. Most people are not associated with blocked domains.

But the problem with yHosts is apparently much more serious than the anti-AD problem mentioned in this issue. yHosts blocks some shopping sites, the websites of mobile phone manufacturers (rather than some sort of API domains that could be used to steal personal privacy or push ads), and the entire domains of some appliance manufacturers.

I tend not to add either of the neoHosts or the yHosts to ADH.

@Mosney
Copy link
Author

Mosney commented Aug 31, 2020

@Mosney Based on the topic of this issue, I think yHosts is even more out of line with your needs, as the wiki for the project explicitly states that it will block content that is not an ad or tracker.

https://github.com/VeleSila/yhosts/wiki/%E9%83%A8%E5%88%86%E9%97%AE%E9%A2%98%E8%AF%B4%E6%98%8E

@ZeroClover Note what he mentions at the beginning,

新版数据已经将下面这部分数据放置在了moot文件夹,你们可以每次check以后不要合并这部分数据就可以了
Means: The new version of the data has placed the following part of the data in the moot folder, you can check each time without merging this part of the data.

In fact, yHosts final lists do not contain what is stated domain anymore. You can check those domains in the final list.

But the problem with yHosts is apparently much more serious than the anti-AD problem mentioned in this issue.

So that is not a problem. As license and agreement, just open an issue to request authorization, anti-AD's upstream may include yHosts cause https://github.com/privacy-protection-tools/anti-AD#special-thanks-to . But if you decide to keep anti-AD, yHosts will become unnecessary, the reason to add it only "give user choice right" left.

@ZeroClover
Copy link

@Mosney Are you sure that this repository contains the so-called "moot" directory?

https://raw.githubusercontent.com/VeleSila/yhosts/master/hosts.txt

https://github.com/VeleSila/yhosts/wiki/%E6%95%B0%E6%8D%AE%E5%90%8D%E7%A7%B0%E5%8F%8A%E5%90%AB%E4%B9%89

I'm not aware of any other lists that don't contain these unnecessary entries.

@Mosney
Copy link
Author

Mosney commented Aug 31, 2020

@Mosney Are you sure that this repository contains the so-called "moot" directory?

Sure, it contained in the original repo, the new repo just clone that statement but change the old file/directory. See https://github.com/vokins/yhosts/tree/master/data/moot

@ZeroClover
Copy link

ZeroClover commented Aug 31, 2020

@Mosney

I thought we were discussing the repository provided by the new maintainer.

But in any case, we cannot perform such complex operations in ADH.

Since the project only provides hosts files, not ADH syntax format, this also makes it difficult to simply remove unnecessary entries via the @@ qualifier.

In fact, yHosts final lists do not contain what is stated domain anymore. You can check those domains in the final list.

This makes no sense, vokins no longer maintain the project, and the new maintainer does not provide such files.

@Mosney
Copy link
Author

Mosney commented Aug 31, 2020

@ZeroClover You are right, I re-checked the new yHosts list, found moot directory domain still show in final host.txt, I thought yHosts as same as anti-AD, both of them contain subjective content. I agree not add yHosts, yHosts as same as anti-AD and anti-AD include yHosts, so yHosts is no need to be added.

@Mosney
Copy link
Author

Mosney commented Aug 31, 2020

Given the realities about Chinese regional filter, no subjective list lack maintain and good maintain list almost all contain subjective entries. I suppose this issue hard to resolve, maybe Chinese people born to deserve it.
AdGuardHome has to keep that list if they need a regional list. I think this issue may be closed if there is no more way to resolve those conflict.

@ameshkov ameshkov removed this from the v0.104 milestone Sep 1, 2020
@ameshkov
Copy link
Member

ameshkov commented Sep 1, 2020

Please reopen this issue or post to #1913 if there emerges any other notable and well-maintained Chinese filter list. So far it seems you're right, there's no Chinese filter list that does not try to block just the ads and trackers.

@ameshkov ameshkov closed this as completed Sep 1, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

9 participants