Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New source: Anti popads.net #283

Closed
betterwebleon opened this issue Apr 29, 2017 · 16 comments
Closed

New source: Anti popads.net #283

betterwebleon opened this issue Apr 29, 2017 · 16 comments

Comments

@betterwebleon
Copy link

Hello,
I've found a new data source on the website FilterLists, where it scores highly.

It is called "Anti popads.net". Raw file is here. Currently it has more than 970 entries and is updated frequently. Description: "Blocks shady, annoying pop-under ads from the infamous PopAds ad network."

Could you add it among sources for your unified hosts file?

@StevenBlack
Copy link
Owner

@betterwebleon sorry for the delay. Hey thanks for pointing me to FilterLists; that's cool! I like how they present the sources and their contact facets.

I'm Currently traveling. I plan to revisit this very soon. Thanks for the suggestion.

@betterwebleon
Copy link
Author

No problem, there's no hurry. Thank you for considering it!

FilterLists is comprehensive and well structured website indeed. I found it just recently, thanks to @gorhill from his wiki page about custom filter lists for uBlock Origin (his content blocker).

@Veticia
Copy link

Veticia commented May 7, 2017

Hi, Yhonay here. I've added a list in a form of a hosts file to my git repo. I don't know how usefull it will be seeing how often popads.net adds new domains to their repertoire, but if you'll find a use for it then here it is.

@StevenBlack
Copy link
Owner

Hi @Yhonay! How are you sourcing the domains in antipopads? What's your process to find these?

@Veticia
Copy link

Veticia commented May 7, 2017

Someone linked me to a list of a few thousands sites using popads.net. I pick a few dozen at random, unobfuscate everything between <script></script> and try to download any found .js file. Then I check if it matches a few keywords found in a standard popads.net .js file. If it matches then a domain that .js was downloaded from is added to the list.

I know it's a little naive but so far it works.

@FadeMind
Copy link
Contributor

@dnmTX
Copy link
Contributor

dnmTX commented Mar 29, 2018

Finally something for them popunders. BIG THANKS FROM ME TOO !!!!

@XhmikosR
Copy link
Contributor

Has there been any progress about this? I see the original source is still active and going 🙂

Repository owner deleted a comment May 10, 2021
@MrRawes
Copy link
Contributor

MrRawes commented Apr 11, 2022

it seems like "Anti popads.net" is a dead list

@bigdargon
Copy link
Contributor

New repo anti-popads https://github.com/AdroitAdorKhan/antipopads-re

FadeMind added a commit to FadeMind/hosts.extras that referenced this issue Apr 11, 2022
@FadeMind
Copy link
Contributor

@StevenBlack PR awaits.

CC @AdroitAdorKhan

@StevenBlack
Copy link
Owner

StevenBlack commented Apr 11, 2022

Using ghosts to evaluate: this would add (27,306 - 794) = 26,512) domains to our base list, a relative increase of 26,512 / 110,297 = 24%.

ghosts -c https://raw.githubusercontent.com/AdroitAdorKhan/antipopads-re/master/formats/hosts.txt     11:21:51
----------------------------------------
Base hosts file summary:
----------------------------------------
Location: https://raw.githubusercontent.com/StevenBlack/hosts/master/hosts
Domains: 110,297
Bytes: 3.5 MB
----------------------------------------
Compared hosts file summary:
----------------------------------------
Location: https://raw.githubusercontent.com/AdroitAdorKhan/antipopads-re/master/formats/hosts.txt
Domains: 27,306
Bytes: 690 kB
Intersection: 794 domains

Looking at TLD:

ghosts -tld -m https://raw.githubusercontent.com/AdroitAdorKhan/antipopads-re/master/formats/hosts.txt
----------------------------------------
Base hosts file summary:
----------------------------------------
Location: https://raw.githubusercontent.com/AdroitAdorKhan/antipopads-re/master/formats/hosts.txt
Domains: 27,306
Bytes: 690 kB
TLD tally:  (130 unique TLD)
   com: 18,160
   top: 2,937
   xyz: 2,528
   net: 1,072
   club: 303
   info: 271
   pro: 261
   biz: 161
   site: 140
   cam: 133
   fun: 116
   me: 107
   casa: 101
   co: 96
   online: 86
   ru: 83
   life: 76
   org: 72
   pw: 36
   am: 34
   icu: 33
   one: 31
   io: 30
   space: 30
   website: 26
   name: 22
   work: 21
   gq: 16
   de: 16
   uk: 15
   link: 14
   click: 12
   tech: 11
   media: 11
   mobi: 11
   bid: 11
   cc: 11
   in: 9
   live: 9
   eu: 9
   network: 8
   tk: 7
   ml: 7
   us: 6
   nl: 6
   ga: 5
   rocks: 5
   cf: 5
   tv: 5
   best: 4
   app: 4
   ai: 4
   su: 3
   fr: 3
   to: 3
   win: 3
   cyou: 3
   sh: 3
   world: 3
   today: 3
   pl: 3
   digital: 2
   lv: 2
   ad: 2
   dev: 2
   bar: 2
   delivery: 2
   vip: 2
   house: 2
   vn: 2
   ph: 2
   sbs: 2
   au: 2
   dk: 2
   cloud: 2
   codes: 2
   by: 2
   trade: 2
   pk: 2
   za: 2
   jp: 2
   webcam: 2
   ca: 2
   wtf: 2
   my: 1
   nz: 1
   news: 1
   center: 1
   ag: 1
   agency: 1
   loan: 1
   ooo: 1
   st: 1
   ae: 1
   fm: 1
   gg: 1
   review: 1
   cfd: 1
   ws: 1
   zone: 1
   stream: 1
   zw: 1
   buzz: 1
   gt: 1
   supply: 1
   kr: 1
   se: 1
   services: 1
   download: 1
   support: 1
   il: 1
   systems: 1
   email: 1
   gs: 1
   ovh: 1
   land: 1
   global: 1
   at: 1
   uno: 1
   red: 1
   cn: 1
   glass: 1
   it: 1
   racing: 1
   is: 1
   men: 1
   marketing: 1
   re: 1
   ie: 1
   mx: 1
----------------------------------------

@StevenBlack
Copy link
Owner

StevenBlack commented Apr 11, 2022

Here is the main reason I'm tending towards declining this.

Who can explain this? 🤷🏻

We have several curated sources that watch this stuff. Many eyes watching. Over a long period of time. Some of our source lists are extremely diligent.

Now we see this proposed source where only 794 domains among 27,306 in its collection that overlap with our existing collection. Only (794 / 27,306) = 3% of the 27,306 domains overlap with all our other curated sources combined.

I would expect, what, 20% to 100% overlap? Something in that range.

Very suspicious.

Can someone make the case why bloating our list by 24% makes it a better list, nevermind a 24% better list?

@bigdargon
Copy link
Contributor

I don't know how the repo managed to collect the popup domains pretty well. Instead of me going to a website, a popup ad pops up, I review the domain log and block it.

Meaning I only collected a few domains, but here they collect so many with automatic updates. I often use it in piholes and filters in the browser. Wonderful!

@StevenBlack
Copy link
Owner

StevenBlack commented Apr 12, 2022

Hi @bigdargon I understand.

I suppose I should clarify: I'm not hearing about popup issues from people using our base list.

Now you understand, if there was a lot of popup spam happening, I would hear about it. I'm not hearing about it.

So this 24% jack in our base list size feels like a solution in search of a problem? Does that make sense?

I'm not saying there's no popup problem, I'm saying I'm not hearing about popup problems. I'm not hearing demand from userland, nor from anywhere else.

@StevenBlack
Copy link
Owner

I've decided to decline adding this list.

Closing.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

8 participants