Skip to content

Commit

Permalink
Merge pull request #410 from sunnydavis/parole-amazon-cloudfront
Browse files Browse the repository at this point in the history
Remove Amazon CloudFront from the crawlers
  • Loading branch information
JayBizzle committed Sep 2, 2020
2 parents 0ffea34 + 8581fd0 commit 3c8104b
Show file tree
Hide file tree
Showing 4 changed files with 13 additions and 16 deletions.
2 changes: 1 addition & 1 deletion raw/Crawlers.json

Large diffs are not rendered by default.

23 changes: 11 additions & 12 deletions raw/Crawlers.txt
@@ -1,13 +1,13 @@
YLT
^b0t$
^bluefish
^bluefish
^Calypso v\/
^Corax
^COMODO DCV
^DangDang
^DavClnt
^DHSH
^FDM
^FDM
^git\/
^Goose\/
^Grabber
Expand All @@ -24,7 +24,7 @@
^PHP\/[0-9]
^RMA\/
^Ruby|Ruby\/[0-9]
^Swurl
^Swurl
^VSE\/[0-9]
^WordPress\.com
^XRL\/[0-9]
Expand Down Expand Up @@ -75,7 +75,6 @@ allloadin
AllSubmitter
alyze\.info
amagit
^Amazon CloudFront$
^Amazon Simple Notification Service Agent$
Anarchie
AndroidDownloadManager
Expand Down Expand Up @@ -516,7 +515,7 @@ httpunit
HttpUrlConnection
httrack
huaweisymantec
HubSpot
HubSpot
Humanlinks
i2kconnect\/
Iblog
Expand Down Expand Up @@ -685,7 +684,7 @@ MetaURI
MFC_Tear_Sample
Microsearch
Microsoft\.Data\.Mashup
Microsoft Office
Microsoft Office
Microsoft Outlook
Microsoft Windows Network Diagnostics
Microsoft-WebDAV-MiniRedir
Expand Down Expand Up @@ -882,7 +881,7 @@ ProWebWalker
proximic
PRTG Network Monitor
pshtt, https scanning
PTST
PTST
PTST\/[0-9]+
Pump
python-httpx
Expand Down Expand Up @@ -1045,7 +1044,7 @@ speedy
SPEng
Spinn3r
spray-can
Sprinklr
Sprinklr
spyonweb
sqlmap
Sqlworm
Expand All @@ -1067,7 +1066,7 @@ summify
SuperHTTP
Surphace Scout
Suzuran
swcd
swcd
Symfony BrowserKit
Symfony2 BrowserKit
SynHttpClient-Built
Expand Down Expand Up @@ -1215,7 +1214,7 @@ WebIndex
webkit2png
WebLeacher
webmastercoffee
webmon
webmon
WebPix
WebReaper
WebSauger
Expand Down Expand Up @@ -1311,12 +1310,12 @@ Zemanta Aggregator
Zend_Http_Client
Zend\\Http\\Client
Zermelo
Zeus
Zeus
zgrab
ZnajdzFoto
ZnHTTP
Zombie\.js
Zoom\.Mac
ZoteroTranslationServer
ZyBorg
[a-z0-9\-_]*(bot|crawl|archiver|transcoder|spider|uptime|validator|fetcher|cron|checker|reader|extractor|monitoring|analyzer|scraper)
[a-z0-9\-_]*(bot|crawl|archiver|transcoder|spider|uptime|validator|fetcher|cron|checker|reader|extractor|monitoring|analyzer|scraper)
1 change: 0 additions & 1 deletion src/Fixtures/Crawlers.php
Expand Up @@ -96,7 +96,6 @@ class Crawlers extends AbstractProvider
'AllSubmitter',
'alyze\.info',
'amagit',
'^Amazon CloudFront$',
'^Amazon Simple Notification Service Agent$',
'Anarchie',
'AndroidDownloadManager',
Expand Down
3 changes: 1 addition & 2 deletions tests/crawlers.txt
Expand Up @@ -3523,7 +3523,6 @@ php-requests/1.7
nghttp2/1.40.0
WinHTTP/1.1
Mozilla/5.0 (compatible; Javelin; +https://about.javelin.io/)
Amazon CloudFront
swcd (unknown version) CFNetwork/1125.2 Darwin/19.4.0
YahooMailProxy; https://help.yahoo.com/kb/yahoo-mail-proxy-SLN28749.html
Mozilla/5.0 (Java) outbrain
Expand Down Expand Up @@ -3560,4 +3559,4 @@ postplanner.com/site-scraping
Nuclei (@pdiscoveryio)
myseosnapshot/1.0
Contextual Code Sites Explorer
Corax - support@coraxcyber.com
Corax - support@coraxcyber.com

0 comments on commit 3c8104b

Please sign in to comment.