Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Qwant blocked #5

Closed
fl02 opened this issue Jan 7, 2017 · 3 comments
Closed

Qwant blocked #5

fl02 opened this issue Jan 7, 2017 · 3 comments
Assignees
Labels

Comments

@fl02
Copy link
Contributor

fl02 commented Jan 7, 2017

Why did you decide to block the Qwant-bot? It's a legitimate search engine which reads and obeys the robots.txt.

@mitchellkrogza
Copy link
Owner

mitchellkrogza commented Jan 7, 2017

I assume you are referring to Qwantify? Must have lingered in there since my original lists, removed with next Commit coming shortly. Has been removed with latest commit - 20dbd00

Thanks for alerting me to this. This one must have slipped through from one of my very early bad bot lists. Will go through bot lists again for a good double check. Please notify me if you spot anything else.

@eurobank
Copy link

eurobank commented Jun 8, 2018

Qwant/Qwantify doesn't read/respect robots.txt.

@chrisastley
Copy link

I'm having big issues with a group of Useragents/bots that don't respect robots.txt and are crawling the same URL's over and over.

Qwantify
VelenPublicWebCrawler
CCBot
ZoominfoBot
Seekport

They appear to be linked as they all crawl the same handful of pages at the same rates and all ignore robots.txt and I'm often finding 3-4 of the above bots hitting the same page on the same seconds continually.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

4 participants