Obeying the robots.txt file

I've recently found this project while reading server logs. Someone is scraping one of the sites that I help administer supposedly using AHC/2.1 and they are not obeying the robots.txt file. There should be several seconds of delay between requests, but it appears to be going a 1 request/second. Is this normal behavior for AHC or is this a user misconfiguration in some way? If this is normal, could robots.txt file support for Crawl-delay values be added by default?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Obeying the robots.txt file #1989

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Obeying the robots.txt file #1989

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions