Skip to content

Web: Check a site can be indexed before completing submission process #14

@m-i-l

Description

@m-i-l

A surprising number of sites (including verified sites) have been submitted with a robots.txt containing:

User-agent: *
Disallow: /

Which means searchmysite doesn't index them. It would be good to check robots.txt at the point of Quick Add or Verified Add so feedback can be given immediately if it isn't possible to index the site.

Note also that there are cases where robots.txt initially allows indexing, but is subsequently changed - see #11 for details of the handling of these.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions