Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve bot detection regex #21

Merged
merged 7 commits into from
Sep 24, 2012
Merged

Commits on Aug 5, 2012

  1. Improve bot detection regex

    This commit improves the bot detection regex by:
    
      - adding a number of missing strings, such as alexa, facebookexternalhit, feedburner, nagios, postrank, pingdom, slurp, and yahoo!
      - removing redundant strings such as googlebot and robot (redundant because of the inclusion of a general "bot" string)
      - consolidate similar strings ("crawler" and "crawling" became "crawl(er|ing)")
      - alphabetize the list of bots
    lencioni committed Aug 5, 2012
    Configuration menu
    Copy the full SHA
    fb9a792 View commit details
    Browse the repository at this point in the history
  2. Add "google web preview" to bot detection regex

    To generate previews on the fly, Google uses the user-agent "Google Web Preview" (the fully-qualified user-agent you see in your server logs may change from time to time) to render images on demand. This commit adds "google web preview" to the bot detection regex.
    lencioni committed Aug 5, 2012
    Configuration menu
    Copy the full SHA
    a507269 View commit details
    Browse the repository at this point in the history

Commits on Aug 6, 2012

  1. Add yandex to bot-detecting regex

    This commit adds "yandex" to the list of bots, to detect the popular Russian search engine.
    lencioni committed Aug 6, 2012
    Configuration menu
    Copy the full SHA
    1f426c5 View commit details
    Browse the repository at this point in the history
  2. Improve bot detection regex

    This commit improves the bot detection regex by:
    
      - adding a number of missing strings, such as alexa,
        facebookexternalhit, feedburner, nagios, postrank, pingdom, slurp,
        and yahoo!
      - removing redundant strings such as googlebot and robot (redundant
        because of the inclusion of a general "bot" string)   - consolidate
        similar strings ("crawler" and "crawling" became "crawl(er|ing)")
      - alphabetize the list of bots
    lencioni committed Aug 6, 2012
    Configuration menu
    Copy the full SHA
    76dc372 View commit details
    Browse the repository at this point in the history
  3. Add "google web preview" to bot detection regex

    To generate previews on the fly, Google uses the user-agent "Google Web
    Preview" (the fully-qualified user-agent you see in your server logs may
    change from time to time) to render images on demand. This commit adds
    "google web preview" to the bot detection regex.
    lencioni committed Aug 6, 2012
    Configuration menu
    Copy the full SHA
    c65c2c2 View commit details
    Browse the repository at this point in the history
  4. Add yandex to bot-detecting regex

    This commit adds "yandex" to the list of bots, to detect the popular
    Russian search engine.
    lencioni committed Aug 6, 2012
    Configuration menu
    Copy the full SHA
    34318e8 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    97790f7 View commit details
    Browse the repository at this point in the history