Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

COUNTER_Robots_list.json: Escape literal dots #35

Merged
merged 1 commit into from
Jul 22, 2020

Conversation

alanorth
Copy link
Contributor

Some patterns are incorrect because they mean to use a literal dot, but we should technically be escaping these to use literal dots in regex. The dot in the virus.detector pattern seems to be intentional, as this user agent sometimes appears as virus-detector and other times as virus_detector from what I can see in other block lists (I have not seem this bot myself). Note that we need to escape the first backslash in the JSON so we can get a literal backslash for escaping the dots in the output file.

Also I've taken the liberty to add URLs and descriptions where applicable.

Some patterns are incorrect because they mean to use a literal dot,
but we should technically be escaping these to use literal dots in
regex. Note that we need to escape the first backslash in the JSON
so we can get a literal backslash for escaping the dots in the out-
put file. The dot in the "virus.detector" pattern seems to be inte-
ntional, as this user agent sometimes appears as virus-detector and
other times as virus_detector.

Also I've taken the liberty to add URLs and descriptions where app-
licable.
@alanorth
Copy link
Contributor Author

Rebased against latest master branch.

alanorth added a commit to ilri/DSpace that referenced this pull request Jul 20, 2020
These are waiting to be merged into the COUNTER-Robots project:

  - New bots: https://github.com/atmire/COUNTER-Robots/pulls
  - Escape literal dots: atmire/COUNTER-Robots#35
  - Remove okhttp anchors: atmire/COUNTER-Robots#33
alanorth added a commit to ilri/DSpace that referenced this pull request Jul 20, 2020
These are waiting to be merged into the COUNTER-Robots project:

  - New bots: https://github.com/atmire/COUNTER-Robots/pulls
  - Escape literal dots: atmire/COUNTER-Robots#35
  - Remove okhttp anchors: atmire/COUNTER-Robots#33
@davidatmire davidatmire merged commit c6efb58 into atmire:master Jul 22, 2020
@alanorth alanorth deleted the regex-dot-escaping branch July 22, 2020 07:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants