Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added TweetedTimes bot #30

Merged
merged 3 commits into from
Nov 3, 2015
Merged

Conversation

romaricdrigon
Copy link
Contributor

Hi,

Here is a new bot I saw.

It is a best practice for "good crawlers" to have as User-Agent such as Mozilla/5.0 (compatible; +http://tweetedtimes.com). No legitimate user will ever have an URL in its UA - maybe we could add http: to the generic regex?

@romaricdrigon
Copy link
Contributor Author

Also added Python-httplib2

@JayBizzle
Copy link
Owner

Good stuff. Will merge later 👍

Regarding the http: regex. It's a good idea, but I want to do some research into it first. Really don't want any false positives.

@JayBizzle
Copy link
Owner

Made an issue for the idea #31

JayBizzle added a commit that referenced this pull request Nov 3, 2015
@JayBizzle JayBizzle merged commit d21da12 into JayBizzle:master Nov 3, 2015
@romaricdrigon romaricdrigon deleted the feat-bots2 branch November 3, 2015 20:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants