Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP

Loading…

bing-bot not filtered out #958

Closed
anonymous-piwik-user opened this Issue · 3 comments

2 participants

Anonymous Piwik user Anthon Pang
Anonymous Piwik user

As described in [http://forum.piwik.org/index.php?showtopic=1451&hl=] , bing-bot is being recognized as a normal user oder link from external page.

In my log I can find these entries for example:

access.log:65.55.110.25 - - +0000 "GET /piwik/piwik.php?idsite=1&url=http%3A%2F%2Fwww.xxx%2F&res=800x600&h=15&m=52&s=41&cookie=1&urlref=http%3A%2F%2Fwww.bing.com%2Fsearch%3Fq%3Denertrag&rand=0.8916126511009081&pdf=0&qt=0&realp=0&wma=0&dir=0&fla=0&java=0&gears=0&ag=0&action_name=XXX HTTP/1.1" 200 43 "http://www.xxx" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.2; SLCC1; .NET CLR 1.1.4322; .NET CLR 2.0.50727; .NET CLR 3.0.30729)"
access.log:65.55.109.173 - - +0000 "GET /piwik/piwik.php?idsite=1&url=http%3A%2F%2Fwww.xxx2F&res=800x600&h=17&m=23&s=43&cookie=1&urlref=http%3A%2F%2Fwww.bing.com%2Fsearch%3Fq%3Dwebseite&rand=0.2069609280545549&pdf=0&qt=0&realp=0&wma=0&dir=0&fla=0&java=0&gears=0&ag=0&action_name=XXX HTTP/1.1" 200 43 "http://www.xxx" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.2; SLCC1; .NET CLR 1.1.4322; .NET CLR 2.0.40607; .NET CLR 3.0.04506.648)"

Anthon Pang
Collaborator

handleNewVisit() looks ok,

From what I can piece together, it appears that the Bing bot may have cookies enabled. So, if Piwik logged the bot before the software update, subsequent visits may not be treated as new visits (and thus, not caught by the filter). Since, cookies expire in 30 days (Piwik default in global.ini.php), this should self-correct itself.

However, it looks like Microsoft is now further cloaking the bot by removing the referer field from the http request. I suggest we add the Microsoft IP to #43 and then remove the hardcoded IP from Visit.php.

Reference: http://www.bing.com/community/forums/t/648805.aspx

Anthon Pang
Collaborator

Re-opening since we have to deal with googlebot.

Anthon Pang
Collaborator

In [1470], fixes #918 and #958 - Filter out Googlebot and Bing bot

Anonymous Piwik user anonymous-piwik-user added this to the Piwik 0.4.4 milestone
This issue was closed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Something went wrong with that request. Please try again.