-
Notifications
You must be signed in to change notification settings - Fork 2
raimonbosch/accesslog.hadoop
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Extract nice statistics from you accesslog. The program analyzes an accesslog with hadoop and extracts information for each visit: number of pages viewed, ip address, referer, url accessed and if the user comes from google it adds more information like google search and google rank. With this information you can easily generate statistics for your keywords and optimize your seo if you merge this visit data with each keyword. You can calculate bounce rate per keyword, average page views, etc... The output generated is in JSON format. You can run this program either using 'hadoop -jar' or directly with java in local mode. TODOs: - Include useful information about bot visits. - Include information about visits from other search engines like Yahoo, Yandex, AOL See more at http://raimonb.wordpress.com/2012/02/26/going-further-than-google-analytics-with-hadoop-and-your-access-log/
About
Get nice statistics analyzing your accesslog with hadoop
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published