Skip to content
Set of Hadoop and Storm based tools for web analytic
Java Python Ruby Shell
Latest commit 7a9455d Mar 27, 2016 Pranab Ghosh config param key prefix
Failed to load latest commit information.
src/main/java/org/visitante config param key prefix Mar 27, 2016
.gitignore python script ti generated visit depth analysis logs Sep 14, 2014 README update Feb 24, 2016 initial commit Mar 22, 2012
pom.xml transaction freq and recency fixes Feb 12, 2016


The goal of visitante is to calculate various web analytic metric as defined by Avinash Kaushik ( on the Hadoop and Storm platform. However, it has evolved into a general purpose log analytic and mining solution, beyond web server logs.

It also includes customer or marketing analytic solution. Since customer behavior data is mostly captured in logs, there is a close relationship between customer analytics and log analytics.


  • Simple and easy to use batch and real time web analytic
  • Highly configurable


The following blogs of mine are good source of details of visitante


  • Hadoop based batch analytic for

    • Num of pages visited
    • Total time spent
    • Last page visited
    • Flow status (e.g., whether checkout flow was entered, entered but not completed or completed)
    • Incident detection
    • Pattern based event detection with context
    • Customer life time value
  • Storm based real time analytic for

    • Bounce rate
    • Visit depth distribution
Something went wrong with that request. Please try again.