Set of Hadoop and Storm based tools for web analytic
Java Python Ruby Shell
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
resource
script
src/main/java/org/visitante
.gitignore python script ti generated visit depth analysis logs Sep 15, 2014
README.md
manifest.mf
pom.xml

README.md

Introduction

The goal of visitante is to calculate various web analytic metric as defined by Avinash Kaushik (http://www.kaushik.net/avinash/) on the Hadoop and Storm platform. However, it has evolved into a general purpose log analytic and mining solution, beyond web server logs.

It also includes customer or marketing analytic solution. Since customer behavior data is mostly captured in logs, there is a close relationship between customer analytics and log analytics.

Philosophy

  • Simple and easy to use batch and real time web analytic
  • Highly configurable

Blogs

The following blogs of mine are good source of details of visitante

Solutions

  • Hadoop based batch analytic for

    • Num of pages visited
    • Total time spent
    • Last page visited
    • Flow status (e.g., whether checkout flow was entered, entered but not completed or completed)
    • Incident detection
    • Pattern based event detection with context
    • Customer life time value
  • Storm based real time analytic for

    • Bounce rate
    • Visit depth distribution