Basic http web log analysis using apache spark and scala

##How to use?

Example

* Example command to run:
 spark-submit
   --class "com.cloudwick.spark.loganalysis.HitsPerHour"
   --master local[4]
    target/scala-2.10/scala-2.10/myspark_2.10-1.0.jar
    /Users/arun/mylogpath/mock_apache_pool-1-thread-1.data

Classes and details

Package

com.cloudwick.spark.loganalysis

Class	Description
HitsPerHour	The HitsPerHour finds the hits happend in a hourly basis
HitsPerUrl	The HitsPerUrl gives the number of hits per URL
LogSizeAggregator	The LogSizeAggregator takes in an apache access log file and computes min, max and avg of content size of the log.
StatusCounter	The StatusCounter aggragate the log messages based on the status code
MsgSizeVsHits	The MsgSizeVsHits calculate the message size and aggregate according to that
TopEndpoints	The TopEndpoints return the top 10 end points.
TopIpaddresses	The TopIpaddresses return top 10 IP Addresses.
ApacheAccessLog	Log Parser

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
ApacheAccessLog.scala		ApacheAccessLog.scala
HitsPerHour.scala		HitsPerHour.scala
HitsPerUrl.scala		HitsPerUrl.scala
LogSizeAggregator.scala		LogSizeAggregator.scala
MsgSizeVsHits.scala		MsgSizeVsHits.scala
OrderingUtils.scala		OrderingUtils.scala
README.md		README.md
StatusCounter.scala		StatusCounter.scala
TopEndpoints.scala		TopEndpoints.scala
TopIpaddresses.scala		TopIpaddresses.scala

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Basic http web log analysis using apache spark and scala

Classes and details

About

Releases

Packages

Languages

SAI2K16/log-analysis-spark

Folders and files

Latest commit

History

Repository files navigation

Basic http web log analysis using apache spark and scala

Classes and details

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages