Log analysis of NASA1995 weblog data using apache spark using pyspark and hadoop hdfs on single node cluster.
Documentation is in ipynb file, with the corresponding code cells.
Use NbViewer From @jupyter for viewing the ipynb files. https://nbviewer.jupyter.org/github/vivekshah1801/Apache-Spark-Web-Log-Analysis/blob/master/Web%20Log%20Analysis%20-%20Spark.ipynb