Skip to content

ashokjain001/LogAnalysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Log Analysis Project

This project is part of Udacity's Full stack nanodegree program. The objective of this project is to create a reporting tool using Python DB-API that prints out reports based on data in the database. The 'news' database contains newspaper articles, authors, as well as the web server log for the site.

Reporting tool developed in this project will aim to answer these 3 questions:

  1. What are the most popular three articles of all time?
  2. Who are the most popular article authors of all time?
  3. On which days did more than 1% of requests lead to errors?

some of the skills learnt and demonstrated in this project:

  • Joining 2 or more Database tables.
  • Using basic select, where statements.
  • Utilizing aggregations functions to answer metric questions.
  • Writing code with Python DB-API.
  • Effectively using common table expressions.

Requirements

Virtual Machine and Vagrant are required to run the reporting scripts, download and Install Vagrant and VM.

Download and setup

After installation navigate to vagrant subdirectory by running command cd vagrant in your terminal and download required project files by cloning git repository mentioned below and contents will be shared with your virtual machine.

git clone https://github.com/ashokjain001/LogAnalysis.git

Unzipping newsdata.zip will create newsdata.sql file.

If you need to bring the virtual machine online, do so by running the following command in terminal,

vagrant up

Then log into it with,

vagrant ssh

Once logged into VM, cd into the vagrant/LogAnalysis and run the command below to load the data,

psql -d news -f newsdata.sql

Execution

Run this code in vagrant terminal,

python logAnalysis.py

This will run the python script and output 3 reports one after the other. You can access the output file logAnalysisOutput.txt for the output without running the code.

About

A reporting tool using Postgres & Python DB-API.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages