Implementing batch processing techniques using Hadoop on a unique dataset
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
stage_1
stage_2
stage_3
README.md

README.md

data_processing_hadoop

Batch processing was carried out on a unique dataset

  • Stage 1 - Setting up Hadoop clusters locally and carrying out preliminary analysis on the dataset.
  • Stage 2 - Extensive analysis on the dataset.
  • Stage 3 - Visualizing results using Tableau and reporting them.