Skip to content

YFC-ophey/big-data-group-project

Repository files navigation

Big Data Group Project

US superstore opening analysis

As a team of 5, we uploaded an eCommerce CSV file to HDFS and utilized Apache Spark in Zeppelin to establish a connection to the data. Once the data was successfully saved to the memory, we transformed it into a Spark dataframe and used SparkSQL to find out the profitable products and exported the results on Tableau for data visualization.

Zeppelin Notebook

Zeppelin-notebook-us-superstore-new-opening-analysis.pdf

Business Presenetation

Business-presentation-US-SuperStore.pdf