Skip to content

Canestin/big-data-project

Repository files navigation

Big Data Project

This project is aimed at processing and analyzing large datasets using Big Data technologies.

Introduction

The goal of this project is to demonstrate the utilization of Big Data tools and techniques for handling and analyzing large datasets. It involves performing data processing, transformation, and analysis tasks on a significant volume of data.

Technologies Used

  • Apache Spark: A distributed computing framework for processing big datasets.
  • Python: Programming language used for data manipulation and analysis.
  • Pandas: Python library for data manipulation and analysis.