The aim of this project was apply classical data mining algorithms to Big Data using Hadoop (Spark).
The dataset exploit to do so was US Air Pollution 2000-2016, downloaded from Kaggle.
This repository has been archived by the owner on Dec 19, 2022. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 0
MartinaSus/Distributed-Data-Analysis-and-Mining
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
The aim of this project was apply classical data mining algorithms to Big Data using Hadoop (Spark).
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published