This is a repository for Spark related projects.
- Notebook 1:
- Word Count on Moby Dick text
- MOst Common words on Moby Dick text
- Data analysis on US and NY weather data
- Notebook 2
- read parquet file from hdfs
- Documentation 1: Setting Up a Single-Node Hadoop Cluster on Ubuntu and Do some stuff on it