Pyspark Notebook With Docker
-
Updated
Aug 18, 2015 - Python
Pyspark Notebook With Docker
Store 6M+ of csv records into containerized MongoDB sharded cluster and run queries
Final Project for IoT: Big Data Processing and Analytics class. Analyzing U.S nationwide temperature from IoT sensors in real-time
Toying around with data about patients with diabetes and their readmission rates to the hospital.
A series of workshops held by the Cartography and Geovisualization Group @ Oregon State University
Example code for processing the Million Song Dataset and other big music datasets
Coded in python. This allows the user to which restaurants offer delicious food and are highly rated by the customers. Dataset is taken based on survey by people who went to the restaurant and gave their opinion about it.
Implementation of Recommender Systems (RS) using Apache Spark MLlib on movielens dataset
Add a description, image, and links to the bigdata topic page so that developers can more easily learn about it.
To associate your repository with the bigdata topic, visit your repo's landing page and select "manage topics."