A Docker container with a full Hadoop cluster setup with Spark and Zeppelin
-
Updated
Feb 2, 2020 - Shell
A Docker container with a full Hadoop cluster setup with Spark and Zeppelin
Snorkel - Bootstrap your Data Science
Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.
Generates a Vagratn box using Packer to have Java and Zeppelin installed in the correct version to use with AWS Glue development endpoint as the tutorial needs: https://docs.aws.amazon.com/glue/latest/dg/dev-endpoint-tutorial-local-notebook.html
Reprodicing Census SIPP Reports Using Apache Spark
clusterdock + zeppelin
Quick Jupyter and Zeppelin Setup
Linux shell scripts to quickly get up and running with Apache Zeppelin
A full data science environment for your laptop in a few commands and clicks.
Add a description, image, and links to the zeppelin-notebook topic page so that developers can more easily learn about it.
To associate your repository with the zeppelin-notebook topic, visit your repo's landing page and select "manage topics."