A container image of jupyter notebook development environment with anaconda3, python2.7, some other runtimes and packages.
-
Updated
Feb 5, 2021 - Python
Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.
A container image of jupyter notebook development environment with anaconda3, python2.7, some other runtimes and packages.
Connecting Solr and Spark In An Apache Zeppelin Notebook
Notebooks and examples for DeepVariant on Spark project.
Template CI friendly local development environment featuring Spark Clusters + Blob Storage + a Notebook for prototyping data feature delivery.
Pyspark and Spark [ My Notes and all practise Notebook ]
Examples of DataBricks notebooks
This notebook contains detailed code for spark and machine learning and databricks
Machine learning using Python and Spark
PySpark notebooks
Big Data Management related Zeppelin notebooks
🐍 📊 🎓 Demo notebooks to show Jupyter Notebooks capabilities for programming teaching, learning and research
Repositório contendo todo o projeto de engenharia de dados realizado na Databricks conectando com o redshift na aws
This is a study project. I get analytics/ML examples from Kaggle and use different technologies to re-implement them.
systemctl for Spark and Jupyter-notebook
Created by Matei Zaharia
Released May 26, 2014