Apache Spark

Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Apache Spark

Here are 177 public repositories matching this topic...

mooyxu / notebook

ciwin / LearningSpark-in-Python

BrooksIan / SolrToSparkNotebook

OrangeGenomix / spark-deepvariant-examples

caldempsey / docker-notebook-spark-s3

Animeshsinghiit / Spark-and-Pyspark

data-engineering-helpers / databricks-examples

JonathanPollyn / Spark

AndreaRettaroli / simulated-transactions-big-data

jinudaniel / pyspark-notebook

alexdyysp / SparkScala

riyadparvez / pyspark-datascience

lix90 / Rnotes

AdamJeddy / Zeppelin-Notebook-Archive

zodiacfireworks / talk--jupyter-notebook

jonyroy / data-engineering-notebook

manilabay / spark-notebooks-apps

matheus-conrado / notebook_databricks_cloud_guardians

DenisOgr / kaggle-notebook-to-production

ac-gomes / systemctl_spark_jupyter-notebook

Related Topics