Databricks
- wherever there is data
- https://databricks.com
Pinned repositories
Repositories
-
koalas
Koalas: pandas API on Apache Spark
-
learning-spark
Example code from Learning Spark book
-
spark-deep-learning
Deep Learning Pipelines for Apache Spark
-
-
tech-talks
This repository contains the notebooks and presentations we use for our Databricks Tech Talks
-
spark-xml
XML data source for Spark SQL and DataFrames
-
-
databricks-cli
Command Line Interface for Databricks
-
containers
Sample base images for Databricks Container Services
-
reference-apps
Spark reference applications
-
tensorframes
[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark
-
LearningSparkV2
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
-
spark-redshift
Redshift data source for Apache Spark
-
pig-on-spark
proof-of-concept implementation of Pig-on-Spark integrated at the logical node level
-
spark-knowledgebase
Spark Knowledge Base
-
-
Spark-The-Definitive-Guide
Spark: The Definitive Guide's Code Repository
-
-
jetty.project
Forked from eclipse/jetty.projectEclipse Jetty® - Web Container & Clients - supports HTTP/2, HTTP/1.1, HTTP/1.0, websocket, servlets, and more
-
jarjar
Forked from shevek/jarjarJar Jar Links is a utility that makes it easy to repackage Java libraries and embed them into your own distribution.
-
containerregistry
Forked from google/containerregistryA set of Python libraries and tools for interacting with a Docker Registry.
-
rules_docker
Forked from bazelbuild/rules_dockerRules for building and handling Docker images with Bazel
-
-
subpar
Forked from google/subparSubpar is a utility for creating self-contained python executables. It is designed to work well with Bazel.
-
scala-style-guide
Databricks Scala Coding Style Guide