Skip to content

Data-drone/db_dcs_toolkit

Repository files navigation

Databricks DCS Toolkit

A set of useful primatives for making more advanced DCS containers: See: https://docs.databricks.com/clusters/custom-containers.html and: https://github.com/databricks/containers

Docker Layer Architecture

Foundation: minimal_container conda_python Standard R Rapids: Rapids Container Monitoring: Ganglia

Basic instructions


make build-base

make build-rapids

make build-ganglia

make push-rapids

Breakdown of images

build-base builds the base image, adds conda and installs some of the standard libs that we need to make notebooks work in databricks. Note that we may still be missing some required Python or Spark packages that normally are in a DBR release.

build-rapids builds a function RAPID.ai Python container with the current (Feb 2021) release of RAPIDs.ai

This has been lightly tested on DBR 10.3 and DBR 10.4 LTS.

Known Missing Features:

R - has issues - related to not triggering init script? ssh Repos feature in DB Web Terminal

Still need to add back some more libs for deeplearning etc

  • need to check if it is in the Rapids installs

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages