Skip to content

mmm/docker-cloudera-quickstart

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

47 Commits
 
 
 
 
 
 
 
 

Repository files navigation

A Docker Image for Cloudera Quickstart

Run the CDH Quick Start image in a docker container.

The latest CDH release is on master and pinned versions can be found on dedicated branches.

Install

This image is somewhat large, so you might want to do a separate

docker pull svds/cdh

to install.

Use

Spin it up with

docker run -td svds/cdh

get the container id from

docker ps

watch logs with

docker logs -f <container_id>

attach to the container with

docker exec -it <container_id> bash -l

then run any of the usual cdh commands

hadoop fs -ls /
impala-shell
hbase shell
spark-submit

etc.

Components

  • Apache Hadoop (Common, HDFS, MapReduce, YARN)
  • Apache HBase
  • Apache ZooKeeper
  • Apache Oozie
  • Apache Hive
  • Hue (Apache licensed)
  • Apache Flume
  • Cloudera Impala (Apache licensed)
  • Apache Sentry
  • Apache Sqoop
  • Cloudera Search (Apache licensed)
  • Apache Spark

Links

Pull the image on Docker Hub: https://registry.hub.docker.com/u/svds/cdh

Github page: https://github.com/silicon-valley-data-science/docker-cdh

Cloudera Documentation

Credits

This image started life as a modified fork of caioquirino/docker-cloudera-quickstart and is now curated by the Silicon Valley Data Science engineering team.

About

Docker Cloudera Quick Start Image

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Shell 100.0%