Skip to content
Bundle for analyzing data with Apache Hive and Apache Hadoop
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.

Apache Analytics with SQL

This bundle is an 8 node cluster designed to scale out. Built around Apache Hadoop components and MySQL, it contains the following units:

  • 1 NameNode
  • 1 ResourceManager
  • 3 Slaves
  • 1 Hive
    • 1 Plugin (colocated on the Hive unit)
  • 1 MySQL


Deploy this bundle using juju-quickstart:

juju quickstart apache-analytics-sql

See juju quickstart --help for deployment options, including machine constraints and how to deploy a locally modified version of the apache-analytics-sql bundle.yaml.

Verify the deployment

The services provide extended status reporting to indicate when they are ready:

juju status --format=tabular

This is particularly useful when combined with watch to track the on-going progress of the deployment:

watch -n 0.5 juju status --format=tabular

The charm for each core component (namenode, resourcemanager, hive) also each provide a smoke-test action that can be used to verify that each component is functioning as expected. You can run them all and then watch the action status list:

juju action do namenode/0 smoke-test
juju action do resourcemanager/0 smoke-test
juju action do hive/0 smoke-test
watch -n 0.5 juju action status

Eventually, all of the actions should settle to status: completed. If any go instead to status: failed then it means that component is not working as expected. You can get more information about that component's smoke test:

juju action fetch <action-id>

Scale out

This bundle was designed to scale out. To increase the amount of slaves, you can add units to the slave service. To add one unit:

juju add-unit slave

Or you can add multiple units at once:

juju add-unit -n4 slave

Contact Information


You can’t perform that action at this time.