hoya: HBase on YARN

NAME

hoya - HBase on YARN

SYNOPSIS

Hoya enables HBase and Accumulo clusters do be dynamically created on a YARN-managed datacenter. The program can be used to create, pause, and shutdown Hoya clusters. It can also be used to list current clusters.

CONCEPTS

A Hoya cluster represents a short-lived or long-lived set of HBase servers; one HBase Master and one or more HBase Region Servers
A cluster is built by deploying an image across multiple nodes in a YARN cluster.
An image is a tar.gz file containing a supported version of HBase.
Images are kept in the HDFS filesystem and identified by their path names; filesystem permissions can be used to share images amongst users.
All clusters are private to the user account that created them; images may be shared
An image configuration is a directory that is overlaid file-by-file onto the conf/ directory inside the HBase image.
Users can have multiple image configurations -they too are kept in HDFS, identified by their path names, and can be shared by setting the appropriate permissions, along with a configuration template file.
Only those files provided in the image configuration directory overwrite the default values contained in the image; all other configuration files are retained.
Late-binding properties can also be provided to a cluster at create time.
Hoya will overwrite some of the HBase configuration properties to configure the dynamically created HBase cluster nodes to bind correctly to each other.
A cluster state directory is a directory created in HDFS describing the cluster; it records user-specified properties including the image and image configuration paths, overridden properties, and creation-time node requirements.
The cluster state directory also contains dynamically created information as to the location of HBase region servers -this is used to place the region servers close to their previous locations -ideally on the server used before, falling back to the same rack and then elsewhere in the same cluster.
A user can create a cluster using a named image.
A cluster can be frozen, saving its final state to its cluster state directory. All the HBase processes are shut down.
A frozen cluster can be thawed -a new set of HBase processes are started on or near the servers where the earlier processes were previously running.
A frozen cluster can be destroyed. simply by deleting the cluster state directory.
A frozen cluster can be reimaged. This can update the cluster's HBase version and its configuration. When the cluster is started, the changes are picked up.
Running clusters can be listed.
A cluster consists of a set of role instances;
The supported roles depends upon the provider behind Hoya: HBase only has worker and master
the number of instances of each role must be specified when a cluster is created.
The number of instances of each role can be varied dynamically.
Users can flex a cluster: adding or removing instances of specific roles dynamically. If the cluster is running, the changes will have immediate effect. If the cluster is stopped, the flexed cluster size will be picked up when the cluster is next started.

Invoking Hoya

hoya <ACTION> [<CLUSTER>] [<OPTIONS>]

COMMON COMMAND-LINE OPTIONS

`--conf configuration.xml`

Configure the Hoya client. This allows the filesystem, zookeeper instance and other properties to be picked up from the configuration file, rather than on the command line.

Important: this configuration file is not propagated to the HBase cluster configuration. It is purely for configuring the client itself.

`-D name=value`

Define a Hadoop configuration option which overrides any options in the configuration XML files of the image or in the image configuration directory. The values will be persisted. Configuration options are only passed to the cluster when creating or reconfiguring a cluster.

`-m, --manager url`

URL of the YARN resource manager

`--fs filesystem-uri`

Use the specific filesystem URI as an argument to the operation.

Actions

CLUSTER COMMANDS

`build cluster`

Build a cluster specification of the given name, with the specific options.

The cluster is not started; this can be done later with a thaw command.

`create cluster`

Build and run a cluster of the given name, using the specified image. If a configuration directory is specified, it's configuration files override those in the image.

The --wait parameter, if provided, specifies the time to wait until the YARN application is actually running. Even after the YARN application has started, there may be some delay for the HBase cluster to start up.

Arguments for `build` and `create`

`--option <name> <value>`

Set a cluster option. These are interpreted by specific cluster providers.

Example:

Set an option to be passed into the -site.xml file of the target system, reducing the HDFS replication factor to 2. (

--option site.dfs.blocksize 128m

Increase the number of YARN containers which must fail before the Hoya cluster itself fails.

-O hoya.container.failure.threshold 16

`--appconf dfspath`

A URI path to the configuration directory containing the template cluster specification. The path must be on a filesystem visible to all nodes in the YARN cluster.

Only one configuration directory can be specified.
The contents of the directory will only be read when the cluster is created/built.

Example:

--appconf hdfs://namenode/users/hoya/conf/hbase-template
--appconf file://users/accumulo/conf/template

`--apphome localpath`

A path to the home dir of a pre-installed application. If set when a Hoya cluster is created, the cluster will run with the binaries pre-installed on the nodes at this location

Important: this is a path in the local filesystem which must be present on all hosts in the cluster

Example

--apphome /usr/hadoop/hbase

`--image path`

The full path in Hadoop HDFS to a .tar or .tar.gz file containing the binaries needed to run the target application -HBase or Accumulo as appropriate.

Example

--image hdfs://namenode/shared/binaries/hbase-0.96.tar.gz

`--role <rolename> <count>`

The desired number of instances of a role.

Example

--role worker 16

`--roleopt <rolename> <option> <value>`

Set any role-specific option, such as its YARN memory requirements.

Example

--roleopt master yarn.memory 2048
--roleopt worker yarn.memory max

`--zkport port`

The port on which the zookeeper processes are listening.

Example

    --zkport 29181

`--zkhosts host1[,host2,host3, ...]`

The list of hosts on which the ZK quorum is running.

Example

--zkhosts zk1,zk2,zk3,zk4,zk5,zk6,zk7,zk8,zk8,zk10,zk11

`destroy \<cluster>`

Destroy a (stopped) cluster.

Important: This deletes all the database data as well as the cluster information

Example

hoya destroy cluster1

`exists \<cluster> [--live]`

Probe the existence of the named Hoya cluster. If the --live flag is set, the cluster must be running

If not, an error code is returned.

When the --live` flag is unset, the command looks for the cluster to be defined in the filesystem -its operation state is not checked.

Return codes

 0 : cluster is defined in the filesystem
70 : cluster is unknown

Example:

hoya exists cluster4

Live Tests

When the --live` flag is set, the cluster must be running for the command to succeed

The probe does not check the status of any Hoya-deployed services, merely that a cluster has been deployed
A cluster that is finished or failed is not considered to be live.

Return codes

 0 : cluster is running
-1 : cluster exists but is not running
70 : cluster is unknown

Example:

hoya exists cluster4 --live

`flex <cluster> [--role rolename count]*`

Flex the number of workers in a cluster to the new value. If greater than before -nodes will be added. If less, nodes will be removed from the cluster.

This operation has a return value of 0 if the size of a running cluster was changed.

It returns -1 if there is no running cluster, or the size of the flexed cluster matches that of the original -in which case the cluster state does not change.

Example

hoya flex cluster1 --role worker 8 --filesystem hdfs://host:port
hoya flex cluster1 --role master 2 --filesystem hdfs://host:port

`freeze <cluster> [--force] [--wait time] [--message text]`

freeze the cluster. The HBase cluster is scheduled to be destroyed. The cluster settings are retained in HDFS.

The --wait argument can specify a time in seconds to wait for the cluster to be frozen.

The --force flag causes the HoyaAM to be bypassed, and YARN asked directly to terminate the application. This will freeze a cluster that has hung or is otherwise not responding.

The --message argument supplies an optional text message to be used in the request: this will appear in the application's diagnostics in the YARN RM UI.

If an unknown (or already frozen) cluster is named, no error is returned.

Examples

hoya freeze cluster1 --wait 30
hoya freeze cluster2 --force --message "maintenance session"

`getconf <cluster> [--out file] [--format xml|properties]`

Get the configuration properties needed for hbase clients to connect to the cluster. Hadoop XML format files (the default) and Java properties files can be generated. The output can be streamed to the console in stdout, or it can be saved to a file via the --out parameter

`killcontainer <cluster> --id container-id`

Kill a container in the cluster. This is useful primarily for testing the cluster's resilience to failures.

Container IDs can be determined from the cluster status JSON document.

`list <cluster>`

List running Hoya clusters visible to the user.

If a cluster name is given and there is no running cluster with that name, an error is returned.

Example

hoya list
hoya list cluster1

`status <cluster> [--out <filename>]`

Get the status of the named Hoya cluster in JSON format. A filename can be used to specify the destination.

Examples:

hoya status cluster1 --manager host:port

hoya status cluster2 --manager host:port --out status.json

`thaw <cluster> [--wait time`]

Resume a frozen cluster: recreate the cluster from its previous state. This will include a best-effort attempt to create the same number of nodes as before, though their locations may be different. The same zookeeper bindings as before will be used.

Examples:

hoya thaw cluster2
hoya thaw cluster1 --wait 60

If a cluster is already running, this is a no-op

`version`

The command hoya version prints out information about the compiled Hoya application, the version of Hadoop against which it was built -and the version of Hadoop that is currently on its classpath.

Note that this is the client-side Hadoop version, not that running on the server, though that can be obtained in the cluster status operation

Example

> hadoop version

2013-12-10 14:28:17,624 [JUnit] INFO  client.HoyaClient - Hoya Core-0.7.1-SNAPSHOT Built against 1dd69 on Java 1.7.0_45 by stevel
2013-12-10 14:28:17,624 [JUnit] INFO  client.HoyaClient - Compiled against Hadoop 2.2.0
2013-12-10 14:28:17,625 [JUnit] INFO  client.HoyaClient - Hadoop runtime version branch-2.2.0 with source checksum 79e53ce7994d1628b240f09af91e1af4 and build date 2013-10-07T06:28Z

`emergency-force-kill <applicationID>`

This attempts to force kill any YARN application referenced by application ID. There is no attempt to notify the running AM.

If the application ID is all then all hoya instances belonging to the current user are killed.

These are clearly abnormal operations; they are here primarily for testing -and documented for completeness.

Example

hoya emergency-force-kill application_1386596138212_0001

`am-suicide <cluster> [--exitcode code] [--message message] [--wait time]`

This operation is purely for testing Hoya Application Master restart; it triggers an asynchronous self-destruct operation in the AM -an operation that does not make any attempt to cleanly shut down the process.

If the application has not exceeded its restart limit (as set by hoya.yarn.restart.limit), YARN will attempt to restart the failed application.

Example

hoya am-suicide --exitcode 1 --wait 5000 -message "test"

Cluster Naming

Cluster names must:

be at least one character long
begin with a lower case letter
All other characters must be in the range [a-z,0-9,-, -]
All upper case characters are converted to lower case

Example valid names:

hoya1
hbase-cluster
hbase_cluster
accumulo_m1_tserve4

Cluster Options

Cluster options are intended to apply across a cluster, are set with the --option or -O arguments, and are saved in the options {} clause in the JSON specification.

HBase options

hbase.master.command: The single command to execute on the HBase master, in the command sequence: hbase master start

for example, if the parameter was

-O hbase.master.command version

Hoya would would run the HBase master with the command

hbase version start

This would not actually create the master -as stated, it is for testing purposes.

General

hoya.test

This notifies the application that this is a test run, and that the application should behave in a way to aid testing. Currently all this does is

Limit the number of attempts to start the AM to one.
Enable verbose output in the client-AM RPC

Role Options

Here are some role options that are intended to be common across roles, though it is up to the provider and role whether or not an option is used.

Important: Unknown options are ignored. If an option does not appear to work, check the spelling.
All values are strings; if an integer is required, it should be quoted

generic

role.name name of the role
role.instances number of instances desired
app.infoport: For applications that support a web port that can be externally configured, the web port to use. A value of "0" means that an arbitrary port should be picked.

YARN parameters

YARN parameters are interpreted by Hoya itself -so will always be read, validated and acted on.

yarn.app.retries: number of times to attempt to retry application execution.
yarn.memory: how much memory (in GB) to request
yarn.vcores: number of cores to request; how this is translated into physical core allocation is a YARN-specific (possibly scheduler-specific) feature.

JVM parameters

These should be interpreted by all providers that start a JVM in the specific role

jvm.heapsize: JVM heap size for Java applications in MB.
jvm.opts: JVM options other than heap size

Files

manpage.md

Latest commit

History

manpage.md

File metadata and controls

hoya: HBase on YARN

NAME

SYNOPSIS

CONCEPTS

Invoking Hoya

COMMON COMMAND-LINE OPTIONS

--conf configuration.xml

-D name=value

-m, --manager url

--fs filesystem-uri

Actions

build cluster

create cluster

Arguments for build and create

--option <name> <value>

--appconf dfspath

--apphome localpath

--image path

--role <rolename> <count>

--roleopt <rolename> <option> <value>

--zkport port

--zkhosts host1[,host2,host3, ...]

destroy \<cluster>

exists \<cluster> [--live]

Live Tests

flex <cluster> [--role rolename count]*

freeze <cluster> [--force] [--wait time] [--message text]

getconf <cluster> [--out file] [--format xml|properties]

killcontainer <cluster> --id container-id

list <cluster>

status <cluster> [--out <filename>]

thaw <cluster> [--wait time]

version

emergency-force-kill <applicationID>

am-suicide <cluster> [--exitcode code] [--message message] [--wait time]

Cluster Naming

Cluster Options

HBase options

General

Role Options

generic

YARN parameters

JVM parameters

`--conf configuration.xml`

`-D name=value`

`-m, --manager url`

`--fs filesystem-uri`

`build cluster`

`create cluster`

Arguments for `build` and `create`

`--option <name> <value>`

`--appconf dfspath`

`--apphome localpath`

`--image path`

`--role <rolename> <count>`

`--roleopt <rolename> <option> <value>`

`--zkport port`

`--zkhosts host1[,host2,host3, ...]`

`destroy \<cluster>`

`exists \<cluster> [--live]`

`flex <cluster> [--role rolename count]*`

`freeze <cluster> [--force] [--wait time] [--message text]`

`getconf <cluster> [--out file] [--format xml|properties]`

`killcontainer <cluster> --id container-id`

`list <cluster>`

`status <cluster> [--out <filename>]`

`thaw <cluster> [--wait time`]

`version`

`emergency-force-kill <applicationID>`

`am-suicide <cluster> [--exitcode code] [--message message] [--wait time]`