Skip to content

geodocker/geodocker-accumulo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

66 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GeoDocker Accumulo

Build Status Docker Repository on Quay Join the chat at https://gitter.im/geotrellis/geotrellis

Apache Accumulo container for GeoDocker Cluster

Roles

This container has three roles that can be supplied as CMD:

  • master - master instance, handles synchronization
    • --auto-init - initialize an accumulo instance if one is not found at hdfs://${HADOOP_MASTER_ADDRESS}/accumulo
  • tserver - tablet server, serves data
  • monitor - monitor instance, provides Web GUI at 50095
  • gc - garbage collector for accumulo tablets
  • tracer - trace for debugging

Configuration

Accumulo container requires HDFS as well as Accumulo configuration. This configuration will be generated from a template if you provide the required environment variables.

Additionally you have an option of volume mounting the configuration files to /etc/hadoop/conf to configure HDFS and /opt/accumulo/conf to configure Accumulo.

Environment

  • HADOOP_MASTER_ADDRESS - hostname for accumulo root, required for all roles
  • ZOOKEEPERS - list of zookeeper instance, at least one
  • ACCUMULO_SECRET - secret value for inter-instance communication (default: DEFAULT)
  • INSTANCE_NAME - accumulo instance name (default: accumulo)
  • ACCUMULO_PASSWORD - root password
  • INSTANCE_VOLUME - HDFS path to accumulo instance volume, generated if not provided
  • TSERVER_XMX - JVM -Xmx parameter value for Accumulo tserver (default: 3g)
  • MASTER_XMX - JVM -Xmx parameter value for Accumulo master (default: 2g)
  • MONITOR_XMX - JVM -Xmx parameter value for Accumulo monitor (default: 1g)
  • GC_XMX - JVM -Xmx parameter value for Accumulo garbage collector (default: 1g)
  • TSERVER_CACHE_DATA_SIZE - tserver.cache.data.size in accumulo-site.xml
  • TSERVER_CACHE_INDEX_SIZE - tserver.cache.index.size in accumulo-site.xml
  • TSERVER_MEMORY_MAPS_MAX - tserver.memory.maps.max in accumulo-site.xml

HADOOP_MASTER_ADDRESS need not be provided if core-site.xml is volume mounted in /etc/hadoop/conf or INSTANCE_VOLUME is provided.

Testing

This container should be tested with docker-compose and through make test