elastic · danielmitterdorfer · Dec 4, 2019 · Nov 21, 2019 · Nov 26, 2019 · Nov 26, 2019
diff --git a/data-params-hot.json b/data-params-hot.json
@@ -0,0 +1,9 @@
+{
+  "additional_cluster_settings": {
+    "node.master": false,
+    "node.data": true,
+    "node.ingest": false,
+    "cluster.remote.connect": false,
+    "node.attr.box_type": "hot"
+  }
+}
diff --git a/data-params-warm.json b/data-params-warm.json
@@ -0,0 +1,9 @@
+{
+  "additional_cluster_settings": {
+    "node.master": false,
+    "node.data": true,
+    "node.ingest": false,
+    "cluster.remote.connect": false,
+    "node.attr.box_type": "warm"
+  }
+}
diff --git a/docs/cluster_management.rst b/docs/cluster_management.rst
@@ -0,0 +1,98 @@
+Setting up a Cluster
+====================
+
+In this section we cover how to setup an Elasticsearch cluster with Rally. It is by no means required to use Rally for this and you can also use existing tooling like Ansible to achieve the same goal. The main difference between standard tools and Rally is that Rally is capable of setting up a :doc:`wide range of Elasticsearch versions </versions>`.
+
+.. warning::
+
+    The following functionality is experimental. Expect the functionality and the command line interface to change significantly even in patch releases.
+
+Overview
+--------
+
+You can use the following subcommands in Rally to manage Elasticsearch nodes:
+
+* ``install`` to install a single Elasticsearch node
+* ``start`` to start a previously installed Elasticsearch node
+* ``stop`` to stop a running Elasticsearch node and remove the installation
+
+Each command needs to be executed locally on the machine where the Elasticsearch node should run. To setup more complex clusters remotely, we recommend using a tool like Ansible to connect to remote machines and issue these commands via Rally.
+
+Getting Started: Benchmarking a Single Node
+-------------------------------------------
+
+In this section we will setup a single Elasticsearch node locally, run a benchmark and then cleanup.
+
+First we need to install Elasticearch::
+
+    esrally install --quiet --distribution-version=7.4.2 --node-name="rally-node-0" --network-host="127.0.0.1" --http-port=39200 --master-nodes="rally-node-0" --seed-hosts="127.0.0.1:39300"
+
+The parameter ``--network-host`` defines the network interface this node will bind to and ``--http-port`` defines which port will be exposed for HTTP traffic. Rally will automatically choose the transport port range as 100 above (39300). The parameters ``--master-nodes`` and ``--seed-hosts`` are necessary for the discovery process. Please see the respective Elasticsearch documentation on `discovery <https://www.elastic.co/guide/en/elasticsearch/reference/current/modules-discovery.html>`_ for more details.
+
+This produces the following output (the value will vary for each invocation)::
+
+    {
+      "installation-id": "69ffcfee-6378-4090-9e93-87c9f8ee59a7"
+    }
+
+
+We will need the installation id in the next steps to refer to our current installation.
+
+.. note::
+
+   You can extract the installation id value with `jq <https://stedolan.github.io/jq/>`_ with the following command: ``jq --raw-output '.["installation-id"]'``.
+
+After installation, we can start the node. To tie all metrics of a benchmark together, Rally needs a consistent race id across all invocations. The format of the race id does not matter but we suggest using UUIDs. You can generate a UUID on the command line with ``uuidgen``. Issue the following command to start the node::
+
+    # generate a unique race id (use the same id from now on)
+    export RACE_ID=$(uuidgen)
+    esrally start --installation-id="69ffcfee-6378-4090-9e93-87c9f8ee59a7" --race-id="${RACE_ID}"
+
+After the Elasticsearch node has started, we can run a benchmark. Be sure to pass the same race id so you can match results later in your metrics store::
+
+    esrally --pipeline=benchmark-only --target-host=127.0.0.1:39200 --track=geonames --challenge=append-no-conflicts-index-only --on-error=abort --race-id=${RACE_ID}
+
+When the benchmark has finished, we can stop the node again::
+
+    esrally stop --installation-id="69ffcfee-6378-4090-9e93-87c9f8ee59a7"
+
+If you only want to shutdown the node but don't want to delete the node and the data, pass ``--preserve-install`` additionally.
+
+
+Levelling Up: Benchmarking a Cluster
+------------------------------------
+
+This approach of being able to manage individual cluster nodes shows its power when we want to setup a cluster consisting of multiple nodes. At the moment Rally only supports a uniform cluster architecture but with this approach we can also setup arbitrarily complex clusters. The following examples shows how to setup a uniform three node cluster on three machines with the IPs ``192.168.14.77``, ``192.168.14.78`` and ``192.168.14.79``. On each machine we will issue the following command (pick the right one per machine)::
+
+    # on 192.168.14.77
+    export INSTALLATION_ID=$(esrally install --quiet --distribution-version=7.4.2 --node-name="rally-node-0" --network-host="192.168.14.77" --http-port=39200 --master-nodes="rally-node-0,rally-node-1,rally-node-2" --seed-hosts="192.168.14.77:39300,192.168.14.78:39300,192.168.14.79:39300" | jq --raw-output '.["installation-id"]')
+    # on 192.168.14.78
+    export INSTALLATION_ID=$(esrally install --quiet --distribution-version=7.4.2 --node-name="rally-node-1" --network-host="192.168.14.78" --http-port=39200 --master-nodes="rally-node-0,rally-node-1,rally-node-2" --seed-hosts="192.168.14.77:39300,192.168.14.78:39300,192.168.14.79:39300" | jq --raw-output '.["installation-id"]')
+    # on 192.168.14.79
+    export INSTALLATION_ID=$(esrally install --quiet --distribution-version=7.4.2 --node-name="rally-node-2" --network-host="192.168.14.79" --http-port=39200 --master-nodes="rally-node-0,rally-node-1,rally-node-2" --seed-hosts="192.168.14.77:39300,192.168.14.78:39300,192.168.14.79:39300" | jq --raw-output '.["installation-id"]')
+
+Then we pick a random race id, e.g. ``fb38013d-5d06-4b81-b81a-b61c8c10f6e5`` and set it on each machine (including the machine where will generate load)::
+
+    export RACE_ID="fb38013d-5d06-4b81-b81a-b61c8c10f6e5"
+
+Now we can start the cluster. Run the following command on each node::
+
+    esrally start --installation-id="${INSTALLATION_ID}" --race-id="${RACE_ID}"
+
+Once this has finished, we can check that the cluster is up and running e.g. with the ``_cat/health`` API::
+
+    curl http://192.168.14.77:39200/_cat/health\?v
+
+We should see that our cluster consisting of three nodes is up and running::
+
+    epoch      timestamp cluster         status node.total node.data shards pri relo init unassign pending_tasks max_task_wait_time active_shards_percent
+    1574930657 08:44:17  rally-benchmark green           3         3      0   0    0    0        0             0                  -                100.0%
+
+Now we can start the benchmark on the load generator machine (remember to set the race id there)::
+
+    esrally --pipeline=benchmark-only --target-host=192.168.14.77:39200,192.168.14.78:39200,192.168.14.79:39200 --track=geonames --challenge=append-no-conflicts-index-only --on-error=abort --race-id=${RACE_ID}
+
+Similarly to the single-node benchmark, we can now shutdown the cluster again by issuing the following command on each node::
+
+    esrally stop --installation-id="${INSTALLATION_ID}"
+
diff --git a/docs/command_line_reference.rst b/docs/command_line_reference.rst
@@ -64,6 +64,54 @@ This will show the path to the default distribution::
       "elasticsearch": "/Users/dm/.rally/benchmarks/distributions/elasticsearch-6.8.0.tar.gz"
     }
 
+``install``
+~~~~~~~~~~~
+
+.. warning::
+
+    This subcommand is experimental. Expect the functionality and the command line interface to change significantly even in patch releases.
+
+This subcommand can be used to install a single Elasticsearch node. Example::
+
+    esrally install --quiet --distribution-version=7.4.2 --node-name="rally-node-0" --network-host="127.0.0.1" --http-port=39200 --master-nodes="rally-node-0" --seed-hosts="127.0.0.1:39300"
+
+This will output the id of this installation::
+
+    {
+      "installation-id": "69ffcfee-6378-4090-9e93-87c9f8ee59a7"
+    }
+
+Please see the :doc:`cluster management tutorial </cluster_management>` for the intended use and a complete walkthrough.
+
+``start``
+~~~~~~~~~
+
+.. warning::
+
+    This subcommand is experimental. Expect the functionality and the command line interface to change significantly even in patch releases.
+
+This subcommand can be used to start a single Elasticsearch node that has been previously installed with the ``install`` subcommand. Example::
+
+    esrally start --installation-id=INSTALLATION_ID --race-id=RACE_ID
+
+``INSTALLATION_ID`` is the installation id value as shown in the output of the ``install`` command. The ``RACE_ID`` can be chosen freely but is required to be the same for all nodes in a cluster.
+
+Please see the :doc:`cluster management tutorial </cluster_management>` for the intended use and a complete walkthrough.
+
+``stop``
+~~~~~~~~
+
+.. warning::
+
+    This subcommand is experimental. Expect the functionality and the command line interface to change significantly even in patch releases.
+
+This subcommand can be used to stop a single Elasticsearch node that has been previously started with the ``start`` subcommand. Example::
+
+    esrally stop --installation-id=INSTALLATION_ID
+
+``INSTALLATION_ID`` is the installation id value as shown in the output of the ``install`` command.
+
+Please see the :doc:`cluster management tutorial </cluster_management>` for the intended use and a complete walkthrough.
 
 Command Line Flags
 ------------------
@@ -164,6 +212,66 @@ A track consists of one or more challenges. With this flag you can specify which
 
 A unique identifier for this race. By default a random UUID is automatically chosen by Rally.
 
+``installation-id``
+~~~~~~~~~~~~~~~~~~~
+
+.. warning::
+
+    This command line parameter is experimental. Expect the functionality and the command line interface to change significantly even in patch releases.
+
+A unique identifier for an installation. This id is automatically generated by Rally when the ``install`` subcommand is invoked and needs to be provided in the ``start`` and ``stop`` subcommands in order to refer to that installation.
+
+``node-name``
+~~~~~~~~~~~~~
+
+.. warning::
+
+    This command line parameter is experimental. Expect the functionality and the command line interface to change significantly even in patch releases.
+
+Used to specify the current node's name in the cluster when it is setup via the ``install`` subcommand.
+
+``network-host``
+~~~~~~~~~~~~~~~~
+
+.. warning::
+
+    This command line parameter is experimental. Expect the functionality and the command line interface to change significantly even in patch releases.
+
+Used to specify the IP address to which the current cluster node should bind to when it is setup via the ``install`` subcommand.
+
+``http-port``
+~~~~~~~~~~~~~
+
+.. warning::
+
+    This command line parameter is experimental. Expect the functionality and the command line interface to change significantly even in patch releases.
+
+Used to specify the port on which the current cluster node should listen for HTTP traffic when it is setup via the ``install`` subcommand. The corresponding transport port will automatically be chosen by Rally and is always the specified HTTP port + 100.
+
+``master-nodes``
+~~~~~~~~~~~~~~~~
+
+.. warning::
+
+    This command line parameter is experimental. Expect the functionality and the command line interface to change significantly even in patch releases.
+
+A comma-separated list that specifies the node names of the master nodes in the cluster when a node is setup via the ``install`` subcommand. Example::
+
+    --master-nodes="rally-node-0,rally-node-1,rally-node-2"
+
+This will treat the nodes named ``rally-node-0``, ``rally-node-1`` and ``rally-node-2`` as `initial master nodes <https://www.elastic.co/guide/en/elasticsearch/reference/current/discovery-settings.html#initial_master_nodes>`_.
+
+``seed-hosts``
+~~~~~~~~~~~~~~
+
+.. warning::
+
+    This command line parameter is experimental. Expect the functionality and the command line interface to change significantly even in patch releases.
+
+A comma-separated list if IP:transport port pairs used to specify the seed hosts in the cluster when a node is setup via the ``install`` subcommand. See the `Elasticsearch documentation <https://www.elastic.co/guide/en/elasticsearch/reference/current/discovery-settings.html#unicast.hosts>`_ for a detailed explanation of the seed hosts parameter. Example::
+
+    --seed-hosts="192.168.14.77:39300,192.168.14.78:39300,192.168.14.79:39300"
+
 .. _clr_include_tasks:
 
 ``include-tasks``

diff --git a/docs/index.rst b/docs/index.rst
@@ -40,6 +40,7 @@ Rally's source code is available on `Github <https://github.com/elastic/rally>`_
    configuration
    race
    tournament
+   cluster_management
    recipes
 
 .. toctree::

diff --git a/esrally/mechanic/__init__.py b/esrally/mechanic/__init__.py
@@ -17,4 +17,4 @@
 
 # expose only the minimum API
 from .mechanic import StartEngine, EngineStarted, StopEngine, EngineStopped, ResetRelativeTime, MechanicActor, \
-    cluster_distribution_version, download
+    cluster_distribution_version, download, install, start, stop
diff --git a/esrally/mechanic/cluster.py b/esrally/mechanic/cluster.py
@@ -21,27 +21,21 @@ class Node:
     Represents an Elasticsearch cluster node.
     """
 
-    def __init__(self, pid, host_name, node_name, telemetry):
+    def __init__(self, pid, binary_path, host_name, node_name, telemetry):
         """
         Creates a new node.
 
         :param pid: PID for this node.
+        :param binary_path: The local path to the binaries for this node.
         :param host_name: The name of the host where this node is running.
         :param node_name: The name of this node.
         :param telemetry: The attached telemetry.
         """
         self.pid = pid
+        self.binary_path = binary_path
         self.host_name = host_name
         self.node_name = node_name
-        self.ip = None
         self.telemetry = telemetry
-        # populated by telemetry
-        self.os = {}
-        self.jvm = {}
-        self.cpu = {}
-        self.memory = {}
-        self.fs = []
-        self.plugins = []
 
     def __repr__(self):
         return self.node_name
diff --git a/esrally/mechanic/java_resolver.py b/esrally/mechanic/java_resolver.py
@@ -21,22 +21,21 @@
 from esrally.utils import jvm
 
 
-def java_home(car, cfg):
-    def determine_runtime_jdks(car):
+def java_home(car_runtime_jdks, cfg):
+    def determine_runtime_jdks():
         override_runtime_jdk = cfg.opts("mechanic", "runtime.jdk")
         if override_runtime_jdk:
             return [override_runtime_jdk]
         else:
-            runtime_jdks = car.mandatory_var("runtime.jdk")
             try:
-                return [int(v) for v in runtime_jdks.split(",")]
+                return [int(v) for v in car_runtime_jdks.split(",")]
             except ValueError:
                 raise exceptions.SystemSetupError(
-                    "Car config key \"runtime.jdk\" is invalid: \"{}\" (must be int)".format(runtime_jdks))
+                    "Car config key \"runtime.jdk\" is invalid: \"{}\" (must be int)".format(car_runtime_jdks))
 
     logger = logging.getLogger(__name__)
 
-    runtime_jdk_versions = determine_runtime_jdks(car)
+    runtime_jdk_versions = determine_runtime_jdks()
     logger.info("Allowed JDK versions are %s.", runtime_jdk_versions)
     major, java_home = jvm.resolve_path(runtime_jdk_versions)
     logger.info("Detected JDK with major version [%s] in [%s].", major, java_home)