Add single-machine deployment example cfgs and scripts #7590

jon-wei · 2019-05-03T19:09:46Z

This PR adds a new set of reference configurations and launch scripts for single-machine deployments:

micro-quickstart
small
medium
large
xlarge

The micro-quickstart is sized for small machines like laptops and is intended for quick evaluation use-cases.

The other configurations are intended for general use single-machine deployments. They are sized for hardware roughly based on Amazon's i3 series of EC2 instances.

The old tutorial cluster configuration has been replaced with micro-quickstart, with the tutorials also pointing users to the other single-server examples if they wish to use a bigger machine.

This PR also adds example configurations for a clustered deployment based on the master/data/query organization, along with associated launch scripts.

fjy · 2019-05-03T19:12:42Z

docs/content/tutorials/index.md


+You will need:
  * Java 8


can you also add (8u92+)

fjy · 2019-05-04T16:32:53Z

examples/conf/druid/single-server/medium/overlord/runtime.properties

+# under the License.
+#
+
+druid.service=druid/overlord


Out of curiosity, why not run Coordinator asOverlord and move towards the world where the coordinator and overlord are one

I initially felt like the simplicity benefit wasn't that high in this case (since premade configs and startup scripts are being provided), and felt that it would be closer to a full clustered deployment for eventual migration with them separate.

I updated the PR to use that setting, since it would be fine to set that in a larger cluster as well.

fjy · 2019-05-04T16:36:15Z

I think there are still a small set of things missing:

Some transition docs describing how Druid should be evaluated (when to use single server, when to use clustered mode)
In clustered docs, master/query/data servers should be started via scripts instead of two separate command lines

jon-wei · 2019-05-06T19:19:52Z

Some transition docs describing how Druid should be evaluated (when to use single server, when to use clustered mode)

In clustered docs, master/query/data servers should be started via scripts instead of two separate command lines

I'm working on another doc-only update PR that will provide guidance on important performance tuning properties, which will also adjust the current cluster.md docs and address these points.

fjy · 2019-05-06T19:29:54Z

Some transition docs describing how Druid should be evaluated (when to use single server, when to use clustered mode)

In clustered docs, master/query/data servers should be started via scripts instead of two separate command lines

I'm working on another doc-only update PR that will provide guidance on important performance tuning properties, which will also adjust the current cluster.md docs and address these points.

Cool. Maybe the ToC should reflect the path to production. Laptop --> Single Server --> Clustering (non-HA) --> Clustering (HA)

jon-wei · 2019-05-06T20:03:27Z

Cool. Maybe the ToC should reflect the path to production. Laptop --> Single Server --> Clustering (non-HA) --> Clustering (HA)

Sounds good, I'll add that structure in the follow-on PR

gianm · 2019-05-07T01:16:07Z

Cool. Maybe the ToC should reflect the path to production. Laptop --> Single Server --> Clustering (non-HA) --> Clustering (HA)

By the way, I don't think there's a need to separate 'laptop' and 'single server' (people can treat their laptop as a single server if they want to - there is really no meaningful difference). I also don't think there's a need to separate 'Clustering (non HA)' from 'Clustering (HA)' (HA should just be a section at the end of the clustering docs).

gianm

👍

gianm · 2019-05-07T02:09:44Z

docs/content/tutorials/index.md

+
+### Hardware
+
+Druid includes several example [single-server configurations](../operations/single-server.html), along with scripts to start the Druid processes using these configurations.


Please add this somewhere more prominent in a follow-up (ToC, perhaps).

- update Docker entrypoint script in respond to the new directory structure apache#7590 - update Dockerfile to allow dependency caching in multi-stage build Signed-off-by: Khwunchai Jaengsawang <khwunchai.j@ku.th>

knoguchi · 2020-04-28T20:18:42Z

examples/bin/run-druid

@@ -34,7 +34,7 @@ else
  CONFDIR="$2"
 fi

-CONFDIR="$(cd "$CONFDIR" && pwd)/druid"
+CONFDIR="$(cd "$CONFDIR" && pwd)"


Why was the /druid removed? It's a breaking change for older version users. It doesn't seem like an absolutely necessary change. This file is one of the bin scripts in the apache-druid.tar.gz distribution although the directory name is "examples". Please avoid similar changes in the future release. Thanks.

Hi @knoguchi,

It was changed to comport with the new directory structure in this patch, where we ship with multiple potential configurations for various sizes of servers. (It no longer makes sense to add /druid to the conf dir, since we provide the full path to a Druid-specific conf dir in the supervise configs.)

I think we should have noted this in the release notes, though.

Add single-machine deployment example cfgs and scripts

2b646d8

jon-wei added Area - Documentation Ease of Use Area - Operations labels May 3, 2019

fjy reviewed May 3, 2019

View reviewed changes

fjy added this to the 0.15.0 milestone May 3, 2019

Add (8u92+)

677bdc0

fjy reviewed May 4, 2019

View reviewed changes

Use combined coordinator-overlord for single machine confs

b4cdfa8

jon-wei force-pushed the single_server_examples branch from 92db75d to b4cdfa8 Compare May 6, 2019 19:08

RAT fix

f604fa4

gianm approved these changes May 7, 2019

View reviewed changes

gianm merged commit 7c2ca47 into apache:master May 7, 2019

jihoonson mentioned this pull request Jun 8, 2019

0.15.0-incubating release notes #7854

Closed

khwj mentioned this pull request Jun 29, 2019

Update Docker build #7997

Closed

1 task

This was referenced Aug 4, 2019

Update docker build #8237

Merged

Update docker build #8244

Merged

knoguchi reviewed Apr 28, 2020

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add single-machine deployment example cfgs and scripts #7590

Add single-machine deployment example cfgs and scripts #7590

jon-wei commented May 3, 2019

fjy May 3, 2019

jon-wei May 3, 2019

fjy May 4, 2019

jon-wei May 6, 2019

fjy commented May 4, 2019 •

edited

Loading

jon-wei commented May 6, 2019 •

edited

Loading

fjy commented May 6, 2019

jon-wei commented May 6, 2019

gianm commented May 7, 2019

gianm left a comment

gianm May 7, 2019

knoguchi Apr 28, 2020

gianm Apr 30, 2020


		### Hardware

		Druid includes several example [single-server configurations](../operations/single-server.html), along with scripts to start the Druid processes using these configurations.

Add single-machine deployment example cfgs and scripts #7590

Add single-machine deployment example cfgs and scripts #7590

Conversation

jon-wei commented May 3, 2019

fjy May 3, 2019

Choose a reason for hiding this comment

jon-wei May 3, 2019

Choose a reason for hiding this comment

fjy May 4, 2019

Choose a reason for hiding this comment

jon-wei May 6, 2019

Choose a reason for hiding this comment

fjy commented May 4, 2019 • edited Loading

jon-wei commented May 6, 2019 • edited Loading

fjy commented May 6, 2019

jon-wei commented May 6, 2019

gianm commented May 7, 2019

gianm left a comment

Choose a reason for hiding this comment

gianm May 7, 2019

Choose a reason for hiding this comment

knoguchi Apr 28, 2020

Choose a reason for hiding this comment

gianm Apr 30, 2020

Choose a reason for hiding this comment

fjy commented May 4, 2019 •

edited

Loading

jon-wei commented May 6, 2019 •

edited

Loading