Remove manual parsing of JVM options #41962

jasontedor · 2019-05-08T18:26:01Z

This commit removes manual parsing of JVM options when calculating ergonomics. This is to avoid a situation that we parse values differently than the JVM would. In fact, we already have a bug along these lines today. It is possible to start the JVM with the same flag multiple times on the command line. In this case, the last value wins. For example, -Xmx1g -Xmx2g would start the JVM with a heap size of two gigabytes. Our JVM ergonomics ignores this possibility and instead the first value is winning!

Our strategy to avoid manual parsing of the JVM options is to start the Java command line parser (without actually starting a JVM) by invoking java with the same command line flags as presented and request that the JVM tell us what values it would start with. This ensures that we have the correct values when making ergonomic decisions.

Moreover, our strategy also is ignoring ES_JAVA_OPTS which could override the heap size as well leading to incorrect ergonomic choices. This commit address this issue too.

Relates #30684

This commit removes manual parsing of JVM options when calculating ergonomics. This is to avoid a situation that we parse values differently than the JVM would. In fact, we already have a bug along these lines today. It is possible to start the JVM with the same flag multiple times on the command line. In this case, the last value wins. For example, -Xmx1g -Xmx2g would start the JVM with a heap size of two gigabytes. Our JVM ergonomics ignores this possibility and instead the first value is winning! Our strategy to avoid manual parsing of the JVM options is to start yet another JVM with the same command line flags as presented and request that the JVM tell us what values it would start with. This ensures that we have the correct values when making ergonomic decisions. Moreover, our strategy also is ignoring ES_JAVA_OPTS which could override the heap size as well leading to incorrect ergonomic choices. This commit address this issue too.

elasticmachine · 2019-05-08T18:26:04Z

Pinging @elastic/es-core-infra

jasontedor · 2019-05-08T18:29:32Z

To my beloved reviewers: This approach was initially suggested on the original work introducing the ergonomics. Please see the discussion there as part of your review before we consider pulling this in. I think that we can revisit the discussion here if needed.

rjernst · 2019-05-08T20:22:01Z

This seems like a very heavyweight hack. Launching a VM with Xmx pretouched, just to find out what the value was is a huge cost (relative to normal startup).

Both the bugs you mentioned (first vs last wins, missing ES_JAVA_OPTS) seem fixable with the current implementation. IMO keeping our own parsing isn't that expensive vs incurring the added startup cost for every node we start (which amplifies in our integration tests as we start a lot of nodes...).

jasontedor · 2019-05-08T21:21:24Z

I think there is confusion here, introduced by my commit message (I will reword it); I’m sorry for that. If you refer back to the initial discussion in #30684, we do not actually start a JVM here, only invoke java for options parsing. By requesting -XX:+PrintFlagsFinal -version it only causes the Java process to parse the options, display all the JVM flags, print the version, and exit. The JVM itself never bootstraps. This is actually quite cheap then.

While what we are doing today has bugs that can be fixed without resorting to this, there are bugs we have today that can not. Here’s a list of the bugs we have today:

ordering of options
ignore ES_JAVA_OPTS
wrong value if heap size is not specified (there are use cases for this, in containers)
we don’t parse all possible unit specifications

The third of these simply can not be fixed by us. My change fixes all of these, with no additional effort on our end. We don’t have to have a side implementation of the JVM parsing logic.

Additionally, this sets us up to easily extract the value of MaxDirectMemorySize, without having to parse that too.

Thus I maintain this is a superior approach.

jaymode · 2019-05-08T21:41:02Z

I think there is confusion here, introduced by my commit message (I will reword it)

The semantics of invoking java -XX:+PrintFlagsFinal -version with all the JVM flags we are going to start elasticsearch with is not intuitive enough to only have code without comments IMO; I think we need to spell this out in a code comment and probably cover:

Does not start a JVM and hence does not allocate/pretouch GBs of memory
Only invokes the option parser
Is cheap
Allows us to avoid having our own implementation of JVM option parsing logic

I'm personally in favor of this approach.

jasontedor · 2019-05-09T00:09:33Z

@jaymode I pushed a comment in 8aed51e.

rjernst · 2019-05-09T01:12:00Z

we do not actually start a JVM here, only invoke java for options parsing. By requesting -XX:+PrintFlagsFinal -version it only causes the Java process to parse the options, display all the JVM flags, print the version, and exit.

You are right, I completely missed that, I'm sorry. I misinterpreted a comment from Daniel in the previous discussion about startup time of java with 8GB Xmx. Given my new understanding, this approach does sound superior.

rjernst

LGTM. Thanks for the comment in code, it makes much more sense now.

jaymode

LGTM

…rsing * elastic/master: [ML] relax set upgrade mode test to match what is guaranteed (elastic#41958) Add note about ILM action ordering (elastic#41771) Remove Version 6 pre-release constants (elastic#41517) Mute illegal interval rollup tests Add static section whitelist info to api docs generation (elastic#41870) Cleanup RollupSearch exceptions, disallow partial results (elastic#41272)

danielmitterdorfer

I'm fine with the approach. Indeed it seems superior to me.

Additionally, this sets us up to easily extract the value of MaxDirectMemorySize, without having to parse that too.

I think you left out one detail that led me to misinterpret your statement originally. We can properly parse MaxDirectMemorySize provided that the user has specified one. If the user did not specify one, we see a MaxDirectMemorySize of 0 because the JVM determines this value in a much later stage during startup. If I run this:

daniel@io:~ $ java -XX:+PrintFlagsFinal -version | grep MaxDirectMemorySize

I get:

 uint64_t MaxDirectMemorySize                      = 0                                         {product} {default}
openjdk version "12" 2019-03-19
OpenJDK Runtime Environment (build 12+32)
OpenJDK 64-Bit Server VM (build 12+32, mixed mode, sharing)

But as I said: I think your intention was to state that we can parse the user-specified value with this approach as well.

This commit removes manual parsing of JVM options when calculating ergonomics. This is to avoid a situation that we parse values differently than the JVM would. In fact, we already have a bug along these lines today. It is possible to start the JVM with the same flag multiple times on the command line. In this case, the last value wins. For example, -Xmx1g -Xmx2g would start the JVM with a heap size of two gigabytes. Our JVM ergonomics ignores this possibility and instead the first value is winning! Our strategy to avoid manual parsing of the JVM options is to start the Java command line parser (without actually starting a JVM) by invoking java with the same command line flags as presented and request that the JVM tell us what values it would start with. This ensures that we have the correct values when making ergonomic decisions. Moreover, our strategy also is ignoring ES_JAVA_OPTS which could override the heap size as well leading to incorrect ergonomic choices. This commit address this issue too.

jasontedor added >enhancement :Delivery/Packaging RPM and deb packaging, tar and zip archives, shell and batch scripts v8.0.0 v7.2.0 labels May 8, 2019

jasontedor requested review from rjernst, danielmitterdorfer and jaymode May 8, 2019 18:26

jasontedor added :Core/Infra/Core Core issues without another label and removed :Delivery/Packaging RPM and deb packaging, tar and zip archives, shell and batch scripts labels May 8, 2019

jasontedor added 2 commits May 8, 2019 14:49

Fix NPE

5973254

Small refactor

4b4d5a6

Add comment

8aed51e

rjernst approved these changes May 9, 2019

View reviewed changes

jaymode approved these changes May 9, 2019

View reviewed changes

jasontedor added 2 commits May 8, 2019 23:32

Fix handling of ES_JAVA_OPTS being empty

3911d10

danielmitterdorfer approved these changes May 9, 2019

View reviewed changes

jasontedor merged commit 2592b49 into elastic:master May 9, 2019

jasontedor deleted the fix-jvm-options-parsing branch May 9, 2019 10:44

jasontedor mentioned this pull request May 9, 2019

Limit max direct memory size to half of heap size #42006

Merged

davidkyle mentioned this pull request May 9, 2019

NPE in JvmErgonomics fails repository plugin tests #42009

Closed

ebadyano mentioned this pull request Sep 26, 2019

JvmOptionsParser - cannot resolve enviroment variables #47133

Closed

This was referenced Oct 28, 2019

Hardcoded Heap settings for docker environments #48574

Closed

Are Java memory options still needed for docker? #42660

Closed

jdiazdev mentioned this pull request Feb 7, 2020

NPE when launching Elasticsearch eclipse-openj9/openj9#7764

Closed

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove manual parsing of JVM options #41962

Remove manual parsing of JVM options #41962

jasontedor commented May 8, 2019 •

edited

elasticmachine commented May 8, 2019

jasontedor commented May 8, 2019

rjernst commented May 8, 2019

jasontedor commented May 8, 2019

jaymode commented May 8, 2019

jasontedor commented May 9, 2019

rjernst commented May 9, 2019

rjernst left a comment

jaymode left a comment

danielmitterdorfer left a comment

Remove manual parsing of JVM options #41962

Remove manual parsing of JVM options #41962

Conversation

jasontedor commented May 8, 2019 • edited

elasticmachine commented May 8, 2019

jasontedor commented May 8, 2019

rjernst commented May 8, 2019

jasontedor commented May 8, 2019

jaymode commented May 8, 2019

jasontedor commented May 9, 2019

rjernst commented May 9, 2019

rjernst left a comment

Choose a reason for hiding this comment

jaymode left a comment

Choose a reason for hiding this comment

danielmitterdorfer left a comment

Choose a reason for hiding this comment

jasontedor commented May 8, 2019 •

edited