Fix RAM usage estimation of LiveVersionMap. #20123

jpountz · 2016-08-23T14:51:51Z

I was writing tests for RAM usage estimation of LiveVersionMap and found a
couple issues:

The BytesRef objects used as uids were oversized since they were created
via new BytesRef(CharSequence) which creates a byte[] whose size is 3x
the length of the provided char sequence. Given that our uids are most of
times ASCII sequences, this is a waste of memory.
VersionValue was using translogLocation.size instead of
translogLocation.ramBytesUsed() for RAM estimation, which is completely
unrelated to the memory footprint of the Translog.Location object.

In particular, the latter issue could cause RAM usage estimation to be
significantly overestimated, especially on large documents.

I also added tests for ram accounting.

Relates #19787

jpountz · 2016-08-23T14:57:19Z

Fortunately this bug was never released, it is 5.0-only.

s1monw · 2016-08-23T19:06:32Z

core/src/main/java/org/elasticsearch/index/engine/LiveVersionMap.java

@@ -146,7 +157,7 @@ VersionValue getUnderLock(final Term uid) {

    /** Adds this uid/version to the pending adds map. */
    void putUnderLock(BytesRef uid, VersionValue version) {
-
+        assert uid.bytes.length == uid.length; // otherwise we are wasting memory


maybe put the actual and expected length in the message

s1monw · 2016-08-23T19:07:24Z

left a minor LGTM otherwise - good catch :)

I was writing tests for RAM usage estimation of LiveVersionMap and found a couple issues: - The BytesRef objects used as uids were oversized since they were created via `new BytesRef(CharSequence)` which creates a `byte[]` whose size is 3x the length of the provided char sequence. Given that our uids are most of times ASCII sequences, this is a waste of memory. - `VersionValue` was using `translogLocation.size` instead of `translogLocation.ramBytesUsed()` for RAM estimation, which is completely unrelated to the memory footprint of the `Translog.Location` object. In particular, the latter issue could cause RAM usage estimation to be significantly overestimated, especially on large documents. I also added tests for ram accounting.

jpountz added >bug :Translog labels Aug 23, 2016

s1monw reviewed Aug 23, 2016
View reviewed changes

jpountz force-pushed the fix/live_version_map_ram_usage_estimation branch from 85507fa to 5d6c9b0 Compare August 24, 2016 07:54

jpountz merged commit 5d6c9b0 into elastic:master Aug 24, 2016

clintongormley added the v5.0.0-beta1 label Aug 29, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix RAM usage estimation of LiveVersionMap. #20123

Fix RAM usage estimation of LiveVersionMap. #20123

jpountz commented Aug 23, 2016 •

edited

jpountz commented Aug 23, 2016

s1monw Aug 23, 2016

s1monw commented Aug 23, 2016

Fix RAM usage estimation of LiveVersionMap. #20123

Fix RAM usage estimation of LiveVersionMap. #20123

Conversation

jpountz commented Aug 23, 2016 • edited

jpountz commented Aug 23, 2016

s1monw Aug 23, 2016

Choose a reason for hiding this comment

s1monw commented Aug 23, 2016

jpountz commented Aug 23, 2016 •

edited