Memory tracking issue: worker OOM in DictionaryValuesWriter #21745

davseitsev · 2024-04-29T09:03:34Z

Under the load some Trino workers experience memory starvation. On the graph it looks like horizontal line at the top.
OOM doesn't happen but the worker disappears from the cluster and running queries fail. It happends due to long Major GC.

This behaviour persists after Trino upgrade from 409 to 444.

Heap dump looks ok there is no obvious memory leak issue. So probably the issue is in memory accouning in memory context.
I looked over the biggest objects and found something interesting.

DictionaryValuesWriter collects heap byte buffers which backing bytes size is much bigger that logical size. As you can see on the screen, logical size of the buffer is 199 bytes when it's backing bytes size is almost 200K. We collect such buffers in dictionary and do not account their "extra" bytes. As you can see on the screen, DictionaryFallbackValuesWriter accepted 620K of raw data, but actually it takes 122MB of heap space.

I went over all instances of PlainBinaryDictionaryValuesWriter and calculated how many extra bytes they take:

select 
  sum(cast(o.this['encodedValues.currentSlabPos'] as int) + retainedSize(o.this['encodedValues.slabs'])) as total_allocated_size,    
  sum(retainedSize(o.this)) total_retained_size
from "org.apache.parquet.column.values.dictionary.DictionaryValuesWriter$PlainBinaryDictionaryValuesWriter" o

Results:

 total_allocated_size | total_retained_size
--------------------------------------------
          149 845 365 |       7 615 653 296
--------------------------------------------

There are about 7.5GB of data in "extra" bytes. I followed thought the source code and I didn't find any other place where we take these bytes into account in memory context. So it can cause OOM workers.

Also as far as I can see, PlainBinaryDictionaryValuesWriter does not take into account the size of binaryDictionaryContent map in the method getAllocatedSize() at all.

The text was updated successfully, but these errors were encountered:

raunaqmorarka · 2024-04-29T13:10:07Z

@davseitsev image attachment links above are not working

davseitsev · 2024-04-30T08:23:03Z

Uploaded images again, should be ok now

raunaqmorarka · 2024-05-02T07:22:16Z

@davseitsev thanks for reporting this
I think #21801 should solve this, can you verify that it fixes the problem for you ?

davseitsev · 2024-05-02T09:05:04Z

I will port the fix and test it on the same query.

davseitsev · 2024-05-02T13:35:18Z

Thank you, @raunaqmorarka, it looks good, there are no more heap buffers. The difference between allocated size and actual retained heap is much smaller.

The query

select 
  sum(cast(o.this['encodedValues.currentSlabPos'] as int) + retainedSize(o.this['encodedValues.slabs'])) as total_allocated_size,    
  sum(retainedSize(o.this)) total_retained_size
from "org.apache.parquet.column.values.dictionary.DictionaryValuesWriter$PlainBinaryDictionaryValuesWriter" o

Returns

 total_allocated_size | total_retained_size
--------------------------------------------
           51 573 871 |       1 341 402 000
--------------------------------------------

As I can see binaryDictionaryContent is still not accounted in memory context but the size of the dictionary is much smaller now, I think it's ok.

raunaqmorarka · 2024-05-02T16:35:27Z

Thanks for verifying, DictionaryValuesWriter is in fact accounting for binaryDictionaryContent through dictionaryByteSize which is updated by PlainBinaryDictionaryValuesWriter#writeBytes. What's missing is that we only account for size of values in the map Object2IntMap<Binary> binaryDictionaryContent and ignore size of the keys and overall map structure. This will be possible to improve when we eventually stop using parquet-mr code and write our own implementation.

wendigo assigned raunaqmorarka Apr 29, 2024

raunaqmorarka mentioned this issue May 2, 2024

Avoid over retaining memory for strings in parquet writer #21801

Merged

wendigo closed this as completed in #21801 May 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory tracking issue: worker OOM in DictionaryValuesWriter #21745

Memory tracking issue: worker OOM in DictionaryValuesWriter #21745

davseitsev commented Apr 29, 2024 •

edited

raunaqmorarka commented Apr 29, 2024

davseitsev commented Apr 30, 2024

raunaqmorarka commented May 2, 2024

davseitsev commented May 2, 2024

davseitsev commented May 2, 2024

raunaqmorarka commented May 2, 2024

Memory tracking issue: worker OOM in DictionaryValuesWriter #21745

Memory tracking issue: worker OOM in DictionaryValuesWriter #21745

Comments

davseitsev commented Apr 29, 2024 • edited

raunaqmorarka commented Apr 29, 2024

davseitsev commented Apr 30, 2024

raunaqmorarka commented May 2, 2024

davseitsev commented May 2, 2024

davseitsev commented May 2, 2024

raunaqmorarka commented May 2, 2024

davseitsev commented Apr 29, 2024 •

edited