MemoryManagement

Background

When the JVM GC performs a large amount of garbage collection, the latency of the request is often high, and the response time becomes uncontrollable. To reduce request latency and response time jitter, the hugegraph-server graph query engine has already used off-heap memory in most OLTP algorithms.

However, at present, hugegraph cannot control memory based on a single request Query, so a Query may exhaust the memory of the entire process and cause OOM, or even cause the service to be unable to respond to other requests. To solve this problem, we can implement a memory management module based on a single Query. Applicants will work with community developers to complete this task, and the specific implementation plan and division of labor/priority can be adjusted as needed.

Technical Skill Points

Java/JVM Basics: Deep understanding of Java's memory model, including the management and operation of heap memory and off-heap memory.
Java NIO: Java NIO library provides an interface for operating off-heap memory, which needs to be mastered. (Familiarity with Netty or other memory management basic libraries is preferred)
Concurrent Programming: Since memory management involves multi-thread concurrent operations, it is necessary to have knowledge of concurrent programming and multi-thread safety.
Data Structures: Understand and apply appropriate data structures to manage memory, such as using queues, stacks, etc., to manage memory blocks.
Operating System: Understand the memory management mechanism of the operating system in order to better understand and optimize Java's off-heap memory management.

Project Output Requirements

Implement a unified memory pool, independently manage JVM off-heap memory, and adapt the memory allocation methods of various native collections, so that the memory mainly used by the algorithm comes from the unified memory pool, and it is returned to the memory pool after release.
Each request corresponds to a unified memory pool, and the memory usage of a request can be controlled by counting the memory usage of a request.
Complete related unit tests UT and basic documentation.

Key Technology Analysis and Selection

Where will a large amount of memory be occupied?
- When querying/inserting data, the allocated vertex, edge, attribute objects (optional requirements, but the priority is high P2)
- Cache memory (optional requirement P3)
- When serializing/deserializing (including the receiving buffer read from storage), the allocated byte array (optional requirement P3)
- Collection objects used in graph algorithms (required requirement P1)
- Collection objects in Gremlin query group, dedup and other operators (optional requirement P3)
- Objects allocated by REST interface requests (optional requirement P4)
Whether to use the dependent library for memory segment page management, how to choose?
- Whether to choose Netty memory management library? --rpc depends on the netty library, and you can also refer to the postgres memory management module
- Ensure that large, medium, and small memories can be efficiently managed
- Ensure that memory restrictions can be made according to Query instances
- May need to use a memory object pool (such as Vertex object pool, this requirement is not yet certain, see whether it is necessary to completely replace Vertex with a binary access object)
How to manage memory at multiple levels: system memory, multiple request memory, memory at each stage of a single request, temporary memory?
- Adopt MemoryContext tree structure?
How to unify the interface for allocating memory everywhere?
- Each Query has an Allocator instance, and one Allocator corresponds to multiple MemoryContexts
- How to pass the Allocator to each place, or put it in the context? (Analysis: If it is placed in the parameters, the transformation cost may be relatively large; it is recommended to put it in the threadlocal context, but cross-thread transmission needs to be considered)
- How to get the corresponding Allocator for return when manually releasing objects
- When the JVM automatically releases the wrapped object, it needs to release the data object itself, how to get the corresponding Allocator for return
Is memory concurrency safe between multiple threads?
- Ensure that multi-threaded concurrent access is safe
- Ensure that multi-threaded concurrent release is safe
How to release memory as quickly as possible
- Clearly, the memory that the subsequent step does not need to use can be released as soon as possible
- Temporary memory used in a loop can be applied from a separate MemoryContext
- When the Query ends, release all objects allocated by the corresponding Allocator
Module portability
- In principle, the memory management module itself does not depend on HugeGraph's code
- Plan to be able to smoothly port to HugeGraph-Computer in the future
Statistics and monitoring (optional)
- Statistics on memory usage
- Statistics on the memory usage of various objects
- Statistics on which objects and usage are leaking memory

Overall Design

Overall, it can be divided into 3 modules:

Memory management implementation module. Implement the life cycle management of memory objects, memory capacity restrictions and other functions, and provide the Allocator interface (including allocation, release interface). This is a relatively independent module.
Integrate the Allocator module into the HugeGraph context and provide a unified interface for memory transformation.
Transform the places where a large amount of memory is occupied, and adapt to use Allocator for object allocation and release.

Detailed Design

TODO

Points to consider:

Research options
Layering and sub-module division
Interface definition: create Allocator interface, use interface

Documentation license here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly