# State of the ART: The Adaptive Radix Tree Index for Main-Memory Databases

## JONAS FRITSCH, Technische Universtität München

With recent trends in hardware, main memory capacities have grown to an extent where most traditional DBMS can fit entirely into memory. This change introduced a new shift of the performance bottleneck from disk-based I/O to main memory access.

While previous index structures like the B-Tree were optimized for minimizing disk access, the adaptive radix tree (ART) is a trie-based index structure designed explicitly for in-memory usage. It utilizes newer architecture features like SIMD and caching effectively and compresses its structure dynamically, both horizontal and vertical. With these measures, ART achieves a performance that beats other state-of-the-art order-preserving indexes in both insertion and single-lookup time while having also having a lower memory footprint.

### 1 INTRODUCTION

The architecture of DBMS has constantly been evolving due to advances in hardware. Over the last few decades, main memory capacities increased from several megabytes up to thousands of gigabytes, such that nowadays, databases can fit entirely into main memory. This change significantly impacted the general architecture of DBMS, which resulted in performance improvements by several factors [4], [11].

The design of index structures used to query a set of data more efficiently was heavily influenced by the main performance bottleneck of disk I/O in traditional disk-based DBMS. Original index structures like the B-Tree designed to minimize disk accesses perform poorly in an in-memory environment. The T-Tree [6] was one of the first index structures proposed for main memory DBMS. However, over the last 35 years, the hardware landscape changed dramatically, causing T-Trees and all other index structures not explicitly designed with caching effects in mind to be rendered inefficient [8]. Further focus on developing cache-sensitive index structures resulted in many different search tree variants.

Cache-sensitive search trees (CSS-Trees) [8], while utilizing cache lines efficiently, introduce a significant overhead for updates as the tree is compactly stored in an array. Similarly, the more recent k-ary search tree [10] and the Fast Architecture Sensitive Tree (FAST) [5] both utilize Single Instruction Multiple Data (SIMD) instructions for data-level parallelism (DLP) to increase performance. However, as static data structures, they do not support incremental updates. A way to circumvent this limitation is to use a delta mechanism where another data structure stores differences and is periodically merged into the static structure. This comes at an additional performance cost. The cache-conscious  $B^+$ -Tree (CSB+) [9] introduced as a variant of  $B^+$ -Trees improves cache utilization by reducing the need to store all different child pointers in each node.

Hash-Tables have been a popular indexing choice for main memory databases as they provide optimal O(1) - as opposed to  $O(\log n)$  for search trees - single-lookup and update time on average. Many different hashing schemes and hash functions have been developed

Author's address: Jonas Fritsch, Technische Universtität München, Fakultät für Informatik, fritsch@in.tum.de.



Fig. 1. A trie with span 8 (1 byte per character) storing the keys HELLO, HAT, and HAVE on the left and its radix tree variant on the right. Nodes marking the end of a word are outlined.

over time, but hash-tables generally do not support any range-based queries due to the nature of hash functions. Additionally, hash-tables can require complete re-hashing with O(n) complexity upon reaching its load balance.

Another possible data structure used for indexing is a trie, also called prefix tree. Tries are search trees with the difference that keys are inserted in pieces (partial keys). This means that two keys sharing a common prefix will have the same path from the root until their next partial key differs. A radix tree is a variant of a trie that further compactifies its structure by compressing nodes for partial keys that only have one child, as shown in Figure 1. With radix trees, some form of leaf nodes holding the compressed rest key is required to restore full keys.

Tries have the following interesting properties:

- Tree height and complexity do not depend on the number of keys *n* stored but rather the key length *k*. As we will see later in Section 4, this means that its performance is mainly impacted not by the amount of keys present as in other search trees but by the skewness of these keys.
- Tries require no rebalancing.
- All insertion orders result in the same tree.
- Keys are stored in lexicographic order.
- Keys are stored implicitly along paths. (This is not directly true for radix trees and can differ with implementations.)

The span of a trie is the number of bits making one partial key. The fanout of a node is the number of children a node can have maximum. The most efficient implementation for a trie of span s is to have a fanout of  $2^s$  on each node. This means that when storing the children in a pointer array of size  $2^s$ , the partial key can be



Fig. 2. Comparison of tree height versus space consumption for different radix tree spans *s* when storing 1M uniformly distributed 32 bit integers with 8 byte pointers. Source: [7]

used directly as index into this array to find the next child without having to make any comparisons. With this, the height of a given trie storing keys of k bits is bound by  $\lceil k/s \rceil$ , and increasing the span results in a lower trie height which is desirable as the height dictates the time complexity of almost all operations.

On the other hand, increasing the span results in an exponentially higher fanout, thus requiring more space as even if few children exist, the array will be filled with null pointers wasting memory. For this reason, having a very high span is generally impractical. The Generalized Prefix Tree (GPT) [2] has a span of 4, and the radix tree implementation used in the Linux kernel uses 6 bits [3]. The Adaptive Radix Tree (ART) [7], as depicted in Figure 3, while using a span of 8, manages to have both fairly low memory consumption as well as a small tree height.

The key idea in lowering memory consumption while maintaining a high span for tries is to adaptively use nodes with different fanouts based on the number of actual children. One of the first data structures to utilize such adaptive nodes was the Judy Array [1] which was invented by Doug Baskins and developed at Hewlett-Packard research labs as a general associative array. However, due to its patent and overall complexity, it has not yet been researched as much. Also, its original, almost two-decade-old design has flaws like assuming 16 byte cache-line sizes or not utilizing SIMD.

The rest of this paper is organized as follows. We first introduce the Adaptive Radix Tree as an index structure and explain its key design features. In Section 3, we show how to convert different key types to binary-comparable keys to index in ART. Section 4 evaluates ART by comparing its performance and memory consumption first in micro-benchmarks and then the TPC-C benchmark. Section 5 discusses related work and research done since the original proposition of ART back in 2013. The final part draws a conclusion and discusses future work.

## 2 ADAPTIVE RADIX TREE

The Adaptive Radix Tree (ART) was first proposed by Leis et al. [7] as a performant, order-preserving in-memory index structure. As a radix tree using multiple different node types with different fanouts, it applies vertical compression to reduce the overall tree height and improve performance, as well as horizontal compression to lower memory consumption.



Fig. 3. Different node structures used by ART. For each node, the partial keys 0, 2, 3, and 255 are mapped to the subtrees a, b, c, and d, respectively. Source: [7]

## 2.1 Horizontal Compression (Adaptive Nodes)

Similar to Judy Arrays, ART uses a number of different node types, each with a different fanout but the same span of 8, to adaptively change between them based on the actual number of child nodes. This way, less space is wasted storing null pointers. Additionally, storing the children in sorted order allows for range scans along nodes. ART uses four different node types illustrated in Figure ?? and named after the maximum number of children they can store. For better cache line utilization during searches ART stores partial keys and child pointers in two seperate arrays as opposed to storing key-pointer pairs in one single array:

**Node4:** Stores up to 4 children by maintaining a sorted array of size 4 of partial keys where the searched index in the key array is directly used as index into the child pointer array.

**Node16:** This node stores up to 16 children in the same way as Node4. Searching for a key can be done efficiently using SIMD instructions.

**Node48:** As the number of partial keys to differentiate increases, searches become more expensive, even in sorted order. Therefore Node48 uses a full 256-sized byte array to hold all possible partial keys. However, as this node only stores up to 48 children, our child pointer array is only of size 48. ART thus stores the index into the child pointer array as value in the key array, which is indexed directly using the partial key.

Node256: Finally, the largest node stores children in the classical trie approach with a child pointer array of size 256 so that, similarly to Node48, the partial key is used directly as index.

## 3 CONSTRUCTING BINARY-COMPARABLE KEYS Lorem ipsum.

## 4 EVALUATION

Lorem ipsum.

#### 5 RELATED WORK

Lorem ipsum.

## 6 CONCLUSION AND FUTURE WORK

Lorem ipsum.

## **REFERENCES**

- [1] Doug Baskins. 2004. Judy arrays. Retrieved July 23, 2022 from http://judy. sourceforge.net/
- [2] Matthias Boehm, Benjamin Schlegel, Peter Benjamin Volk, Ulrike Fischer, Dirk Habich, and Wolfgang Lehner. 2011. Efficient in-memory indexing with generalized prefix trees. In Datenbanksysteme für Business, Technologie und Web (BTW), Theo Härder, Wolfgang Lehner, Bernhard Mitschang, Harald Schöning, and Holger Schwarz (Eds.). Gesellschaft für Informatik e.V., Bonn, 227-246.
- [3] Jonathan Corbet. 2006. Trees I: Radix trees. Retrieved July 23, 2022 from https: //lwn.net/Articles/175432/

- [4] Stavros Harizopoulos, Daniel J. Abadi, Samuel Madden, and Michael Stonebraker. 2008. OLTP through the Looking Glass, and What We Found There. In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data (Vancouver, Canada) (SIGMOD '08). Association for Computing Machinery, New York, NY, USA, 981-992. https://doi.org/10.1145/1376616.1376713
- [5] Changkyu Kim, Jatin Chhugani, Nadathur Satish, Eric Sedlar, Anthony D. Nguyen, Tim Kaldewey, Victor W. Lee, Scott A. Brandt, and Pradeep Dubey. 2010. FAST: Fast Architecture Sensitive Tree Search on Modern CPUs and GPUs. In Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data (Indianapolis, Indiana, USA) (SIGMOD '10). Association for Computing Machinery, New York, NY, USA, 339-350. https://doi.org/10.1145/1807167.1807206
- [6] Tobin J Lehman and Michael J Carey. 1985. A study of index structures for main memory database management systems. Technical Report. University of Wisconsin-Madison Department of Computer Sciences.
- Viktor Leis, Alfons Kemper, and Thomas Neumann. 2013. The adaptive radix tree: ARTful indexing for main-memory databases. In 2013 IEEE 29th International Conference on Data Engineering (ICDE). 38-49. https://doi.org/10.1109/ICDE.2013.
- [8] Jun Rao and Kenneth A Ross. 1998. Cache conscious indexing for decision-support in main memory. (1998). https://doi.org/10.7916/D8T441ZB
- [9] Jun Rao and Kenneth A. Ross. 2000. Making B+- Trees Cache Conscious in Main Memory. In Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data (Dallas, Texas, USA) (SIGMOD '00). Association for Computing Machinery, New York, NY, USA, 475-486. https://doi.org/10.1145/342009.335449
- [10] Benjamin Schlegel, Rainer Gemulla, and Wolfgang Lehner. 2009. K-Ary Search on Modern Processors. In Proceedings of the Fifth International Workshop on Data Management on New Hardware (Providence, Rhode Island) (DaMoN '09). Association for Computing Machinery, New York, NY, USA, 52-60. https://doi. org/10.1145/1565694.1565705
- Hao Zhang, Gang Chen, Beng Chin Ooi, Kian-Lee Tan, and Meihui Zhang. 2015. In-Memory Big Data Management and Processing: A Survey. IEEE Transactions on Knowledge and Data Engineering 27, 7 (2015), 1920-1948. https://doi.org/10. 1109/TKDE.2015.2427795