This is a modified version of PostgreSQL with just-in-time compiler for expressions. Current base version is 9.6.1.
Switch branches/tags
Clone or download
Pull request Compare This branch is 240 commits ahead, 4960 commits behind postgres:master.

README.md

PostgreSQL with JIT compiler for expressions

This is a modified version of PostgreSQL with just-in-time compiler for expressions. Current base version is 9.6.1. Just-in-time compiler (JIT for short) improves database performance significantly on OLAP queries. This version uses LLVM JIT for query compilation and shows up to 20% improvement on TPC-H tests.

Dependencies

  • LLVM version 3.7

Building

The easiest way is to follow the process of building vanilla PostgreSQL. LLVM JIT compiler for expressions is enabled by default.

The most basic procedure is as follows:

$ git clone https://github.com/ispras/postgres
$ cd postgres
$ ./configure [configure options]
$ make [make options] -jN

Configure options

option description
--disable-llvm-jit Disables expression compilation with LLVM JIT.
--with-llvm-config=<llvm_config> Specify your own llvm-config, version 3.7.0 or 3.7.1 required for build.

Make options

option description
LLVM_BACKEND_FILE_LIMIT=<N> Parallelize compilation by splitting LLVM backend file, N — number of functions in each file (50 by default).

PostgreSQL GUC settings

setting description
enable_llvm_jit Enable LLVM JIT of expressions (enabled by default).
enable_llvm_dump Dump compiled LLVM IR (developer option, disabled by default).
debug_llvm_jit Disable optimization of generated LLVM modules (developer option, disabled by default).

Internals

Internally, evaluation of each individual expression in PostgreSQL happens by means of calling a function pointer which is stored in the expression object and initialized during expression initialization phase with the address of the corresponding execution function.

Expression JIT presented here works by hooking into expression initialization phase and replacing this pointer with address of a function generated for the expression at run time.

Advice for developers

  • Enabling assertions (configure with --enable-cassert option) also turns on LLVM module verification.
  • Set enable_llvm_dump and debug_llvm_jit to on to explore compiled code.

LLVM dumps are stored in the llvm_dump subdirectory in the database directory. Each compiled expression is written into three files (substitute ### with expression number):

  • expr.### — expression tree, in the “pretty-print” format.
  • dump.###.ll, dump.###.opt.llLLVM assembly files generated for expression.

Resources

License

PostgreSQL License