Parabix LLVM

This repository contains the implementation for the thesis "High-Performance Regular Expression Matching with Parabix and LLVM" which can also be found here.

This project was done a part of TUM Database Implementation practical course.

Implementation

This repository contains both iterative and LLVM codegen approaches for Parabix, they are located at parabix_cpp and parabix_llvm relatively.

You may also want to check the Parabix compiler (parabix_compiler.cc) that generates a code by LLVM IRBuilder API.

Presentation

You can find the PDF document here used during the presentation.

Benchmark

Relative files are generator and benchmark.

size/algo	std::regex	parabix-ccp	parabix-llvm
10MB	0.22	0.12	0.016
100MB	2.2	1.2	0.12
500MB	11	6	0.6
1GB	23	13	1.2
1.2GB	25	15	1.4

NOTE: Time to read input data from a file is excluded from the elapsed times. The pattern is a[0-9]*z.

Wanna try?

mkdir build
cd build
cmake ../
# generate input file
ninja generator
./generator 1000 ../1gb.txt
# run benchmark
ninja benchmark
./benchmark
# run vgrep
ninja vgrep_llvm
./vgrep_llvm ../1gb.txt "a[0-9]*z"

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
cmake		cmake
generator		generator
include		include
perfevent @ f34b43a		perfevent @ f34b43a
presentation		presentation
src		src
test		test
tools		tools
vendor		vendor
.clang-format		.clang-format
.clang-tidy		.clang-tidy
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Parabix LLVM

Implementation

Presentation

Benchmark

Wanna try?

About

Releases

Packages

Languages

ahmadov/parabix

Folders and files

Latest commit

History

Repository files navigation

Parabix LLVM

Implementation

Presentation

Benchmark

Wanna try?

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages