Faster Turtle Parsing using Compile Time Regexes #307

- We now have two tokenizer's, one using Google's RE2 and one using hanickadot (Hana Dusikova's) CTRE (compile time regex) library. - The CTRE tokenizer is faster but currently only supports prefixes that contain only ascii character's - for exactly this reason they have to be explicitly activated in the settings file - Added Unit Tests for the CTRE Tokenizer Some of them are commented out, because they test the currently unsupported none-ascii prefixes - Implemented the Regexes for using correct prefixes in CTRE in the UTF8RegexTest.cpp file but don't use them in actual code because they bloat up the compile-time by an unacceptable amount. - Included the two Parsing Modes (CTRE with relaxed prefixes and Google Re2 as before) into the IndexBuilderMain

Also use the ctre parser in the e2e tests.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Faster Turtle Parsing using Compile Time Regexes #307

Faster Turtle Parsing using Compile Time Regexes #307

Commits on Jan 30, 2020