Peg parser optimizations #2878

dolik-rce · 2021-02-19T11:17:15Z

Following up from discussion in #2866 about peg parser speed and possible optimization, I'd like to use this issue to share some of my findings and to discuss possibilities for optimization.

As a first expirement, I wrote a simple script that optimizes the grammar. It reduces the number of rules in the grammar, which in turn means less allocations and other overhead. The real-world results are quite good, for Kotlin parser, the grammar is reduced to less than half (from 517 to 222 rules) and the runtime is ~40% faster.

This approach is very basic and could be much improved, especially if it was implemented directly in packcc. However, that is probably not going to happen, since one of the goals of packcc is to generate readable parser which would not be true if the grammar was highly optimized.

Another option is to add a script like this in the build process of ctags, preprocessing the peg files before the are compiled. Full original grammar could be used in debug mode, to make the development easier.

The 40% speed-up is nice, however it still is orders of magnitude worse then custom C parsers. I'll definitely continue to look inside the packcc internals. I believe there must be some way to make the generated parsers faster (and to reduce the memory overhead.

dolik-rce · 2021-02-21T11:56:58Z

I've discovered couple places, where realloc is called unnecessarily. Fixing that shaves off about ~13% of the runtime. I will send a PR to packcc soon.

@masatake: By the way, do you have any plans to migrate to the upstream packcc?

dolik-rce · 2021-02-21T12:49:20Z

I have also a very strong suspicion that the allocations can be optimized much more. By logging all de/allocations, I have discovered, that there is significantly more calls to free then to malloc. For a very simple kotlin script, containing just a single line val x=1, there is:

4615 calls to malloc
43 calls to realloc
5273 calls to free

The log file also shows that there is many small (64 bytes or less) objects (lr answers, captures, thunks, chunks, ...) that are alocated only to be immediately freed again as the algortihm backtracks. I have a strong suspicion, that a simple memory pool (or object pool) would help a lot. I'll try to create some proof of concept implementation, at least for some of the structures, just to be able to measure the performance impact.

masatake · 2021-02-21T14:20:38Z

@masatake: By the way, do you have any plans to migrate to the upstream packcc?

Yes. We should use the upstream version.
However, there are some ctags specific changes in our version.
See #2866 (comment) .

masatake · 2021-02-21T14:56:49Z

I found the object pool must be implemented in struct foo_context_tag.
static pool_t pool64 = ... is not allowed because Thread-safe and reentrant is one of the features of packcc.

dolik-rce · 2021-02-23T15:32:08Z

I implemented a very simple fixed-size memory pool and modified packcc to use it for all allocation of pcc_lr_answer_t, which is the most often allocated type. The resulting kotlin parser runs about 10% faster. Converting all of the allocations could yield (by my wild guess) 30-35% speedup.

However, more testing is needed, since the pool is not suitable for general use yet. Some of the Kotlin files I use for testing require quite a lot memory, while others require only few thousands simultaneously allocated objects. There is no reasonable limit to set as a default size, so it must be dynamic, which complicates the implementation quite a bit. Only proper testing will reveal if the pool overhead will be lower then using malloc/free in the first place...

dolik-rce · 2021-02-23T15:55:03Z

Yes. We should use the upstream version.

Ok, I will suggest the optimizations directly to upstream. You can either merge them to ctags later or use them after ctags switches to upstream packcc.

Thread-safe and reentrant is one of the features of packcc.

I wasn't aware of that, but since I followed the general codestyle used in packcc, the pool actually turned out to be both thread safe and reentrant :-)

masatake · 2021-03-13T17:37:36Z

All the changes made by @dolik-rce are merged to the upstream project.
Through the merging process, I watched how the upstream project is healthy or not.
I think it is mostly healty.

Now we must use the upstream version.
What we have to do first is solving universal-ctags/packcc#5.

exporting all changes made in https://github.com/universal-ctags/packcc to the upstream project.
rebase https://github.com/universal-ctags/packcc to the upstream.
update the build-system of ctags.

masatake · 2021-03-20T19:25:28Z

I listed the changes developed at u-ctags project.
universal-ctags/packcc#5

masatake · 2021-06-01T00:57:56Z

I think now we can close this.

dolik-rce mentioned this issue Feb 23, 2021

Skip unnecessary reallocations arithy/packcc#16

Merged

masatake closed this as completed Jun 1, 2021

dolik-rce mentioned this issue Oct 6, 2021

Peg only mode? arithy/packcc#46

Closed

dolik-rce mentioned this issue Dec 7, 2021

Add support for Kotlin tags geany/geany#3034

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Peg parser optimizations #2878

Peg parser optimizations #2878

dolik-rce commented Feb 19, 2021

dolik-rce commented Feb 21, 2021

dolik-rce commented Feb 21, 2021

masatake commented Feb 21, 2021

masatake commented Feb 21, 2021

dolik-rce commented Feb 23, 2021

dolik-rce commented Feb 23, 2021

masatake commented Mar 13, 2021

masatake commented Mar 20, 2021

masatake commented Jun 1, 2021

Peg parser optimizations #2878

Peg parser optimizations #2878

Comments

dolik-rce commented Feb 19, 2021

dolik-rce commented Feb 21, 2021

dolik-rce commented Feb 21, 2021

masatake commented Feb 21, 2021

masatake commented Feb 21, 2021

dolik-rce commented Feb 23, 2021

dolik-rce commented Feb 23, 2021

masatake commented Mar 13, 2021

masatake commented Mar 20, 2021

masatake commented Jun 1, 2021