Idea: improve select-s performance with lockless word_id -> word translations #23

anton-povarov · 2020-05-01T10:06:08Z

Currently to translate word_id -> word_str (done for each key in each selected row, potentially millions of times per select) - we need to read lock global dictionary shard.

This incurs significant overhead just for locks themselves.
An in cases where contention might be high (when per-repacker caching is inefficient, e.g. nginx urls) - might also slow down new packets processing.

The lock is only needed, because we use std::deque to find a word by offset, and it might get inserted into while we're reading (changing it's structure).

The proposed idea is to rework dictionary shard to use just a contiguous mmap()-ed memory region, enabling fully-lockless read-at-offset path (as the mmap()-ed region pointer never changes).

The word_t size is less than modern x86 CPU's cache line size, but due to the strong x86 cache-coherence model - this might only incur a performance penalty, but not compromise correctness.

As far as i understand it, anyway :)

The text was updated successfully, but these errors were encountered:

anton-povarov · 2020-05-04T22:52:18Z

Preliminary testing on live instances shows no significant impact tbh.
It feels that the change is good, but kinda of small, performance wise.

Will keep it open just for visibility.

anton-povarov added enhancement performance improvement labels May 1, 2020

anton-povarov self-assigned this May 1, 2020

anton-povarov added a commit that referenced this issue May 1, 2020

issue #23: initial dirty implementation

809cebf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Idea: improve select-s performance with lockless word_id -> word translations #23

Idea: improve select-s performance with lockless word_id -> word translations #23

anton-povarov commented May 1, 2020

anton-povarov commented May 4, 2020

Idea: improve select-s performance with lockless word_id -> word translations #23

Idea: improve select-s performance with lockless word_id -> word translations #23

Comments

anton-povarov commented May 1, 2020

anton-povarov commented May 4, 2020