Query latency impact from linux disk read ahead option #12166

jasperjiaguo · 2023-12-16T21:35:24Z

Recently we have discovered that Pinot query latency can be impact by the value of Linux's read_ahead_kb. Specifically we saw a very high page fault count and severe tail latency shootup when the read_ahead_kb was set to a larger value in certain Linux distributions. As read_ahead_kb controls the read ahead during the access of mmap files, we think larger read ahead value harms the queries having more random data access patterns. Theoretically it might benefit the opposite pattern but we have yet to see such a case. I think there are a few things that might worth doing:

In the short term we add this as a tip for Pinot admin in the OSS doc, so that it becomes a public knowledge
In the long term we may explore controlling this programmatically (like madvise in c), but it might be harder to do in Java
Revisit the mmap based segment cache

Similar issues/analysis:
https://smalldatum.blogspot.com/2014/05/the-impact-of-read-ahead-and-read-size.html
elastic/elasticsearch#27748

The text was updated successfully, but these errors were encountered:

jasperjiaguo changed the title ~~Locality impact from linux disk read ahead option~~ Query latency impact from linux disk read ahead option Dec 16, 2023

Jackie-Jiang added the performance label Dec 19, 2023

dario-liberman mentioned this issue May 28, 2024

Segment memory usage for unused columns #13242

Open

dinoocch mentioned this issue Jul 30, 2024

Support madvise for MmapMemory #13721

Merged

3 tasks

siddharthteotia assigned jasperjiaguo Aug 13, 2024

jasperjiaguo closed this as completed in #13721 Aug 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Query latency impact from linux disk read ahead option #12166

Query latency impact from linux disk read ahead option #12166

jasperjiaguo commented Dec 16, 2023 •

edited

Loading

Query latency impact from linux disk read ahead option #12166

Query latency impact from linux disk read ahead option #12166

Comments

jasperjiaguo commented Dec 16, 2023 • edited Loading

jasperjiaguo commented Dec 16, 2023 •

edited

Loading