[Performance] Modify FD flags as data become progressively cold

RocketMQ is designed to be **write friendly**, using commit log file sequence to store all topics served by the broker. This works well since the majority of the read are served with data from page-cache, thus random read is not involved. 

Unfortunately, it is not always the case as more workloads are served. If a few subscribers read data that were kicked out of page-cache, severe read amplification and page-cache pollution are observed, bringing substantial impact to IO bandwidth, resulting in broker busy from time to time.

Page-cache pollution, at present, cannot be fixed unless we turn to Direct IO and manage cache manually, now that flag POSIX_FADV_NOREUSE of posix_madvise in Linux is **[no-op](https://github.com/torvalds/linux/blob/master/mm/fadvise.c#L109)**, which is **not** in sync with [its doc](https://linux.die.net/man/2/posix_fadvise).  Read amplification may be mitigated through posix_madvise, yet by way of JNI. 

Along the way to identifying the performance issue, I found Lucene/elastic-search developers came across this issue years ago
https://github.com/elastic/elasticsearch/issues/27748
https://blog.mikemccandless.com/2010/06/lucene-and-fadvisemadvise.html
 



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Performance] Modify FD flags as data become progressively cold #4465

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Performance] Modify FD flags as data become progressively cold #4465

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions