Write-through inter-block cache #1947

cwgoes · 2018-08-09T07:00:28Z

Many values in state are frequently read but infrequently written, such as global parameters, or written at the end of one block and read in the next, such as governance proposal queues. We should have a write-through inter-block (persistent) cache which prevents these kinds of reads from hitting the disk.

ValarDragon · 2018-08-09T19:34:34Z

Seems like something we should absolutely do. Can be done in a non-breaking manner as well, as its just caching in memory. (Hence I've tagged it postlaunch)

rigelrozanski · 2018-08-27T22:08:22Z

trying to understand what's being proposed. Are you suggesting that we add a write through cache which reflects certain state normalling held in the block state, but persists in between blocks? (unless in a rare situation one of those parameters is updated). So for each block the cache would still need to be verified once anyways - or maybe we could have a special update functionality for any params in this cache

cwgoes · 2018-08-28T11:32:08Z

So for each block the cache would still need to be verified once anyways.

No, because it's a write-through cache (the underlying IAVL tree is written too), and the blocks are executed in order. The cache can be implemented below the ABCI abstraction, just over the GoMemDB. Main advantage is in speeding up reads and iteration over frequently accessed parameters.

(maybe I misunderstand your question?)

I actually think we might want to make this prelaunch, it's not too hard and could speed up block execution substantially.

rigelrozanski · 2018-08-28T16:21:26Z

if it's easy to do, and you think it will speed things up substantially then yes it sounds like a priority - but still a last prelaunch priority which happens after feature complete

ValarDragon · 2018-08-28T16:21:38Z

I actually think we might want to make this prelaunch, it's not too hard and could speed up block execution substantially.

Lets get the state machine complete, and mostly bug free, then consider performance improvements. Those can easily be punted to postlaunch.

Currently we don't even have benchmarks for IAVL or our block execution. Thus I'm extremely hesistant for us to spend awhile optimizing code prelaunch, when we don't even have evidence saying its the problem. I'm currently leaning on making any backwards compatible performance improvement postlaunch, as our current block execution time is sufficient, and we don't have evidence of what our bottlenecks are.

The biggest bottleneck in the entire system is our current mempool design, which we have decided to fix postlaunch. The best this change would do is speedup block processing time, by a currently unknown factor, which is not at all a prelaunch concern imo. The best faster blocks provides, is faster network block time. Given that the first 3 weeks have 0 load, block time is not a concern at all, and therefore we could develop this change in that 3 week interrim.

cwgoes · 2018-08-28T17:08:17Z

Lets get the state machine complete, and mostly bug free, then consider performance improvements.

Agreed on benchmarks first. I think we want to avoid needing to push changes within a short timeframe after launch, because we might need to react to unexpected bugs. If we can figure out how much of an issue this will be short-term, that would better inform the pre/post-launch decision.

ValarDragon · 2018-10-18T16:20:51Z

Do you have a particular preference for this being a write-through cache vs a write-back cache? I think it'd be a better design to make it write-back (i.e. all writes are batched in end block or commit) While we may not have a batch write operation in IAVL, its reasonable to think that one day we might.

cwgoes · 2018-10-18T17:14:36Z

A write-back cache could make sense, depending on what operations the IAVL tree supports (presumably we could change the kind of cache without breaking the state machine?) - I think the more important part is just to cache reads of data that is frequently read (e.g. parameters, validator info) but not updated between blocks, which AFAIK we don't do at all at the moment.

jackzampolin · 2019-05-28T14:38:34Z

Did we implement this? Also applicable for @ultraeric

rigelrozanski · 2019-05-28T19:22:35Z

not that I'm aware of

alexanderbez · 2019-05-28T19:25:53Z

@jackzampolin No, we currently do not have this implemented atm and we can and should introduce such functionality soon (in a non-breaking manner of course).

From my general understanding is as follows:

write-through: Slower writes, but fast/efficient reads. Best for situations that write and then (re)read data frequently.
write-back: Faster writes (only blocks on I/O to cache) and efficient reads, but can potentially suffer from data loss if the cache is corrupted/lost and data not written to disk in time. Best for situations that require low latency and frequent writes.

Being that we want to keep any hits to I/O at a minimum, it seems a write-back is more suitable. Data loss should not be a problem if we can batch deletes and writes in the store/cachekv.Store#Write. But even without batching, this can still be done afaict. Correct me if I'm wrong.

Perhaps this intra-block cache can reside in the BaseApp and somehow feed this cache into the context when starting a block. Commit it when the block finishes.

ValarDragon · 2019-05-28T19:43:18Z

This is an issue for an inter-block cache, in that setting, I think a write through cache is more suitable. For an inter block write cache, I don't think write back is suitable. Its not fine for a node to crash at block N + x, with that Tendermint state, whereas its state machine state is block N. That problem is elided with the write through cache.

I do agree with a write-back intra-block cache (which you suggested), but I wanted to clarify as it is an inter-block cache issue. I do think the inter-block cache is more critical, as intra-block caching is at least semi achieved with L3/L4 caches in servers.

alexanderbez · 2019-05-28T20:05:32Z

Ahhh yes, I misunderstood then. If commits aren't happening blocky, then write-through is more suitable for safety guarantees. I think we can easily achieve this through some caching type in the BaseApp that is passed along to contexts.

But, as I think of it, we already have intra-block caching, don't we? The BaseApp's deliverState and multi-store cache wrapping provide this unless I'm mistaken.

cwgoes · 2019-07-04T22:45:36Z

Bump; I'm pretty sure this will increase performance quite a bit (ref #4641).

What do you think @alexanderbez?

alexanderbez · 2019-07-05T14:04:11Z

Yes, this is on the top of my priority list -- plan on getting it started this weekend.

cwgoes added S:proposed core labels Aug 9, 2018

ValarDragon added the post-launch label Aug 9, 2018

ValarDragon added S:proposal accepted and removed S:proposed labels Aug 9, 2018

ValarDragon added the T: Performance Performance improvements label Sep 14, 2018

mossid mentioned this issue Oct 18, 2018

Create a per keeper cache in the context #2193

Closed

alexanderbez self-assigned this Jul 5, 2019

sunnya97 mentioned this issue Jul 14, 2019

StdTx should include a fingerprint of a recent block hash #4720

Closed

4 tasks

fedekunze removed the post-launch label Jul 16, 2019

fedekunze added this to the v0.37.0 milestone Jul 19, 2019

alexanderbez mentioned this issue Jul 19, 2019

Write-Through Inter-Block Cache #4748

Merged

5 tasks

alexanderbez closed this as completed in #4748 Sep 4, 2019

chillyvee pushed a commit to chillyvee/cosmos-sdk that referenced this issue Mar 1, 2024

Updated roadmap (cosmos#1947)

2b5fc41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Write-through inter-block cache #1947

Write-through inter-block cache #1947

cwgoes commented Aug 9, 2018

ValarDragon commented Aug 9, 2018

rigelrozanski commented Aug 27, 2018

cwgoes commented Aug 28, 2018

rigelrozanski commented Aug 28, 2018

ValarDragon commented Aug 28, 2018

cwgoes commented Aug 28, 2018 •

edited

ValarDragon commented Oct 18, 2018

cwgoes commented Oct 18, 2018

jackzampolin commented May 28, 2019

rigelrozanski commented May 28, 2019 •

edited

alexanderbez commented May 28, 2019

ValarDragon commented May 28, 2019

alexanderbez commented May 28, 2019

cwgoes commented Jul 4, 2019 •

edited

alexanderbez commented Jul 5, 2019

Write-through inter-block cache #1947

Write-through inter-block cache #1947

Comments

cwgoes commented Aug 9, 2018

ValarDragon commented Aug 9, 2018

rigelrozanski commented Aug 27, 2018

cwgoes commented Aug 28, 2018

rigelrozanski commented Aug 28, 2018

ValarDragon commented Aug 28, 2018

cwgoes commented Aug 28, 2018 • edited

ValarDragon commented Oct 18, 2018

cwgoes commented Oct 18, 2018

jackzampolin commented May 28, 2019

rigelrozanski commented May 28, 2019 • edited

alexanderbez commented May 28, 2019

ValarDragon commented May 28, 2019

alexanderbez commented May 28, 2019

cwgoes commented Jul 4, 2019 • edited

alexanderbez commented Jul 5, 2019

cwgoes commented Aug 28, 2018 •

edited

rigelrozanski commented May 28, 2019 •

edited

cwgoes commented Jul 4, 2019 •

edited