Adds calculate_incremental_accounts_hash() #29734

brooksprumo · 2023-01-17T16:00:35Z

Problem

It is not possible to calculate an incremental accounts hash.

Summary of Changes

Add calculate_incremental_accounts_hash()
Split the accounts hash cache into "full" and "incremental"
Support including zero-lamport accounts in the accounts cacher

Numbers

I ran a modified validator on my dev box that calculated the incremental accounts hash in AHV after calculating the full accounts hash and the averages times were:

full accounts hash calculation: 12 seconds
incremental accounts hash calculation: 3 seconds

While likely not exact benchmark numbers, it was at least good to see significant speedup with IAH.

Node Operators

Node operators, please delete your <ledger>/calculate_accounts_hash_cache directory (if you have one). It has been moved/renamed to <ledger>/accounts_hash_cache/full.

jeffwashington · 2023-01-17T19:20:18Z

how many slots were in the incremental hash calculation in your benchmark? # of slots can range from 1-25k I think.

jeffwashington · 2023-01-17T19:21:02Z

runtime/src/accounts_db.rs

@@ -2187,7 +2188,7 @@ impl<'a> AppendVecScan for ScanState<'a> {
 }

 impl AccountsDb {
-    pub const ACCOUNTS_HASH_CACHE_DIR: &str = "calculate_accounts_hash_cache";
+    pub const ACCOUNTS_HASH_CACHE_DIR: &str = "accounts_hash_cache";


maybe we need to delete 'calculate_accounts_hash_cache' on user's ledger?

Yeah, I was wondering how to handle this as well. I should've called it out here.

Since we'll need separate caches for "full" and "incremental" account hashes, which already breaks/changes the current path, I figured shortening this to "accounts_hash_cache" would be nice.

Want me to add code to check for "calculate_accounts_hash_cache" and delete the directory if it's found?

not sure. just seems like a gotcha. this folder could be 30G today on mnb?

jeffwashington

lgtm

brooksprumo · 2023-01-17T19:30:33Z

how many slots were in the incremental hash calculation in your benchmark? # of slots can range from 1-25k I think.

Not sure, I'll need to re-run the validator to get exact numbers. I ran it long enough for my node to produce multiple full snapshots, so the whole range of 1-25k should've been exercised. The 3 seconds number that I quoted was from the times that I check my logs, but I'm not sure the number of storages at each of those datapoints.

The metrics should match the number of slots though. Want me to get specific numbers?

Edit: We talked offline and looked over metrics in the logs.

brooksprumo self-assigned this Jan 17, 2023

Adds calculate_incremental_accounts_hash()

267ee01

brooksprumo force-pushed the iah/calculate-iah branch from cbfe4ea to 267ee01 Compare January 17, 2023 16:20

brooksprumo marked this pull request as ready for review January 17, 2023 18:03

brooksprumo requested a review from jeffwashington January 17, 2023 18:03

jeffwashington reviewed Jan 17, 2023

View reviewed changes

jeffwashington self-requested a review January 17, 2023 19:26

jeffwashington approved these changes Jan 17, 2023

View reviewed changes

brooksprumo merged commit 8c62927 into solana-labs:master Jan 17, 2023

brooksprumo deleted the iah/calculate-iah branch January 17, 2023 20:04

brooksprumo mentioned this pull request Jul 24, 2023

Removes old accounts hash cache dir #32604

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds calculate_incremental_accounts_hash() #29734

Adds calculate_incremental_accounts_hash() #29734

brooksprumo commented Jan 17, 2023 •

edited

Loading

jeffwashington commented Jan 17, 2023

jeffwashington Jan 17, 2023

brooksprumo Jan 17, 2023

jeffwashington Jan 17, 2023

jeffwashington left a comment

brooksprumo commented Jan 17, 2023 •

edited

Loading

Adds calculate_incremental_accounts_hash() #29734

Adds calculate_incremental_accounts_hash() #29734

Conversation

brooksprumo commented Jan 17, 2023 • edited Loading

Problem

Summary of Changes

Numbers

Node Operators

jeffwashington commented Jan 17, 2023

jeffwashington Jan 17, 2023

Choose a reason for hiding this comment

brooksprumo Jan 17, 2023

Choose a reason for hiding this comment

jeffwashington Jan 17, 2023

Choose a reason for hiding this comment

jeffwashington left a comment

Choose a reason for hiding this comment

brooksprumo commented Jan 17, 2023 • edited Loading

brooksprumo commented Jan 17, 2023 •

edited

Loading

brooksprumo commented Jan 17, 2023 •

edited

Loading