Memory-based pruning history size #4114

rphmeier · 2017-01-10T14:46:51Z

Might also come with changes to informant + a different default setting (currently 150MB)

rphmeier · 2017-01-10T15:01:07Z

ethcore/src/client/client.rs

+		// prune all ancient eras until we're below the memory target,
+		// but have at least the minimum number of states.
+		loop {
+			if state.journal_db().mem_used() <= self.config.history_mem { break }


slight concern here about the time taken perform this check potentially multiple times per imported block -- an optimization might be done via incremental bookkeeping using the sizes of key-value pairs inserted and deleted into the pruning overlay rather than sweeping the whole overlay each time.

rphmeier · 2017-01-10T15:16:55Z

Hm, apparently earliest_era isn't actually updated when marking states canonical, leading to an infinite loop...

rphmeier · 2017-01-12T10:49:51Z

parity/snapshot.rs

@@ -170,7 +171,20 @@ impl SnapshotCommand {
 		execute_upgrades(&self.dirs.base, &db_dirs, algorithm, self.compaction.compaction_profile(db_dirs.db_root_path().as_path()))?;

 		// prepare client config
-		let client_config = to_client_config(&self.cache_config, Mode::Active, tracing, fat_db, self.compaction, self.wal, VMType::default(), "".into(), algorithm, self.pruning_history, true);
+		let client_config = to_client_config(


the fact that this stuff is duplicated so much is practically a crime :)
Since basically every subcommand needs to initialize a client (and initializing a client needs, at a minimum, some of these parameters), the commands themselves should just take a client config rather than rebuilding it each and every time.

coveralls · 2017-01-12T18:06:29Z

Changes Unknown when pulling cc69ab5 on memory-pruning into ** on master**.

rphmeier · 2017-01-13T13:43:29Z

This should help with mitigating forks on the testnet: with the default setting of --pruning-memory 150, about 1200 states are stored at the head of the chain.

arkpar · 2017-01-18T10:40:36Z

parity/cli/usage.txt

+  --pruning-memory MB      The ideal amount of memory in megabytes to use to store
+                           recent states. As many states as possible will be kept
+						   within this limit, and at least --pruning-history states
+						   will always be kept. (default: {flag_pruning_memory})


2 lines above should be aligned with spaces

gavofyork · 2017-01-18T11:53:14Z

ethcore/src/client/client.rs

@@ -578,9 +564,49 @@ impl Client {
 		self.db.read().write_buffered(batch);
 		chain.commit();
 		self.update_last_hashes(&parent, hash);
+
+		if let Err(e) = self.prune_ancient(state, &chain) {


just wondering if this breaks an implicit precondition that the ancient block should be marked canon before the new block is committed.

This is still analogous to the prior behavior in the case of 1 entry being pruned: journal_under was previously called before mark_canonical. Nothing in the JournalDB interface (or our implementations) should prohibit "runs" of either call without alternation.

rphmeier · 2017-01-18T12:12:40Z

My main concern about this PR is that it changes the memory usage calculations for the journal DB substantially (they now tend to be 1.5-2x lower than previously) in order to keep them fast enough to call multiple times per-block.

arkpar · 2017-01-18T12:27:42Z

@rphmeier Is that because of additional check here: https://github.com/ethcore/parity/pull/4114/files#diff-698ad0b84c222f83a6207ea3c6d8bc07R170?
Maybe change emplace to return bool instead?

rphmeier · 2017-01-18T12:37:05Z

@arkpar that check is because values may exist in the overlay multiple times, but we only want to count its size once. the lower calculated memory usage is because we no longer use heap_size_of_children on everything, just incrementally keep track of state items' sizes, so we're essentially ignoring HashMap overhead.

rphmeier · 2017-01-19T16:31:10Z

re: memory calculation concerns.

I've added a journal_size function to JournalDB which reports the amount of memory journalled state objects are currently taking up. The dynamic pruning will use this, and mem_used for fast pruning has been restored to its original slow state. This means that the informant will always print something higher than --pruning-memory in the db section, but this was already the case since the field includes the state cache size as well.

rphmeier added 2 commits January 10, 2017 13:41

prune states based on memory param

0c0ad1c

pruning memory CLI and usage in sync

415cb5e

rphmeier added A0-pleasereview 🤓 Pull request needs code review. M4-core ⛓ Core client code / Rust. labels Jan 10, 2017

rphmeier requested a review from arkpar January 10, 2017 14:46

rphmeier commented Jan 10, 2017

View reviewed changes

rphmeier added A3-inprogress ⏳ Pull request is in progress. No review needed at this stage. and removed A0-pleasereview 🤓 Pull request needs code review. labels Jan 10, 2017

rphmeier commented Jan 12, 2017

View reviewed changes

rphmeier added 4 commits January 12, 2017 11:56

Merge branch 'master' into memory-pruning

c7f4bcb

return purged value from memorydb

688b943

calculate memory used incrementally in overlayrecentdb

ac2186e

refactor shared history pruning code in client

cc69ab5

rphmeier added A0-pleasereview 🤓 Pull request needs code review. and removed A3-inprogress ⏳ Pull request is in progress. No review needed at this stage. labels Jan 12, 2017

arkpar reviewed Jan 18, 2017

View reviewed changes

Fixed usage alignment

b852f6f

arkpar added A8-looksgood 🦄 Pull request is reviewed well. and removed A0-pleasereview 🤓 Pull request needs code review. labels Jan 18, 2017

gavofyork reviewed Jan 18, 2017

View reviewed changes

rphmeier added the B7-releasenotes 📜 Changes should be mentioned in the release notes of the next minor version release. label Jan 19, 2017

journal_size function for fast memory calculation

60798e3

gavofyork merged commit 203fd8a into master Jan 20, 2017

gavofyork deleted the memory-pruning branch January 20, 2017 12:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory-based pruning history size #4114

Memory-based pruning history size #4114

rphmeier commented Jan 10, 2017

rphmeier Jan 10, 2017

rphmeier commented Jan 10, 2017

rphmeier Jan 12, 2017

coveralls commented Jan 12, 2017

rphmeier commented Jan 13, 2017 •

edited

Loading

arkpar Jan 18, 2017

gavofyork Jan 18, 2017 •

edited

Loading

rphmeier Jan 18, 2017

rphmeier commented Jan 18, 2017

arkpar commented Jan 18, 2017

rphmeier commented Jan 18, 2017

rphmeier commented Jan 19, 2017

Memory-based pruning history size #4114

Memory-based pruning history size #4114

Conversation

rphmeier commented Jan 10, 2017

rphmeier Jan 10, 2017

Choose a reason for hiding this comment

rphmeier commented Jan 10, 2017

rphmeier Jan 12, 2017

Choose a reason for hiding this comment

coveralls commented Jan 12, 2017

rphmeier commented Jan 13, 2017 • edited Loading

arkpar Jan 18, 2017

Choose a reason for hiding this comment

gavofyork Jan 18, 2017 • edited Loading

Choose a reason for hiding this comment

rphmeier Jan 18, 2017

Choose a reason for hiding this comment

rphmeier commented Jan 18, 2017

arkpar commented Jan 18, 2017

rphmeier commented Jan 18, 2017

rphmeier commented Jan 19, 2017

rphmeier commented Jan 13, 2017 •

edited

Loading

gavofyork Jan 18, 2017 •

edited

Loading