migrate_database_lmdb_to_rocksdb improvements #4647

RickiNano · 2024-06-13T10:47:31Z

The migrate_database_lmdb_to_rocksdb option is running for a very long time with the current ledger size (almost 200 million blocks). On my local machine it took 65 minutes to complete. Nothing is written on screen during this process and users may think the process has stalled.
This PR adds some progress feedback. One update for each of the 7 tables that are migrated.
It also adds a simple disk space check to warn users if they might not have enough space to complete the migration.
~~The current converted RocksDb database is 73 GB, and the warning is given if the system has less than 75GB available.~~
The warning is given based on the size of the LMDB database that is being migrated. The final RocksDb size is approximately 65% of the LMDB space.

pwojcikdev · 2024-06-13T15:45:48Z

This should be using node logger not cout, otherwise logs aren't saved to disk. There are some places in the existing code that are using cout too, but those are leftovers. Also, printing progress every x converted entries would be nice.

RickiNano · 2024-06-14T19:08:51Z

I have been unable to get node logger implemented. I don't think it's important since the migration is a one time process.
I've updated the code with more granular updates to indicate that progress is being made.
Here is the output of a full production lmdb to rocksDb migration:

pwojcikdev · 2024-06-16T22:27:46Z

I have been unable to get node logger implemented. I don't think it's important since the migration is a one time process.

And that's exactly the reason why it should be logged well, in line with production ready code. In case something goes wrong there should be a persistent record.

RickiNano · 2024-06-23T18:51:36Z

@pwojcikdev
I got the Nano logger implemented now.
Instead of outputting dots for each step it is now giving actual numbers to the log.

…mprovements

pwojcikdev · 2024-07-03T09:33:44Z

nano/nano_node/entry.cpp

 }

 int main (int argc, char * const * argv)
 {
 	nano::set_umask (); // Make sure the process umask is set before any files are created
 	nano::logger::initialize (nano::log_config::cli_default ());

+	nano::set_file_descriptor_limit (OPEN_FILE_DESCRIPTORS_LIMIT);
+	auto const file_descriptor_limit = nano::get_file_descriptor_limit ();
+	nano::default_logger ().info (nano::log::type::daemon, "File descriptors limit: {}", file_descriptor_limit);


This shouldn't be moved, this won't log anything.

We could split setting the file descriptor limit with logging the warning about it being too low, though setting the file descriptor limit was moved here since it was only set when called with "--daemon" before and it needed to be set at least for "--migrate_database_lmdb_to_rocksdb" if not other commands when used with rocksdb when it has many files.

Setting the limit here is not a problem, it's a good place for it. The problem is trying to log when logging is still in CLI mode. CLI commands usually have a very specific output format and to avoid polluting it with node details, only critical errors are logged. The warning probably needs to be logged via cerr, and the "File descriptors limit: {}"... line moved back into the daemon code, so it leaves a trace in log files if we ever need to debug it.

nano::log_config nano::log_config::cli_default () { log_config config{}; config.default_level = nano::log::level::critical; config.console.colors = false; // to avoid printing warning about cerr and colors config.console.to_cerr = true; // Use cerr to avoid interference with CLI output that goes to stdout config.file.enable = false; return config; }

I pushed a commit which should fix the logging. Also, as setting the limit was moved away from daemon class, targets other than cli wallet would no longer set it.

pwojcikdev · 2024-07-03T09:34:31Z

nano/secure/ledger.cpp

 	auto rocksdb_store = nano::make_store (logger, data_path_a, nano::dev::constants, false, true, rocksdb_config);

 	if (!rocksdb_store->init_error ())
 	{
 		logger.info (nano::log::type::ledger, "Step 1 of 7: Converting blocks table");
 		std::atomic<std::size_t> count = 0;
+		auto refresh_interval = 20ms;


Why such a short refresh interval?

…ons will be able to open more files.

RickiNano force-pushed the lmdb-to-rocksdb branch from ce5a2f3 to 48d4e6f Compare June 23, 2024 18:39

RickiNano force-pushed the lmdb-to-rocksdb branch from cdfaa6a to e8ba8ba Compare June 26, 2024 07:29

gr0vity-dev pushed a commit to gr0vity-dev/nano-node that referenced this pull request Jul 1, 2024

Squash merge PR nanocurrency#4647: migrate_database_lmdb_to_rocksdb i…

2412162

…mprovements

gr0vity-dev pushed a commit to gr0vity-dev/nano-node that referenced this pull request Jul 2, 2024

Squash merge PR nanocurrency#4647: migrate_database_lmdb_to_rocksdb i…

2c6462d

…mprovements

gr0vity-dev pushed a commit to gr0vity-dev/nano-node that referenced this pull request Jul 2, 2024

Squash merge PR nanocurrency#4647: migrate_database_lmdb_to_rocksdb i…

4aa2428

…mprovements

gr0vity-dev pushed a commit to gr0vity-dev/nano-node that referenced this pull request Jul 2, 2024

Squash merge PR nanocurrency#4647: migrate_database_lmdb_to_rocksdb i…

62ba77f

…mprovements

gr0vity-dev pushed a commit to gr0vity-dev/nano-node that referenced this pull request Jul 2, 2024

Squash merge PR nanocurrency#4647: migrate_database_lmdb_to_rocksdb i…

0bcf8a9

…mprovements

qwahzi added this to the V27 milestone Jul 2, 2024

qwahzi added this to In Progress / V27.0 in Nano Roadmap Jul 2, 2024

pwojcikdev reviewed Jul 3, 2024

View reviewed changes

pwojcikdev force-pushed the lmdb-to-rocksdb branch from 6a2923b to fa9180f Compare July 3, 2024 11:57

RickiNano and others added 13 commits July 5, 2024 16:49

Rocksdb migration: disk space check and progress output

de12818

Estimate the required free disk space

ecc5089

More granular progress feedback

110dd98

Use Nano Logger

80d02ba

Clang format fix

afc2598

Fixed double count

87e8efe

Use atomics for multithreaded counters are accurate

74be41f

Set file descriptor limit in nano_node entry point so non-daemon opti…

b405dd3

…ons will be able to open more files.

Use a single periodically refreshed transaction per thread.

a9e034f

Fix logging

6a4a31b

Fix missing entrypoints

69ef7c9

Output the number of entries for each table

3566c81

Use TransactionDB rather than OptimisticTransactionDB.

8191ce7

RickiNano force-pushed the lmdb-to-rocksdb branch from 3ebe2fb to 8191ce7 Compare July 5, 2024 15:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

migrate_database_lmdb_to_rocksdb improvements #4647

migrate_database_lmdb_to_rocksdb improvements #4647

RickiNano commented Jun 13, 2024 •

edited

Loading

pwojcikdev commented Jun 13, 2024

RickiNano commented Jun 14, 2024

pwojcikdev commented Jun 16, 2024

RickiNano commented Jun 23, 2024

pwojcikdev Jul 3, 2024

clemahieu Jul 3, 2024

pwojcikdev Jul 3, 2024

pwojcikdev Jul 3, 2024

pwojcikdev Jul 3, 2024

migrate_database_lmdb_to_rocksdb improvements #4647

Are you sure you want to change the base?

migrate_database_lmdb_to_rocksdb improvements #4647

Conversation

RickiNano commented Jun 13, 2024 • edited Loading

pwojcikdev commented Jun 13, 2024

RickiNano commented Jun 14, 2024

pwojcikdev commented Jun 16, 2024

RickiNano commented Jun 23, 2024

pwojcikdev Jul 3, 2024

Choose a reason for hiding this comment

clemahieu Jul 3, 2024

Choose a reason for hiding this comment

pwojcikdev Jul 3, 2024

Choose a reason for hiding this comment

pwojcikdev Jul 3, 2024

Choose a reason for hiding this comment

pwojcikdev Jul 3, 2024

Choose a reason for hiding this comment

RickiNano commented Jun 13, 2024 •

edited

Loading