Address performance issues reported with 1.9.0 #4152

nbougalis · 2022-04-23T04:44:47Z

No description provided.

This commit addresses minor bugs introduced with commit 6faaa91: - The number of threads used by the database engine was incorrectly clamped to the lower possible value, such that the database was effectively operating in single threaded mode. - The number of requests to extract at once was so high that it could result in increased latency. The bundle size is now limited to 4 and can be adjusted by a new configuration option `rq_bundle` in the `[node_db]` stanza. This is an advanced tunable and adjusting it should not be needed.

Several hard-coded parameters control the behavior of the ledger acquisition engine. The values of many of these parameters where set by intuition and have complex and non-intuitive interactions with each other and other parts of the code. An earlier commit attempted to adjust several of these parameters to improve syncing performance; initial testing was promising but a number of operators reported experiencing syncing and stability issues with their servers. As a result, this commit reverts parts of commit 1823506. This commit further adjusts some tunables so as to increase the aggressiveness of the ledger acquisition engine.

greg7mdp · 2022-05-02T11:22:53Z

src/ripple/nodestore/impl/Database.cpp

                            readCondVar_.wait(lock);
+                            runningThreads_++;
+                        }


I think I see a potential deadlock. Suppose one of the threads is interrupted right after checking while (!isStopping()) at Database.cpp:71 and then another thread calls Database::stop(), sets readStopping_ to true, takes the lock and calls readCondVar_.notify_all();.

When the thread resumes, it will wait forever on readCondVar_.wait(lock), I think?

This could be addressed by checking isStopping() right after taking the lock (see next suggestion).

greg7mdp · 2022-05-02T11:29:30Z

src/ripple/nodestore/impl/Database.cpp

@@ -68,14 +74,20 @@ Database::Database(
                        std::unique_lock<std::mutex> lock(readLock_);


Suggested change

std::unique_lock<std::mutex> lock(readLock_);

std::unique_lock<std::mutex> lock(readLock_);

if (isStopping())

continue;

nbougalis requested a review from thejohnfreeman April 23, 2022 04:44

nbougalis added 2 commits April 22, 2022 21:48

thejohnfreeman approved these changes Apr 27, 2022

View reviewed changes

greg7mdp reviewed May 2, 2022

View reviewed changes

This was referenced May 10, 2022

Propose 1.9.1-b1 #4158

Closed

Proposed 1.9.1-b1 #4161

Merged

manojsdoshi closed this in #4161 May 11, 2022

greg7mdp referenced this pull request in seelabs/rippled Jul 18, 2022

[fold] minor cleanups in database stopping

d28da66

nbougalis deleted the 190fixes branch October 16, 2023 06:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Address performance issues reported with 1.9.0 #4152

Address performance issues reported with 1.9.0 #4152

nbougalis commented Apr 23, 2022

greg7mdp May 2, 2022 •

edited

Loading

greg7mdp May 2, 2022

		@@ -68,14 +74,20 @@ Database::Database(
		std::unique_lock<std::mutex> lock(readLock_);

Address performance issues reported with 1.9.0 #4152

Address performance issues reported with 1.9.0 #4152

Conversation

nbougalis commented Apr 23, 2022

greg7mdp May 2, 2022 • edited Loading

Choose a reason for hiding this comment

greg7mdp May 2, 2022

Choose a reason for hiding this comment

greg7mdp May 2, 2022 •

edited

Loading