New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Make sure we never keep more than 1 ledger state in memory #639

Merged

erikd merged 3 commits into master from kderme/parse-ledger-once

Jun 11, 2021

Contributor

kderme commented Jun 9, 2021 •

edited

Loading

Ledger states have grown to take a lot of memory and we must be very careful with handling them. On rollbacks, db-sync parses a new ledger state from a file. We have to make sure that we don't keep a pointer to the old ledger state while parsing the new ledger state or this can cause big memory spikes.

Rollbacks also happen on startups. We have fixed startups in a slightly different way. Since the first message is always a MsgRollBackward we don't parse any ledger state before this message.

After the fix the spike disappears:

kderme commented

View reviewed changes

cardano-sync/src/Cardano/Sync/LedgerState.hs

+                  -- Ledger states are growing to become very big in memory.
+                  -- Before parsing the new ledger state we need to make sure the old ledger state
+                  -- is or can be garbage collected.
+                  writeLedgerState env Nothing
                   mst <- findStateFromPoint env point delFiles

Contributor Author

kderme Jun 9, 2021

maybe we want to force a gc at this point, since the old ledger state is a very big memory chunk.

Contributor Author

kderme Jun 9, 2021

performMajorGC?

erikd reviewed

View reviewed changes

cardano-sync/src/Cardano/Sync/LedgerState.hs Outdated

+              readStateUnsafe env = do
+                  mState <- readTVar $ leStateVar env
+                  case mState of
+                    Nothing -> panic "ledger state is not found"

Contributor

erikd Jun 10, 2021

I realize this is still a work in progress, but panic message should have the module name in the message so if it ever gets hit we know where it came from.

erikd reviewed

View reviewed changes

cardano-sync/src/Cardano/Sync/LedgerState.hs Outdated

+                  mState <- readTVar $ leStateVar env
+                  case mState of
+                    Nothing -> panic "ledger state is not found"
+                    Just st -> return st

Contributor

erikd Jun 10, 2021

The rest of the code uses pure instead of return in preparation for https://gitlab.haskell.org/ghc/ghc/-/wikis/proposal/monad-of-no-return .

erikd reviewed

View reviewed changes

cardano-sync/src/Cardano/Sync/LedgerState.hs

@@ @@ -128,7 +127,7 @@ data LedgerEnv = LedgerEnv @@
                 { leProtocolInfo :: !(Consensus.ProtocolInfo IO CardanoBlock)
                 , leDir :: !LedgerStateDir
                 , leNetwork :: !Ledger.Network
-                , leStateVar :: !(StrictTVar IO CardanoLedgerState)
+                , leStateVar :: !(StrictTVar IO (Maybe CardanoLedgerState))

Contributor

erikd Jun 10, 2021

I wonder if we could have an empty or minimal ledger state there instead of having the Maybe .

Contributor Author

kderme Jun 10, 2021 •

edited

Loading

leStateVar is read in 1 place: applyBlock. If we have a minimal state at this point it would simply lead to more cryptic errors.

We could try to make this type safe. This would probably need to follow the approach of ouroboros-network with typed protocols and parametrise LedgerEnv over the Nat n. Probably a pretty big refactoring.

leStateVar can be empty in 2 cases: after initiation and if loadLedgerAtPoint returns Left. In both cases we send to the node a FindIntersect message. The node replies with the point we should roll back and so loadLedgerAtPoint is called again.

kderme marked this pull request as ready for review

June 10, 2021 13:21

kderme mentioned this pull request

Bulk insert optimisation #645

Merged

kderme force-pushed the kderme/parse-ledger-once branch from adeef7e to 197df2f Compare

June 10, 2021 18:34

kderme added 3 commits

June 11, 2021 09:33


          Parse ledger state only one on startup

daad14b


          Make sure the old ledger state can be gced on rollbacks

ed73408


          Perform manual gc

1ec52c8

erikd force-pushed the kderme/parse-ledger-once branch from 197df2f to 1ec52c8 Compare

June 10, 2021 23:48

erikd approved these changes

View reviewed changes

cardano-sync/src/Cardano/Sync/LedgerState.hs

+              readStateUnsafe env = do
+                  mState <- readTVar $ leStateVar env
+                  case mState of
+                    Nothing -> panic "LedgerState.readStateUnsafe: Ledger state is not found"

Contributor

erikd Jun 11, 2021

👍

erikd merged commit 57daf09 into master

iohk-bors bot deleted the kderme/parse-ledger-once branch

June 11, 2021 00:07

kderme mentioned this pull request

High memory usage and profiling #634

Closed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet