Fix memory leaks on `historyHook` and `workspaceHistoryHook` #653

RubenAstudillo · 2021-11-22T19:14:18Z

Description

Both of these hooks leaked memory on long running sessions.

To fix it I had to force the thunks stored on the hooks as the compiler wouldn't be able to see where they would be demanded as they depended on the user interaction. Now the graph looks like this.

The most polemic change on this PR is the inclusion of the parallel library as a dependency. I needed for the Control.Seq module which gave me combinators to force just enough of the Data.Seq and [] data types.

Checklist

[🗸] I've read CONTRIBUTING.md
[🗸] I've considered how to best test these changes (property, unit,
manually, ...) and concluded: It shows the memory improvements!
[🗸] I updated the CHANGES.md file (None)

slotThe

This is neat, thanks!

I wonder if we could get away with just stictifying the respective data structures (since neither is exported); i.e.,

data SP a b = SP !a !b
  deriving (Read, Show)

newtype WorkspaceHistory = WorkspaceHistory
  { history :: [StrictPair ScreenId WorkspaceId]
    -- ^ Workspace Screens in reverse-chronological order.
  } deriving (Read, Show)

---

data HistoryDB = HistoryDB !(Maybe Window) -- currently focused window
                           !(Seq Window)   -- previously focused windows
               deriving (Read, Show)

This is still not ultra-strict but since we're not really interesting in forcing things more than to WHNF it should be fine I think. We may or may not still need the odd call to seq to force the list, not entirely sure about that right now

EDIT: Oh, and

* [🗸] I've considered how to best test these changes (property, unit,
  manually, ...) and concluded: XXX

:)

RubenAstudillo · 2021-11-23T11:42:05Z

I wonder if we could get away with just stictifying the respective data structures (since neither is exported); i.e.,
data SP a b = SP !a !b
  deriving (Read, Show)

newtype WorkspaceHistory = WorkspaceHistory
  { history :: [StrictPair ScreenId WorkspaceId]
    -- ^ Workspace Screens in reverse-chronological order.
  } deriving (Read, Show)

---

data HistoryDB = HistoryDB !(Maybe Window) -- currently focused window
                           !(Seq Window)   -- previously focused windows
               deriving (Read, Show)
This is still not ultra-strict but since we're not really interesting in forcing things more than to WHNF it should be fine I think. We may or may not still need the odd call to seq to force the list, not entirely sure about that right now

We are interested on forcing things to more than WHNF. On the WorkspaceHistory patch I needed to force at least until each element of each pair on the history list. WHNF on that data structure would be the first cons of that list, not enough.

On a related note, given that we are concerned with forcing evaluation until certain "depth", using strict data types as StrictPair as above is not enough. We would also need:

A strict list container so we match to WHNF any StrictPair.
Match on the WorkspaceHistory to WHNF before the put calls. As it is a newtype, it doesn't have extra bottoms. But we still need its head matched, so it forces the strict list and then the StrictPair.

So, instead of doing those two things, we can use the combinators on Control.Seq and match before the put call to get the correct behavior.

The same is true in respect to HistoryDB. We would need a strict Seq data structure. A bang pattern as !(Seq Window) is not enough to deal with this leak as it forces only the head. Every other element remains as a thunk until a undefined point in the future.

EDIT: Oh, and

* [🗸] I've considered how to best test these changes (property, unit,
  manually, ...) and concluded: XXX

Thanks. I should have put something on those XXX. I edited that part on the original post.

updateHistory leaks unfiltered windows from previous states as it is never forced. The consumer of such data structure is not visible to ghc, so the demand analysis has to fallback on pure laziness. We fix this inserting evaluation points on the `historyHook` function. We do this for two reasons, this is the only function calling `updateHistory`. Plus we cannot do it clearly at the `updateHistory` function as we operate inside a continuation on withWindowSet. In respect to the `put`, everything would be a big thunk.

The XS.modify was leaving thunk on the history that the demand analyser could not prove to be neccesary as they depended on the future user interaction. This was bad as the time advance there was less and less neccesity to force such value, so the thunk would be increasing. Since the datatypes that the `WorkspaceHistory` are really simple, we can just evaluate and save a good chunk of memory.

RubenAstudillo · 2021-11-23T20:52:10Z

I was missing the parallel dependency for the test build. Allow the workflows to be re run.

liskin · 2021-11-23T20:58:02Z

The most polemic change on this PR is the inclusion of the parallel library as a dependency. I needed for the Control.Seq module which gave me combinators to force just enough of the Data.Seq and [] data types.

Would https://hackage.haskell.org/package/deepseq work here as well? Both seem to be maintained by the Core Libraries Committee but deepseq is distributed with GHC so adding that dependency is easier.

RubenAstudillo · 2021-11-23T22:03:00Z

Yeah, we are basically doing the same as in deepseq now. It is a better dependency. I will use that, re-run my tests and publish an update.

RubenAstudillo · 2021-11-23T23:56:51Z

@liskin using deepseq exposes other tradeoffs. The NFData typeclass instance has to be derived for WorkspaceHistory. This can be done on two ways per the docs

Derive Generic, Generic1 and then NFData. The Generics instances are just to for using the derive anyclass instance for NFData. But these are heavyweigh for just forcing.
Use GeneralizedNewtypeDeriving for deriving NFData. This requires a NFData instance on ScreenId on the main xmonad project.

I consider both of these alternatives worse than the current status. Specially considering ScreenId is a newtype, so it doesn't introduce an extra bottom between it and the base Int it contains. So the strictness of ScreenId is equivalent to a shallow seq. I use this on the current code but it is not apparent for the NFData deriving mechanism.

I vote for keep using parallel instead of deepseq. Thoughts?

slotThe · 2021-11-24T07:55:48Z

I vote for keep using parallel instead of deepseq. Thoughts?

While GeneralizedNewtypeDeriving is probably a no-go as we would like to keep compatibility with xmonad 0.17.0 a bit longer than a few weeks (well, I would, anyways), I don't really see a problem with deriving Generic. It's not like this will affect a lot of data structures such that actual compile times are affected.

I consider both of these alternatives worse than the current status. Specially considering ScreenId is a newtype, so it doesn't introduce an extra bottom between it and the base Int it contains. So the strictness of ScreenId is equivalent to a shallow seq. I use this on the current code but it is not apparent for the NFData deriving mechanism.

While this is true, I think the idea of what we want to achieve is expressed much more clearly by "this is an instance of NFData".

RubenAstudillo · 2021-11-24T16:01:50Z

@slotThe You are right in that the intent is more clear with an NFData instance!

The memory figures are basically the same. I still had to do an instance by hand but the liftRnf/liftRnf2 and rwhnf from deepseq were useful. Can you give permission to re-run the GH workflows?

RubenAstudillo · 2021-11-24T16:38:47Z

I think is merge ready.

slotThe

LGTM now!

XMonad/Util/ExtensibleState.hs

RubenAstudillo · 2021-11-24T19:15:10Z

Using the latest version with $! the memory profile is the same

Again, let the workflows re-run and I think is ready to merge. Thanks for all you patience and time @slotThe and @liskin 💪 .

slotThe · 2021-11-24T20:03:24Z

Thanks!

Simplify stuff a bit. Prevent memory leaks additionally in: `workspaceHistoryTransaction` and `workspaceHistoryModify`. Related: xmonad#653

Add a modify' function on extensible state

282afef

RubenAstudillo force-pushed the feature/no-leak-history-hook branch from ff1fa01 to 30ab657 Compare November 22, 2021 20:29

RubenAstudillo marked this pull request as ready for review November 22, 2021 20:33

slotThe reviewed Nov 23, 2021

View reviewed changes

RubenAstudillo added 2 commits November 23, 2021 17:50

RubenAstudillo force-pushed the feature/no-leak-history-hook branch from 30ab657 to 44fb597 Compare November 23, 2021 20:50

RubenAstudillo requested a review from slotThe November 23, 2021 20:51

slotThe approved these changes Nov 24, 2021

View reviewed changes

XMonad/Util/ExtensibleState.hs Outdated Show resolved Hide resolved

Use deepseq instead of parallel

b75d0d2

RubenAstudillo force-pushed the feature/no-leak-history-hook branch from 5b150ee to b75d0d2 Compare November 24, 2021 18:24

slotThe merged commit 3d71669 into xmonad:master Nov 24, 2021

slotThe mentioned this pull request Dec 15, 2023

CPU usage is higer than once (Could it may be a memory leak?) #849

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix memory leaks on `historyHook` and `workspaceHistoryHook` #653

Fix memory leaks on `historyHook` and `workspaceHistoryHook` #653

RubenAstudillo commented Nov 22, 2021 •

edited

Loading

slotThe left a comment •

edited

Loading

RubenAstudillo commented Nov 23, 2021

RubenAstudillo commented Nov 23, 2021

liskin commented Nov 23, 2021

RubenAstudillo commented Nov 23, 2021

RubenAstudillo commented Nov 23, 2021 •

edited

Loading

slotThe commented Nov 24, 2021

RubenAstudillo commented Nov 24, 2021 •

edited

Loading

RubenAstudillo commented Nov 24, 2021

slotThe left a comment

RubenAstudillo commented Nov 24, 2021 •

edited

Loading

slotThe commented Nov 24, 2021

Fix memory leaks on historyHook and workspaceHistoryHook #653

Fix memory leaks on historyHook and workspaceHistoryHook #653

Conversation

RubenAstudillo commented Nov 22, 2021 • edited Loading

Description

Checklist

slotThe left a comment • edited Loading

Choose a reason for hiding this comment

RubenAstudillo commented Nov 23, 2021

RubenAstudillo commented Nov 23, 2021

liskin commented Nov 23, 2021

RubenAstudillo commented Nov 23, 2021

RubenAstudillo commented Nov 23, 2021 • edited Loading

slotThe commented Nov 24, 2021

RubenAstudillo commented Nov 24, 2021 • edited Loading

RubenAstudillo commented Nov 24, 2021

slotThe left a comment

Choose a reason for hiding this comment

RubenAstudillo commented Nov 24, 2021 • edited Loading

slotThe commented Nov 24, 2021

Fix memory leaks on `historyHook` and `workspaceHistoryHook` #653

Fix memory leaks on `historyHook` and `workspaceHistoryHook` #653

RubenAstudillo commented Nov 22, 2021 •

edited

Loading

slotThe left a comment •

edited

Loading

RubenAstudillo commented Nov 23, 2021 •

edited

Loading

RubenAstudillo commented Nov 24, 2021 •

edited

Loading

RubenAstudillo commented Nov 24, 2021 •

edited

Loading