'nixops deploy' exits with 'Too many heap sections: Increase MAXHINCR or MAX_HEAP_SECTS' #287

soenkehahn · 2014-06-17T16:37:59Z

After invoking nixops deploy -j4 ... I got this error message:

Too many heap sections: Increase MAXHINCR or MAX_HEAP_SECTS
nix-instantiate killed by signal 6

The same command worked with -j3.

The full command was:

 nixops deploy -d some_deployment -I .. --read-write --option binary-caches http://hydra.nixos.org -j4

The text was updated successfully, but these errors were encountered:

soenkehahn · 2014-06-19T15:35:32Z

I have been running into this more frequently now. It feels very much non-deterministic.

edolstra · 2014-06-19T16:04:37Z

Is this with a very large network?

shlevy · 2014-06-19T16:09:34Z

Single machine.

soenkehahn · 2014-07-03T07:23:58Z

I was hitting this again and was able to work around it by setting the environment variable GC_MAXIMUM_HEAP_SIZE to something big (5G worked).

phunehehe · 2014-08-28T09:29:42Z

For future reference, Nix uses boehm-gc for garbage collection.

[5:23:35 PM] Soenke Hahn: That has a limit for the amount of memory that is allowed to be allocated.
[5:23:56 PM] Soenke Hahn: When the evaluation takes up more memory it crashes.
[5:24:17 PM] Soenke Hahn: Fortunately the library allows to modify that memory limit through GC_MAXIMUM_HEAP_SIZE.

domenkozar · 2014-09-17T12:55:29Z

Now also happen on hydra while doing nix-build nixos/release-combined.nix

jgeerds · 2014-09-19T18:00:43Z

Is someone working on this issue? (especially hydra)

lucabrunox · 2014-09-19T19:34:08Z

It should be fixed in recent master.

On Fri, Sep 19, 2014 at 8:00 PM, Jascha Geerds notifications@github.com
wrote:

Is someone working on this issue? (especially hydra)

—
Reply to this email directly or view it on GitHub
#287 (comment).

www.debian.org - The Universal Operating System

jgeerds · 2014-09-19T19:48:32Z

@lethalman: Great! So nixos-rebuild --upgrade will work again? (or in a few hours/days)

lucabrunox · 2014-09-19T19:51:03Z

The problem with the GC has been solved, but now hydra is faster at
evaluation and thus it queues jobs faster :P So we got another problem
@edolstra
http://hydra.nixos.org/jobset/nixos/trunk-combined#tabs-evaluations

On Fri, Sep 19, 2014 at 9:48 PM, Jascha Geerds notifications@github.com
wrote:

@lethalman https://github.com/lethalman: Great! So nixos-rebuild
--upgrade will work again? (or in a few hours/days)

—
Reply to this email directly or view it on GitHub
#287 (comment).

www.debian.org - The Universal Operating System

jgeerds · 2014-09-22T13:41:40Z

Hopefully this will be fixed :-)

shlevy · 2014-10-08T01:27:16Z

Been hitting this with 1.8pre3823_53b044c

shlevy · 2014-10-15T12:00:16Z

@edolstra Can you suggest anything we can do to profile/investigate this? This keeps hitting us.

edolstra · 2014-10-15T15:40:40Z

Try the latest Nix version. Commit 6bb4c0b should improve garbage collection quite a bit.

Also, you could build boehmgc with enableLargeConfig = true. In my experience, it makes the Too many heap sections message go away, but actually increases memory use. But that was before 6bb4c0b, it might be better now.

shlevy · 2014-10-24T15:29:11Z

@edolstra no help, unfortunately. Any other ideas here?

shlevy · 2014-11-09T03:44:50Z

@edolstra ping

shlevy · 2014-11-24T14:53:46Z

@edolstra ping?

edolstra · 2014-11-24T16:00:46Z

Sorry, no ideas. I haven't seen this message myself in a while. And I don't think I've ever seen it on a single-machine network, only on large Hydra jobset evaluations.

shlevy · 2014-11-24T16:06:01Z

@edolstra Any advice for investigating this ourselves?

edolstra · 2014-11-25T11:48:04Z

Not really, sorry. Have you tried doing what the message suggests (namely increase MAXHINCR or MAX_HEAP_SECTS)?

wmertens · 2014-11-25T12:50:07Z

How about building a vm that reproduces the problem so we can all have a
look?

On Tue, Nov 25, 2014, 12:48 Eelco Dolstra notifications@github.com wrote:

Not really, sorry. Have you tried doing what the message suggests (namely
increase MAXHINCR or MAX_HEAP_SECTS)?

—
Reply to this email directly or view it on GitHub
#287 (comment).

domenkozar · 2014-12-01T11:35:42Z

I can reproduce this in current nixpkgs master, though it doesn't hit the limit.

$ nix-build nixos/release-combined.nix -A tested                                                                                                                                                                                             
GC Warning: Repeated allocation of very large block (appr. size 135168):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 135168):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 131072):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):
        May lead to memory leak and poor performance.
GC Warning: Repeated allocation of very large block (appr. size 151552):

edolstra · 2014-12-02T16:19:24Z

@iElectric Right. But that's a pretty big evaluation (containing dozens of NixOS VMs), not a single machine case.

domenkozar · 2015-03-03T10:54:59Z

The error is back on master: http://hydra.nixos.org/jobset/nixos/trunk-combined

aszlig · 2015-03-06T13:25:28Z

Related: NixOS/nixpkgs#3594

domenkozar · 2015-12-28T08:14:41Z

Back on master: http://hydra.nixos.org/jobset/nixos/trunk-combined#tabs-errors

Otherwise we hit NixOS/nix#287 on Hydra

domenkozar · 2016-11-15T13:41:49Z

I'm got the same error with 100 nodes deployed via NixOps to EC2.

GC_INITIAL_HEAP_SIZE=$((8*1024*1024*1024)) fixes it, but uses almost 13GB of ram (barely to fit on our 16GB machine).

domenkozar · 2017-12-09T12:57:28Z

@volth see https://ac.els-cdn.com/S157106610900396X/1-s2.0-S157106610900396X-main.pdf?_tid=14162726-dce0-11e7-a77d-00000aacb361&acdnat=1512824242_7e9551614d8141a063f6582e02c10e8f

If I understood @edolstra correctly, it's hard to implement GC on top of it. In Nixops it would probably pay off turning GC off and sharing memory.

Short term solution is to get nixops to evaluate each machine separately, in multiprocess manner.

Currently, NixOS evaluation grows linearly, meaning if one machine takes 100MB of memory to evaluate, once you have 100 machines it takes ~10GB of memory.

orivej · 2017-12-13T11:37:03Z

There is a significant memory usage improvement in Nixpkgs staging: NixOS/nixpkgs#32544

domenkozar · 2017-12-26T21:56:11Z

https://nixos.org/~eelco/pubs/laziness-ldta2008-final.pdf

wmertens · 2018-08-06T13:48:04Z

@edolstra A thought: using eval time as a metric for cache eviction

Would it be hard to keep track of how long it took to evaluate an expression, and use that to decide which expressions to memoize?

So if you could somehow say "cache should be below 100MB", and then when the cache is bigger, you evict items sorted by increasing eval time?

(possibly this is a trivial concept to you and not possible to implement, I just thought of it and wondered if that was a worthwhile approach to improving memory usage)

stale · 2021-02-16T00:49:08Z

I marked this as stale due to inactivity. → More info

stale · 2022-04-29T19:09:38Z

I closed this issue due to inactivity. → More info

fricklerhandwerk · 2023-10-10T07:18:22Z

Closing this as it's very likely not relevant any more. Reopen if needed.

domenkozar added a commit to snabblab/snabblab-nixos that referenced this issue May 26, 2016

buildNTimes: scrub most of attributes so we're easy on ram

27bed54

Otherwise we hit NixOS/nix#287 on Hydra

edolstra added the improvement label Feb 1, 2018

shlevy added the backlog label Apr 1, 2018

shlevy assigned edolstra Apr 1, 2018

wmertens mentioned this issue Jun 15, 2018

How to debug memory use of evaluation #2232

Closed

domenkozar removed the backlog label Apr 30, 2020

stale bot added the stale label Feb 16, 2021

stale bot closed this as completed Apr 29, 2022

thufschmitt reopened this Feb 24, 2023

figsoda mentioned this issue Sep 26, 2023

Generating index fails (possible boehm-gc issue) nix-community/nix-index#235

Open

fricklerhandwerk closed this as completed Oct 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

'nixops deploy' exits with 'Too many heap sections: Increase MAXHINCR or MAX_HEAP_SECTS' #287

'nixops deploy' exits with 'Too many heap sections: Increase MAXHINCR or MAX_HEAP_SECTS' #287

soenkehahn commented Jun 17, 2014

soenkehahn commented Jun 19, 2014

edolstra commented Jun 19, 2014

shlevy commented Jun 19, 2014

soenkehahn commented Jul 3, 2014

phunehehe commented Aug 28, 2014

domenkozar commented Sep 17, 2014

jgeerds commented Sep 19, 2014

lucabrunox commented Sep 19, 2014

jgeerds commented Sep 19, 2014

lucabrunox commented Sep 19, 2014

jgeerds commented Sep 22, 2014

shlevy commented Oct 8, 2014

shlevy commented Oct 15, 2014

edolstra commented Oct 15, 2014

shlevy commented Oct 24, 2014

shlevy commented Nov 9, 2014

shlevy commented Nov 24, 2014

edolstra commented Nov 24, 2014

shlevy commented Nov 24, 2014

edolstra commented Nov 25, 2014

wmertens commented Nov 25, 2014

domenkozar commented Dec 1, 2014

edolstra commented Dec 2, 2014

domenkozar commented Mar 3, 2015

aszlig commented Mar 6, 2015

domenkozar commented Dec 28, 2015

domenkozar commented Nov 15, 2016

domenkozar commented Dec 9, 2017

orivej commented Dec 13, 2017

domenkozar commented Dec 26, 2017

wmertens commented Aug 6, 2018

stale bot commented Feb 16, 2021

stale bot commented Apr 29, 2022

fricklerhandwerk commented Oct 10, 2023

'nixops deploy' exits with 'Too many heap sections: Increase MAXHINCR or MAX_HEAP_SECTS' #287

'nixops deploy' exits with 'Too many heap sections: Increase MAXHINCR or MAX_HEAP_SECTS' #287

Comments

soenkehahn commented Jun 17, 2014

soenkehahn commented Jun 19, 2014

edolstra commented Jun 19, 2014

shlevy commented Jun 19, 2014

soenkehahn commented Jul 3, 2014

phunehehe commented Aug 28, 2014

domenkozar commented Sep 17, 2014

jgeerds commented Sep 19, 2014

lucabrunox commented Sep 19, 2014

jgeerds commented Sep 19, 2014

lucabrunox commented Sep 19, 2014

jgeerds commented Sep 22, 2014

shlevy commented Oct 8, 2014

shlevy commented Oct 15, 2014

edolstra commented Oct 15, 2014

shlevy commented Oct 24, 2014

shlevy commented Nov 9, 2014

shlevy commented Nov 24, 2014

edolstra commented Nov 24, 2014

shlevy commented Nov 24, 2014

edolstra commented Nov 25, 2014

wmertens commented Nov 25, 2014

domenkozar commented Dec 1, 2014

edolstra commented Dec 2, 2014

domenkozar commented Mar 3, 2015

aszlig commented Mar 6, 2015

domenkozar commented Dec 28, 2015

domenkozar commented Nov 15, 2016

domenkozar commented Dec 9, 2017

orivej commented Dec 13, 2017

domenkozar commented Dec 26, 2017

wmertens commented Aug 6, 2018

stale bot commented Feb 16, 2021

stale bot commented Apr 29, 2022

fricklerhandwerk commented Oct 10, 2023