Consider available memory and address space for parallel execution #2418

pmatilai · 2023-03-08T08:21:57Z

See commits for details, but short summary: add optional proc/thread argument to %{getncpus} macro which consider the available memory and address space, based on newly added tunables for tasksize.

The goal here is to avoid build failures due to gross overallocation on constrained systems, and to allow packagers to easily adjust for gigantic builds.

Fixes: #804

pmatilai · 2023-03-08T09:01:31Z

In my original version this also added a %getmem macro with similar arguments. Left it out to keep things simple and minimal, but would be trivial to add back if people think that's useful.

Conan-Kudo · 2023-03-20T20:41:28Z

%getmem could be useful for making %limit_build work better in Fedora, so I think it'd be worth having.

pmatilai · 2023-03-21T06:29:45Z

Well, the point of this PR is to make %limit_build and the like redundant entirely.

Conan-Kudo · 2023-03-21T11:36:41Z

That's even better. 👍🏾

rpmio/macro.c

dmnks · 2023-03-22T16:55:33Z

rpmio/macro.c

+{
+    unsigned long mem = getmem_total();
+    /*
+     * Conservative estimates for thread use on 32bit systems where address


Just curious (no nitpick here), where do these estimates come from? They don't seem totally arbitrary so I suppose there's some technical reasoning behind them?

The total 32bit address space is 4GB. On a native 32bit Linux system, the kernel eats 1GB out of that, leaving 3GB for the process. On a 64bit system a 32bit process has nearly all of the 4GB available to it. While running out of physical memory can be handled by virtual memory (aka swap), the address space is a hard limit and rpm can't handle running into it. Which is why the estimates are very conservative.

Just realized the patch uses a misleading 'vmem' name when it's actually address space it's talking about.

dmnks · 2023-03-22T16:56:55Z

Other than the above, the overall changeset looks sane to me.

"total" equals calling with no arguments, "proc" and "thread" consider further constraints, what is implemented here is heuristics based on available physical memory and address-space and %_smp_tasksize_proc / %_smp_tasksize_thread tunables. Change the previous %getncpus related tests to use %getconfdir instead, they are testing unexpected arguments behavior for this type of macro, not %getncpus itself. Add a test for the actual functionality: if nproc is available, test that our total matches with that, and that defining tasksize to total memory only allocates one thread. Optimally we'd test separately for 32bit address space limitations but that gets tough when we have no idea where this will be executed.

Take advantage of the new %{getncpus:thread} functionality when calculating the number of threads to use for io stream compression (when not in parallel region)

…anagement#804) Take advantage of the new %{getncpus:proc/thread} functionality when calculating the number of processes/threads to use during build. The goal here is to avoid gross overallocation of processes/threads on constrained systems. In particular threads on 32bit systems where address space is limited, but also to allow packagers to easily tune for gigantic build jobs such as webkit that may overwhelm otherwise adequate systems. Fixes: rpm-software-management#804, RhBug:1118734

dmnks · 2023-03-27T14:16:08Z

OK, I can confirm the new push fixes the proc calculation, thanks!

pmatilai · 2023-03-30T10:57:59Z

Okay, since there are no further comments...

pmatilai added the RFE label Mar 8, 2023

pmatilai added this to the 4.19.0 milestone Mar 8, 2023

pmatilai force-pushed the memcpus-pr branch 3 times, most recently from 3353d6a to e4d0e62 Compare March 8, 2023 08:58

pmatilai self-assigned this Mar 16, 2023

dmnks reviewed Mar 22, 2023

View reviewed changes

rpmio/macro.c Outdated Show resolved Hide resolved

dmnks reviewed Mar 22, 2023

View reviewed changes

rpmio/macro.c Show resolved Hide resolved

dmnks reviewed Mar 22, 2023

View reviewed changes

pmatilai added 3 commits March 24, 2023 10:57

Make rpmio parallelism memory aware

758e92b

Take advantage of the new %{getncpus:thread} functionality when calculating the number of threads to use for io stream compression (when not in parallel region)

pmatilai force-pushed the memcpus-pr branch from e4d0e62 to 73dad5c Compare March 24, 2023 08:57

pmatilai removed this from the 4.19.0 milestone Mar 28, 2023

pmatilai merged commit a213101 into rpm-software-management:master Mar 30, 2023

pmatilai deleted the memcpus-pr branch March 30, 2023 10:58

pmatilai mentioned this pull request Apr 4, 2023

Document the build macros + tunables #2466

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider available memory and address space for parallel execution #2418

Consider available memory and address space for parallel execution #2418

pmatilai commented Mar 8, 2023

pmatilai commented Mar 8, 2023

Conan-Kudo commented Mar 20, 2023

pmatilai commented Mar 21, 2023

Conan-Kudo commented Mar 21, 2023

dmnks Mar 22, 2023 •

edited

pmatilai Mar 24, 2023 •

edited

dmnks Mar 27, 2023

dmnks commented Mar 22, 2023

dmnks commented Mar 27, 2023

pmatilai commented Mar 30, 2023

Consider available memory and address space for parallel execution #2418

Consider available memory and address space for parallel execution #2418

Conversation

pmatilai commented Mar 8, 2023

pmatilai commented Mar 8, 2023

Conan-Kudo commented Mar 20, 2023

pmatilai commented Mar 21, 2023

Conan-Kudo commented Mar 21, 2023

dmnks Mar 22, 2023 • edited

Choose a reason for hiding this comment

pmatilai Mar 24, 2023 • edited

Choose a reason for hiding this comment

dmnks Mar 27, 2023

Choose a reason for hiding this comment

dmnks commented Mar 22, 2023

dmnks commented Mar 27, 2023

pmatilai commented Mar 30, 2023

dmnks Mar 22, 2023 •

edited

pmatilai Mar 24, 2023 •

edited