Use puma worker count equal to processor count in production #46838

dhh · 2022-12-27T21:31:14Z

Use all the processor performance of the host by default in production.

lazaronixon · 2022-12-28T17:58:58Z

mightn't it break a environment like Heroku, since the eco plan has only 512 MB of memory? 🤔

https://devcenter.heroku.com/articles/deploying-rails-applications-with-the-puma-web-server?page=49#process-count-value

https://twitter.com/nateberkopec/status/1390793618751774720

natematykiewicz · 2022-12-29T05:29:41Z

Optimizing the default production config for the cheapest possible Heroku plan seems pretty unreasonable. People who are on plans that cheap can set their WEB_CONCURRENCY env var if they don't have enough resources to run 2 processes (it has 2 cores) with 512MB of RAM.

This would be in 7.1 which is a "major" version in Rails terms, and this is only going to affect people who accept the diff on the generated file. A few commits ago, WEB_CONCURRENCY was in the file but commented and you were expected to uncomment it. So a decent amount of people (like myself) already set the value, even.

The article you quoted says this:

to fully make use of multiple cores, your application should have a process count that matches the number of physical cores on the system.

That's exactly what this PR does. Seems like a pretty sane default.

lazaronixon · 2022-12-29T05:46:40Z

But even the plan Performance-M just gives you 2.5GB of memory. For 4 cores the recommended memory is 8GB, which is not offered by most cloud services in this range. IMO we could let it as before, as a comment, and use 2 as default.

The first line of the article makes it clear:

Increasing process count increases RAM utilization, which can be a limiting factor.

If memory would be the same, independent of the number of workers, I wouldn't have objections here.

natematykiewicz · 2022-12-29T05:57:33Z

Performance-M is not a good choice for Rails apps. https://judoscale.com/guides/how-many-dynos

lazaronixon · 2022-12-29T06:02:22Z

So we are in a worse situation, just 1024 MB of RAM. Even in the article, they recommend a WEB_CONCURRENCY of 2, with the current puma configuration file we would have this value set to 4 or even 8(I'm not sure).

natematykiewicz · 2022-12-29T06:14:50Z

Looks like I misspoke earlier and the Free (now Eco) have 1 CPU. Which means this change does nothing for Eco dynos.

lazaronixon · 2022-12-29T07:20:19Z

No, it has 1 DYNO, 4 physical cores, and 4 virtual cores, but I'm not sure if you see 4 or 8 using the method Concurrent.physical_processor_count. 😬

https://devcenter.heroku.com/articles/deploying-rails-applications-with-the-puma-web-server#process-count-value

trevorturk · 2023-01-02T17:07:26Z

FWIW, on Heroku, with a Standard-1X dyno, Concurrent.physical_processor_count returns 4, when I believe you'll want to set WEB_CONCURRENCY=1 or your Dyno Load would go over 1: https://devcenter.heroku.com/articles/metrics#dyno-load.

I don't have a strong opinion on changing this default, but I do think it may be slightly problematic for Heroku users. Perhaps Heroku could update their documentation, or set a default ENV var for you. Rails Guides could also detail some caveats, and it may be worth considering a comment in this template file for clarity.

lazaronixon · 2023-01-02T17:32:44Z

That's the point 👍. On Heroku, you can set SENSIBLE_DEFAULTS then it will set the correct WEB_CONCURRENCY based on the plan, but the fact is that most cloud services don't give enough memory to run through all processors.

https://devcenter.heroku.com/changelog-items/618

railties/lib/rails/generators/rails/app/templates/config/puma.rb.tt

simi · 2023-01-02T18:14:05Z

ℹ️ This could be also problem in K8s deploys where scaling should be done through pods, not by running daemons and multiple processes.

trevorturk · 2023-01-02T18:29:47Z

Oh interesting about Heroku's SENSIBLE_DEFAULTS, I hadn't seen that before. Thanks for the tip. In that case, it looks like 2 is Heroku's recommended WEB_CONCURRENCY for a Standard-1x Dyno. (I'm using Falcon instead of Puma, which wants 1 process per CPU.)

I still think it may be fine to change this default, but it seems like some additional documentation for cloud hosting providers might be nice to have.

bdewater · 2023-01-08T19:31:22Z

ℹ️ This could be also problem in K8s deploys where scaling should be done through pods, not by running daemons and multiple processes.

Scaling with pods only means missing out on memory savings via copy-on-write.

I'm no K8s expert, but does it really matter from a container perspective whether it's a single worker or master process with forked workers is accepting requests? Based on puma/puma#2645 (comment) it doesn't seem to be.

mindlace · 2023-01-10T18:03:21Z

This implementation doesn't return the right values when running in a container (it will always return 1).

To handle the container case, the code should look at /proc/1/sched and if that doesn't start with init, it should look at /sys/fs/cgroup/cpuset/cpuset.cpus to get the number allocated to that container.

For this specific PR, choosing to use the physical_processor_count method will also give incorrect values when running under jruby.

dhh · 2023-01-10T18:31:40Z

Old default was 1, so that would amount to the same. But would be great if we could get it fixed up in a way so we by default use all available cores inside a container too. Please do investigate a patch ✌️

simi · 2023-01-10T23:05:40Z

This implementation doesn't return the right values when running in a container (it will always return 1).

To handle the container case, the code should look at /proc/1/sched and if that doesn't start with init, it should look at /sys/fs/cgroup/cpuset/cpuset.cpus to get the number allocated to that container.

For this specific PR, choosing to use the physical_processor_count method will also give incorrect values when running under jruby.

Would you mind to report this to https://github.com/ruby-concurrency/concurrent-ruby/issues/new?

dhh · 2023-01-21T08:21:35Z

I just tried to replicate the problem, but couldn't. Concurrent.physical_processor_count reports the correct number of available cores when answering from within a Docker container running on a VM.

Use worker count equal to processor count in production

303235d

rails-bot bot added the railties label Dec 27, 2022

dhh merged commit 839ac1e into main Dec 28, 2022

dhh deleted the puma-production-worker-count branch December 28, 2022 02:09

fxn reviewed Jan 2, 2023

View reviewed changes

railties/lib/rails/generators/rails/app/templates/config/puma.rb.tt Show resolved Hide resolved

gregmolnar mentioned this pull request Jan 3, 2023

fix comparison type issue in puma.rb template #46874

Merged

zzak added a commit that referenced this pull request Jan 7, 2023

Add changelog for #46838

911b3a4

This was referenced Sep 19, 2023

Puma fails to boot in production with default config due to missing Concurrency constant (Rails 7.1.0.beta1) #49323

Closed

Fix uninitialized constant error on Puma boot #49325

Merged

byroot mentioned this pull request Jan 9, 2024

Update the default Puma configuration #50669

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use puma worker count equal to processor count in production #46838

Use puma worker count equal to processor count in production #46838

dhh commented Dec 27, 2022

lazaronixon commented Dec 28, 2022 •

edited

natematykiewicz commented Dec 29, 2022 •

edited

lazaronixon commented Dec 29, 2022 •

edited

natematykiewicz commented Dec 29, 2022 •

edited

lazaronixon commented Dec 29, 2022

natematykiewicz commented Dec 29, 2022

lazaronixon commented Dec 29, 2022 •

edited

trevorturk commented Jan 2, 2023 •

edited

lazaronixon commented Jan 2, 2023 •

edited

simi commented Jan 2, 2023

trevorturk commented Jan 2, 2023

bdewater commented Jan 8, 2023

mindlace commented Jan 10, 2023 •

edited

dhh commented Jan 10, 2023

simi commented Jan 10, 2023

dhh commented Jan 21, 2023

Use puma worker count equal to processor count in production #46838

Use puma worker count equal to processor count in production #46838

Conversation

dhh commented Dec 27, 2022

lazaronixon commented Dec 28, 2022 • edited

natematykiewicz commented Dec 29, 2022 • edited

lazaronixon commented Dec 29, 2022 • edited

natematykiewicz commented Dec 29, 2022 • edited

lazaronixon commented Dec 29, 2022

natematykiewicz commented Dec 29, 2022

lazaronixon commented Dec 29, 2022 • edited

trevorturk commented Jan 2, 2023 • edited

lazaronixon commented Jan 2, 2023 • edited

simi commented Jan 2, 2023

trevorturk commented Jan 2, 2023

bdewater commented Jan 8, 2023

mindlace commented Jan 10, 2023 • edited

dhh commented Jan 10, 2023

simi commented Jan 10, 2023

dhh commented Jan 21, 2023

lazaronixon commented Dec 28, 2022 •

edited

natematykiewicz commented Dec 29, 2022 •

edited

lazaronixon commented Dec 29, 2022 •

edited

natematykiewicz commented Dec 29, 2022 •

edited

lazaronixon commented Dec 29, 2022 •

edited

trevorturk commented Jan 2, 2023 •

edited

lazaronixon commented Jan 2, 2023 •

edited

mindlace commented Jan 10, 2023 •

edited