Increasing and high Server Load Avearge #4282

nicfab · 2023-12-16T15:09:15Z

Requirements

Is this a bug report? For questions or discussions use https://lemmy.ml/c/lemmy_support
Did you check to see if this issue already exists?
Is this only a single bug? Do not put multiple bugs in one issue.
Is this a backend issue? Use the lemmy-ui repo for UI / frontend issues.

Summary

After upgrading my Lemmy instance to version 0.19.0 I noted a fast increase in load average on the server that remains high (1.30).

Steps to Reproduce

Run docker compose down
Upgrading to new version 0.19.0 by modifying the docker-compose.yml file
Run docker compose up -d

Technical Details

The server OS is Ubuntu 22.04.3 LTS.
Some files:

Logs - lemmy_log.txt
lemmy.hjson (in txt format)- lemmy.hjson.txt
docker-compose.yml (in txt format) - docker-compose.yml.txt

Version

0.19.0

Lemmy Instance URL

https://community.nicfab.it

The text was updated successfully, but these errors were encountered:

arifwn · 2023-12-16T16:58:47Z

I also noticed increased database load after the upgrade. Is this related to the new persistent federation queue?

Edit: setting max_connections to 50 seems to limit the database memory usage on my small instance with 4.5GB ram to a manageable level, though there is a lot of FATAL: sorry, too many clients already on postgres log now. I wonder if there is a proper way to limit the queue from lemmy server side.

Demigodrick · 2023-12-16T19:21:52Z

Just want to +1 this - I've seen on average a doubling of the server load metric since the update was applied.

axeleroy · 2023-12-18T09:14:16Z

I have been able to reduce the database load by setting database.pool_size in lemmy.hjson. I still have to tweak its value to get a good balance between Lemmy performance and relatively low database load.

Nutomic · 2023-12-18T09:40:34Z

Yes this is most likely because of the new federation queue. Previously, outgoing activities would be handled entirely in memory in the Lemmy process, but now they get written to the db and then read again. #4285 should help by batching these db queries.

arifwn · 2023-12-18T10:59:22Z

Thanks! Adding database.pool_size in lemmy.hjson works better than limiting the max connections on postgres.

phiresky · 2023-12-18T13:36:34Z

Please follow these steps to get info about database performance:

enable pg_stat_statements and auto_explain by making sure these lines exists in the postgresql config (customPostgresql.conf):

shared_preload_libraries=pg_stat_statements,auto_explain
pg_stat_statements.track = all
auto_explain.log_min_duration=5000ms

open a psql repl by running docker compose exec -it db -u postgres psql -d lemmy and reset the stats by calling create extension pg_stat_statements; select pg_stat_statements_reset()
wait an hour
post the outputs of
docker compose exec -it db -u postgres -qtAX -d lemmy -c 'select json_agg(a) from (select * from pg_stat_statements order by total_exec_time desc limit 10) a;' > total_exec_time.json

and

docker compose exec -it db -u postgres -qtAX -d lemmy -c 'select json_agg(a) from (select * from pg_stat_statements order by mean_exec_time desc limit 10) a;' > mean_exec_time.json

phiresky · 2023-12-18T13:38:22Z

Also, in general a higher server load floor on 0.19 is expected and not really an issue. The floor of server use is higher (esp. for small instances) but it scales better to higher federation loads.

axeleroy · 2023-12-18T13:43:26Z

My issue isn't particularly the increased CPU load but rather the increased IO load which is tanking performance on other services I host, as well as the increased memory usage due to increased PostgreSQL activity (which filled my host's swap until I set a limit to the connection pool size)

I hope #4285 will resolve my issues.

phiresky · 2023-12-18T13:44:54Z

It won't.. Do you have synchronous_commit=off set?

axeleroy · 2023-12-18T13:49:57Z

I don't think so, my postgres command is

[
  "postgres",
  "-c",
  "session_preload_libraries=auto_explain",
  "-c",
  "auto_explain.log_min_duration=5ms",
  "-c",
  "auto_explain.log_analyze=true",
  "-c",
  "track_activity_query_size=1048576",
]

I'll try that though

arifwn · 2023-12-18T14:35:09Z

Postgres memory usage is down again after setting database.pool_size to 30 in lemmy.hjson. The default value (95?) seems to be too high for my small VPS with 4.5GB RAM.

nicfab · 2023-12-18T17:12:17Z

I am following your comments, and I'll be sure to wait for a solution.
In the meantime, I had to stop my Lemmy docker and put offline my instance.

linux-cultist · 2024-01-03T15:22:45Z

Postgres memory usage is down again after setting database.pool_size to 30 in lemmy.hjson. The default value (95?) seems to be too high for my small VPS with 4.5GB RAM.

@arifwn You can control postgres with the customPostgresql.conf file and put settings into it tuned by your hardware:

pgtune.leopard.in.ua

If you tell postgres to use 3 GB and 1 cpu (for example), it wont use all your resources. It may use memory if its not used by something else, and then release it the second something else needs it. Thats normal.

That being said, I also reduced the pool size to 30 but didnt really notice a difference. The postgres settings made the major difference.

phiresky · 2024-01-04T11:53:14Z

I'll close this since it seems like the same thing as #4334 and that one has more detail (I don't see any info here that's not present there) (?)

arifwn · 2024-01-04T17:08:02Z

If you tell postgres to use 3 GB and 1 cpu (for example), it wont use all your resources. It may use memory if its not used by something else, and then release it the second something else needs it. Thats normal.

That being said, I also reduced the pool size to 30 but didnt really notice a difference. The postgres settings made the major difference.

@linux-cultist My postgres config: https://gist.github.com/arifwn/1c86fe79708dfe3bd43ecabaafc73320

The VPS has 4.5 GB of RAM and postgres is configured to use 2 GB (or did I configure it wrong?). Unless I configured lemmy to database.pool_size to 30 (instead of leaving it to use default value, which was 95 back then in 0.19.0), after two days either lemmy or posgres got oom-killed because the ram was exhausted. I haven't tried again in 0.19.1.

nicfab added the bug Something isn't working label Dec 16, 2023

Nutomic mentioned this issue Dec 18, 2023

Better query plan viewing experience #4285

Merged

ticoombs mentioned this issue Dec 18, 2023

Recommended/default config should include pg_stat_statements and auto_explain LemmyNet/lemmy-ansible#211

Closed

Nutomic added a commit that referenced this issue Dec 19, 2023

Reduce default db pool size to 30 (ref #4282)

67a90c8

Nutomic added a commit that referenced this issue Dec 19, 2023

Reduce default db pool size to 30 (ref #4282)

d397645

phiresky closed this as not planned Won't fix, can't repro, duplicate, stale Jan 4, 2024

LemmyNet locked as resolved and limited conversation to collaborators Jan 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Increasing and high Server Load Avearge #4282

Increasing and high Server Load Avearge #4282

nicfab commented Dec 16, 2023

arifwn commented Dec 16, 2023 •

edited

Loading

Demigodrick commented Dec 16, 2023

axeleroy commented Dec 18, 2023

Nutomic commented Dec 18, 2023

arifwn commented Dec 18, 2023

phiresky commented Dec 18, 2023

phiresky commented Dec 18, 2023

axeleroy commented Dec 18, 2023 •

edited

Loading

phiresky commented Dec 18, 2023

axeleroy commented Dec 18, 2023 •

edited

Loading

arifwn commented Dec 18, 2023

nicfab commented Dec 18, 2023

linux-cultist commented Jan 3, 2024 •

edited

Loading

phiresky commented Jan 4, 2024 •

edited

Loading

arifwn commented Jan 4, 2024

Increasing and high Server Load Avearge #4282

Increasing and high Server Load Avearge #4282

Comments

nicfab commented Dec 16, 2023

Requirements

Summary

Steps to Reproduce

Technical Details

Version

Lemmy Instance URL

arifwn commented Dec 16, 2023 • edited Loading

Demigodrick commented Dec 16, 2023

axeleroy commented Dec 18, 2023

Nutomic commented Dec 18, 2023

arifwn commented Dec 18, 2023

phiresky commented Dec 18, 2023

phiresky commented Dec 18, 2023

axeleroy commented Dec 18, 2023 • edited Loading

phiresky commented Dec 18, 2023

axeleroy commented Dec 18, 2023 • edited Loading

arifwn commented Dec 18, 2023

nicfab commented Dec 18, 2023

linux-cultist commented Jan 3, 2024 • edited Loading

phiresky commented Jan 4, 2024 • edited Loading

arifwn commented Jan 4, 2024

arifwn commented Dec 16, 2023 •

edited

Loading

axeleroy commented Dec 18, 2023 •

edited

Loading

axeleroy commented Dec 18, 2023 •

edited

Loading

linux-cultist commented Jan 3, 2024 •

edited

Loading

phiresky commented Jan 4, 2024 •

edited

Loading