Limit number of concurrent preview generations #18210

dbw9580 · 2019-12-03T15:02:00Z

I added two semaphores to limit concurrent access to CPU resources. One is to guard the CPU-intensive calls to GeneratorHelper::getThumbnail and several image manipulation operations in Generator::generatePreview. The other is around PreviewManager::getPreview.

The second semaphore is necessary because without it the responses see a significant delay in delivery. This is the waterfall graph of a page load of 30 pictures, without the second semaphore guarding PreviewManager::getPreview, and preview_concurrent_new set to 4:

You can see most thumbnails are loaded within the last 6 seconds.

This is how it looks with preview_concurrent_new set to 4, and preview_concurrent_all set to 8:

The loading process is much more gradual and smoother.

lib/private/Preview/Generator.php

dbw9580 · 2019-12-04T12:34:16Z

I installed my Nextcloud instance inside a linux container with 3 cores and 2GB RAM allocated. The host has 4 cores @ 1.6GHz and 8GB RAM in total. Without proper restrictions on how many preview generations can happen at a time, a scroll load of 50 pictures eats up all 2GB memory and causes the kernel OOM killer to kick in. Sometimes the MySQL process is also killed, rendering the website being out of service for a while.

After applying limits on concurrent preview generations (preview_concurrent_new to 3 and preview_concurrent_all to 8), the same page load typically uses 800 ~ 900MB RAM, seldom over 1GB, and there has never been an outage due to OOM ever since.

szaimen · 2019-12-28T10:55:48Z

@kesselb why wasn't this PR considered for Nextcloud 18?
Sounds really awesome!

kesselb · 2019-12-28T14:03:54Z

No one had time to look into I guess. I like the idea in general. Code style is a bit off (we use camelCase here). Probably we could deduplicate the code a bit. Needs some testing of course.

Some feedback and questions:

I agree to limiting the generation but the all case will also limit / block the delivery of existing. Someone has to look at this (probably with xhprof and xdebug).
What happens if a lock is never released (probably a crashed process). By default all locks are released on request_shutdown.
Not sure if possible but some tests would be cool.

I think it's a good way using a lock. Most of these "improve preview generation" pull requests had in common that they improve the situation for one use case (I remember a pull request where the author announce 200% performance boost. Some tests later it boils down only for nfs based filesystem).

As always: Feel free to patch your instance and report back your experiences.

ariselseng · 2020-08-04T19:17:36Z

@kesselb Maybe I am off here. But cannot this be mostly solved by limited how many concurrent previews are opened in the web UI? Like lazy loading and max 2-3 concurrent previews at a time? Of course that will not fix having many clients connecting at the same time.

skjnldsv · 2020-10-31T15:08:48Z

@dbw9580 please rebase

dbw9580 · 2020-11-04T13:05:21Z

@skjnldsv done. Also deduplicated the code.

szaimen · 2021-01-05T00:12:52Z

Any update here?

dbw9580 · 2021-01-07T06:48:02Z

Hi devs, is there anything I can do to help get this PR merged?

skjnldsv · 2021-01-07T09:26:10Z

Hi devs, is there anything I can do to help get this PR merged?

Not much! :)
Lets ping @ChristophWurst @juliushaertl @rullzer and @MorrisJobke

ChristophWurst · 2021-01-07T14:22:03Z

I'm not sure how that could work with clustered envs, like if there is more than one application server. I assume it wouldn't lock across servers but always be scoped to each server, right?

dbw9580 · 2021-01-07T14:30:43Z

True. I don't think semaphores can work across OS boundaries. 2021年1月7日 22:22, 22:22，在 Christoph Wurst <notifications@github.com> 已写: I'm not sure how that could work with clustered envs, like if there is more than one application server. I assume it wouldn't lock across servers but always be scoped to each server, right? — You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub, or unsubscribe.

skjnldsv · 2022-11-04T08:51:42Z

Restarted drone

skjnldsv · 2022-11-04T09:21:08Z

Seems like the unit tests are not ok with this

dbw9580 · 2022-11-04T09:28:15Z

Warning: Transient problem: connection refused Will retry in 10 seconds. 1 
Warning: retries left.

  0     0    0     0    0     0      0      0 --:--:-- --:--:-- --:--:--     0
  0     0    0     0    0     0      0      0 --:--:-- --:--:-- --:--:--     0
curl: (7) Failed to connect to localhost port 9000: Connection refused
Error: Process completed with exit code 7.

https://github.com/nextcloud/server/actions/runs/3387563288/jobs/5628546406

I see the error is a network connection issue. How is this related to my change?

szaimen · 2022-11-04T09:31:07Z

Warning: Transient problem: connection refused Will retry in 10 seconds. 1 
Warning: retries left.

  0     0    0     0    0     0      0      0 --:--:-- --:--:-- --:--:--     0
  0     0    0     0    0     0      0      0 --:--:-- --:--:-- --:--:--     0
curl: (7) Failed to connect to localhost port 9000: Connection refused
Error: Process completed with exit code 7.

https://github.com/nextcloud/server/actions/runs/3387563288/jobs/5628546406

I see the error is a network connection issue. How is this related to my change?

@dbw9580 it is not this test that is the problem. The nodb drone test is the problem which runs in a timeout. See e.g. https://drone.nextcloud.com/nextcloud/server/25183/9/4

If you have any idea how to fix that, help would be really appreciated! :)

dbw9580 · 2022-11-04T10:42:09Z

I see the nodb test is stuck at 61%, how can I find out exactly which test is running into a problem?

szaimen · 2022-11-04T18:19:58Z

Seems like it hangs at Test 'Test\Preview\GeneratorTest::testGetNewPreview' started

lib/private/Preview/Generator.php

szaimen · 2022-11-04T19:05:47Z

BTW, @dbw9580 can you please fix DCO? Then we can merge this if the tests should run through :)

szaimen · 2022-11-05T00:51:39Z

@dbw9580 so the tests finally pass now. Please fix DCO and then I am merging this. Thank you! :)

Signed-off-by: Bowen Ding <dbw9580@live.com> Signed-off-by: szaimen <szaimen@e.mail.de>

dbw9580 · 2022-11-05T08:05:37Z

@szaimen rebased and signed off the commit, PTAL.

szaimen · 2022-11-05T10:31:18Z

Thank you very much! The tests pass now and everything looks good! Merging! :)

welcome · 2022-11-05T10:32:15Z

Thanks for your first pull request and welcome to the community! Feel free to keep them coming! If you are looking for issues to tackle then have a look at this selection: https://github.com/nextcloud/server/issues?q=is%3Aopen+is%3Aissue+label%3A%22good+first+issue%22

dbw9580 · 2022-11-05T10:39:31Z

@szaimen thank you for the review and all the efforts for making the merge happen! looking forward to seeing this in the next release!

jospoortvliet · 2023-02-24T16:19:28Z

@dbw9580 thanks for your incredible patience with this! It took a long time, but I'm also super happy it got in!

kesselb reviewed Dec 3, 2019

View reviewed changes

lib/private/Preview/Generator.php Outdated Show resolved Hide resolved

kesselb reviewed Dec 3, 2019

View reviewed changes

lib/private/Preview/Generator.php Outdated Show resolved Hide resolved

dbw9580 marked this pull request as ready for review December 4, 2019 12:23

matiasdelellis mentioned this pull request Dec 4, 2019

Initial face preview generation hang Nextcloud matiasdelellis/facerecognition#193

Closed

jospoortvliet requested a review from icewind1991 February 20, 2020 11:27

skjnldsv requested a review from rullzer October 31, 2020 15:08

skjnldsv added 3. to review Waiting for reviews enhancement feature: previews and thumbnails performance 🚀 labels Oct 31, 2020

skjnldsv added this to the Nextcloud 21 milestone Oct 31, 2020

dbw9580 closed this Nov 4, 2020

dbw9580 force-pushed the master branch from ebfd26c to b65d9eb Compare November 4, 2020 12:48

dbw9580 reopened this Nov 4, 2020

dbw9580 force-pushed the master branch 2 times, most recently from 18eeb79 to 02256c8 Compare November 4, 2020 13:00

kesselb requested a review from MorrisJobke November 4, 2020 13:29

rullzer mentioned this pull request Dec 14, 2020

21 beta2 #24685

Merged

59 tasks

rullzer removed this from the Nextcloud 21 milestone Dec 14, 2020

skjnldsv added the 4. to release Ready to be released and/or waiting for tests to finish label Nov 4, 2022

skjnldsv added 2. developing Work in progress and removed 4. to release Ready to be released and/or waiting for tests to finish labels Nov 4, 2022

szaimen reviewed Nov 4, 2022

View reviewed changes

lib/private/Preview/Generator.php Show resolved Hide resolved

szaimen force-pushed the master branch from e7eb085 to cfffe5d Compare November 4, 2022 19:13

Limit-number-of-concurrent-preview-generations

f9e9cd2

Signed-off-by: Bowen Ding <dbw9580@live.com> Signed-off-by: szaimen <szaimen@e.mail.de>

dbw9580 force-pushed the master branch from cfffe5d to f9e9cd2 Compare November 5, 2022 08:04

szaimen merged commit 779fedd into nextcloud:master Nov 5, 2022

szaimen added 4. to release Ready to be released and/or waiting for tests to finish and removed 2. developing Work in progress labels Nov 5, 2022

Glandos mentioned this pull request Nov 13, 2022

[Performance] List preview directory only once #34910

Closed

juliushaertl mentioned this pull request Nov 27, 2022

📝 Changelog juliushaertl/nextcloud-docker-dev#104

Open

lukasmu mentioned this pull request Jan 16, 2023

We need limit the amount of previews generated at once when listing images in Files #15075

Closed

MichaIng added this to the Nextcloud 26 milestone Feb 24, 2023

LaXiS96 mentioned this pull request Mar 24, 2023

Preview formats no longer sufficient for photo app 2.0.0 nextcloud/photos#1492

Open

szaimen mentioned this pull request Apr 1, 2023

Add Imaginary Docker for previews nextcloud/vm#2464

Merged

pulsejet mentioned this pull request May 20, 2023

Add support for RoadRunner PHP webserver #36290

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Limit number of concurrent preview generations #18210

Limit number of concurrent preview generations #18210

dbw9580 commented Dec 3, 2019 •

edited by MichaIng

Loading

dbw9580 commented Dec 4, 2019

szaimen commented Dec 28, 2019

kesselb commented Dec 28, 2019

ariselseng commented Aug 4, 2020

skjnldsv commented Oct 31, 2020

dbw9580 commented Nov 4, 2020

szaimen commented Jan 5, 2021

dbw9580 commented Jan 7, 2021

skjnldsv commented Jan 7, 2021

ChristophWurst commented Jan 7, 2021

dbw9580 commented Jan 7, 2021 via email

skjnldsv commented Nov 4, 2022

skjnldsv commented Nov 4, 2022

dbw9580 commented Nov 4, 2022

szaimen commented Nov 4, 2022

dbw9580 commented Nov 4, 2022

szaimen commented Nov 4, 2022

szaimen commented Nov 4, 2022

szaimen commented Nov 5, 2022

dbw9580 commented Nov 5, 2022

szaimen commented Nov 5, 2022

welcome bot commented Nov 5, 2022

dbw9580 commented Nov 5, 2022

jospoortvliet commented Feb 24, 2023

Limit number of concurrent preview generations #18210

Limit number of concurrent preview generations #18210

Conversation

dbw9580 commented Dec 3, 2019 • edited by MichaIng Loading

dbw9580 commented Dec 4, 2019

szaimen commented Dec 28, 2019

kesselb commented Dec 28, 2019

ariselseng commented Aug 4, 2020

skjnldsv commented Oct 31, 2020

dbw9580 commented Nov 4, 2020

szaimen commented Jan 5, 2021

dbw9580 commented Jan 7, 2021

skjnldsv commented Jan 7, 2021

ChristophWurst commented Jan 7, 2021

dbw9580 commented Jan 7, 2021 via email

skjnldsv commented Nov 4, 2022

skjnldsv commented Nov 4, 2022

dbw9580 commented Nov 4, 2022

szaimen commented Nov 4, 2022

dbw9580 commented Nov 4, 2022

szaimen commented Nov 4, 2022

szaimen commented Nov 4, 2022

szaimen commented Nov 5, 2022

dbw9580 commented Nov 5, 2022

szaimen commented Nov 5, 2022

welcome bot commented Nov 5, 2022

dbw9580 commented Nov 5, 2022

jospoortvliet commented Feb 24, 2023

dbw9580 commented Dec 3, 2019 •

edited by MichaIng

Loading