Performance is slow for large installations #272

TomNewChao · 2023-08-03T12:29:15Z

Hello,
We have more than 10000+ jobs in jenkins, The git trigger the job by using generic-webhook-trigger-plugin that will spend 2+ minutes. When I parse the code, I found that the time is consumed in the following code：

I'm wondering if it's possible to get the corresponding job based on the content of the request,I tried to modify the code for testing, using a job with a specified path to trigger, and found that the efficiency increased significantly,So,The JobFinder that Find the corresponding job according to the request content to trigger the task by request body is required.

TomNewChao · 2023-08-03T12:29:52Z

I suggest:
1.It is recommended to use the radio button to select the jobfinder.
2.The jobfinder configure the path of the corresponding request content to obtain job related information to trigger.

I am very interested in contributing my pr with your permission.

tomasbjerre · 2023-08-03T13:38:44Z

You are very welcome to fork the repo and open a pull-request.

TomNewChao · 2023-08-04T03:05:11Z

You are very welcome to fork the repo and open a pull-request.

Thanks for your reply, I think it's feature development,Need a lot of PR to solve this issue,Would you help me to create a branch named feature/issue-272 in this repository?

tomasbjerre · 2023-08-04T14:41:18Z

Only contributions from forks are allowed.

If you want to make a lot of changes it might be better if you keep your feature in your fork without merging to this main repo. To avoid causing problems for the 30k+ installations that are using it.

TomNewChao · 2023-08-05T12:44:02Z

I see,ok,I will develop this issue on my forks repo.

tomasbjerre · 2023-08-19T10:24:05Z

I pushed an alternative solution here. Adding a loading cache to JobFinder:
https://github.com/jenkinsci/generic-webhook-trigger-plugin/pull/274/files#diff-6cc9e047aa25546ac7f7dcce5804eb64149b98bb21b7649d477dd05b99e611a1

TomNewChao · 2023-08-19T17:08:42Z

I pushed an alternative solution here. Adding a loading cache to JobFinder: https://github.com/jenkinsci/generic-webhook-trigger-plugin/pull/274/files#diff-6cc9e047aa25546ac7f7dcce5804eb64149b98bb21b7649d477dd05b99e611a1

Thanks for your commit, But Adding a loading cache to JobFinder may not work in 10000+ jobs, Because they may not repeatedly trigger the same work within the cache time, it may be better to find the corresponding job based on the content of the request body。

tomasbjerre · 2023-08-19T17:32:47Z

If you use token in your jobs, all those jobs will invoke Jenkins.getInstance().getAllItems(ParameterizedJob.class); exactly the same way.

I made an alternative with the cache config on the global config page. I think that might be better so that users don't need to change their webhook url:s. #275

tomasbjerre · 2023-08-19T18:47:24Z

I released 1.87.0 with the caching feature.

TomNewChao · 2023-08-21T07:13:23Z

If you use token in your jobs, all those jobs will invoke Jenkins.getInstance().getAllItems(ParameterizedJob.class); exactly the same way.

I made an alternative with the cache config on the global config page. I think that might be better so that users don't need to change their webhook url:s. #275

I'm sorry, You didn't get me, I won't use token in my job, the specise job will invoke "Jenkins.getInstance().get().getItemByFullName(fullName, ParameterizedJobMixIn.ParameterizedJob.class);"
to get job, and The fullname is concatenated according to the content and configuration of the request body, Using cache will cause inconsistency between original data and cached data, when the original data is modified, So I still think my approach is the best, even though your code has been merged into。

tomasbjerre · 2023-08-21T07:21:44Z

So a solution might be to start using a token and enable the cache? You can use the exact same token i all 10000 jobs.

TomNewChao · 2023-08-21T07:29:34Z

no, I think it is good idea to use my idea that use the specise job, and Using the cache layer will cause waste of memory resources, data inconsistency, etc.

TomNewChao · 2023-08-21T07:38:09Z

my idea is use the specise job, and more detail:

use the config path express and the request body to generate a jenkins path that named fullname.
use fullname and Jenkins.getInstance().get().getItemByFullName(fullName, ParameterizedJobMixIn.ParameterizedJob.class) to get job.
filter by "Renderer.isMatching and Renderer.renderText" (available now)
trigger there job

tomasbjerre · 2023-08-21T17:03:05Z

How much memory does the cache allocate in your case?

TomNewChao · 2023-08-22T03:08:33Z

How much memory does the cache allocate in your case?

In my case, A path expression find one ParameterizedJob at most, when i print the ParameterizedJob, it allocate 128 bytes.

TomNewChao · 2023-08-22T03:37:01Z

When I reviewed your code again, I found that our solution may be different. You use the cache to store all the List ParameterizedJob, and then filter out the qualified ParameterizedJob according to the token. The time complexity of this is o(n); My solution is to use the path to find the corresponding job, and the time complexity is o(1); this is not mutually exclusive, it can completely coexist.

tomasbjerre · 2023-08-22T04:06:58Z

If one job needs 128 bytes, I do not se a problem with "waste of memory resources".

By "data inconsistency" you probably mean that any changes to the configuration will be delayed by the cache. I dont see a problem with that as the configurations are (in my experience) rarely changed.

As I stated in your first PR:

As I see it a PR is really just a way of asking for free maintenance. There is nothing stopping you from adjusting the code to your needs and use that in you Jenkins. But for me to maintain it, it has to be as simple as possible. If a complex feature only helps a few users I would rather not merge it.

So I dont want another solution here, the caching is the solution until I see some convincing motivation to why it is not.

TomNewChao · 2023-08-22T08:40:57Z

If one job needs 128 bytes, I do not se a problem with "waste of memory resources".

By "data inconsistency" you probably mean that any changes to the configuration will be delayed by the cache. I dont see a problem with that as the configurations are (in my experience) rarely changed.

As I stated in your first PR:

As I see it a PR is really just a way of asking for free maintenance. There is nothing stopping you from adjusting the code to your needs and use that in you Jenkins. But for me to maintain it, it has to be as simple as possible. If a complex feature only helps a few users I would rather not merge it.

So I dont want another solution here, the caching is the solution until I see some convincing motivation to why it is not.

Yeah, You are right, But when I used the cache job finder for performance comparison in my development, I found that the cache finder had no effect on reducing the interface time-consuming time. I felt that the cache layer had no effect. i don't know why. The test scenario: jenkins version [Jenkins 2.361.4] and set Cache Get Jobs Minutes is 5.

TomNewChao mentioned this issue Aug 8, 2023

add precise job finder #273

Closed

tomasbjerre added a commit that referenced this issue Aug 19, 2023

feat: add caching feature to jobfinder (refs #272)

087ac6a

tomasbjerre added a commit that referenced this issue Aug 19, 2023

feat: add caching feature to jobfinder (refs #272)

8c264f3

tomasbjerre added a commit that referenced this issue Aug 19, 2023

feat: add caching feature to jobfinder (refs #272)

0436b11

tomasbjerre added a commit that referenced this issue Aug 19, 2023

feat: add caching feature to jobfinder (refs #272)

80e08c2

tomasbjerre added a commit that referenced this issue Aug 19, 2023

feat: add caching feature to jobfinder (refs #272)

8cccf1b

tomasbjerre added a commit that referenced this issue Aug 19, 2023

feat: add caching feature to jobfinder (refs #272)

dcb9f98

tomasbjerre added a commit that referenced this issue Aug 19, 2023

feat: add caching feature in global config (refs #272)

fbad1b6

tomasbjerre added a commit that referenced this issue Aug 19, 2023

feat: add caching feature in global config (refs #272)

26fdff0

tomasbjerre added a commit that referenced this issue Aug 19, 2023

feat: add caching feature in global config (refs #272)

64e4bad

tomasbjerre added a commit that referenced this issue Aug 19, 2023

feat: add caching feature in global config (refs #272)

0a07ea4

tomasbjerre added a commit that referenced this issue Aug 19, 2023

feat: add caching feature in global config (refs #272)

a22625c

tomasbjerre added a commit that referenced this issue Aug 19, 2023

feat: add caching feature in global config (refs #272)

f5ca1d8

tomasbjerre added a commit that referenced this issue Aug 19, 2023

feat: add caching feature in global config (refs #272)

1922d49

tomasbjerre added a commit that referenced this issue Aug 19, 2023

feat: add caching feature in global config (refs #272) (#275)

811e50b

TomNewChao mentioned this issue Aug 21, 2023

Implement precise job in the simplest way #276

Closed

tomasbjerre changed the title ~~Optional jobfinder is required~~ Performance is slow for large installations Aug 22, 2023

tomasbjerre closed this as completed Aug 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance is slow for large installations #272

Performance is slow for large installations #272

TomNewChao commented Aug 3, 2023 •

edited

TomNewChao commented Aug 3, 2023

tomasbjerre commented Aug 3, 2023

TomNewChao commented Aug 4, 2023

tomasbjerre commented Aug 4, 2023

TomNewChao commented Aug 5, 2023

tomasbjerre commented Aug 19, 2023

TomNewChao commented Aug 19, 2023

tomasbjerre commented Aug 19, 2023

tomasbjerre commented Aug 19, 2023

TomNewChao commented Aug 21, 2023

tomasbjerre commented Aug 21, 2023

TomNewChao commented Aug 21, 2023

TomNewChao commented Aug 21, 2023 •

edited

tomasbjerre commented Aug 21, 2023

TomNewChao commented Aug 22, 2023

TomNewChao commented Aug 22, 2023 •

edited

tomasbjerre commented Aug 22, 2023

TomNewChao commented Aug 22, 2023

Performance is slow for large installations #272

Performance is slow for large installations #272

Comments

TomNewChao commented Aug 3, 2023 • edited

TomNewChao commented Aug 3, 2023

tomasbjerre commented Aug 3, 2023

TomNewChao commented Aug 4, 2023

tomasbjerre commented Aug 4, 2023

TomNewChao commented Aug 5, 2023

tomasbjerre commented Aug 19, 2023

TomNewChao commented Aug 19, 2023

tomasbjerre commented Aug 19, 2023

tomasbjerre commented Aug 19, 2023

TomNewChao commented Aug 21, 2023

tomasbjerre commented Aug 21, 2023

TomNewChao commented Aug 21, 2023

TomNewChao commented Aug 21, 2023 • edited

tomasbjerre commented Aug 21, 2023

TomNewChao commented Aug 22, 2023

TomNewChao commented Aug 22, 2023 • edited

tomasbjerre commented Aug 22, 2023

TomNewChao commented Aug 22, 2023

TomNewChao commented Aug 3, 2023 •

edited

TomNewChao commented Aug 21, 2023 •

edited

TomNewChao commented Aug 22, 2023 •

edited