Remove resource allocator #1990

kraih · 2019-02-07T10:07:35Z

When i started replacing the dbus methods in OpenQA::ResourceAllocator with a redis alternative i noticed that dbus was only used for blocking RPC calls. And that means that we could have just as well used OpenQA::Resource::Locks/Jobs directly from the API controllers, making the resource allocator obsolete. So that's what i ended up doing. This pull request removes the resource allocator completely, and i will have to use another service for introducing redis into openQA (most likely the scheduler).

The only noticeable difference should be the removal of the single process bottleneck that blocking RPC calls to OpenQA::ResouceAllocator caused. And that means there is a small risk that new race conditions will be introduced once locks/barriers can be managed with the prefork daemon. But looking through the code in OpenQA::Resource::Locks/Jobs (and having done a few tests) it seems rather defensive and i'm cautiously optimistic that it will "just work". 😉 Worst case, we need to add one or two transactions to OpenQA::Resource::Locks later.

Progress: https://progress.opensuse.org/issues/46778

…he webapi

codecov · 2019-02-07T11:28:09Z

Codecov Report

Merging #1990 into master will increase coverage by 17.09%.
The diff coverage is 96%.

@@             Coverage Diff             @@
##           master    #1990       +/-   ##
===========================================
+ Coverage    71.8%   88.89%   +17.09%     
===========================================
  Files         132      153       +21     
  Lines        9614    10356      +742     
===========================================
+ Hits         6903     9206     +2303     
+ Misses       2711     1150     -1561

Impacted Files	Coverage Δ
lib/OpenQA/IPC.pm	`83.76% <ø> (-0.28%)`	⬇️
lib/OpenQA/WebAPI/Controller/API/V1/Command.pm	`25% <ø> (ø)`
lib/OpenQA/Task/Asset/Download.pm	`16.94% <ø> (-5.78%)`	⬇️
lib/OpenQA.pm	`70.27% <ø> (ø)`
lib/OpenQA/Scheduler/Scheduler.pm	`90.69% <ø> (+4.98%)`	⬆️
lib/OpenQA/WebAPI/Controller/API/V1/Iso.pm	`95.34% <ø> (+15.98%)`	⬆️
lib/OpenQA/WebAPI.pm	`97.14% <ø> (+0.33%)`	⬆️
lib/OpenQA/WebAPI/Controller/API/V1/Job.pm	`87.18% <100%> (+38.06%)`	⬆️
lib/OpenQA/Resource/Jobs.pm	`100% <100%> (ø)`	⬆️
lib/OpenQA/WebAPI/Controller/API/V1/Asset.pm	`98.07% <100%> (+64.74%)`	⬆️
... and 83 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 7179521...893ae52. Read the comment docs.

codecov · 2019-02-07T11:28:09Z

Codecov Report

Merging #1990 into master will increase coverage by 17.17%.
The diff coverage is 96%.

@@             Coverage Diff             @@
##           master    #1990       +/-   ##
===========================================
+ Coverage    71.8%   88.97%   +17.17%     
===========================================
  Files         132      153       +21     
  Lines        9614    10340      +726     
===========================================
+ Hits         6903     9200     +2297     
+ Misses       2711     1140     -1571

Impacted Files	Coverage Δ
lib/OpenQA/IPC.pm	`83.76% <ø> (-0.28%)`	⬇️
lib/OpenQA/WebAPI/Controller/API/V1/Command.pm	`25% <ø> (ø)`
lib/OpenQA/Task/Asset/Download.pm	`23.25% <ø> (+0.52%)`	⬆️
lib/OpenQA.pm	`70.27% <ø> (ø)`
lib/OpenQA/Scheduler/Scheduler.pm	`88.37% <ø> (+2.65%)`	⬆️
lib/OpenQA/WebAPI/Controller/API/V1/Iso.pm	`95.34% <ø> (+15.98%)`	⬆️
lib/OpenQA/WebAPI.pm	`97.14% <ø> (+0.33%)`	⬆️
lib/OpenQA/WebAPI/Controller/API/V1/Job.pm	`87.18% <100%> (+38.06%)`	⬆️
lib/OpenQA/Resource/Jobs.pm	`100% <100%> (ø)`	⬆️
lib/OpenQA/WebAPI/Controller/API/V1/Asset.pm	`98.07% <100%> (+64.74%)`	⬆️
... and 82 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 7179521...893ae52. Read the comment docs.

coolo · 2019-02-07T12:17:09Z

please give a good test with multi machine jobs on staging

Martchus · 2019-02-07T14:47:16Z

This looks already good and also complete with all the adjustments to the spec file and bootstrap script. Let's see how it works on staging.

okurz

+1

kraih · 2019-02-08T21:47:12Z

Multi machine tests will take a little more time i'm afraid, since our staging machines aren't working properly at the moment (and i'm still learning while trying to fix them 😉).

coolo · 2019-02-09T06:00:33Z

you learning about them has priority over merging this :)

kraih · 2019-02-12T17:14:22Z

Small progress update, ran CaaSP multi machine tests so far and everything looks fine.

Martchus · 2019-02-13T09:10:06Z

I'm wondering on which instance. Only openqa-staging-1 has CaaSP jobs but they failed as incomplete.

kraih · 2019-02-13T10:54:25Z

@Martchus I restarted the CaaSP jobs right away yesterday to try again after they were successful, but it looks like the API keys expired around the same time...

kraih · 2019-02-13T12:32:52Z

@ldevulder also ran some independent HA/HPC tests successfully on his lab with this pull request applied.

Martchus

Not sure why http://openqa-staging-1.qa.suse.de/tests/756 failed but I guess it is beyond the scope of removing the resource allocator :-)

commit 0ef8f6a Merge: 40a74f5 893ae52 Author: Martchus <martchus@gmx.net> AuthorDate: Fri Mar 1 16:17:22 2019 +0100 Commit: GitHub <noreply@github.com> CommitDate: Fri Mar 1 16:17:22 2019 +0100 Merge pull request #1990 from kraih/remove_resource_allocator Remove resource allocator

kraih added 7 commits February 7, 2019 11:22

Move OpenQA::Resource::Jobs handling from the resource allocator to t…

348d1b9

…he webapi

Declare the DBIx::Class dependency properly

0886cb9

Remove first dbus method and use OpenQA::Schema directly

53fb393

Move mutex handling out of the resource allocator

f99c828

Move barrier handling out of the resource allocator

ad45128

Remove the resource allocator

b3049f0

Remove unused imports and IPC objects

893ae52

kraih force-pushed the remove_resource_allocator branch from ed00bcc to 893ae52 Compare February 7, 2019 10:25

coolo added the acceptance-tests-needed Needed for code that is required to be tested on a production-like environment label Feb 7, 2019

okurz reviewed Feb 8, 2019

View reviewed changes

kraih mentioned this pull request Feb 12, 2019

Cache service migrations #1995

Merged

Martchus approved these changes Feb 13, 2019

View reviewed changes

foursixnine approved these changes Feb 27, 2019

View reviewed changes

Martchus merged commit 0ef8f6a into os-autoinst:master Mar 1, 2019

kraih deleted the remove_resource_allocator branch May 12, 2020 13:19

perlpunk added a commit to perlpunk/openQA that referenced this pull request Mar 19, 2022

a3665b26 2022-03-15 Merge pull request os-autoinst#1990

f60dac8

perlpunk added a commit to perlpunk/openQA that referenced this pull request Mar 19, 2022

a3665b26 2022-03-15 Merge pull request os-autoinst#1990

e6fdb93

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove resource allocator #1990

Remove resource allocator #1990

kraih commented Feb 7, 2019 •

edited

codecov bot commented Feb 7, 2019

codecov bot commented Feb 7, 2019 •

edited

coolo commented Feb 7, 2019

Martchus commented Feb 7, 2019

okurz left a comment

kraih commented Feb 8, 2019

coolo commented Feb 9, 2019

kraih commented Feb 12, 2019

Martchus commented Feb 13, 2019

kraih commented Feb 13, 2019

kraih commented Feb 13, 2019

Martchus left a comment

Remove resource allocator #1990

Remove resource allocator #1990

Conversation

kraih commented Feb 7, 2019 • edited

codecov bot commented Feb 7, 2019

Codecov Report

codecov bot commented Feb 7, 2019 • edited

Codecov Report

coolo commented Feb 7, 2019

Martchus commented Feb 7, 2019

okurz left a comment

Choose a reason for hiding this comment

kraih commented Feb 8, 2019

coolo commented Feb 9, 2019

kraih commented Feb 12, 2019

Martchus commented Feb 13, 2019

kraih commented Feb 13, 2019

kraih commented Feb 13, 2019

Martchus left a comment

Choose a reason for hiding this comment

kraih commented Feb 7, 2019 •

edited

codecov bot commented Feb 7, 2019 •

edited