Release CPUs reminders #475

liranr23 · 2022-06-19T17:10:27Z

When we allocate CPUs we are basing upon the list we return back. This
is correct for the scheduling phase. But now we run the same logic on
canSchedule. There, when having a group of VMs, we are basing on the
cpuTopology list, not the list we selected to allocate. The overtaken
CPU wasn't released from the pinning itself and that harmed the
canSchedule logic for VM groups.

Also, when grouping the VMs, when the group is having an affinity set,
it is reordered. Therefore, it is required to sort them there as well.

Change-Id: Idcae90fed953c4798f2c4bf8893bff0a1b7086ea
Signed-off-by: Liran Rotenberg lrotenbe@redhat.com

liranr23 · 2022-06-19T21:30:29Z

/ost

ahadas · 2022-06-20T06:56:58Z

the code changes lgtm but this description.. it should clarify the reason/context for the changes but it's not easy to parse it - see comments below

When we allocate CPUs we are basing upon the list we return back.

by 'we are basing upon' you mean 'we rely on'?

There, when having a group of VMs, we are basing on the cpuTopology list, not the list we selected to allocate.

do you mean "we use the cpuTopology list"?

The overtaken CPU wasn't released from the pinning itself and that harmed the canSchedule logic for VM groups.

"The overtaken CPU remained pinned"?

Also, when grouping the VMs, when the group is having an affinity set, it is reordered. Therefore, it is required to sort them there as well.

"when grouping the VMs and having an affinity set"? what does "there" refer to?

When we allocate CPUs we rely on the list we return back. This is correct for the scheduling phase. But now we run the same logic on canSchedule. There, when having a group of VMs, we use the cpuTopology list, not the list we selected to allocate. The overtaken CPU remained pinned and that harmed the canSchedule logic for VM groups. When grouping the VMs, when the group is having an affinity set, it is reordered. Therefore, it is required to sort them. Change-Id: Idcae90fed953c4798f2c4bf8893bff0a1b7086ea Signed-off-by: Liran Rotenberg <lrotenbe@redhat.com>

liranr23 · 2022-06-20T08:03:55Z

the code changes lgtm but this description.. it should clarify the reason/context for the changes but it's not easy to parse it - see comments below

When we allocate CPUs we are basing upon the list we return back.

by 'we are basing upon' you mean 'we rely on'?

There, when having a group of VMs, we are basing on the cpuTopology list, not the list we selected to allocate.

do you mean "we use the cpuTopology list"?

The overtaken CPU wasn't released from the pinning itself and that harmed the canSchedule logic for VM groups.

"The overtaken CPU remained pinned"?

Also, when grouping the VMs, when the group is having an affinity set, it is reordered. Therefore, it is required to sort them there as well.

"when grouping the VMs and having an affinity set"? what does "there" refer to?

thanks, commit changed.
"there" referred to the start of groupVms, were we sort the vms list. but that will be alright only when there is no affinity.

ljelinkova · 2022-06-20T11:39:16Z

I still do not understand what is this PR about - do you have an example of incorrect behavior?

liranr23 · 2022-06-20T12:46:02Z

I still do not understand what is this PR about - do you have an example of incorrect behavior?

The one in the bug https://bugzilla.redhat.com/show_bug.cgi?id=2079351 .
1st problem is when we thought we do sort the VM list in groupVms (under SchedulingManager) and we do. Just not in the case there is affinity group, if we do have affinity group the sort need to be done at the end of the function.

2nd problem, now when we enter CpuPinningPolicyUnit, we use the same cpuTopology for the group of VMs. When allocating (actually, this may hit CPUPolicyUnit as well now), the logic is to allocate and "drop" the reminder CPU(s) if there are any, the returned list from the allocation function (cpusToAllocate) was the place we looked into.
But now in canSchedule, using the same cpuTopology, we don't only look on cpusToAllocate, we then keep passing on cpuTopology, which didn't unPin the reminders.

Example from the bug having host with 4:4:1, VM1 with 4:3:1 and VM2 with 2:1:1.
In the allocation function we will actually pin 15 CPUs (only the one with VDSM is left out), while we only take 12 in cpusToAllocate (there are 3 sockets having 1 CPU as a reminder). Now we will also call unPin to this VdsCpuUnit, and it can be used for the next VM (in our case VM2).
Makes more sense?

ljelinkova · 2022-06-23T07:49:40Z

Yes, it is starting to make sense to me. Do I understand it correctly that at first we pin all CPUs in cpusToBeAllocated and then remove and unpin some of them. Why is that? What does coresReminder mean?

liranr23 · 2022-06-23T08:08:18Z

Yes, it is starting to make sense to me. Do I understand it correctly that at first we pin all CPUs in cpusToBeAllocated and then remove and unpin some of them. Why is that? What does coresReminder mean?

When selecting CPUs to allocate we pass on cpuTopology and pin them, cpusToBeAllocated are the selected ones we really take. First we select and pin, then we remove any reminder left from the cpusToBeAllocated. Now we also need to unPin those CPUs.

Why it's like that is a good question which I don't remember (and it was discussed) when implemented. To me it's more greedy approach, when we see that core doesn't have enough threads within to be used, we just release it.

ljelinkova

I still do not completely understand how cpusToBeAllocated is created but since that part has already been tested and works, the changes in this PR make sense.

liranr23 · 2022-06-23T08:42:45Z

OST passed last time, the only code change is the commit message.

liranr23 requested review from ahadas, bennyz, emesika, mwperina, michalskrivanek, oliel, sgratch and didib as code owners June 19, 2022 17:10

liranr23 requested a review from ljelinkova June 19, 2022 17:12

liranr23 added verified virt labels Jun 19, 2022

ahadas assigned ljelinkova Jun 20, 2022

liranr23 force-pushed the release_allocation branch from e4f2d2d to f1ead12 Compare June 20, 2022 08:03

ljelinkova approved these changes Jun 23, 2022

View reviewed changes

ahadas approved these changes Jun 23, 2022

View reviewed changes

ahadas merged commit 9347171 into oVirt:master Jun 23, 2022

liranr23 deleted the release_allocation branch June 23, 2022 09:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release CPUs reminders #475

Release CPUs reminders #475

liranr23 commented Jun 19, 2022

liranr23 commented Jun 19, 2022

ahadas commented Jun 20, 2022

liranr23 commented Jun 20, 2022

ljelinkova commented Jun 20, 2022

liranr23 commented Jun 20, 2022 •

edited

ljelinkova commented Jun 23, 2022

liranr23 commented Jun 23, 2022

ljelinkova left a comment

liranr23 commented Jun 23, 2022 •

edited

Release CPUs reminders #475

Release CPUs reminders #475

Conversation

liranr23 commented Jun 19, 2022

liranr23 commented Jun 19, 2022

ahadas commented Jun 20, 2022

liranr23 commented Jun 20, 2022

ljelinkova commented Jun 20, 2022

liranr23 commented Jun 20, 2022 • edited

ljelinkova commented Jun 23, 2022

liranr23 commented Jun 23, 2022

ljelinkova left a comment

Choose a reason for hiding this comment

liranr23 commented Jun 23, 2022 • edited

liranr23 commented Jun 20, 2022 •

edited

liranr23 commented Jun 23, 2022 •

edited