YUNIKORN-2196 Optimize aggregated resources tracking feature to not add overhead to scheduling #739

zhuqi-lucas · 2023-11-28T14:06:34Z

What is this PR for?

Currently, all tracking calculations are done in the scheduling cycle, it will add overhead to scheduling, we need to rethink and optimize this!

We may use the events that are generated to allow calculating this outside of the scheduler.

What type of PR is it?

Todos

- Task

What is the Jira issue?

Open an issue on Jira https://issues.apache.org/jira/browse/YUNIKORN-2196
Put link here, and add [YUNIKORN-Jira number] in PR title, eg. [YUNIKORN-2] Gang scheduling interface parameters

How should this be tested?

Screenshots (if appropriate)

Questions:

- The licenses files need update.
- There is breaking changes for older versions.
- It needs documentation.

…dd overhead to scheduling

zhuqi-lucas · 2023-11-28T14:13:25Z

Hi @wilfred-s @pbacsko This is a draft PR for us to move resource tracking from scheduling cycle, i want to know your opinions about if it's the right way for us to do this in event publish cycle?

And more questions:

The event storage seems will lose some events when it's reach the limit size, if we need to change it to accurate storage which will not lose any events since we want the resource tracking progress more accurate?
If we need to add more fields to the EventRecord, because now i add some app resources tracking fields into EventRecord message field, but it seems not performance good?

pbacsko · 2023-11-28T15:31:15Z

@zhuqi-lucas I think we should hold off with this for a while. We need to benchmark and see how expensive resource tracking is. In this case, I'm not sure that benchmarking is actually needed.

As I can see, aggregation is called from three places:
application.go L#1780
application.go L#1787
application.go L#1839

When does this happen? Allocation removal or placeholder replacement. It's not a very frequent operation. TrackedResource.AggregateTrackedResource() is not a particularly expensive method, there's a loop for the resources and some basic math.

The overhead of tracking is very low, I think almost unmeasurable. My performance unit test does not trigger it because it does not send pod completed events, but I'd be surpised if the call stack for this were visible.

I'd skip this one completely.

pbacsko · 2023-11-28T15:33:18Z

pkg/events/event_publisher.go

@@ -68,6 +77,43 @@ func (sp *EventPublisher) Stop() {
 	sp.stop.Store(true)
 }

+func (sp *EventPublisher) AggregateAppTrackedResourceFromEvents(messages []*si.EventRecord) {


This is very complicated for something which isn't even a problem in my opinion.

I'm a strong -1 for optimizing tracking this way.

pbacsko

I'm -1 on this.

We've talked about this recently. After checking&understanding the actual costs of the tracking, I would not touch the existing code at all. The new, event based aggregation is VERY complicated.

zhuqi-lucas · 2023-11-29T02:00:28Z

Hi @pbacsko , thank you for review, i agree the application tracking now does not have much impact on the scheduling performance.

And for further resource aggregation such as user/group tracking, if we can have a separate service to do all the tracking async calculation? I am not sure if it's a good option.
And besides the external access from REST API for event, if we want to do some internal aggregation and or filter for the events?

wilfred-s · 2023-11-29T03:40:38Z

I think we can simply use the existing events and push the whole aggregation out of YuniKorn. The only thing that seems to be missing is the node instance type. If that is available just listening to the app and node events would allow creating the summary outside of the scheduler.
When we have the user and group links to the applications as events we can do the same for those without any impact on the scheduling cycle.

So instead of adding that inside the event system we should push this out. Keep the scheduler for scheduling. Make all this an add on...

That is where my remarks came from when I looked at the aggregation for users and groups.

wilfred-s · 2023-11-29T03:46:44Z

And for further resource aggregation such as user/group tracking, if we can have a separate service to do all the tracking async calculation? I am not sure if it's a good option.

That is the best option. A scheduler should schedule. It is not for providing statistics.

And besides the external access from REST API for event, if we want to do some internal aggregation and or filter for the events?

No, because whatever we come up with will not fit all use cases. The only thing we can think of, which has been discussed before, is splitting the streams into just one type i.e. only application events. Not even sure that it will help much.
If filtering or something needs to happen

zhuqi-lucas · 2023-11-29T04:37:17Z

Thank you @wilfred-s , so i am thinking the passing flow for event, if we can extend the event to use for trackingService, we can just do: cc @pbacsko

When we add event, we pass to eventSystem and have storage for pushing events to shim.
When we add event, we notify to trackingService to aggregate.
So the same event, can pass to both eventSystem and trackingService

So we don't need a storage for trackingService, we don't need to push events to shim, but if we want to have a storage for REST API can query aggregated result? Maybe require a design for this also, and we can expose some history aggregation result in UI, and we can provide the duration for history data?

zhuqi-lucas · 2023-11-29T13:26:54Z

Anyway, i created a jira to extend the EventRecord first, so we can depend on that:
https://issues.apache.org/jira/browse/YUNIKORN-2208

zhuqi-lucas added 5 commits November 28, 2023 11:52

YUNIKORN-2196 Optimize aggregated resources tracking feature to not a…

c494436

…dd overhead to scheduling

Merge remote-tracking branch 'upstream/master' into YUNIKORN-2196

c00efa4

fix

0487b15

Move to tracking package.

351735a

Remove scheduling tracking logic

0d35672

zhuqi-lucas marked this pull request as draft November 28, 2023 14:06

pbacsko reviewed Nov 28, 2023

View reviewed changes

pbacsko requested changes Nov 28, 2023

View reviewed changes

pbacsko assigned zhuqi-lucas Nov 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

YUNIKORN-2196 Optimize aggregated resources tracking feature to not add overhead to scheduling #739

YUNIKORN-2196 Optimize aggregated resources tracking feature to not add overhead to scheduling #739

zhuqi-lucas commented Nov 28, 2023 •

edited

Loading

zhuqi-lucas commented Nov 28, 2023 •

edited

Loading

pbacsko commented Nov 28, 2023

pbacsko Nov 28, 2023

pbacsko left a comment

zhuqi-lucas commented Nov 29, 2023 •

edited

Loading

wilfred-s commented Nov 29, 2023

wilfred-s commented Nov 29, 2023

zhuqi-lucas commented Nov 29, 2023 •

edited

Loading

zhuqi-lucas commented Nov 29, 2023

YUNIKORN-2196 Optimize aggregated resources tracking feature to not add overhead to scheduling #739

Are you sure you want to change the base?

YUNIKORN-2196 Optimize aggregated resources tracking feature to not add overhead to scheduling #739

Conversation

zhuqi-lucas commented Nov 28, 2023 • edited Loading

What is this PR for?

What type of PR is it?

Todos

What is the Jira issue?

How should this be tested?

Screenshots (if appropriate)

Questions:

zhuqi-lucas commented Nov 28, 2023 • edited Loading

pbacsko commented Nov 28, 2023

pbacsko Nov 28, 2023

Choose a reason for hiding this comment

pbacsko left a comment

Choose a reason for hiding this comment

zhuqi-lucas commented Nov 29, 2023 • edited Loading

wilfred-s commented Nov 29, 2023

wilfred-s commented Nov 29, 2023

zhuqi-lucas commented Nov 29, 2023 • edited Loading

zhuqi-lucas commented Nov 29, 2023

zhuqi-lucas commented Nov 28, 2023 •

edited

Loading

zhuqi-lucas commented Nov 28, 2023 •

edited

Loading

zhuqi-lucas commented Nov 29, 2023 •

edited

Loading

zhuqi-lucas commented Nov 29, 2023 •

edited

Loading