[SPARK-49783][YARN] Fix resource leak of yarn allocator #48238

zuston · 2024-09-25T08:17:48Z

What changes were proposed in this pull request?

Fix the resource leak of yarn allocator

Why are the changes needed?

When the target < running containers number, the assigned containers from the resource manager will be skipped, but these containers are not released by invoking the amClient.releaseAssignedContainer , that will make these containers reserved into the Yarn resourceManager at least 10 minutes. And so, the cluster resource will be wasted at a high ratio.

And this will reflect that the vcore * seconds statistics from yarn side will be greater than the result from the spark event logs.

From my statistics, the cluster resource waste ratio is ~25% if the spark jobs are exclusive in this cluster.

Does this PR introduce any user-facing change?

No

How was this patch tested?

In our internal hadoop cluster

Was this patch authored or co-authored using generative AI tooling?

No

zuston · 2024-09-26T02:06:14Z

cc @LuciferYang @cloud-fan @dongjoon-hyun

LuciferYang · 2024-09-26T06:05:41Z

also cc @tgravescs and @pan3793

LuciferYang · 2024-09-26T06:08:35Z

Is it possible to add a new test case with MiniYARNCluster? @zuston

zuston · 2024-09-26T06:09:51Z

Is it possible to add a new test case with MiniYARNCluster? @zuston

From my sight, this is hard to simulate this case to reproduce in test case. But I have verified in our internal cluster, the detail verification could be found in this blog: https://zuston.vercel.app/publish/resource-leak-of-spark-yarn-allocator#Verification

LuciferYang · 2024-09-26T07:22:21Z

The code is OK for me, but it would be even better if we could continuously ensure that this behavior meets expectations by adding an additional test case.

Also, could you provide more detailed information in the How was this patch tested? section? The description In our internal hadoop cluster is too opaque box, we need a reproducible verification method to allow reviewers to confirm that the issue truly exists and has been fixed.

pan3793 · 2024-09-26T07:50:09Z

Changes make sense to me, but I'm confused with the impact.

cluster resource waste ratio is ~25%

Does it mean, with this change, the wasted resource could be leveraged so that all spark jobs can use more resources and execute faster? Or just shrink the gap of metric between the Spark event log and YARN?

zuston · 2024-09-26T08:05:22Z

Does it mean, with this change, the waste resource could be leveraged so that all spark jobs can use more resources and execute faster? Or just shrink the gap of metric between the Spark event log and YARN?

The unrelease resource will be still occupied at least 10 min in the Yarn ResouceManager, but these resources are not used by spark. So these resources are wasted.

And the gap of yarn collected vcore * seconds metrics and spark collected vcore * seconds from all finished spark jobs is the wasted resource.

I will attach some online cluster report if possible.

mridulm · 2024-12-15T00:30:29Z

Looks reasonable to me, but it would be better if @tgravescs could take a look.

zuston · 2024-12-19T09:01:59Z

Could you help review this? @tgravescs

tgravescs

the change looks fine to me. I can't think of any reason to not do this but its been a while since I did YARN stuff too.

Can you clarify what all testing you have done? I just see tested on internal cluster. Was that one a single job, is it running on thousands of jobs every day, etc?

zuston · 2024-12-20T03:36:12Z

the change looks fine to me. I can't think of any reason to not do this but its been a while since I did YARN stuff too.

Can you clarify what all testing you have done? I just see tested on internal cluster. Was that one a single job, is it running on thousands of jobs every day, etc?

This patch has been applied into our internal spark3.5 version, and has been running on the hadoop3.2.1 cluster for 2 months+ with 150K+ spark jobs daily.

tgravescs · 2024-12-20T15:58:57Z

Thanks, +1.

cxzl25 · 2024-12-23T07:09:18Z

This PR looks good. YARN-11702 proposes a general method. I am not sure whether it is related to this PR.

JIRA

YARN-11702: Fix Yarn over allocating containers
https://issues.apache.org/jira/browse/YARN-11702
Fix Version/s: 3.5.0

zuston · 2024-12-23T07:50:22Z

This PR looks good. YARN-11702 proposes a general method. I am not sure whether it is related to this PR.

JIRA

YARN-11702: Fix Yarn over allocating containers https://issues.apache.org/jira/browse/YARN-11702 Fix Version/s: 3.5.0

Thanks for your reply. I have seen YARN-11702, from my side, it solves the concurrency allocation problem about the AM and RM connection, and it is only valid that scoped in the resouce request (not including the scheduling request)

zuston · 2025-01-07T03:10:54Z

Looks reasonable to me, but it would be better if @tgravescs could take a look.

Could you help take another look? @mridulm

dongjoon-hyun

+1, LGTM for Improvement.

Thank you, @zuston , @mridulm , @tgravescs , @LuciferYang , @pan3793 , @cxzl25.

Merged to master for Apache Spark 4.0.0 (Feature Freeze on January 15th).

…pache#672) [SPARK-49783][YARN] Fix resource leak of yarn allocator Fix the resource leak of yarn allocator When the target < running containers number, the assigned containers from the resource manager will be skipped, but these containers are not released by invoking the amClient.releaseAssignedContainer , that will make these containers reserved into the Yarn resourceManager at least 10 minutes. And so, the cluster resource will be wasted at a high ratio. And this will reflect that the vcore * seconds statistics from yarn side will be greater than the result from the spark event logs. From my statistics, the cluster resource waste ratio is ~25% if the spark jobs are exclusive in this cluster. No In our internal hadoop cluster No Closes apache#48238 from zuston/patch-1. Authored-by: Junfan Zhang <zuston@apache.org> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org> Co-authored-by: Junfan Zhang <zuston@apache.org>

[SPARK-49783] fix(yarn): Resource leak of yarn allocator

7dbb4f3

github-actions bot added the YARN label Sep 25, 2024

zuston changed the title ~~[SPARK-49783] fix(yarn): Resource leak of yarn allocator~~ [SPARK-49783][YARN] Fix resource leak of yarn allocator Sep 25, 2024

tgravescs reviewed Dec 19, 2024

View reviewed changes

dongjoon-hyun approved these changes Jan 7, 2025

View reviewed changes

dongjoon-hyun closed this in 0467aca Jan 7, 2025

[SPARK-49783][YARN] Fix resource leak of yarn allocator #48238

[SPARK-49783][YARN] Fix resource leak of yarn allocator #48238

Uh oh!

Conversation

zuston commented Sep 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

zuston commented Sep 26, 2024

Uh oh!

LuciferYang commented Sep 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LuciferYang commented Sep 26, 2024

Uh oh!

zuston commented Sep 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LuciferYang commented Sep 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pan3793 commented Sep 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zuston commented Sep 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mridulm commented Dec 15, 2024

Uh oh!

zuston commented Dec 19, 2024

Uh oh!

tgravescs left a comment

Choose a reason for hiding this comment

Uh oh!

zuston commented Dec 20, 2024

Uh oh!

tgravescs commented Dec 20, 2024

Uh oh!

cxzl25 commented Dec 23, 2024

JIRA

Uh oh!

zuston commented Dec 23, 2024

JIRA

Uh oh!

zuston commented Jan 7, 2025

Uh oh!

dongjoon-hyun left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

zuston commented Sep 25, 2024 •

edited

Loading

LuciferYang commented Sep 26, 2024 •

edited

Loading

zuston commented Sep 26, 2024 •

edited

Loading

LuciferYang commented Sep 26, 2024 •

edited

Loading

pan3793 commented Sep 26, 2024 •

edited

Loading

zuston commented Sep 26, 2024 •

edited

Loading