[SPARK-55093][CORE] Handle TaskRunner construction failures in launchTask #53865

ChuckLin2025 · 2026-01-20T07:56:22Z

What changes were proposed in this pull request?

Move createTaskRunner into try-catch block to handle construction failures
Add cleanup to remove TaskRunner from runningTasks if threadPool.execute throws
Prevent potential memory leak by cleaning up TaskRunner when threadPool.execute fails

Why are the changes needed?

The createTaskRunner may throw an Exception.

Does this PR introduce any user-facing change?

No

How was this patch tested?

Added an unit test.

Was this patch authored or co-authored using generative AI tooling?

Generated-by: claude-4.5

…on failure - Move createTaskRunner into try-catch block to handle construction failures - Add cleanup to remove TaskRunner from runningTasks if threadPool.execute throws - Prevent potential memory leak by cleaning up TaskRunner when threadPool.execute fails - Update test to use current TaskDescription API

- Use reflection to mock runningTasks.put to throw exception - Tests cleanup logic when TaskRunner is created but fails to be added to runningTasks - Verify exception is properly caught and reported to driver

github-actions · 2026-01-20T07:56:33Z

JIRA Issue Information

=== Bug SPARK-55093 ===
Summary: Handle TaskRunner construction failures in launchTask
Assignee: None
Status: Open
Affected: ["4.1.1"]

This comment was automatically generated by GitHub Actions

ChuckLin2025 · 2026-01-20T07:58:37Z

@cloud-fan @Ngone51 What do you think about this PR ?

Ngone51

LGTM

…ng mocked Executor fields The test was failing because mocked Executor's val fields (runningTasks, threadPool, conf, env, killMarks) were not initialized when using mock[Executor](CALLS_REAL_METHODS). This caused NullPointerExceptions when the real launchTask method tried to access them. Solution: Use reflection to manually set these val fields on the mocked Executor object after creation, allowing the real launchTask method to execute properly and add tasks to the runningTasks map. Test now passes consistently in ~1.4 seconds.

ChuckLin2025 · 2026-01-26T02:54:38Z

@Ngone51 Could you please take another look here ? I fix the test case "track allocated resources by taskId" in a new commit.

The test case didn't mock the killMarks and ran into a null Exception. It passed before this PR because we didn't handle the exception and run the cleanup job.

Zequn Lin added 2 commits January 20, 2026 04:01

[SPARK-55093] Fix test for TaskRunner construction failures

e1c8d7d

- Use reflection to mock runningTasks.put to throw exception - Tests cleanup logic when TaskRunner is created but fails to be added to runningTasks - Verify exception is properly caught and reported to driver

github-actions bot added the CORE label Jan 20, 2026

ChuckLin2025 changed the title ~~55093~~ [SPARK 55093] Handle TaskRunner construction failures in launchTask #1 Jan 20, 2026

ChuckLin2025 changed the title ~~[SPARK 55093] Handle TaskRunner construction failures in launchTask #1~~ [SPARK 55093] Handle TaskRunner construction failures in launchTask Jan 20, 2026

Ngone51 approved these changes Jan 20, 2026

View reviewed changes

Ngone51 changed the title ~~[SPARK 55093] Handle TaskRunner construction failures in launchTask~~ [SPARK 55093][CORE] Handle TaskRunner construction failures in launchTask Jan 20, 2026

cloud-fan changed the title ~~[SPARK 55093][CORE] Handle TaskRunner construction failures in launchTask~~ [SPARK-55093][CORE] Handle TaskRunner construction failures in launchTask Jan 21, 2026

ChuckLin2025 added 2 commits January 26, 2026 02:17

Merge branch 'apache:master' into 55093

90adbc6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-55093][CORE] Handle TaskRunner construction failures in launchTask #53865

[SPARK-55093][CORE] Handle TaskRunner construction failures in launchTask #53865

ChuckLin2025 commented Jan 20, 2026

Uh oh!

github-actions bot commented Jan 20, 2026 •

edited

Loading

Uh oh!

ChuckLin2025 commented Jan 20, 2026

Uh oh!

Ngone51 left a comment

Uh oh!

ChuckLin2025 commented Jan 26, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[SPARK-55093][CORE] Handle TaskRunner construction failures in launchTask #53865

Are you sure you want to change the base?

[SPARK-55093][CORE] Handle TaskRunner construction failures in launchTask #53865

Conversation

ChuckLin2025 commented Jan 20, 2026

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

github-actions bot commented Jan 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

JIRA Issue Information

Uh oh!

ChuckLin2025 commented Jan 20, 2026

Uh oh!

Ngone51 left a comment

Choose a reason for hiding this comment

Uh oh!

ChuckLin2025 commented Jan 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

github-actions bot commented Jan 20, 2026 •

edited

Loading

ChuckLin2025 commented Jan 26, 2026 •

edited

Loading