Skip to content

[CELEBORN-1457][0.4] Avoid NPE during shuffle data cleanup#2565

Closed
cfmcgrady wants to merge 1 commit intoapache:branch-0.4from
cfmcgrady:CELEBORN-1457-branch-0.4
Closed

[CELEBORN-1457][0.4] Avoid NPE during shuffle data cleanup#2565
cfmcgrady wants to merge 1 commit intoapache:branch-0.4from
cfmcgrady:CELEBORN-1457-branch-0.4

Conversation

@cfmcgrady
Copy link
Contributor

What changes were proposed in this pull request?

backport #2553 to branch-0.4

Avoid NPE during shuffle data cleanup by checking for null LevelDB.

Why are the changes needed?

If the LevelDB in StorageManager fails to initialize, the db will be null. This will cause a java.lang.NullPointerException when storageManager.cleanupExpiredShuffleKey(expiredShuffleKeys) is called, and the shuffle data in expiredShuffleKeys will not be cleaned up. The worker's disk may be filled up as a result.

Does this PR introduce any user-facing change?

No

How was this patch tested?

Manual Testing

### What changes were proposed in this pull request?
Avoid NPE during shuffle data cleanup by checking for null LevelDB.

### Why are the changes needed?
If the LevelDB in StorageManager fails to initialize, the db will be null. This will cause a java.lang.NullPointerException when storageManager.cleanupExpiredShuffleKey(expiredShuffleKeys) is called, and the shuffle data in expiredShuffleKeys will not be cleaned up. The worker's disk may be filled up as a result.

### Does this PR introduce _any_ user-facing change?
No

### How was this patch tested?
Manual Testing

Closes apache#2553 from jiang13021/celeborn-1457.

Authored-by: jiang13021 <jiangyanze.jyz@antgroup.com>
Signed-off-by: SteNicholas <programgeek@163.com>
@cfmcgrady cfmcgrady force-pushed the CELEBORN-1457-branch-0.4 branch from 54b7de2 to d9478e5 Compare June 14, 2024 05:23
@cfmcgrady
Copy link
Contributor Author

thanks, merging to branch-0.4(v0.4.2).

cfmcgrady pushed a commit that referenced this pull request Jun 14, 2024
### What changes were proposed in this pull request?

backport #2553 to `branch-0.4`

Avoid NPE during shuffle data cleanup by checking for null LevelDB.

### Why are the changes needed?
If the LevelDB in StorageManager fails to initialize, the db will be null. This will cause a java.lang.NullPointerException when storageManager.cleanupExpiredShuffleKey(expiredShuffleKeys) is called, and the shuffle data in expiredShuffleKeys will not be cleaned up. The worker's disk may be filled up as a result.

### Does this PR introduce _any_ user-facing change?

No

### How was this patch tested?
Manual Testing

Closes #2565 from cfmcgrady/CELEBORN-1457-branch-0.4.

Authored-by: jiang13021 <jiangyanze.jyz@antgroup.com>
Signed-off-by: Fu Chen <cfmcgrady@gmail.com>
@cfmcgrady cfmcgrady closed this Jun 14, 2024
@cfmcgrady cfmcgrady deleted the CELEBORN-1457-branch-0.4 branch June 14, 2024 07:01
cfmcgrady pushed a commit to cfmcgrady/incubator-celeborn that referenced this pull request Aug 21, 2025
### What changes were proposed in this pull request?

backport apache#2553 to `branch-0.4`

Avoid NPE during shuffle data cleanup by checking for null LevelDB.

### Why are the changes needed?
If the LevelDB in StorageManager fails to initialize, the db will be null. This will cause a java.lang.NullPointerException when storageManager.cleanupExpiredShuffleKey(expiredShuffleKeys) is called, and the shuffle data in expiredShuffleKeys will not be cleaned up. The worker's disk may be filled up as a result.

### Does this PR introduce _any_ user-facing change?

No

### How was this patch tested?
Manual Testing

Closes apache#2565 from cfmcgrady/CELEBORN-1457-branch-0.4.

Authored-by: jiang13021 <jiangyanze.jyz@antgroup.com>
Signed-off-by: Fu Chen <cfmcgrady@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants