-
Notifications
You must be signed in to change notification settings - Fork 2.9k
Open
Labels
bugSomething isn't workingSomething isn't working
Description
Apache Iceberg version
1.8.1
Query engine
Flink
Please describe the bug 🐞
ENV:
flink version: 1.19.0
parallelism: 128
Problem:
When running a flink job with RANGE distribution mode, after several cycles of running the job for a while, stopping with a savepoint, and restarting from that savepoint, the range shuffle operator stuck at the “INITIALIZING” status without any error or warn logs, while all other operators successfully transition to the RUNNING state.
Steps to Reproduce:
- Run a job in range distribution mode.
- Stop with savepoint.
- Restart from the savepoint.
- Repeat steps 1–3 multiple times.
- Eventually, after a restart, the job gets stuck in the INITIALIZING stage.
The hot thread of range-shuffle operator is:

Willingness to contribute
- I can contribute a fix for this bug independently
- I would be willing to contribute a fix for this bug with guidance from the Iceberg community
- I cannot contribute a fix for this bug at this time
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working