Skip to content

[Bug] [server] task can not be submitted when the worker is down #12938

@ChPi

Description

@ChPi

Search before asking

  • I had searched in the issues and found no similar issues.

What happened

version 3.1.0.

i run a flow on dolphinscheduler k8s cluster, then delete the worker which task running,the task cannot be resubmitted.
the task state is always Need fault tolerance.

when a worker is down, the master will set TASK_STATE_CHANGE and NEED_FAULT_TOLERANCE for the task, then call action(run) at TaskStateEventHandler, should it call action(resubmit) for NEED_FAULT_TOLERANCE ?

What you expected to happen

resubmit the task.

How to reproduce

  1. The task is running in worker1
  2. delete worker1
  3. The task need fault tolerance, but it cannot be resubmitted

Anything else

No response

Version

3.1.x

Are you willing to submit PR?

  • Yes I am willing to submit a PR!

Code of Conduct

Metadata

Metadata

Assignees

No one assigned

    Labels

    Waiting for replyWaiting for replybugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions