Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug][master] After batch deleting the executing process instances, the master cannot get the worker feedback results #3621

Closed
xingchun-chen opened this issue Aug 28, 2020 · 4 comments
Labels
bug Something isn't working
Milestone

Comments

@xingchun-chen
Copy link
Contributor

xingchun-chen commented Aug 28, 2020

  1. Run more than 40 workflows in batches, and the status of the workflow instance and task instance is executing
  2. Click the batch delete button on the process instance page
  3. After the worker task is executed successfully, the master cannot get the result
  4. Run the workflow, the master does not assign tasks to the workers

Solution: After restarting the master service, all tasks run normally

1.批量运行40个以上工作流,工作流实例及任务实例状态都为执行中
2.流程实例页面点击批量删除按钮
3.worker任务执行成功后,master拿不到woker反馈的结果
4.运行工作流,master也没有向worker分配任务

解决方法:重启master服务后,所有任务正常运行

image

image

Which version of Dolphin Scheduler:
-[1.3.2-release]

@xingchun-chen xingchun-chen added the bug Something isn't working label Aug 28, 2020
@xingchun-chen xingchun-chen added this to the 1.3.3-release milestone Aug 28, 2020
@yangyichao-mango
Copy link
Contributor

I will try reproduce this bug later.

@xingchun-chen
Copy link
Contributor Author

need to check whether the process instance exists after the worker returns, and then update if it exists. If it does not exist, throw the result away, and run a new workflow without issuing tasks.

@yangyichao-mango
Copy link
Contributor

need to check whether the process instance exists after the worker returns, and then update if it exists. If it does not exist, throw the result away, and run a new workflow without issuing tasks.

I do not reproduce 3,
I try it with 45 process instance, and batch delete 10 process instance, it still can return the wright result for the other 35 process instance.

I reproduced 4, when I batch delete process instance and new a process instance, the log shows it is assigned to worker successfully and the worker run it successfully, but master do not get the result response, and I will trace it later.

@break60
Copy link
Contributor

break60 commented Sep 17, 2020

If the workflow instance status is executing status, it is forbidden to select

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants