New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug] [运维中心] 在dinky正常运行,作业发布后,如果作业一直running,但是flink集群关闭情况下,运维中心重启后获取到作业状态一直running #700
Closed
2 of 3 tasks
Comments
请附截图 |
+1 |
这是正常的,因为由于意外原因无法请求到jobmanager,所以无法获取最新的算子状态,默认不更新,即还是RUNNING。 |
意外原因无法请求到jobmanager,个人认为默认更新为UNKNOWN可能更合理些,是希望平台能够管理监控任务的,变为UNKNOWN告警好去检查异常 |
任务实例状态已经更新为 UNKNOWN 了 |
我认为job instance的状态 和job history的状态应该保持一致才对,不然会出现状态不一致导致用户出现迷惑 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Search before asking
What happened
在dinky正常运行,作业发布后,如果作业状态一直running,但是flink集群关闭情况下,运维中心重启后获取到作业状态一直running。
What you expected to happen
希望运维中心的状态根据实时的进行,同时支持以dinky端数据为主,即如果dinky原来的作业状态为running状态,但是flink集群没有对应job或者已经关闭,那么dinky可以自动重启一个job。
How to reproduce
直接在dinky-admin web端提交任务,然后直接关闭flink集群,然后重新启动dinky就可以复现。
Anything else
No response
Version
0.6.5
Are you willing to submit PR?
Code of Conduct
The text was updated successfully, but these errors were encountered: