Pod are repeatedly created and deleted #3601

Wang-Kai · 2024-07-16T09:17:39Z

What happened:

Due to the bug, the job was deleted from etcd, but it still remains in the cache. This causes the Volcano controller to create a pod, and then the GC controller deletes the pod instantly, resulting the operation being executed repeatedly. This causes a lot of load on the apiserver.

企业微信截图_2243befd-7db5-44f9-9ffe-347384317ebd

What you expected to happen:

The volcano controller's cache keep same with etcd, and should not fight with the GC controller about pod.

How to reproduce it (as minimally and precisely as possible):

When a pod is updating, delete the owner job instantly.

Anything else we need to know?:

Environment: linux

Volcano Version: v1.8.2
Kubernetes version (use kubectl version): v1.20
Cloud provider or hardware configuration:
OS (e.g. from /etc/os-release): Debian GNU/Linux 9 (stretch)
Kernel (e.g. uname -a): Linux 5.10.0-103-bili-colo x86_64
Install tools:
Others:

The text was updated successfully, but these errors were encountered:

Wang-Kai added the kind/bug Categorizes issue or PR as related to a bug. label Jul 16, 2024

Monokaix mentioned this issue Jul 17, 2024

Performance improvement #3502

Open

2 tasks

Wang-Kai linked a pull request Aug 21, 2024 that will close this issue

remove deletedJobs queue in cache model #3686

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pod are repeatedly created and deleted #3601

Pod are repeatedly created and deleted #3601

Wang-Kai commented Jul 16, 2024

Pod are repeatedly created and deleted #3601

Pod are repeatedly created and deleted #3601

Comments

Wang-Kai commented Jul 16, 2024