Replies: 8 comments
-
deploy across two Kubernetes clusters.
|
Beta Was this translation helpful? Give feedback.
-
|
The management cluster part of this architecture diagram is almost the same as the deployment of our actual intranet, and is managed based on the deployment capabilities of k8s. The difference is that we manage the compute cluster part based on an internal OSS system, and it would be a good best practice if someone could run through the K8S management of the compute cluster |
Beta Was this translation helpful? Give feedback.
-
|
Thank you for the reference chart @aisjca. Is there a reference k8s deployment example? |
Beta Was this translation helpful? Give feedback.
-
|
K8s boasts a large user ecosystem. We recommend prioritizing the capability of managing CubeSandbox sandboxes via K8s, which will greatly expand CubeSandbox's ecosystem. The recently open-sourced project https://github.com/agent-substrate/substrate by Google natively supports K8s management, delivering a friendly experience for users within the K8s ecosystem. |
Beta Was this translation helpful? Give feedback.
-
|
Thank you for your suggestion, we will provide some guidelines later for convenient cluster deployment using terrorform or k8s configuration methods |
Beta Was this translation helpful? Give feedback.
-
|
I have been experimenting with running CubeSandbox in a Kubernetes environment recently, so I wanted to add a few notes from my tests. Using Kubernetes to manage CubeSandbox makes sense to me. It would let us reuse the existing ecosystem for deployment, scheduling, rollouts, monitoring, service discovery, and general operations. From what I have seen so far, the control-plane services are probably not the hard part. Components such as My main concern is on the compute/data-plane side, especially In my cluster, the CNI is Cilium. I saw CubeSandbox’s eBPF programs being loaded, but the datapath did not always become effective. For example, the I also noticed issue #443, which reports a similar networking problem in a Kubernetes environment using Calico. That makes me think this may not be Cilium-specific, but a broader compatibility question between CubeSandbox’s networking components and Kubernetes CNI implementations. It would be useful to clarify the recommended architecture here: can |
Beta Was this translation helpful? Give feedback.
-
|
@zyl1121 thanks for writing this up in detail, this matches what I'd expect from mixing two independent network-management stacks on the same host. I think the real issue isn't "CNI vs no CNI" but ownership of the host network stack. Given that, I'd suggest the recommended architecture is to not run a general-purpose CNI on dedicated compute nodes at all, and let On how to actually run these three components: kubelet + static pods would work mechanically — static pods bypass the scheduler entirely (kubelet reads them straight from the local manifest directory), so they'll start even if the node is NotReady due to no CNI being configured, and with So in short: dedicate compute nodes, drop the general CNI (and kube-proxy) there, let |
Beta Was this translation helpful? Give feedback.
-
|
Thanks, this makes sense to me. I agree that framing this as ownership of the host networking stack is more accurate than just “CNI vs no CNI”.
My concern is that removing the general CNI and kube-proxy from compute nodes solves only the local datapath conflict, not the full architecture. That may still be a valid direction, but it makes the compute pool much more special. The design would need to clearly document which components are managed by Kubernetes, which nodes run CNI/kube-proxy, how compute nodes reach the control plane, how failover/LB is handled, and what networking assumptions are required on the compute side. So I think the “no CNI on compute nodes” model is worth considering, but it should be treated as part of a larger architecture rather than just a deployment detail. Ideally we can either find a safe coexistence model with common Kubernetes networking stacks, or document a dedicated-compute-pool model with clear control-plane connectivity, failover, and operational assumptions. |
Beta Was this translation helpful? Give feedback.

Uh oh!
There was an error while loading. Please reload this page.
-
Taking advantage of k8s resource elasticity and resource management capabilities, cube is made easier to use at the resource level, and control plane service management is more elastic and highly available.
Beta Was this translation helpful? Give feedback.
All reactions