Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Longhorn Storage - right sizing and monitoring #129

Closed
johnml1135 opened this issue Sep 12, 2023 · 1 comment
Closed

Longhorn Storage - right sizing and monitoring #129

johnml1135 opened this issue Sep 12, 2023 · 1 comment
Assignees

Comments

@johnml1135
Copy link
Collaborator

Disk usage can explode when there are many SMT engines being made and not deleted: Disk usage: min(kubelet_volume_stats_used_bytes{namespace=~"nlp|serval"}) by (persistentvolumeclaim, namespace)/ on (persistentvolumeclaim, namespace) min(kube_persistentvolumeclaim_resource_requests_storage_bytes{namespace=~"nlp|serval"}) by (persistentvolumeclaim, namespace) * 100

The following was from some load testing - that did have an out-of-disk space error. We need to to both right-size the longhorn storage based upon expected usage and monitor it's growth to not crash the system.

image

@johnml1135
Copy link
Collaborator Author

If there are issues, we can make them bigger.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: ✅ Done
Development

No branches or pull requests

1 participant