docs: add 8 new FAQ entries covering GPU virtualization, scheduling, and ecosystem integration (#416)#426
Open
mesutoezdil wants to merge 3 commits into
Open
Conversation
…-HAMi#416) Adds entries covering the three topic areas defined in the issue: GPU virtualization model: - How HAMi enforces limits via libvgpu.so CUDA interception - HAMi vGPU vs NVIDIA MIG comparison and decision guide - Why nvidia-smi shows less memory inside container than on host - Why gpumem limits are not enforced (CUDA_DISABLE_CONTROL, DinD, direct driver API calls, misconfigured container runtime) Scheduling interaction: - Whether HAMi replaces or extends kube-scheduler (extender model) Ecosystem integration: - HAMi with vLLM multi-GPU tensor parallelism (tp>1 NCCL fix in v2.7) - HAMi with NVIDIA GPU Operator and DCGM metrics - Prometheus and Grafana monitoring setup with bundled dashboard JSON Each entry follows the existing FAQ format: direct answer in the first sentence, supporting detail, links to relevant doc pages. All internal links use the correct ./path format for the faq/faq.md URL depth. Sourced from issue Project-HAMi#415 research output. Closes Project-HAMi#416. Signed-off-by: mesutoezdil <mesudozdil@gmail.com>
Contributor
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: mesutoezdil The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
✅ Deploy Preview for project-hami ready!
To edit notification comments on pull requests, go to your Netlify project configuration. |
Signed-off-by: mesutoezdil <mesudozdil@gmail.com>
Replace incorrect LD_PRELOAD claim with accurate /etc/ld.so.preload hostPath mount mechanism, matching docs/core-concepts/gpu-virtualization.md. Update vLLM tensor parallelism section: full support for vLLM > 0.18 landed in v2.9.0 (CHANGELOG), not v2.7.0 as previously stated. Signed-off-by: mesutoezdil <mesudozdil@gmail.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Adds 8 new FAQ entries to
docs/faq/faq.mdcovering the three topic areas defined in the issue. All questions were sourced from the research compiled in #415.New entries
GPU virtualization model
Scheduling interaction
Ecosystem integration
static/grafana/gpu-dashboard.jsondashboard.Closes #416.
Refs #415.