Skip to content

MAF-19283: feat(e2e): add Heimdall tag support and enhance inference service configurations#56

Merged
hhk7734 merged 10 commits intomainfrom
MAF-19283_add_toleration_affinity
Feb 13, 2026
Merged

MAF-19283: feat(e2e): add Heimdall tag support and enhance inference service configurations#56
hhk7734 merged 10 commits intomainfrom
MAF-19283_add_toleration_affinity

Conversation

@ghost
Copy link
Copy Markdown

@ghost ghost commented Feb 12, 2026

No description provided.

…service configurations

- Introduced a new environment variable `HEIMDALL_TAG` for Heimdall configuration.
- Updated inference service data structure to include `Replicas` and `HeimdallTag`.
- Modified performance and quality test files to utilize the new `HeimdallTag` variable.
- Enhanced YAML templates for Heimdall to dynamically set the image tag based on the new variable.
- Improved resource management in inference service configurations for better clarity and maintainability.
…lude HEIMDALL_TAG environment variable for enhanced configuration management
@ghost ghost self-assigned this Feb 12, 2026
@ghost ghost self-requested a review as a code owner February 12, 2026 10:46
@ghost ghost requested review from TaeSooRhee-moreh, Copilot and nulledge February 12, 2026 10:46
…del PV and PVC management. Moved model PV and PVC creation to a conditional block based on the SkipKind environment variable, improving resource handling. Streamlined the deletion of Heimdall to ensure clarity in resource management during test teardown.
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR enhances the E2E performance/quality suites by allowing the Heimdall image tag to be configured via an environment variable and by making InferenceService manifests more configurable (notably replica counts and non-kind scheduling constraints).

Changes:

  • Add HEIMDALL_TAG env var support and pass it into Heimdall Helm values templates.
  • Parameterize InferenceService spec.replicas via InferenceServiceData.Replicas and update tests accordingly.
  • Extend performance InferenceService configuration with additional node affinity/tolerations, and add extra volumes/mounts in performance Heimdall values.

Reviewed changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 4 comments.

Show a summary per file
File Description
test/utils/inference_service.go Adds Replicas to template render data and removes the now-unused helper constructor.
test/e2e/quality/quality_test.go Passes Heimdall tag to values template and sets replicas explicitly for the quality InferenceService.
test/e2e/quality/config/inference-service.yaml.tmpl Replaces hardcoded replicas with templated .Replicas.
test/e2e/quality/config/heimdall-values.yaml.tmpl Introduces image.tag templating via .HeimdallTag.
test/e2e/performance/performance_test.go Passes Heimdall tag to values template and sets replicas explicitly for prefill/decode services.
test/e2e/performance/config/inference-service.yaml.tmpl Replaces hardcoded replicas with .Replicas and adds non-kind affinity/tolerations.
test/e2e/performance/config/heimdall-values.yaml.tmpl Adds image.tag templating and configures extra volumes/mounts.
test/e2e/envs/env_vars.go Registers HEIMDALL_TAG in centralized E2E env var management.
.github/workflows/e2e-inference-perf-p-cluster.yaml Sets HEIMDALL_TAG for the performance workflow run.

Comment thread test/e2e/performance/config/heimdall-values.yaml.tmpl Outdated
Comment thread test/e2e/quality/config/heimdall-values.yaml.tmpl Outdated
Comment thread test/e2e/performance/config/heimdall-values.yaml.tmpl
Comment thread test/e2e/performance/config/inference-service.yaml.tmpl
…e YAML template to enhance resource scheduling
@ghost ghost marked this pull request as draft February 12, 2026 10:54
@ghost ghost requested a review from hyeongyun0916 February 12, 2026 11:02
@ghost
Copy link
Copy Markdown
Author

ghost commented Feb 12, 2026

@hyeongyun0916 지금 작업사항대로 Action 실행하면 정상 작동합니다.

inference-perf를 실행하고 나면 실험 결과를 s3에 저장하고 바로 namespace 통째로 리소스를 정리해 버리므로,
결과를 확인하시려면 아래 두 가지 방법 중 하나를 선택하시면 됩니다.

  1. Action의 환경변수 설정에 SKIP_CLEANUP: "true" 추가하여 리소스 정리되지 않게 한 다음 namespace 접속해서 pod log 확인
  2. aws s3에 저장된 파일을 가져와서 확인
    i. aws s3 ls --profile s3 s3://moreh-benchmark/ --recursive

@ghost ghost marked this pull request as ready for review February 12, 2026 11:17
Copilot AI review requested due to automatic review settings February 12, 2026 11:17
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 9 out of 9 changed files in this pull request and generated 1 comment.

Comment thread test/e2e/performance/config/heimdall-values.yaml.tmpl
@hyeongyun0916
Copy link
Copy Markdown
Contributor

hyeongyun0916 commented Feb 13, 2026

@hyeongyun0916 지금 작업사항대로 Action 실행하면 정상 작동합니다.

inference-perf를 실행하고 나면 실험 결과를 s3에 저장하고 바로 namespace 통째로 리소스를 정리해 버리므로, 결과를 확인하시려면 아래 두 가지 방법 중 하나를 선택하시면 됩니다.

  1. Action의 환경변수 설정에 SKIP_CLEANUP: "true" 추가하여 리소스 정리되지 않게 한 다음 namespace 접속해서 pod log 확인
  2. aws s3에 저장된 파일을 가져와서 확인
    i. aws s3 ls --profile s3 s3://moreh-benchmark/ --recursive

이거 md나 action파일위쪽에 주석이나 괜찮은곳으로 추가부탁드립니다.

Copy link
Copy Markdown
Member

@hhk7734 hhk7734 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

heimdall tag -> heimdall version
image.tag 명시 제거
heimdall helm version을 변경

…o include result retrieval instructions for S3, enhancing clarity on post-test result management.
@ghost
Copy link
Copy Markdown
Author

ghost commented Feb 13, 2026

heimdall tag -> heimdall version
image.tag 명시 제거
heimdall helm version을 변경

"테스트는 마이너 버전을 release해서 진행하던 무조건 버전값이 있어야 하고, tag를 변경해서 진행하는건 안된다" 라고 이해하면 될까요?

…ment variables and test configurations to streamline setup and enhance clarity in performance and quality tests.
@ghost
Copy link
Copy Markdown
Author

ghost commented Feb 13, 2026

이거 md나 action파일위쪽에 주석이나 괜찮은곳으로 추가부탁드립니다.

b35ae98

@ghost
Copy link
Copy Markdown
Author

ghost commented Feb 13, 2026

heimdall tag -> heimdall version image.tag 명시 제거 heimdall helm version을 변경

ab61b26

Copilot AI review requested due to automatic review settings February 13, 2026 01:31
@hhk7734
Copy link
Copy Markdown
Member

hhk7734 commented Feb 13, 2026

heimdall tag -> heimdall version
image.tag 명시 제거
heimdall helm version을 변경

"테스트는 마이너 버전을 release해서 진행하던 무조건 버전값이 있어야 하고, tag를 변경해서 진행하는건 안된다" 라고 이해하면 될까요?

vX.Y.Z, vX.Y.Z-rc.N 외의 테스트는 E2E에서 다루지 않겠습니다.

Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 7 out of 7 changed files in this pull request and generated no new comments.

…tests by introducing a conditional check for the IsKind variable, improving clarity in resource management and template rendering based on the environment configuration.
Copilot AI review requested due to automatic review settings February 13, 2026 01:38
@ghost
Copy link
Copy Markdown
Author

ghost commented Feb 13, 2026

vX.Y.Z, vX.Y.Z-rc.N 외의 테스트는 E2E에서 다루지 않겠습니다.

이 부분 기록 차원으로 AGENTS.md에 추가하겠습니다
40e5bcb

Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 8 out of 8 changed files in this pull request and generated no new comments.

@hhk7734 hhk7734 merged commit 0a44ee6 into main Feb 13, 2026
8 of 9 checks passed
@hhk7734 hhk7734 deleted the MAF-19283_add_toleration_affinity branch February 13, 2026 06:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants