MAF-19232: feat(e2e): use template on e2e tests#51
Merged
Conversation
…plates for meta-llama model - Changed the test model to "meta-llama/Llama-3.2-1B-Instruct" and added a new test template "quickstart-vllm-meta-llama-llama-3.2-1b-instruct-amd-mi250-tp2" in e2e workflows. - Removed deprecated InferenceServiceTemplates and updated the InferenceService data structure to include template references. - Introduced new inference service YAML templates for performance and quality benchmarks, enhancing the testing framework. - Updated environment variable handling and resource management in the e2e tests for improved clarity and maintainability.
…mance and quality tests - Added functions to create and delete model Persistent Volumes (PV) and Persistent Volume Claims (PVC) in the e2e testing framework. - Updated performance and quality test cases to utilize the new PV and PVC management, enhancing resource handling during tests. - Refactored existing code to remove deprecated PV and PVC creation methods, ensuring cleaner and more maintainable test scripts.
- Updated the ParseImage function to return an error if the image format is invalid, ensuring better validation. - Modified the createInferencePerfJob function to handle parsing errors gracefully, improving robustness in performance tests. - Reorganized the deletion of Heimdall in quality tests for better clarity and consistency.
… tests - Changed the paths for ModelPV and ModelPVC constants to reflect the new directory structure, ensuring alignment with the latest configuration standards. - This update enhances maintainability and clarity in the e2e testing framework.
…functions and streamline gateway service retrieval - Eliminated the CreateInferenceServiceTemplate and DeleteInferenceServiceTemplate functions to reduce redundancy. - Introduced a new GetGatewayServiceName function for improved clarity in retrieving the Gateway service name. - Enhanced error handling in the GetGatewayServiceName function to ensure better feedback when the service is not found.
…AML templates for performance and quality tests - Modified GetInferenceServiceData function to include an 'isKind' parameter for better context handling. - Updated performance and quality test files to utilize the new parameter in data retrieval. - Enhanced inference service YAML templates to conditionally include resource requests and limits based on the 'isKind' flag, improving resource management in tests.
…te data retrieval for inference tests - Added new environment variables for test templates: TEST_TEMPLATE_PREFILL and TEST_TEMPLATE_DECODE. - Updated performance and quality test files to utilize the new template variables for data retrieval, improving clarity and maintainability in test configurations. - Ensured consistency in the handling of inference service data across different test scenarios.
…ource management - Introduced new YAML templates for Persistent Volume (PV) and Persistent Volume Claim (PVC) to support model storage in the e2e testing framework. - Updated constants to reflect the new template paths, ensuring consistency and maintainability in test configurations. - These additions improve the handling of storage resources during performance and quality tests.
Author
|
Quality benchmark를 실행할 때, PD disaggregation 환경에서는 tokenizer 관련 이슈로 아래와 같은 문제가 발생하여 우선 aggregated 환경에서의 테스트로 전환해 두겠습니다. |
Contributor
There was a problem hiding this comment.
Pull request overview
Updates the E2E test harness to create InferenceService resources using templateRefs directly (instead of creating/deleting InferenceServiceTemplate CRs), and centralizes model PV/PVC creation into reusable test utils. This aligns the E2E suites with the new “template-based” flow and updates CI inputs accordingly.
Changes:
- Refactor quality/performance E2E suites to create
InferenceServicefrom new per-suite templates that usetemplateRefs. - Add reusable utils + templates for creating/deleting model PV/PVC for product-cluster runs.
- Update E2E env var set (model + template names) and workflows for product-cluster executions.
Reviewed changes
Copilot reviewed 19 out of 21 changed files in this pull request and generated 5 comments.
Show a summary per file
| File | Description |
|---|---|
test/utils/settings/constants.go |
Adds model PV/PVC template paths; removes legacy inference service template constants. |
test/utils/models.go |
New utils for creating/deleting model PV/PVC via templates. |
test/utils/kind.go |
Skips kind cluster creation if it already exists. |
test/utils/inference_service.go |
Simplifies InferenceServiceData to templateRefs-based flow; adds helper to fetch container image. |
test/utils/common.go |
Adds ParseImage helper for extracting image tag. |
test/utils/config/model-pv.yaml.tmpl |
New model PV template used by E2E. |
test/utils/config/model-pvc.yaml.tmpl |
New model PVC template used by E2E. |
test/e2e/envs/env_vars.go |
Adds TEST_TEMPLATE_PREFILL/DECODE env vars; updates defaults. |
test/e2e/quality/quality_test.go |
Switches to new inference-service template flow and utils-based PV/PVC management. |
test/e2e/quality/config/inference-service.yaml.tmpl |
New InferenceService template using templateRefs. |
test/e2e/performance/performance_test.go |
Switches to new inference-service template flow; uses live image tag for perf job. |
test/e2e/performance/config/inference-service.yaml.tmpl |
New InferenceService template using templateRefs. |
test/e2e/performance/config/heimdall-values.yaml.tmpl |
Updates Heimdall plugin chain / scheduling profiles for PD-style routing. |
test/config/base/resources/inference-service-template-*.yaml.tmpl |
Removes legacy InferenceServiceTemplate manifests no longer used by tests. |
test/config/base/resources/inference-service-{prefill,decode}.yaml.tmpl |
Removes legacy prefill/decode InferenceService manifests. |
.github/workflows/e2e-quality-p-cluster.yaml |
Updates quality workflow env vars (model + template name). |
.github/workflows/e2e-pd-p-cluster.yaml |
Updates PD workflow env vars (model + template names). |
hhk7734
requested changes
Feb 10, 2026
…larity - Revised comments in CreateModelPV and DeleteModelPV functions to specify that they handle PersistentVolumes, enhancing clarity for future developers.
…ence service templates - Eliminated the 'parallelism' section from the performance and quality inference service YAML templates to streamline configuration and improve clarity.
Member
|
테스트 끝난건가요? |
Author
넵 performance, quality 둘다 성공했습니다. |
hhk7734
approved these changes
Feb 10, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
No description provided.