feat: Feature check quota before dep #568
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Purpose
This pull request introduces significant changes to enable validation of Azure model deployment quotas for OpenAI models during provisioning. It adds scripts for both Windows and POSIX environments, updates deployment parameters, and includes quota validation logic for specific AI models. The main themes are provisioning enhancements and quota validation functionality.
Provisioning Enhancements:
azure.yaml
: Addedpreprovision
hooks for both POSIX and Windows environments to validate model deployment quotas before provisioning. The scripts ensure proper permissions and execute validation logic for AI models based on subscription and location parameters.infra/main.parameters.json
: Added deployment parameters for AI models, including configurations forgpt-4o-mini
andtext-embedding-ada-002
. These parameters specify model names, versions, formats, and resource capacities required for deployment.Quota Validation Functionality:
infra/scripts/validate_model_deployment_quota.ps1
: Introduced a PowerShell script to validate Azure quotas for AI model deployments. The script checks subscription details, retrieves model configurations, validates parameters, and ensures sufficient quota for deployment. It supports fallback regions and updates environment variables dynamically.infra/scripts/validate_model_deployment_quota.sh
: Added a Bash script for POSIX environments to validate Azure quotas for AI model deployments. Similar to the PowerShell script, it handles parameter validation, quota checks, and fallback region selection.infra/scripts/validate_model_quota.ps1
: Implemented a detailed quota validation script to check available resources for AI models (gpt-4o
andtext-embedding-ada-002
) in specific Azure regions. It provides a summary table of quota availability and supports manual region selection for deployment.Does this introduce a breaking change?
Golden Path Validation
Deployment Validation
What to Check
Verify that the following are valid
Other Information