Skip to content

feat: Feature check quota before dep #568

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 2 commits into
base: dev
Choose a base branch
from

Conversation

Priyanka-Microsoft
Copy link
Contributor

Purpose

  • ...
    This pull request introduces significant changes to enable validation of Azure model deployment quotas for OpenAI models during provisioning. It adds scripts for both Windows and POSIX environments, updates deployment parameters, and includes quota validation logic for specific AI models. The main themes are provisioning enhancements and quota validation functionality.

Provisioning Enhancements:

  • azure.yaml: Added preprovision hooks for both POSIX and Windows environments to validate model deployment quotas before provisioning. The scripts ensure proper permissions and execute validation logic for AI models based on subscription and location parameters.

  • infra/main.parameters.json: Added deployment parameters for AI models, including configurations for gpt-4o-mini and text-embedding-ada-002. These parameters specify model names, versions, formats, and resource capacities required for deployment.

Quota Validation Functionality:

  • infra/scripts/validate_model_deployment_quota.ps1: Introduced a PowerShell script to validate Azure quotas for AI model deployments. The script checks subscription details, retrieves model configurations, validates parameters, and ensures sufficient quota for deployment. It supports fallback regions and updates environment variables dynamically.

  • infra/scripts/validate_model_deployment_quota.sh: Added a Bash script for POSIX environments to validate Azure quotas for AI model deployments. Similar to the PowerShell script, it handles parameter validation, quota checks, and fallback region selection.

  • infra/scripts/validate_model_quota.ps1: Implemented a detailed quota validation script to check available resources for AI models (gpt-4o and text-embedding-ada-002) in specific Azure regions. It provides a summary table of quota availability and supports manual region selection for deployment.

Does this introduce a breaking change?

  • Yes
  • No

Golden Path Validation

  • I have tested the primary workflows (the "golden path") to ensure they function correctly without errors.

Deployment Validation

  • I have validated the deployment process successfully and all services are running as expected with this change.

What to Check

Verify that the following are valid

  • ...

Other Information

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant