Skip to content

Commit ea5837b

Browse files
committed
Update token capacity (optimal 500k) and remove manual schema registration fallback
1 parent 8fd3834 commit ea5837b

3 files changed

Lines changed: 4 additions & 10 deletions

File tree

docs/CustomizingAzdParameters.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,7 @@ By default this template will use the environment name as the prefix to prevent
1616
| `AZURE_ENV_MODEL_DEPLOYMENT_TYPE` | string | `GlobalStandard` | Defines the model deployment type. Allowed: `Standard`, `GlobalStandard`. |
1717
| `AZURE_ENV_MODEL_NAME` | string | `gpt-5.1` | Specifies the GPT model name. Default: `gpt-5.1`. |
1818
| `AZURE_ENV_MODEL_VERSION` | string | `2025-04-14` | Specifies the GPT model version. |
19-
| `AZURE_ENV_MODEL_CAPACITY` | integer | `300` | Sets the model capacity (minimum 1). Recommended: 300 for multi-document claim processing. |
19+
| `AZURE_ENV_MODEL_CAPACITY` | integer | `300` | Sets the model capacity (minimum 1). Default: 300. Optimal: 500 for multi-document claim processing. |
2020
| `AZURE_ENV_CONTAINER_REGISTRY_ENDPOINT` | string | `cpscontainerreg.azurecr.io` | Sets the public container image endpoint for pulling pre-built images. |
2121
| `AZURE_ENV_CONTAINER_IMAGE_TAG` | string | `latest` | Sets the container image tag (e.g., `latest`, `dev`, `hotfix`). |
2222
| `AZURE_ENV_LOG_ANALYTICS_WORKSPACE_ID` | string | Guide to get your [Existing Workspace ID](/docs/re-use-log-analytics.md) | Reuses an existing Log Analytics Workspace instead of provisioning a new one. |

docs/DeploymentGuide.md

Lines changed: 2 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -66,7 +66,7 @@ Ensure you have access to an [Azure subscription](https://azure.microsoft.com/fr
6666

6767
**Recommended Configuration:**
6868
- **Default:** 300k tokens
69-
- **Optimal:** 300k tokens (recommended for multi-document claim processing)
69+
- **Optimal:** 500k tokens (recommended for multi-document claim processing)
7070

7171
> **Note:** When you run `azd up`, the deployment will automatically show you regions with available quota, so this pre-check is optional but helpful for planning purposes. You can customize these settings later in [Step 3.3: Advanced Configuration](#33-advanced-configuration-optional).
7272
@@ -306,7 +306,7 @@ After successful deployment:
306306

307307
> Want to customize the schemas for your own documents? [Learn more about adding your own schemas here.](./CustomizeSchemaData.md)
308308

309-
Schema registration happens **automatically** as part of the `azd up` post-provisioning hook. After infrastructure is deployed, the hook:
309+
Schema registration happens **automatically** as part of the `azd up` post-provisioning hook — no manual steps required. After infrastructure is deployed, the hook:
310310

311311
1. Waits for the API container app to be ready
312312
2. Registers the sample schema files (auto claim, damaged car image, police report, repair estimate)
@@ -317,12 +317,6 @@ You should see output like this in the terminal:
317317

318318
![schema file registration](./images/SchemaFileRegistration.png)
319319

320-
> **If registration was skipped** (e.g. the API wasn't ready in time), you can run it manually:
321-
> ```bash
322-
> cd src/ContentProcessorAPI/samples/schemas
323-
> python register_schema.py https://<< API Service Endpoint>> schema_info.json
324-
> ```
325-
326320
### 5.2 Configure Authentication (Required)
327321

328322
**This step is mandatory for application access:**

docs/quota_check.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
## Check Quota Availability Before Deployment
22

33
Before deploying the accelerator, **ensure sufficient quota availability** for the required model.
4-
> **For Global Standard | GPT-5.1 — the capacity to at least 300K tokens for optimal multi-document claim processing.**
4+
> **For Global Standard | GPT-5.1 — the capacity to at least 300K tokens (default). 500K tokens recommended for optimal multi-document claim processing.**
55
66
### Login if you have not done so already
77
```

0 commit comments

Comments
 (0)