title | titleSuffix | description | author | manager | ms.service | ms.topic | ms.date | ms.author | ms.custom |
---|---|---|---|---|---|---|---|---|---|
How to deploy a model for conversational language understanding |
Azure AI services |
Use this article to learn how to deploy models for conversational language understanding. |
jboback |
nitinme |
azure-ai-language |
how-to |
12/19/2023 |
jboback |
language-service-clu, |
Once you are satisfied with how your model performs, it's ready to be deployed, and query it for predictions from utterances. Deploying a model makes it available for use through the prediction API.
- A successfully created project
- Labeled utterances and successfully trained model
- Reviewed the model performance to determine how your model is performing.
See project development lifecycle for more information.
After you have reviewed the model's performance and decide it's fit to be used in your environment, you need to assign it to a deployment to be able to query it. Assigning the model to a deployment makes it available for use through the prediction API. It is recommended to create a deployment named production
to which you assign the best model you have built so far and use it in your system. You can create another deployment called staging
to which you can assign the model you're currently working on to be able to test it. You can have a maximum on 10 deployments in your project.
[!INCLUDE Deploy a model using Language Studio]
[!INCLUDE deploy model]
[!INCLUDE get deployment status]
After you are done testing a model assigned to one deployment, you might want to assign it to another deployment. Swapping deployments involves:
- Taking the model assigned to the first deployment, and assigning it to the second deployment.
- taking the model assigned to second deployment and assign it to the first deployment.
This can be used to swap your production
and staging
deployments when you want to take the model assigned to staging
and assign it to production
.
[!INCLUDE Swap deployments]
[!INCLUDE Swap deployments]
[!INCLUDE Delete deployment]
[!INCLUDE Delete deployment]
You can deploy your project to multiple regions by assigning different Language resources that exist in different regions.
[!INCLUDE Assign resource]
[!INCLUDE Assign resource]
When unassigning or removing a deployment resource from a project, you will also delete all the deployments that have been deployed to that resource's region.
[!INCLUDE Unassign resource]
[!INCLUDE Unassign resource]