page_type | languages | products | name | urlFragment | description | azureDeploy | |||
---|---|---|---|---|---|---|---|---|---|
sample |
|
|
Build a Copilot app using Azure Cosmos DB & Azure OpenAI Service |
chat-app |
Sample application that implements a Generative AI chat application that demonstrates context windows, semantic cache and Semantic Kernel integration. |
This sample application shows how to build a Generative-AI application using Azure Cosmos DB using its new vector search capabilities and Azure OpenAI Service and Semantic Kernel. The sample provides practical guidance on many concepts you will need to design and build these types of applications.
This application demonstrates the following concepts and how to implement them:
- The basics of building a highly scalable Generative-AI chat application using Azure Cosmos DB for NoSQL.
- Generating completions and embeddings using Azure OpenAI Service.
- Managing a context window (chat history) for natural conversational interactions with an LLM.
- Manage token consumption and payload sizes for Azure OpenAI Service requests.
- Building a semantic cache using Azure Cosmos DB for NoSQL vector search for improved performance and cost.
- Using the Semantic Kernel SDK for completion and embeddings generation.
- Azure Subscription
- Subscription access to Azure OpenAI service. Start here to Request Access to Azure OpenAI Service
- Visual Studio, VS Code, GitHub Codespaces or another editor to edit or view the source for this sample.
- Azure Cosmos DB for NoSQL Vector Search Preview enrollment
This lab utilizes a preview feature, Vector search for Azure Cosmos DB for NoSQL which requires preview feature registration. Follow the below steps to register. You must be enrolled before you can deploy this solution:
- Navigate to your Azure Cosmos DB for NoSQL resource page.
- Select the "Features" pane under the "Settings" menu item.
- Select for “Vector Search in Azure Cosmos DB for NoSQL”.
- Read the description to confirm you want to enroll in the preview.
- Select "Enable" to enroll in the preview.
Click one of the Deploy to Azure buttons belows and follow the prompts in Azure Portal to deploy this solution. The first option deploys a new Azure OpenAI account. The second allows you to use an existing Azure OpenAI account.
The provided ARM Templates will provision the following resources:
- Azure Cosmos DB Serverless account with database and container with a vector embedding policy on the container and vector indexes defined.
- Azure OpenAI Service You must also specify a name for the deployment of the "completion" and "embedding" models used by this application.
- Azure App service Web application host for the ASP.NET Blazor application.
Note: You must have access to Azure Open AI service from your subscription before attempting to deploy this application.
All connection information for Azure Cosmos DB and Azure Open AI is zero-touch and injected as environment variables in the Azure App Service instance at deployment time.
Deploy with existing Azure OpenAI account: Use a pre-existing Azure OpenAI service account with GPT 3.5 Turbo and ADA-002. Use this Deploy to Azure button below. Provide Azure OpenAI account name, key, and deployment names for GPT 3.5 Turbo and ADA-002 models.
This solution has a number of quickstarts than you can run through to learn about the features in this sample and how to implement them yourself.
Please see Quickstarts
To remove all the resources used by this sample, delete the resource group for your deployment.
To learn more about the services and features demonstrated in this sample, see the following: