Skip to content

Latest commit

 

History

History
53 lines (33 loc) · 2.49 KB

get-started-app-chat-scaling-with-azure-container-apps.md

File metadata and controls

53 lines (33 loc) · 2.49 KB
title description ms.date ms.topic ms.custom
Scale Azure OpenAI for Java chat sample using RAG
Learn how to add load balancing to your Java solution to extend the chat app beyond the Azure OpenAI token and model quota limits.
05/13/2024
get-started
devx-track-java, devx-track-java-ai, devx-track-extended-java, build-2024-intelligent-apps

Scale Azure OpenAI for Java chat using RAG with Azure Container Apps

[!INCLUDE aca-load-balancer-intro]

Prerequisites

[!INCLUDE scaling-load-balancer-aca-procedure.md]

[!INCLUDE py-deployment-procedure]

[!INCLUDE logs]

[!INCLUDE capacity.md]

[!INCLUDE py-aca-cleanup]

Sample code

Samples used in this article include:

Next step