The RAG pattern enables businesses to use the reasoning capabilities of LLMs, using their existing models to process and generate responses based on new data. RAG facilitates periodic data updates without the need for fine-tuning, thereby streamlining the integration of LLMs into businesses.

The Enterprise RAG Solution Accelerator (GPT-RAG) offers a robust architecture tailored for enterprise-grade deployment of the RAG pattern. It ensures grounded responses and is built on Zero-trust security and Responsible AI, ensuring availability, scalability, and auditability. Ideal for organizations transitioning from exploration and PoC stages to full-scale production and MVPs.

Enterprise RAG Community

Discord Channel Enterprise RAG - Connected Community

Components

Data ingestion Optimizes data preparation for Azure OpenAI.
Orchestrator The system's dynamic backbone ensuring scalability and a consistent user experience.
App Front-End Built with Azure App Services and the Backend for Front-End pattern, offers a smooth and scalable user interface.

Concepts

RAG Pattern: What and Why?
Solution Architecture Overview

Enterprise RAG +Prompt Engineering+Finetuning+Train (Video in Spanish)

Getting Started

This guide will walk you through the deployment of Enterprise RAG. Before you begin, ensure you have the necessary tools and services installed as listed in the Pre-reqs section.

Pre-reqs

Azure Developer CLI: Download azd for Windows, Other OS's.
Powershell (Windows only): Powershell
Git: Download Git
Node.js 16+ windows/mac linux/wsl
Python 3.11: Download Python
Initiate an Azure AI service creation and agree to the Responsible AI terms **

** If you have not created an Azure AI service resource in the subscrption before

Basic Architecture

For quick demos or Proof of Concept (PoC) without network isolation, you can deploy the accelerator with the basic architecture.

The deployment procedure is quite simple, just install the prerequisites and follow these four steps using Azure Developer CLI (azd) in a terminal:

1 Download the Repository:

azd init -t azure/gpt-rag

2 Login to Azure:

azd auth login

3 Start Building the infrastructure and components deployment:

azd up

4 Add source documents to object storage

Upload your documents to the 'documents' folder located in the storage account. The name of this account should start with 'strag'. This is the default storage account, as shown in the sample image below.

Zero Trust Architecture

For more secure and isolated deployments, you can opt for the Zero Trust architecture. This architecture is ideal for production environments where network isolation and stringent security measures are highly valued.

Deploying the Zero Trust architecture follows a similar procedure to the Basic Architecture deployment, but includes some additional steps. Refer to the instructions below for a detailed guide on deploying this option:

1 Download the Repository

azd init -t azure/gpt-rag

2 Enable network isolation

azd env set AZURE_NETWORK_ISOLATION true

3 Login to Azure:

azd auth login

4 Start Building the infrastructure and components deployment:

azd up

After the infrastructure is provisioned and before starting the deployment of the components, you will be asked the following question:

Zero Trust Infrastructure enabled. Confirm you are using a connection where resources are reachable (like VM+Bastion)? [Y/n]:

Initially, you will not be connected to the same vnet where the resources can be accessed, so answer n.

5 Next, you will use the Virtual Machine with the Bastion connection (created during step 4) to continue the deployment.

Log into the created VM with the user gptrag and authenticate with the password stored in the keyvault, similar to the figure below:

6 Upon accessing Windows, install Powershell, as the other prerequisites are already installed on the VM.

7 Open the command prompt and run the following command to update azd to the latest version:

choco upgrade azd

After updating azd, simply close and reopen the terminal.

8 Create a new directory, for example, deploy then enter the created directory.

mkdir deploy  
cd deploy

To finalize the procedure, execute the subsequent commands in the command prompt to successfully complete the deployment:

azd init -t azure/gpt-rag  
azd auth login   
azd env refresh  
azd package  
azd deploy

Note: when running the azd init ... and azd env refresh, use the same environment name, subscription, and region used in the initial provisioning of the infrastructure.

Done! Zero trust deployment is completed.

Additional Customizations

Refer to the Custom Deployment section to learn about additional customization options you can make.

Additional Resources

Troubleshooting

Look at the Troubleshooting page in case you face some error in the deployment process.

Evaluating

Performance Testing

Pricing Estimation

Pricing Model

Governance

Governance Model

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Enterprise RAG Community

Components

Concepts

Getting Started

Basic Architecture

Zero Trust Architecture

Additional Customizations

Additional Resources

Troubleshooting

Evaluating

Pricing Estimation

Governance

Contributing

Trademarks

Files

README.md

Latest commit

History

README.md

File metadata and controls

Enterprise RAG Community

Components

Concepts

Getting Started

Basic Architecture

Zero Trust Architecture

Additional Customizations

Additional Resources

Troubleshooting

Evaluating

Pricing Estimation

Governance

Contributing

Trademarks