Azure OpenAI VBD - Microsoft Federal

GPT Powered Search Accelerator built for Azure Government

Azure OpenAI + Langchain + Vector Database + Microsoft Teams + Azure SQL + CosmosDB + Azure Bot Framework

Your organization requires a Multi-Channel Smart Chatbot and a search engine capable of comprehending diverse types of data scattered across various locations. Additionally, the conversational chatbot should be able to provide answers to inquiries, along with the source and an explanation of how and where the answer was obtained. In other words, you want private and secured ChatGPT for your organization that can interpret, comprehend, and answer questions about your business and mission data with high accuracy.

This repo helps you accelerate your solution for building enterprise and mission GPT Virtual Assistant built with Azure Services, with your own data in your own environment. The solution consists of:

Backend Bot API built with Bot Framework and exposed to multiple channels (Web Chat, MS Teams, SMS, Email, Slack, etc)
Frontend web application with a Search and a Bot UI.
Jupyter notebooks data scienists and developers can use to get started on building their own use cases.
Data science and development environment using Azure Machine Learning

The repo is made to teach you step-by-step on how to build an OpenAI based Smart Search Engine. Each Notebook builds on top of each other and ends in building the two applications.

Prerequisites

Azure commercial subscription and Azure government subscription.
Accepted Application to Azure Open AI, including GPT-4 (mandatory)
A Resource Group (RG) needs to be set for this Workshop POC, in the customer Azure tenant
A storage account must be set in place in the RG.
Data/Documents must be uploaded to the blob storage account
For IDE collaboration during workshop, Jupyper Lab will be used, for this, Azure Machine Learning Workspace must be deployed in the RG
- Note: Please ensure you have enough core compute quota in your Azure Machine Learning workspace

Architecture

Flow

Data scientists and developers use Azure Machine Learning workspace to experiment and develop Azure OpenAI solutions. 1a/1b. The user asks a question from a web UI or Microsoft Teams.
In the app, an OpenAI LLM uses a clever prompt to determine which source contains the answer to the question.
Three types of sources are available:
- Azure SQL Database - contains COVID-related statistics in the US.
- External Vector DB - contains AI-enriched documents from Blob Storage (10k PDFs and 52k articles).
  - 3b.1. Uses an LLM (OpenAI) to vectorize the top K document chunks.
  - 3b.2. Uses external vectordb cosine similarity to get the top N chunks.
  - 3b.3. Uses an OpenAI GPT model to craft the response from the Cog Search Engine (3c) by combining the question and the top N chunks.
- 3c. CSV Tabular File - contains COVID-related statistics in the US.
The app retrieves the result from the source and crafts the answer.
The tuple (Question and Answer) is saved to CosmosDB to keep a record of the interaction.
The answer is delivered to the user.

Demo

https://webapp-frontend-a3pkv5rdhszks.azurewebsites.us/

To open the Bot in GCC-H MS Teams, click HERE You need to have an account and permission created prior to using Teams.

🔧Features

Implements the AOAI application hosted in Azure Government cloud connecting to Azure OpenAI instance in Azure Commercial cloud, based on the recommended Microsoft architecture.
Enables search/chat experience throuhg Microsoft Teams through the Bot Framework and Bot Service.
100% Python.
Incorporates an external vector store deployed on Azure Kubernetes Service (Weaviate)
Uses LangChain as a wrapper for interacting with Azure OpenAI , vector stores, constructing prompts and creating agents.
Tabular Data Q&A with CSV files and SQL Databases
Uses CosmosDB as persistent memory to save user's conversations.
Uses Streamlit to build the Frontend web application in python.
Optional: Uses Azure Cognitive Services to index and enrich unstructured documents: Detect Language, OCR images, Key-phrases extraction, entity recognition (persons, emails, addresses, organizations, urls).

Steps to Run the POC/Accelerator

Note: (Pre-requisite) You need to have an Azure OpenAI service already created

Fork this repo to your Github account.
In the Azure Commercial cloud in Azure OpenAI studio, deploy these two models: Make sure that the deployment name is the same as the model name.
- "gpt-35-turbo" for the model "gpt-35-turbo (0301)". If you have "gpt-4", use it (it is definitely better)
- "text-embedding-ada-002"
In the Azure Government cloud, create a Resource Group where all the assets of this accelerator are going to be. Use the Azure OpenAI endpoints and keys to configure and deploy the resources in Azure Government.
ClICK BELOW to create all the Azure Infrastructure needed to run the Notebooks (Cognitive Services, SQL Database, CosmosDB):

Note: If you have never created a Cognitive services Multi-Service account before, please create one manually in the azure portal to read and accept the Responsible AI terms. Once this is deployed, delete this and then use the above deployment button.

Clone your Forked repo to your local machine or AML Compute Instance. If your repo is private, see below in Troubleshooting section how to clone a private repo.
Make sure you run the notebooks on a Python 3.10 conda enviroment
Install the dependencies on your machine (make sure you do the below pip comand on the same conda environment that you are going to run the notebooks. For example, in AZML compute instance run:

conda activate azureml_py310_sdkv2
pip install -r ./common/requirements.txt

Edit the file credentials.env with your own values from the services created in step 4.
Run the Notebooks in order. They build up on top of each other.

FAQs

Why do we use Prompt Engineering rather than the fine-tune approach?

A: Quoting the OpenAI documentation: "GPT-3 has been pre-trained on a vast amount of text from the open internet. When given a prompt with just a few examples, it can often intuit what task you are trying to perform and generate a plausible completion. This is often called "few-shot learning. Fine-tuning improves on few-shot learning by training on many more examples than can fit in the prompt, letting you achieve better results on a wide number of tasks. Once a model has been fine-tuned, you won't need to provide examples in the prompt anymore. This saves costs and enables lower-latency requests"

However, fine-tuning the model requires providing hundreds or thousands of Prompt and Completion tuples, which are essentially query-response samples. The purpose of fine-tuning is not to give the LLM knowledge of the company's data but to provide it with examples so it can perform tasks really well without requiring examples on every prompt.

There are cases where fine-tuning is necessary, such as when the examples contain proprietary data that should not be exposed in prompts or when the language used is highly specialized, as in healthcare, pharmacy, or other industries or use cases where the language used is not commonly found on the internet.

Troubleshooting

Steps to clone a private repo:

On your Terminal, Paste the text below, substituting in your GitHub email address. Generate a new SSH key.

ssh-keygen -t ed25519 -C "your_email@example.com"

Copy the SSH public key to your clipboard. Add a new SSH key.

cat ~/.ssh/id_ed25519.pub
# Then select and copy the contents of the id_ed25519.pub file
# displayed in the terminal to your clipboard

On GitHub, go to Settings-> SSH and GPG Keys-> New SSH Key
In the "Title" field, add a descriptive label for the new key. "AML Compute". In the "Key" field, paste your public key.
Clone your private repo

git clone git@github.com:YOUR-USERNAME/YOUR-REPOSITORY.git

Name		Name	Last commit message	Last commit date
Latest commit History 804 Commits
.devcontainer		.devcontainer
.github/workflows		.github/workflows
apps		apps
common		common
images		images
.gitignore		.gitignore
00-Resource-Validation.ipynb		00-Resource-Validation.ipynb
01-Load-Data-ACogSearch.ipynb		01-Load-Data-ACogSearch.ipynb
02-LoadCSVOneToMany-ACogSearch.ipynb		02-LoadCSVOneToMany-ACogSearch.ipynb
03-Quering-AOpenAI.ipynb		03-Quering-AOpenAI.ipynb
03b-Querying-Compare-AOAI-Llama2-70B.ipynb		03b-Querying-Compare-AOAI-Llama2-70B.ipynb
04-Complex-Docs.ipynb		04-Complex-Docs.ipynb
05-Adding_Memory.ipynb		05-Adding_Memory.ipynb
06-TabularDataQA.ipynb		06-TabularDataQA.ipynb
07-SQLDB_QA.ipynb		07-SQLDB_QA.ipynb
08-BingChatClone.ipynb		08-BingChatClone.ipynb
09-API-Search.ipynb		09-API-Search.ipynb
09-Smart_Agent.ipynb		09-Smart_Agent.ipynb
10-Building-Apps.ipynb		10-Building-Apps.ipynb
11-VectorDB_Load.ipynb		11-VectorDB_Load.ipynb
12-VectorDB_QA.ipynb		12-VectorDB_QA.ipynb
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Intro AOAI GPT Azure Smart Search Engine Accelerator.pptx		Intro AOAI GPT Azure Smart Search Engine Accelerator.pptx
LICENSE.txt		LICENSE.txt
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
azuredeploy.bicep		azuredeploy.bicep
azuredeploy.json		azuredeploy.json
azuredeploy.params.json		azuredeploy.params.json
credentials.env		credentials.env
deploy.sh		deploy.sh
download_odbc_driver.sh		download_odbc_driver.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Azure OpenAI VBD - Microsoft Federal

GPT Powered Search Accelerator built for Azure Government

Azure OpenAI + Langchain + Vector Database + Microsoft Teams + Azure SQL + CosmosDB + Azure Bot Framework

Architecture

Flow

Demo

🔧Features

Steps to Run the POC/Accelerator

FAQs

Troubleshooting

About

Releases

Packages

Languages

License

FederalCSUMission/Azure-OpenAI-Accelerator-Federal

Folders and files

Latest commit

History

Repository files navigation

Azure OpenAI VBD - Microsoft Federal

GPT Powered Search Accelerator built for Azure Government

Azure OpenAI + Langchain + Vector Database + Microsoft Teams + Azure SQL + CosmosDB + Azure Bot Framework

Architecture

Flow

Demo

🔧Features

Steps to Run the POC/Accelerator

FAQs

Troubleshooting

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages