Document Processing with Azure AI Samples

page_type

languages

products

name

description

sample

python

csharp

bicep

azure

ai-services

azure-openai

document-intelligence

language-service

azure-translator

Document Processing with Azure AI Samples

This collection of samples demonstrates how to use various Azure AI capabilities to build a solution to extract structured data, classify, redact, and analyze documents.

Document Processing with Azure AI Samples

This repository contains a collection of code samples that demonstrate how to use various Azure AI capabilities to process documents.

The samples are intended to help engineering teams establish techniques with Azure AI Foundry, Azure OpenAI, Azure AI Document Intelligence, and Azure AI Language services to build solutions to extract structured data, classify, and analyze documents.

The techniques demonstrated take advantage of various capabilities from each service to:

Reduce complexity of custom model training by taking advantage of the capabilities of Generative AI models to analyze and classify documents.
Improve reliability in document processing by utilizing combining AI service capbilities to extract structured data from any document type, with high accuracy and confidence.
Simplify document processing workflows by providing reusable code and patterns that can be easily modified and evaluated for most use cases.

Samples

Document Classification

Sample	Link	Description	Example Use Cases
Vision-based Classification with Azure OpenAI GPT-4.1	Python \| .NET	Use Azure OpenAI GPT-4.1 models to classify documents using their built-in vision capabilities.	Processing multiple documents types or documents with varying purposes, such as contracts, legal documents, and emails.
Semantic Similarity Classification with Vector Embeddings	Python \| .NET	Use Azure OpenAI embedding models to convert document text and classify them based on similarity to pre-defined classification lists.	Processing multiple documents types or documents with varying purposes, such as contracts, legal documents, and emails.

Document Redaction

Sample	Link	Description	Example Use Cases
LLM-enabled Redaction with Azure AI Document Intelligence, Azure OpenAI GPT-4.1, and Post-Processing	Python \| .NET	Use Azure AI Document Intelligence `prebuilt-layout` and Azure OpenAI GPT models to redact sensitive information from documents using natural language instruction to determine redaction areas.	Require specific redaction rules, such as redacting based on context or relationships. Also works for redacting PII, including names, addresses, and phone numbers.
Document Redaction with Azure AI Language PII Native Document Analysis	Python \| .NET	Use Azure AI Language Native Document Analysis to redact personally identifiable information (PII) from documents.	Redacting sensitive information from documents, such as names, addresses, and phone numbers.

Document Extraction

Note

All data extraction samples provide both an accuracy and confidence score for the extracted data. The accuracy score is calculated based on the similarity between the extracted data and the ground truth data. The confidence score can be calculated based on OCR analysis confidence and logprobs in Azure OpenAI responses.

Sample	Link	Description	Example Use Cases
Text-based Extraction with Azure AI Document Intelligence and Azure OpenAI GPT-4.1	Python \| .NET	Use Azure AI Document Intelligence `prebuilt-layout` and Azure OpenAI GPT models to extract structured data from documents using text.	Predominantly text-based documents such as invoices, receipts, and forms.
Text-based Extraction with Azure AI Document Intelligence and Microsoft Phi	Python \| .NET	Use Azure AI Document Intelligence `prebuilt-layout` and Microsoft's Phi models to extract structured data from documents using text.	Predominantly text-based documents such as invoices, receipts, and forms.
Vision-based Extraction with Azure OpenAI GPT-4.1	Python \| .NET	Use Azure OpenAI GPT-4.1 models to extract structured data from documents using vision capabilities.	Complex documents with a mix of text and images, including diagrams, signatures, selection marks, etc. such as reports and contracts.
Multi-Modal (Text and Vision) Extraction with Azure AI Document Intelligence and Azure OpenAI GPT-4.1	Python \| .NET	Improve the accuracy and confidence in extracting structured data from documents by combining text and images with LLMs.	Any structured or unstructured document type.

Use Case Scenarios

This repo also contains a collection of end-to-end use case scenarios that demonstrate how to combine the various samples to create a real-world scenario for document processing.

Scenario	Link	Description
Invoice	Python \| .NET	Using a structured Invoice object (Python \| .NET), invoice documents can be extracted into a standard Invoice schema by first classifying which pages to extract from using boundary detection.
US Tax 1040	Python	Using Azure AI Document Intelligence prebuilt-tax.us.1040 models, US Tax 1040 documents can be extracted into a standard schema for each form type by first classifying which pages to extract from using boundary detection with Azure OpenAI.

Getting Started

The sample repository comes with a Dev Container that contains all the necessary tools and dependencies to run the sample. Please review the container and it's dependencies to understand all of the necessary components required to run these in a real-world environment, including the use of Poppler.

Important

An Azure subscription is required to run these samples. If you don't have an Azure subscription, create an account.

Setup on GitHub Codespaces

Preferred Method: You can use GitHub Codespaces to quickly set up a development environment without needing to install anything on your local machine.

Note

After the environment has loaded, you may need to run the following command in the terminal to install the necessary Python dependencies: pip --disable-pip-version-check --no-cache-dir install --user -r requirements.txt

Once the Dev Container is up and running, continue to the deployment section.

Setup on Local Machine

Alternative Method: If you prefer to run the project on your local machine, you can set up a development environment using Docker and Visual Studio Code.

To use the Dev Container, you need to have the following tools installed on your local machine:

Install Visual Studio Code
Install Docker Desktop
Install Remote - Containers extension for Visual Studio Code

To setup a local development environment, follow these steps:

Important

Ensure that Docker Desktop is running on your local machine.

Clone the repository to your local machine.
Open the repository in Visual Studio Code.
Press F1 to open the command palette and type Dev Containers: Reopen in Container.

Note

After the environment has loaded, you may need to run the following command in the terminal to install the necessary Python dependencies: pip --disable-pip-version-check --no-cache-dir install --user -r requirements.txt

Once the Dev Container is up and running, continue to the Azure environment setup section.

Deploy the Azure environment

Once the Dev Container is up and running, you can setup the necessary Azure services and run the samples in the repository by running the following command in a bash terminal:

Note

For the most optimal sample experience, it is recommended to run the samples in East US which will provide support for all the services used in the samples. Find out more about region availability for Azure AI Document Intelligence, and GPT-4.1, Phi-4, and text-embedding-3-large models.

az login

bash ./infra/scripts/deploy.sh {unique-deployment-name} {resource-group-name} {location}

Alternatively, you can run the PowerShell script to deploy the Azure environment in a pwsh terminal:

az login

./infra/scripts/Deploy-Infrastructure.ps1 -DeploymentName {unique-deployment-name} -ResourceGroupName {resource-group-name} -Location {location}

Note

If a specific Azure tenant is required, use the --tenant <TenantId> parameter in the az login command. az login --tenant <TenantId>

The script will deploy the following resources to your Azure subscription:

Azure AI Foundry Hub & Project, a development platform for building AI solutions that integrates with Azure AI Services in a secure manner using Microsoft Entra ID for authentication.
- Note: Phi-4 will be deployed as a PAYG serverless endpoint in the Azure AI Foundry Project with its primary key stored in the associated Azure Key Vault.
Azure AI Services, a managed service for all Azure AI Services, including Azure OpenAI, Azure AI Document Intelligence, and Azure AI Language services.
- Note: GPT-4.1 will be deployed as a Global Standard model. text-embedding-3-large will be deployed as a Standard model. These can be adjusted based on your quota availability in the main.bicep file.
Azure Storage Account, required by Azure AI Foundry.
Azure Monitor, used to store logs and traces for monitoring and troubleshooting purposes.

Note

All resources are secured by default with Microsoft Entra ID using Azure RBAC. Your user client ID will be added with the necessary least-privilege roles to access the resources created.

After the script completes, you can run any of the samples in the repository by following their instructions.

For more information on the deployment, see the infrastructure deployment README.

Contributing

You can contribute to the repository by opening an issue or submitting a pull request. For more information, see the Contributing guide.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 186 Commits
.devcontainer		.devcontainer
.github		.github
.vscode		.vscode
images		images
infra		infra
samples		samples
.editorconfig		.editorconfig
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
CODEOWNERS		CODEOWNERS
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.md		LICENSE.md
README.md		README.md
azure.yaml		azure.yaml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Document Processing with Azure AI Samples

Contents

Samples

Document Classification

Document Redaction

Document Extraction

Use Case Scenarios

Getting Started

Setup on GitHub Codespaces

Setup on Local Machine

Deploy the Azure environment

Contributing

License

About

Uh oh!

Uh oh!

Contributors 5

Uh oh!

Languages

License

Azure-Samples/azure-ai-document-processing-samples

Folders and files

Latest commit

History

Repository files navigation

Document Processing with Azure AI Samples

Contents

Samples

Document Classification

Document Redaction

Document Extraction

Use Case Scenarios

Getting Started

Setup on GitHub Codespaces

Setup on Local Machine

Deploy the Azure environment

Contributing

License

About

Topics

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 5

Uh oh!

Languages