# Getting Started with Azure Document Intelligence Pipeline

This notebook walks you through the prerequisites and initial setup for the Azure Document Intelligence PDF Processing Pipeline.

## Prerequisites

Before you begin, ensure you have the following installed:

- [Azure CLI](https://docs.microsoft.com/cli/azure/install-azure-cli) (v2.50+)
- [Azure Functions Core Tools](https://docs.microsoft.com/azure/azure-functions/functions-run-local) (v4.x)
- [UV](https://docs.astral.sh/uv/) (Python package manager)
- [.NET 8+ SDK](https://dotnet.microsoft.com/download) (for Polyglot Notebooks)
- Python 3.10+
- Azure subscription with appropriate permissions

## 1. Verify Prerequisites

Run the following cells to verify your environment is correctly configured.

In [None]:
# Check Azure CLI version
Write-Host "Checking Azure CLI..." -ForegroundColor Cyan
az --version | Select-Object -First 3

In [None]:
# Check Azure Functions Core Tools
Write-Host "Checking Azure Functions Core Tools..." -ForegroundColor Cyan
func --version

In [None]:
# Check Python and UV
Write-Host "Checking Python..." -ForegroundColor Cyan
python --version

Write-Host "`nChecking UV..." -ForegroundColor Cyan
uv --version

## 2. Azure Authentication

Login to Azure and configure your subscription.

In [None]:
# Login to Azure (opens browser for authentication)
az login

In [None]:
# List available subscriptions
Write-Host "Available Azure Subscriptions:" -ForegroundColor Cyan
az account list --output table

In [None]:
# Set your target subscription (UPDATE THIS VALUE)
$SUBSCRIPTION_ID = "<YOUR_SUBSCRIPTION_ID>"

az account set --subscription $SUBSCRIPTION_ID

# Verify current subscription
Write-Host "Current subscription:" -ForegroundColor Cyan
az account show --query "{Name:name, ID:id}" --output table

## 3. Install Project Dependencies

Install Python dependencies using UV.

In [None]:
# Navigate to project root (adjust path as needed)
Set-Location -Path "$PSScriptRoot/.."

# Install dependencies with UV
Write-Host "Installing Python dependencies..." -ForegroundColor Cyan
uv sync

## 4. Project Structure Overview

```
azure-doc-intel-pipeline/
├── infra/                          # Bicep infrastructure templates
│   ├── main.bicep                  # Main deployment orchestrator
│   ├── modules/                    # Reusable Bicep modules
│   └── parameters/                 # Environment configs
├── src/
│   ├── functions/                  # Azure Functions (Python)
│   └── synapse/                    # Synapse artifacts
├── scripts/                        # Deployment scripts
├── tests/                          # Unit and integration tests
├── notebooks/                      # Polyglot notebooks (you are here!)
└── README.md
```

In [None]:
# View actual project structure
Set-Location -Path "E:\Repos\GitHub\MyDemoRepos\FormExtraction"
Get-ChildItem -Recurse -Depth 2 | Where-Object { $_.PSIsContainer } | Select-Object FullName

## 5. Deployment Options

Choose the deployment option that best fits your scenario:

| Option | Use Case | What Gets Deployed | Notebook |
|--------|----------|-------------------|----------|
| **A** | Fresh start, no existing resources | All resources | `01-Deployment-New-Resources.ipynb` |
| **B** | Have existing resources + Function App | Function code only | `02-Deployment-Existing-Resources.ipynb` |
| **C** | Have existing resources, need new Function App | Function App + Plan | `02-Deployment-Existing-Resources.ipynb` |

Continue with the appropriate notebook based on your scenario.

## Notebooks Index

| Notebook | Description |
|----------|-------------|
| `00-Getting-Started.ipynb` | Prerequisites and setup (this notebook) |
| `01-Deployment-New-Resources.ipynb` | Deploy all Azure resources from scratch |
| `02-Deployment-Existing-Resources.ipynb` | Deploy with existing Azure resources |
| `03-Local-Development.ipynb` | Set up local development environment |
| `04-Testing-Linting.ipynb` | Run tests and lint code |
| `05-Synapse-Pipeline.ipynb` | Configure and run Synapse pipeline |
| `06-Analytics-SynapseLink.ipynb` | Synapse Link and Delta Lake analytics |
| `07-Monitoring-Troubleshooting.ipynb` | Logging, monitoring, and troubleshooting |