C# Semantic Document Processor

.NET 8 Web API portfolio project that classifies document images, extracts typed invoice or receipt data, and applies deterministic business policy checks through Microsoft Semantic Kernel.

The business scenario is a lightweight accounts-payable intake workflow: receive a synthetic invoice or receipt image, classify it, extract only the fields needed by downstream systems, match vendors against an approved list, and return an auditable policy decision with token usage for cost tracking.

The target architecture is Microsoft-centric at the application layer:

ASP.NET Core Minimal API
dependency injection
options binding with IOptions<T>
Microsoft Semantic Kernel
provider-portable LLM configuration through an OpenAI-compatible endpoint

The initial provider target is Together AI using a configurable vision-capable open model.

Current Status

The portfolio slice is complete:

solution file
Web API project
pinned Semantic Kernel connector package
AiSettings options model
environment-based API key provider
lazy Semantic Kernel registration
/health endpoint
initial domain model for invoices, receipts, shared document metadata, and policy decisions
image intake endpoint with multipart validation and file metadata response
live image classification through Semantic Kernel and Together AI
typed invoice and receipt extraction through Semantic Kernel and Together AI
deterministic Semantic Kernel native plugins for vendor matching and policy evaluation
DocumentProcessingOrchestrator for classify, route, extract, evaluate, and aggregate response flow
xUnit tests for deterministic policy, validation, parsing, and orchestration paths
synthetic demo assets for the current invoice and receipt scope
browser frontend for upload, workflow inspection, policy results, token totals, and raw JSON
consistent API error responses with correlation IDs
Dockerfile and GitHub Actions build/test workflow

Policy evaluation is implemented for the v1 invoice and receipt samples. The API endpoint handles upload validation and delegates the semantic workflow to the orchestrator.

Architecture

flowchart LR
    Client["Client / curl / future UI"] --> Api["ASP.NET Core Minimal API<br/>POST /api/documents/process"]
    Api --> Intake["DocumentImageValidator<br/>content type, extension, size"]
    Intake --> Orchestrator["DocumentProcessingOrchestrator"]
    Orchestrator --> Classifier["IDocumentClassificationService<br/>Semantic Kernel chat completion"]
    Classifier --> Model["Together AI<br/>OpenAI-compatible vision model"]
    Orchestrator --> Extractor["IDocumentExtractionService<br/>invoice or receipt prompt"]
    Extractor --> Model
    Orchestrator --> Policy["IPolicyEvaluationService"]
    Policy --> Plugins["Semantic Kernel native plugins<br/>VendorPolicyPlugin + ApprovalPolicyPlugin"]
    Plugins --> VendorStore["In-memory vendor policies"]
    Orchestrator --> Response["DocumentProcessingResponse<br/>typed data, policy result, token usage"]

The application keeps the model-facing behavior behind app-owned service interfaces. Semantic Kernel is used for the chat-completion integration and the native C# business plugins, while the approval decisions themselves stay deterministic and testable.

Microsoft / Semantic Kernel Vocabulary

Concept	In this project	Microsoft-centric vocabulary
API host	`SemanticDocumentProcessor.Api`	ASP.NET Core Minimal API
Configuration	`AiSettings`, `DocumentIntakeSettings`, `PolicySettings`	Options pattern with `IOptions<T>`
Model connector	`AddOpenAIChatCompletion` with a custom endpoint	Semantic Kernel chat completion service
Workflow coordinator	`DocumentProcessingOrchestrator`	Application service / orchestration layer
Model prompt boundary	classification and extraction services	Semantic Kernel chat history plus prompt execution settings
Business functions	`VendorPolicyPlugin`, `ApprovalPolicyPlugin`	Semantic Kernel native plugins / kernel functions
Typed outputs	`InvoiceData`, `ReceiptData`, `ProcessedDocument`	C# records with predictable JSON serialization
Policy output	`InvoicePolicyResult`, `ReceiptPolicyResult`	deterministic domain service result
Cost signal	`ModelTokenUsage`, `DocumentModelUsage`	structured logging and response telemetry

Configuration

Default AI settings live in src/SemanticDocumentProcessor.Api/appsettings.json:

{
  "Ai": {
    "Provider": "TogetherAI",
    "Endpoint": "https://api.together.xyz/v1",
    "ModelId": "google/gemma-4-31B-it",
    "ApiKeyEnvironmentVariable": "TOGETHER_API_KEY",
    "ServiceId": "together-vision",
    "RequestTimeoutSeconds": 180
  }
}

Do not put API keys in source-controlled configuration files.

Set the Together key as a user-level environment variable:

[Environment]::SetEnvironmentVariable("TOGETHER_API_KEY", "your_key_here", "User")

Restart the terminal or Codex session after setting the variable.

For a one-session smoke test:

$env:TOGETHER_API_KEY = "your_key_here"

Run

dotnet run --project .\src\SemanticDocumentProcessor.Api\SemanticDocumentProcessor.Api.csproj

Demo frontend:

GET http://localhost:5275/

Health check:

GET http://localhost:5275/health

The health response reports whether the configured API key environment variable is present, without exposing the key.

Process an image:

curl.exe -F "image=@assets/sample-doc1.png;type=image/png" -F "sourceId=sample-doc1" http://localhost:5275/api/documents/process

Try all included demo assets:

curl.exe -F "image=@assets/sample-doc1.png;type=image/png" -F "sourceId=sample-doc1" http://localhost:5275/api/documents/process
curl.exe -F "image=@assets/sample-doc2.png;type=image/png" -F "sourceId=sample-doc2" http://localhost:5275/api/documents/process
curl.exe -F "image=@assets/sample-doc3.png;type=image/png" -F "sourceId=sample-doc3" http://localhost:5275/api/documents/process

The current processing endpoint validates and reads the uploaded image, then delegates to DocumentProcessingOrchestrator. The orchestrator classifies it as Invoice, Receipt, or Unknown, routes to the correct extractor, evaluates deterministic C# business policy through Semantic Kernel native plugins where applicable, and returns a single typed response.

Responses include modelUsage with token counts for each model call and per-document totals when the provider returns usage data:

{
  "modelUsage": {
    "calls": [
      {
        "operation": "classification",
        "modelId": "google/gemma-4-31B-it",
        "inputTokens": 439,
        "outputTokens": 150,
        "totalTokens": 589
      }
    ],
    "totalInputTokens": 439,
    "totalOutputTokens": 150,
    "totalTokens": 589
  }
}

The API also emits structured log events named ModelTokenUsage and DocumentModelUsage with FileName, SourceId, ModelId, and token fields for downstream cost analysis.

Every request receives an X-Correlation-ID response header. If the caller sends X-Correlation-ID, that value is used as the ASP.NET Core trace identifier; otherwise the server-generated trace identifier is returned. Error responses use a shared shape:

{
  "code": "invalid_document_upload",
  "message": "Unsupported content type 'application/pdf'.",
  "target": "image",
  "traceId": "00-..."
}

The included sample assets currently process as:

assets/sample-doc1.png: Invoice, vendor Workspace Interiors Ltd, total 967.20 GBP
assets/sample-doc2.png: Receipt, store Meadow Vale Supermarket, total 21.02 GBP
assets/sample-doc3.png: Receipt, store South Coast Rail Services, total 32.30 GBP

All current samples evaluate to Approved under the seeded policies. Invoice policy checks vendor alias matching, active vendor status, currency, and max auto-approved value. Receipt policy checks the review threshold and visible payment method.

Provider Portability

The code is intentionally not tied to a direct OpenAI account. The current configuration uses Together AI through an OpenAI-compatible endpoint because Semantic Kernel can speak to that shape via AddOpenAIChatCompletion.

Provider-specific details are concentrated in configuration:

Ai:Provider
Ai:Endpoint
Ai:ModelId
Ai:ApiKeyEnvironmentVariable
Ai:ServiceId
Ai:RequestTimeoutSeconds

The rest of the application depends on IDocumentClassificationService, IDocumentExtractionService, IPolicyEvaluationService, and IDocumentProcessingOrchestrator. A later provider profile can swap endpoint/model/key settings when the provider supports the same chat-completion/image payload shape. If a provider needs a different payload contract, the change should stay behind the classification and extraction service interfaces, leaving the API contract, domain records, and policy plugins intact.

PDF Scope

PDF input is deliberately out of scope for v1. The portfolio point here is the semantic processing workflow, not document rasterization. Keeping the public endpoint image-only avoids mixing two concerns:

image intake, classification, extraction, and policy evaluation
PDF page rendering, page selection, and multi-page document handling

PDF support can be added later as an adapter in front of the existing pipeline:

Accept a PDF upload through a separate intake path.
Render selected pages to images using a PDF-to-image library or service.
Submit each rendered image to the existing DocumentProcessingOrchestrator.
Add aggregation rules for multi-page documents if needed.

That keeps the current model prompts, typed extraction records, policy plugins, and response model reusable.

Portfolio Narrative

This project demonstrates a pragmatic enterprise AI pattern for C# teams: use Microsoft-native application structure and Semantic Kernel integration points while keeping the LLM provider replaceable. The model handles fuzzy visual understanding and field extraction; C# owns validation, routing, vendor policy, approval thresholds, logging, and tests.

The result is a small but realistic document-processing slice: it shows how to add AI into a .NET workflow without handing core business decisions to the model, and it produces typed, auditable responses that a finance or operations system could consume.

Additional portfolio notes:

Policy verification without live model calls:

dotnet run --project .\spikes\PolicyPluginVerifier\PolicyPluginVerifier.csproj --no-restore

The verifier invokes the Semantic Kernel native policy plugins directly with the current sample extraction values.

Tests

Run the unit test suite:

dotnet test .\tests\SemanticDocumentProcessor.Tests\SemanticDocumentProcessor.Tests.csproj --no-restore

The tests cover deterministic vendor matching, approval policy boundaries, upload image validation, model JSON parsing failure paths, and orchestrator routing using fake classifier/extractor/policy services. The test project is run by project path so the existing solution build remains focused on the API project.

Demo Frontend

The API serves a minimal browser UI from src/SemanticDocumentProcessor.Api/wwwroot. It supports image selection or drag-and-drop, optional sourceId, API health display, workflow status, extracted document fields, policy reasons, token totals, and raw JSON inspection.

The frontend calls the same POST /api/documents/process endpoint used by the curl examples, so screenshots reflect the real API workflow rather than mocked data.

Container

Build the API container:

docker build -t semantic-document-processor .

Run it with the Together key supplied from the host environment:

docker run --rm -p 8080:8080 -e TOGETHER_API_KEY=$env:TOGETHER_API_KEY semantic-document-processor

The repository includes a GitHub Actions workflow at .github/workflows/build.yml that restores, builds, and runs the unit tests.

License

This project is licensed under the MIT License. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.github/workflows		.github/workflows
assets		assets
docs		docs
spikes		spikes
src/SemanticDocumentProcessor.Api		src/SemanticDocumentProcessor.Api
tests/SemanticDocumentProcessor.Tests		tests/SemanticDocumentProcessor.Tests
.dockerignore		.dockerignore
.gitignore		.gitignore
CSharpSemanticDocumentProcessor.sln		CSharpSemanticDocumentProcessor.sln
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
backlog.md		backlog.md
milestone-zero-notes.md		milestone-zero-notes.md
plan.md		plan.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

C# Semantic Document Processor

Current Status

Architecture

Microsoft / Semantic Kernel Vocabulary

Configuration

Run

Provider Portability

PDF Scope

Portfolio Narrative

Tests

Demo Frontend

Container

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

C# Semantic Document Processor

Current Status

Architecture

Microsoft / Semantic Kernel Vocabulary

Configuration

Run

Provider Portability

PDF Scope

Portfolio Narrative

Tests

Demo Frontend

Container

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages