XafRag — RAG Sample for DevExpress XAF

2026-03-21.11-38-12.mp4

XafRag — RAG Sample for DevExpress XAF

XafRag is a tutorial and reference implementation showing how to add Retrieval-Augmented Generation (RAG) to a DevExpress XAF Blazor Server application. It uses PostgreSQL with the PGVector extension to store and query vector embeddings, OpenAI to generate embeddings and LLM responses, and the DevExpress DxAIChat component to provide a polished in-app chat interface — all wired together through Microsoft.Extensions.AI abstractions.

Tech Stack

Layer	Technology
Framework	.NET 8, DevExpress XAF 25.2.3
UI	Blazor Server, DevExpress DxAIChat 25.2.4
ORM	EF Core 8
Vector store	PostgreSQL 18 + PGVector
AI	OpenAI `text-embedding-3-small` (embeddings), `gpt-4o` (chat)
AI abstractions	`Microsoft.Extensions.AI` 9.7.1
Document parsing	DevExpress Document Processor (PDF, DOCX), plain text (TXT, MD)
Logging	Serilog (console + rolling file)

Prerequisites

.NET 8 SDK
Docker Desktop (for PostgreSQL + PGVector)
DevExpress license (25.2.x) with the DevExpress NuGet feed configured
OpenAI API key

Quick Start

Clone the repository

git clone https://github.com/your-org/xafrag.git
cd xafrag

Start PostgreSQL with PGVector
```
docker compose up -d
```
This starts a pgvector/pgvector:pg18 container on port 5435 with database XafRag.
Set your OpenAI API key

Add your key to appsettings.Development.json (gitignored):
```
{
  "OpenAI": {
    "ApiKey": "sk-your-key-here"
  }
}
```
Run the application
```
dotnet run --project XafRag/XafRag.Blazor.Server
```
On first run, XAF creates all database tables automatically (including the knowledge_chunks vector table).
Log in

Open https://localhost:5001 and log in as Admin with an empty password.
Add knowledge

Navigate to Knowledge Base > Document and upload files. Supported formats: .txt, .md, .pdf, .docx. The file name is captured automatically from the upload. Each document is chunked, embedded, and stored in PGVector in the background.

Alternatively, navigate to Knowledge Base > Knowledge Article and write articles directly. They are ingested on save.

Sample documents covering .NET, Blazor, EF Core, XAF, and the Dark Forest theory are included in the docs/ folder — upload them to get started quickly.
Chat

Navigate to Knowledge Base > RAG Chat and ask questions. The assistant retrieves relevant chunks from your knowledge base, cites the source document and chunk position, and generates a grounded response. Use the Clear Chat button in the header to start a new conversation.

Project Structure

xafrag/
├── docker-compose.yml                        # PostgreSQL + PGVector (port 5435)
├── docs/                                     # Sample documents for demo upload
│   ├── blazor-components.md
│   ├── blazor-fundamentals.md
│   ├── dark-forest-theory.md
│   ├── dotnet-rag-quickstart.md
│   ├── ef-core-getting-started.md
│   ├── ef-core-querying.md
│   ├── ef-core-relationships.md
│   ├── fermi-paradox.md
│   ├── xaf-crud-operations.md
│   ├── xaf-security-passwords.md
│   ├── how_to_implement.md                   # Step-by-step implementation guide
│   └── architecture.excalidraw              # Editable architecture diagram
├── XafRag/
│   ├── XafRag.Module/
│   │   └── BusinessObjects/
│   │       ├── KnowledgeArticle.cs           # XAF entity (title + content)
│   │       ├── Document.cs                   # XAF entity (file upload, auto filename)
│   │       ├── DocumentStatus.cs             # Pending → Processing → Completed/Failed
│   │       ├── RagChatHolder.cs              # Non-persistent object backing the chat view
│   │       ├── KnowledgeChunk.cs             # EF Core entity with vector(1536) column
│   │       ├── ChunkSourceType.cs            # Article or Document enum
│   │       ├── RagDbContext.cs               # Separate DbContext for vector operations
│   │       └── XafRagDbContext.cs            # XAF-managed EF Core DbContext
│   └── XafRag.Blazor.Server/
│       ├── Configuration/
│       │   ├── OpenAiOptions.cs              # API key + model names
│       │   └── RagOptions.cs                 # Chunk size, overlap, search thresholds
│       ├── Controllers/
│       │   ├── KnowledgeArticleIngestionController.cs
│       │   ├── DocumentIngestionController.cs
│       │   └── RagChatWindowController.cs    # Redirects ListView → DetailView for chat
│       ├── Editors/
│       │   ├── RagChatViewItem.cs            # Custom XAF ViewItem (IComponentContentHolder)
│       │   └── RagChatComponent.razor        # DxAIChat wrapper with Markdown rendering
│       ├── Services/
│       │   ├── ChunkingService.cs            # Paragraph-aware text splitter
│       │   ├── EmbeddingService.cs           # Wraps IEmbeddingGenerator
│       │   ├── DocumentProcessingService.cs  # PDF/DOCX/TXT/MD text extraction
│       │   ├── IngestionService.cs           # Background ingestion with status tracking
│       │   └── RagService.cs                 # Vector search + source resolution + LLM streaming
│       ├── RagChatDetailViewUpdater.cs       # Programmatic model layout for chat view
│       ├── BlazorModule.cs                   # Registers the detail view updater
│       ├── Startup.cs                        # DI wiring
│       ├── Program.cs                        # Serilog configuration
│       └── appsettings.json                  # Configuration (API key in Development.json)

How It Works

Ingestion pipeline

When a KnowledgeArticle is saved or a Document is uploaded, an XAF ViewController fires after ObjectSpace.Committed. The document status moves to Processing and the IngestionService runs the pipeline on a background thread:

Upload document / Save article
  → DocumentProcessingService  — extract text from PDF/DOCX/TXT/MD
  → ChunkingService            — split into ~500-token paragraphs with 100-token overlap
  → EmbeddingService           — call OpenAI text-embedding-3-small → float[1536] vectors
  → RagDbContext               — delete old chunks, insert new KnowledgeChunk rows
  → Update Document.Status     — Completed or Failed

Retrieval pipeline

When the user sends a message in the RAG Chat:

User question
  → EmbeddingService           — embed the question (same model)
  → RagDbContext               — cosine distance search via PGVector, top 5 under threshold
  → Source resolution           — resolve document filenames / article titles
  → RagService                 — build system prompt: "[Part N of "filename.md"] chunk text..."
  → OpenAI gpt-4o              — streaming chat response
  → DxAIChat                   — render Markdown with source citations in the browser

Configuration

All configuration lives in appsettings.json. The OpenAI API key should be placed in appsettings.Development.json (gitignored).

"OpenAI": {
  "ApiKey": "",
  "EmbeddingModel": "text-embedding-3-small",
  "ChatModel": "gpt-4o"
},
"Rag": {
  "ChunkTokenLimit": 500,
  "ChunkOverlap": 100,
  "MaxResults": 5,
  "DistanceThreshold": 1.0
}

Setting	Description
`OpenAI:ApiKey`	Your OpenAI API key (use `appsettings.Development.json`)
`ChunkTokenLimit`	Maximum estimated tokens per chunk (1 token ~ 4 characters)
`ChunkOverlap`	Overlap in tokens carried from the previous chunk
`MaxResults`	Maximum chunks returned by vector search
`DistanceThreshold`	Cosine distance ceiling — chunks beyond this are excluded

Logging

Serilog writes to both the console and rolling log files at logs/xafrag-YYYY-MM-DD.log. The ingestion pipeline logs every step (text extraction, chunking, embedding, saving) so you can diagnose issues without a debugger.

Sample Documents

The docs/ folder contains 10 ready-to-upload documents covering three domains:

Domain	Documents
.NET / Blazor / EF Core	`ef-core-getting-started.md`, `ef-core-querying.md`, `ef-core-relationships.md`, `blazor-fundamentals.md`, `blazor-components.md`, `dotnet-rag-quickstart.md`
DevExpress XAF	`xaf-security-passwords.md`, `xaf-crud-operations.md`
Cosmology / Sci-Fi	`dark-forest-theory.md`, `fermi-paradox.md`

Upload these through the Document view to populate the knowledge base and test cross-domain retrieval.

Future Extensions

Web search — integrate Tavily, Bing, or OpenAI Responses API for answers beyond the knowledge base
Hybrid search — combine BM25 full-text search with vector search and re-rank results
Query expansion — generate multiple query variants to improve recall
Chat history persistence — store conversation threads in the database
Background job queue — replace fire-and-forget Task.Run with Hangfire for reliable ingestion

Implementation Guide

See docs/how_to_implement.md for a step-by-step guide on adding RAG to your own XAF application.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
XafRag		XafRag
docs		docs
.gitignore		.gitignore
README.md		README.md
XafRag.slnx		XafRag.slnx
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

XafRag — RAG Sample for DevExpress XAF

Tech Stack

Prerequisites

Quick Start

Project Structure

How It Works

Ingestion pipeline

Retrieval pipeline

Configuration

Logging

Sample Documents

Future Extensions

Implementation Guide

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

XafRag — RAG Sample for DevExpress XAF

Tech Stack

Prerequisites

Quick Start

Project Structure

How It Works

Ingestion pipeline

Retrieval pipeline

Configuration

Logging

Sample Documents

Future Extensions

Implementation Guide

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages