Urbot

Template repository -- Use this as a starting point for building your own RAG chatbot with the Embabel Agent Framework. Click "Use this template" on GitHub to create your own copy.

A RAG-powered document chatbot with a Vaadin web interface, built on the Embabel Agent Framework.

Upload documents, ask questions, and get intelligent answers grounded in your content -- powered by agentic Retrieval-Augmented Generation with Neo4j graph-backed vector search.

Architecture

graph TB
    subgraph UI["Vaadin Web UI"]
        CV[Chat View]
        UD[User Drawer<br/>Personal Documents]
        GD[Global Drawer<br/>Global Documents]
    end

    subgraph App["Spring Boot Application"]
        subgraph Agent["Embabel Agent Platform"]
            CA[ChatActions<br/>Agentic RAG]
        end
        subgraph Docs["Document Service"]
            TP[Tika Parser]
            CH[Chunking + Metadata]
        end
    end

    subgraph Store["Neo4j + Drivine"]
        EMB[Vector Embeddings + Graph Search]
    end

    LLM[(LLM Provider<br/>OpenAI / Anthropic)]

    CV --> CA
    UD --> TP
    GD --> TP
    TP --> CH
    CH --> EMB
    CA --> EMB
    CA --> LLM

How Agentic RAG Works

Unlike traditional RAG pipelines where retrieval is a fixed preprocessing step, Urbot uses the Embabel Agent Framework's Utility AI pattern to make retrieval agentic. The LLM autonomously decides when and how to search your documents.

sequenceDiagram
    participant U as User
    participant C as ChatActions
    participant L as LLM
    participant T as ToolishRag
    participant S as Neo4j Store

    U->>C: Ask a question
    C->>L: Send message + tools + system prompt

    Note over L: LLM reasons about approach

    L->>T: Call vectorSearch("relevant query")
    T->>S: Embed query + similarity search<br/>(filtered by user context)
    S-->>T: Matching chunks with metadata
    T-->>L: Retrieved context

    Note over L: May search again to refine

    L->>T: Call vectorSearch("follow-up query")
    T->>S: Embed + search
    S-->>T: More chunks
    T-->>L: Additional context

    L-->>C: Synthesized answer grounded in documents
    C-->>U: Display response with markdown

Key aspects of the agentic approach:

Autonomous tool use -- The LLM decides whether to search and what to search for
Iterative retrieval -- Multiple searches can refine results before answering
Context-aware filtering -- Results are scoped to the user's current workspace context
Template-driven prompts -- Jinja2 templates separate persona, objective, and guardrails

Document Contexts

Urbot supports two document scopes:

Scope	Access	Ingestion	Description
Personal	Per-user context	User Drawer (click profile)	Documents scoped to a user's named context (e.g. `2_personal`). Users can create and switch between multiple contexts.
Global	Shared across all users	Global Drawer (`...` toggle)	Documents available to everyone, stored under the `global` context.

RAG search filters results to the user's current effective context, so personal and global documents are searched independently based on which context is active.

Technology Stack

Layer	Technology	Role
UI	Vaadin 24	Server-side Java web framework with real-time push updates
Backend	Spring Boot 3	Application framework, dependency injection, security
Agent Framework	Embabel Agent	Agentic AI orchestration with Utility AI pattern
Graph + Vector Store	Neo4j via Drivine	Graph-backed vector embeddings, semantic search, and document relationships
Document Parsing	Apache Tika	Extract text from PDF, DOCX, HTML, and 1000+ formats
LLM	OpenAI / Anthropic	Chat completion and text embedding models
Auth	Spring Security	Form-based authentication with role-based access

Embabel Agent Framework

Urbot is built on the Embabel Agent Framework, which provides:

AgentProcessChatbot -- Wires actions into a conversational agent using the Utility AI pattern, where the LLM autonomously selects which @Action methods to invoke
ToolishRag -- Exposes vector search as an LLM-callable tool, enabling agentic retrieval
DrivineStore -- Neo4j-backed RAG store with vector indexes and graph relationships (Lucene and pgvector backends are also available)
Jinja2 prompt templates -- Composable system prompts with persona/objective/guardrails separation

Vaadin UI

The frontend is built entirely in server-side Java using Vaadin Flow:

ChatView -- Main chat interface with message bubbles, markdown rendering, and real-time tool call progress indicators
UserDrawer -- Click the profile chip to manage personal documents, switch contexts, and log out
DocumentsDrawer -- Right-side toggle panel for uploading and managing global documents
Dark theme -- Custom Lumo theme with responsive design
Push updates -- Async responses stream to the browser via long polling

Neo4j Vector Store

Documents are chunked, embedded, and stored in Neo4j via Drivine:

Chunking -- 800-character chunks with 100-character overlap for context continuity
Embeddings -- Generated via OpenAI text-embedding-3-small (configurable)
Metadata filtering -- Chunks tagged with user/context metadata for scoped search
Graph relationships -- Document → section → chunk hierarchy preserved as graph edges
Persistent storage -- Neo4j container via Docker Compose, survives restarts

Features

Document upload -- PDF, DOCX, XLSX, TXT, MD, HTML, ODT, RTF (up to 10MB)
URL ingestion -- Fetch and index web pages directly
Personal & global documents -- Personal documents scoped per user context; global documents shared across all users
Multi-context workspaces -- Create and switch between named contexts to organize personal documents
Markdown chat -- Responses render with full markdown and code highlighting
Tool call visibility -- See real-time progress as the agent searches your documents
Session persistence -- Conversation history preserved across page reloads
Configurable persona -- Switch voice and objective via configuration

Project Structure

src/main/java/com/embabel/urbot/
├── UrbotApplication.java           # Spring Boot entry point + Drivine bootstrap
├── ChatActions.java                # @Action methods for agentic RAG chat
├── ChatConfiguration.java          # Utility AI chatbot wiring
├── RagConfiguration.java           # Neo4j/Drivine vector store setup
├── UrbotProperties.java            # Externalized configuration
├── rag/
│   └── DocumentService.java        # Document ingestion, context management
├── security/
│   ├── SecurityConfiguration.java  # Spring Security setup
│   └── LoginView.java              # Login page
├── user/
│   ├── UrbotUser.java              # User model with context
│   └── UrbotUserService.java       # User service interface
└── vaadin/
    ├── ChatView.java               # Main chat interface
    ├── ChatMessageBubble.java      # User/assistant message rendering
    ├── DocumentsDrawer.java        # Global document management panel
    ├── UserDrawer.java             # Personal document management + context selector
    ├── DocumentListSection.java    # Document list component
    ├── FileUploadSection.java      # File upload component (reusable)
    ├── UrlIngestSection.java       # URL ingestion component (reusable)
    ├── UserSection.java            # Clickable user profile chip
    └── Footer.java                 # Document/chunk statistics

src/main/resources/
├── application.yml                 # Server, LLM, Neo4j, and chunking config
└── prompts/
    ├── urbot.jinja                 # Main prompt template
    ├── elements/
    │   ├── guardrails.jinja        # Safety guidelines
    │   └── personalization.jinja   # Dynamic persona/objective loader
    ├── personas/
    │   └── assistant.jinja         # Default assistant persona
    └── objectives/
        └── general.jinja           # General knowledge base objective

docker-compose.yml                  # Neo4j container with vector index support

Getting Started

Prerequisites

Java 21+
Maven 3.9+
Docker (for Neo4j)
An OpenAI or Anthropic API key

Run

# Start Neo4j
docker compose up -d

# Set your API key
export OPENAI_API_KEY=sk-...    # or ANTHROPIC_API_KEY for Claude

# Start the application
mvn spring-boot:run

Open http://localhost:9000 and log in:

Username	Password	Roles
`admin`	`admin`	ADMIN, USER
`user`	`user`	USER

Upload Documents and Chat

Click your profile chip (top right) to open the personal documents drawer -- upload files or paste URLs scoped to your current context
Click the ... toggle on the right edge to open the global documents drawer -- uploads here are shared across all users
Ask questions -- the agent will search your documents and synthesize answers

Configuration

All settings are in src/main/resources/application.yml:

urbot:
  chunker-config:
    max-chunk-size: 800       # Characters per chunk
    overlap-size: 100         # Overlap between chunks
    embedding-batch-size: 800

  chat-llm:
    model: gpt-4.1-mini      # LLM for chat responses
    temperature: 0.0          # Deterministic responses

  voice:
    persona: assistant        # Prompt persona template
    max-words: 250            # Target response length

  objective: general          # Prompt objective template

embabel:
  models:
    default-llm:
      model: gpt-4.1-mini
    default-embedding-model:
      model: text-embedding-3-small

# Neo4j connection (matches docker-compose.yml)
database:
  datasources:
    neo:
      type: NEO4J
      host: localhost
      port: 7891
      user-name: neo4j
      password: urbot123

LLM provider is selected automatically based on which API key is set:

OPENAI_API_KEY activates OpenAI models
ANTHROPIC_API_KEY activates Anthropic Claude models

Related Projects

Urbot is one of several example applications built on the Embabel Agent Framework:

Project	Description
Ragbot	CLI + web RAG chatbot demonstrating the core agentic RAG pattern with multiple personas and pluggable vector stores
Impromptu	Classical music discovery chatbot with Spotify/YouTube integration, Matryoshka tools, and DICE semantic memory

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
src/main		src/main
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Urbot

Architecture

How Agentic RAG Works

Document Contexts

Technology Stack

Embabel Agent Framework

Vaadin UI

Neo4j Vector Store

Features

Project Structure

Getting Started

Prerequisites

Run

Upload Documents and Chat

Configuration

Related Projects

License

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

embabel/urbot

Folders and files

Latest commit

History

Repository files navigation

Urbot

Architecture

How Agentic RAG Works

Document Contexts

Technology Stack

Embabel Agent Framework

Vaadin UI

Neo4j Vector Store

Features

Project Structure

Getting Started

Prerequisites

Run

Upload Documents and Chat

Configuration

Related Projects

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages