🧿 Vision Agent (Local Image Analysis)

A lightweight local AI vision agent built with Streamlit + LangGraph + Ollama that analyzes images and answers user questions using a multi-step pipeline.

Features

Upload any image (JPG, PNG, WebP, GIF)
Multi-step reasoning pipeline:
- Vision → image description (LLaVA)
- Research → reasoning over description
- Writer → clean final answer
Fully local (no API costs)
Clean modern UI with Streamlit

Architecture

User Input
   │
   ▼
[ Vision Model (llava-phi3) ]
   │
   ▼
[ Research Agent (llama3.2) ]
   │
   ▼
[ Writer Agent (llama3.2) ]
   │
   ▼
Final Answer

Built using LangGraph state machine.

Requirements

Python 3.9+
Ollama installed → https://ollama.com
8GB RAM recommended

Setup

# Install dependencies
pip install -r requirements.txt

# Start Ollama
ollama serve

# Pull models
ollama pull llava-phi3
ollama pull llama3.2:1b

Run App

streamlit run frontend.py

Project Structure

.
├── frontend.py   # Streamlit frontend
├── backend.py    # LangGraph pipeline
└── README.md

How It Works

Image is converted to base64
Sent to LLaVA (vision model) via Ollama
Output is analyzed by LLM (llama3.2)
Final answer is generated and displayed

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
public		public
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
backend.py		backend.py
frontend.py		frontend.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧿 Vision Agent (Local Image Analysis)

Features

Architecture

Requirements

Setup

Run App

Project Structure

How It Works

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🧿 Vision Agent (Local Image Analysis)

Features

Architecture

Requirements

Setup

Run App

Project Structure

How It Works

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages