Skip to content

konjoai/miru

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

13 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🐍 Miru

Language API License

πŸ‘οΈ Multimodal reasoning tracer β€” see what models see, and why they decide.


πŸ‘οΈ Meaning

Miru (見る) β€” to see, to observe.

Not just outputs β€” but perception and reasoning.


πŸš€ What it is

Miru is a multimodal explainability engine:

  • Input: image or document + question

  • Output:

    • answer
    • reasoning trace
    • attention visualization

❗ The problem

Multimodal models are black boxes:

  • No visibility into reasoning
  • No auditability
  • No explainability

Critical issue for:

  • compliance
  • medical
  • enterprise AI

🧠 What you learn

  • Vision-language models (VLMs)
  • Cross-attention mechanisms
  • Saliency & interpretability
  • Multimodal reasoning

βš™οΈ Stack

  • 🐍 Python (FastAPI backend)
  • 🎨 Visualization layer (attention maps, overlays)

πŸš€ Quick Start

uvicorn miru.main:app --reload

🎯 Vision

Make AI reasoning visible.

About

🐍 Miru β€” Multimodal reasoning tracer πŸ‘οΈ. Answers questions over images/docs πŸ–ΌοΈπŸ“„ while visualizing step-by-step attention and reasoning πŸ”. Built for explainable vision-language AI πŸ€–βœ¨

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages