0.1.0
First release of Image to Markdown — transcribe the embedded images of an Obsidian note to Markdown with a local, OpenAI-compatible vision LLM. Fully offline, non-destructive, streaming live into the sidebar.
Highlights
- Sidebar view — one card per image with live-streaming transcription (plus an expandable thinking block for reasoning models) and a copy button. Create one transcript note per image, or all at once.
- Commands — "Transcribe images in the active note" (batch) and "Open sidebar"; plus an editor context-menu entry for a single image.
- Bilingual UI (English / Deutsch) — every user-facing string follows Obsidian's language setting; English is canonical, German is provided automatically. Detected once at load (reload to switch).
- Offline-first — configurable OpenAI-compatible vision endpoint (LM Studio / MLX / Ollama); connection status + "Test connection", vision-capability detection + active "Test vision".
- Non-destructive & idempotent — one transcript note per image; the image embed in the source note is replaced by an embed of the new note. The original text is never overwritten.
Install
- Manual: download
main.js,manifest.jsonandstyles.cssfrom this release into<vault>/.obsidian/plugins/image-to-markdown/, then enable under Settings → Community plugins. - BRAT: add
johannes-kaindl/image-to-markdown.
Requires Obsidian 1.4.0+. Code: AGPL-3.0-or-later. See the manual in the repository for details.