Releases: johannes-kaindl/image-to-markdown
0.1.3
0.1.2
Image to Markdown transcribes the embedded images of an Obsidian note to Markdown with a local, OpenAI-compatible vision LLM. Fully offline, non-destructive, streaming live into the sidebar.
Highlights
- Sidebar view — one card per image with live-streaming transcription (plus an expandable thinking block for reasoning models) and a copy button. Create one transcript note per image, or all at once.
- Commands — "Transcribe images in the active note" (batch) and "Open sidebar"; plus an editor context-menu entry for a single image.
- Bilingual UI (English / Deutsch) — every user-facing string follows Obsidian's language setting; English is canonical, German is provided automatically.
- Offline-first — configurable OpenAI-compatible vision endpoint (LM Studio / MLX / Ollama); connection status + "Test connection", vision-capability detection + active "Test vision".
- Non-destructive & idempotent — one transcript note per image; the image embed in the source note is replaced by an embed of the new note. The original text is never overwritten.
Changes in 0.1.2
- Live streaming now uses
activeWindow.fetch(injected stream transport) instead of the globalfetch, satisfying the Obsidian plugin linter without suppressions. Behavior unchanged. - README: replaced the Community-plugins placeholder with real install instructions.
Install
- Manual: download
main.js,manifest.jsonandstyles.cssfrom this release into<vault>/.obsidian/plugins/image-to-markdown/, then enable under Settings → Community plugins. - BRAT: add
johannes-kaindl/image-to-markdown.
Requires Obsidian 1.8.7+. Code: AGPL-3.0-or-later.
0.1.1
Image to Markdown transcribes the embedded images of an Obsidian note to Markdown with a local, OpenAI-compatible vision LLM. Fully offline, non-destructive, streaming live into the sidebar.
Highlights
- Sidebar view — one card per image with live-streaming transcription (plus an expandable thinking block for reasoning models) and a copy button. Create one transcript note per image, or all at once.
- Commands — "Transcribe images in the active note" (batch) and "Open sidebar"; plus an editor context-menu entry for a single image.
- Bilingual UI (English / Deutsch) — every user-facing string follows Obsidian's language setting; English is canonical, German is provided automatically.
- Offline-first — configurable OpenAI-compatible vision endpoint (LM Studio / MLX / Ollama); connection status + "Test connection", vision-capability detection + active "Test vision".
- Non-destructive & idempotent — one transcript note per image; the image embed in the source note is replaced by an embed of the new note. The original text is never overwritten.
Changes in 0.1.1
- Network calls go through Obsidian's
requestUrl(live streaming keepsfetch, whichrequestUrlcannot do). minAppVersionraised to 1.8.7 (officialgetLanguage()API).- Clean against the Obsidian plugin linter (
eslint-plugin-obsidianmd): no unsupported-API usage,activeDocument, no floating promises.
Install
- Manual: download
main.js,manifest.jsonandstyles.cssfrom this release into<vault>/.obsidian/plugins/image-to-markdown/, then enable under Settings → Community plugins. - BRAT: add
johannes-kaindl/image-to-markdown.
Requires Obsidian 1.8.7+. Code: AGPL-3.0-or-later.
0.1.0
First release of Image to Markdown — transcribe the embedded images of an Obsidian note to Markdown with a local, OpenAI-compatible vision LLM. Fully offline, non-destructive, streaming live into the sidebar.
Highlights
- Sidebar view — one card per image with live-streaming transcription (plus an expandable thinking block for reasoning models) and a copy button. Create one transcript note per image, or all at once.
- Commands — "Transcribe images in the active note" (batch) and "Open sidebar"; plus an editor context-menu entry for a single image.
- Bilingual UI (English / Deutsch) — every user-facing string follows Obsidian's language setting; English is canonical, German is provided automatically. Detected once at load (reload to switch).
- Offline-first — configurable OpenAI-compatible vision endpoint (LM Studio / MLX / Ollama); connection status + "Test connection", vision-capability detection + active "Test vision".
- Non-destructive & idempotent — one transcript note per image; the image embed in the source note is replaced by an embed of the new note. The original text is never overwritten.
Install
- Manual: download
main.js,manifest.jsonandstyles.cssfrom this release into<vault>/.obsidian/plugins/image-to-markdown/, then enable under Settings → Community plugins. - BRAT: add
johannes-kaindl/image-to-markdown.
Requires Obsidian 1.4.0+. Code: AGPL-3.0-or-later. See the manual in the repository for details.