Embed YAML source in PNG output (port of upstream PR #234) by SomethingNew71 · Pull Request #5 · ClassicMiniDIY/WireViz

SomethingNew71 · 2026-05-05T00:25:30Z

Summary

Ports upstream PR #234 — renders embed the source YAML in PNG output as a zlib-compressed iTXt chunk under the key `wireviz:yaml`. The CLI auto-detects `.png` inputs and pulls the YAML back out, so a single PNG file is enough to re-render or edit a harness — no sidecar `.yml` needed.

This is the load-bearing capability for the upcoming wireviz-gui: drag a PNG into the editor and recover the source. Without it, every PNG in the wild is an opaque artifact divorced from its model.

Workflow

```bash
wireviz harness.yml # produces harness.png with YAML inside
wireviz harness.png # round-trips: extract YAML, re-render
wireviz --no-embed-yaml harness.yml # plain PNG, no metadata
```

API

Surface	Change
`wireviz.parse()`	New `embed_yaml: bool = True`
`Harness.output()`	New `yaml_source: Optional[str] = None`
`Harness._render()`	New `yaml_source: Optional[str] = None`
`Harness` module	New `read_yaml_from_png(path)` and `_embed_yaml_in_png(bytes, yaml)` helpers
`wv_cli.py`	`--no-embed-yaml` flag; `.png` input auto-detected

Implementation notes

iTXt chunk, not zTXt — international text with zlib compression, so unicode YAML (Asian characters in part numbers etc.) round-trips cleanly.
Key prefix `wireviz:` namespaces the chunk against PNG software-defined keywords.
Dict inputs to `parse()` are `yaml.safe_dump`'d back for embedding — round-trip readable, but without original comments/formatting (those don't survive dict conversion regardless).
`build_examples.py` opts out (`embed_yaml=False`) so regression baseline PNGs stay deterministic.

Differences from upstream PR wireviz#234

The 2021-era PR was heavily bit-rotted — argparse, the old `parse_cmdline`/`parse_file` layer, conceal-input enum, etc. Only the load-bearing idea (zTXT/iTXt embed in PNG, `.png` input recovery) was preserved. Reworked against current master's click CLI, the in-memory render dict from PR wireviz#321 stdin/stdout, and integrated cleanly with `source_path` from earlier PRs in this chain.

The upstream's `ConcealEnum` (none/all/prepend/main) was reduced to a single `--no-embed-yaml` boolean — simpler API surface, same privacy escape hatch.

Verification

✅ Round-trip: `harness.yml` → `harness.png` → extract YAML → byte-identical to source
✅ `--no-embed-yaml` produces a PNG without the chunk (verified via PIL)
✅ `wireviz chunkless.png` raises a clean `click.UsageError`
✅ `build_examples.py` runs cleanly; `.gv` and `.bom.tsv` byte-identical to baseline

Test plan

CI workflow passes
Round-trip a non-trivial harness (with images, custom colors, BOM additions) and verify all rendering matches
Verify GUI prototype can extract via `Harness.read_yaml_from_png()`

🤖 Generated with Claude Code

SomethingNew71 · 2026-05-05T00:25:44Z

@jacobian91 — heads up, your upstream #234 (Embed YAML data into PNG) is being incorporated here.

Context: We use WireViz to document Classic Mini Cooper wiring harnesses at Classic Mini DIY and are building a GUI on top of it. Upstream wireviz/WireViz looks abandoned (last real master commit was 2021), so we're forking and pulling in the open PRs we want.

Your PR is the most consequential one for the GUI direction — it's what makes a PNG a self-contained editable artifact instead of an opaque pixel dump. Round-trip drag-and-drop editing in the GUI depends on this exact mechanism.

The 2021-era patch had bit-rotted significantly against current master (argparse → click, parse_cmdline/parse_file gone, in-memory render dict from wireviz#321, etc.) so I rebuilt the implementation from scratch against current master while preserving the load-bearing ideas: iTXt chunk for YAML embedding, .png as a first-class input format that auto-extracts.

Two simplifications I made:

Switched the chunk type from zTXt to iTXt so unicode YAML round-trips cleanly (some part-number metadata in our use case has non-ASCII).
Reduced the ConcealEnum (none/all/prepend/main) down to a single --no-embed-yaml boolean. Same privacy escape hatch, less API surface; if anyone ever needs the granular variant we can grow it back.

Commit attribution links back to your PR. Thanks for the work — really appreciate it.

gemini-code-assist

Code Review

This pull request implements a round-trip editing feature by embedding the source YAML into rendered PNG files using iTXt chunks. It introduces utility functions for metadata manipulation, updates the core parsing and output logic to support this embedding, and enhances the CLI to accept PNG files as input by extracting the embedded YAML. Review feedback highlights opportunities to make the new Pillow dependency optional, optimize the PNG re-encoding process to preserve existing metadata, and improve error handling when processing invalid image files.

gemini-code-assist · 2026-05-05T00:27:23Z

+from PIL import Image as PILImage
+from PIL.PngImagePlugin import PngInfo


Importing PIL at the top level makes it a mandatory dependency for all wireviz operations. If the intention is to keep Pillow as an optional dependency for users who do not need the YAML embedding feature, consider moving these imports inside the functions that use them (_embed_yaml_in_png and read_yaml_from_png) or wrapping them in a try...except block.

gemini-code-assist · 2026-05-05T00:27:23Z

+def _embed_yaml_in_png(png_bytes: bytes, yaml_source: str) -> bytes:
+    """Re-encode PNG bytes with the YAML source stored in an iTXt chunk."""
+    with PILImage.open(io.BytesIO(png_bytes)) as im:
+        im.load()
+        chunks = PngInfo()
+        chunks.add_itxt(PNG_YAML_CHUNK_KEY, yaml_source, zip=True)
+        out = io.BytesIO()
+        im.save(out, format="PNG", pnginfo=chunks)
+        return out.getvalue()


The current implementation of _embed_yaml_in_png decodes the PNG into raw pixels and then re-encodes it to add the metadata chunk. This is computationally expensive and can lead to the loss of original image metadata (such as DPI settings or color profiles) generated by Graphviz. If re-encoding is kept, consider passing **im.info to im.save() to preserve as much metadata as possible. Alternatively, manipulating the PNG byte stream directly to insert the iTXt chunk would be more efficient and non-destructive.

gemini-code-assist · 2026-05-05T00:27:23Z

+                # PNG input: try to recover the YAML embedded by an
+                # earlier WireViz render. Falls through with a clear
+                # error if no chunk is present.
+                embedded = read_yaml_from_png(file)


The call to read_yaml_from_png is not protected against invalid or corrupted image files. If the input file is not a valid image, PIL.Image.open will raise an UnidentifiedImageError. Wrapping this in a try...except block would allow for a cleaner error message to the user.

try: embedded = read_yaml_from_png(file) except Exception as e: raise click.UsageError(f"Could not read YAML from {file}: {e}")

@jacobian91

…eam PR wireviz#234) Renders now embed the source YAML in PNG output as a zlib-compressed iTXt chunk under the key ``wireviz:yaml``. The CLI auto-detects ``.png`` inputs and pulls the YAML back out, so a single PNG file is enough to re-render or edit a harness — no sidecar .yml needed. The headline workflow: wireviz harness.yml # produces harness.png with yaml inside wireviz harness.png # round-trips: extract YAML, re-render This is the load-bearing capability for the upcoming wireviz-gui: drag a PNG into the editor and recover the source. Without it, every PNG in the wild is an opaque artifact divorced from its model. API surface: * wireviz.parse() gains ``embed_yaml: bool = True``. The default embeds; pass False to render plain PNGs without source-bearing metadata. * Harness.output() / _render() gain ``yaml_source: Optional[str]``. When non-None and PNG is in the requested formats, the rendered PNG bytes are post-processed through PIL to attach the iTXt chunk. * New module-level helpers in Harness.py: - PNG_YAML_CHUNK_KEY = "wireviz:yaml" - _embed_yaml_in_png(png_bytes, yaml_source) -> bytes - read_yaml_from_png(png_path) -> Optional[str] * CLI ``--no-embed-yaml`` flag opts out of embedding when desired (e.g. before sharing a diagram externally without source). Implementation notes: * The chunk uses ``iTXt`` (international text, zip-compressed) rather than ``zTXt`` so unicode YAML round-trips cleanly. Key prefix ``wireviz:`` namespaces the chunk against PNG software-defined keywords. * When parse() is called with a Dict input, we yaml.safe_dump it back for embedding — round-trip-readable, but without the original comments or formatting (those don't survive the dict-conversion step regardless of embedding). * build_examples.py opts out (``embed_yaml=False``) so the regression baseline PNGs stay deterministic. Adapted from wireviz#234 (originally by @jacobian91, targeting upstream ``dev``). The 2021-era PR was heavily bit-rotted — argparse, the old parse_cmdline / parse_file layer, conceal-input enum — only the load-bearing idea (zTXT/iTXt embed in PNG, .png input recovery) was preserved. Reworked against current master's click CLI, the in-memory render dict from PR wireviz#321 stdin/stdout, and threaded source_path / template_dir from earlier PRs in this chain. Verified: * round-trip: harness.yml → harness.png → re-extract → identical YAML * --no-embed-yaml produces a PNG without the chunk (verified via PIL) * ``wireviz harness.png`` on a chunk-less PNG raises a clean click.UsageError * build_examples.py runs cleanly; .gv and .bom.tsv byte-identical to baseline. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

gemini-code-assist Bot reviewed May 5, 2026

View reviewed changes

SomethingNew71 force-pushed the port-234-yaml-in-png branch from d9c733f to dee9e67 Compare May 5, 2026 00:50

SomethingNew71 merged commit 885ae5e into port-321-stdin-stdout May 5, 2026

SomethingNew71 mentioned this pull request May 5, 2026

Bring 6 upstream PR ports onto upstream-fixes #9

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Embed YAML source in PNG output (port of upstream PR #234)#5

Embed YAML source in PNG output (port of upstream PR #234)#5
SomethingNew71 merged 1 commit into
port-321-stdin-stdoutfrom
port-234-yaml-in-png

SomethingNew71 commented May 5, 2026

Uh oh!

SomethingNew71 commented May 5, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 5, 2026

Uh oh!

gemini-code-assist Bot May 5, 2026

Uh oh!

gemini-code-assist Bot May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		from PIL import Image as PILImage
		from PIL.PngImagePlugin import PngInfo

Conversation

SomethingNew71 commented May 5, 2026

Summary

Workflow

API

Implementation notes

Differences from upstream PR wireviz#234

Verification

Test plan

Uh oh!

SomethingNew71 commented May 5, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 5, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 5, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant