feat: SSA dataflow analysis with Graphviz DOT export by sii-xinglong · Pull Request #2 · primatrix/strix

sii-xinglong · 2026-05-01T03:48:51Z

Summary

Add --dataflow-output CLI flag to generate Graphviz DOT files showing SSA def-use dependency graphs
Extract data flow from simulated OpEvent tree with time-partitioned layout, hardware swim lanes (VPU/DMA), loop clusters, and cross-stream edge highlighting
Persist SSA inputs/outputs on OpEvent attributes during simulation for downstream analysis

Design

See docs/plans/2026-05-01-dataflow-analysis-design.md for the full design document.

RFC: primatrix/wiki#143

Changes

dataflow.py — DFNode/DFEdge/DataFlowGraph data model + extract_dataflow() extraction
dataflow_exporter.py — DataFlowDotExporter renders DOT with time buckets, hardware clusters, loop nesting
simulator.py — persist SSA boundary info on scheduled events
cli.py — --dataflow-output flag

Usage

python -m strix.cli path/to/post-finalize-llo.txt \
  --default-sld-value 128 \
  --dataflow-output dataflow.dot

dot -Tsvg dataflow.dot -o dataflow.svg

Test plan

24 tests (unit + integration), all passing
Smoke test on real fused-moe kernel: 8,419 nodes, 779 cross-stream edges
Spec compliance + code quality reviews passed

🤖 Generated with Claude Code

Design for analyzing LLO VLIW Bundles ↔ Pallas source code correspondence. Adds new bundle_parser, bundle_domain, and bundle_exporter modules with analyze-bundles CLI subcommand. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…tion Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Preserve the SSA boundary (inputs/outputs) computed by group_boundary_ssa() as attributes on every scheduled OpEvent, enabling downstream dataflow analysis to build dependency graphs. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Introduces DFNode, DFEdge, and DataFlowGraph dataclasses along with extract_dataflow() which walks the simulated OpEvent tree, creates nodes for LEAF/BLOCK/STALL events, tracks LOOP containers, and builds deduplicated SSA dependency edges including cross-stream (DMA<->VPU). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…raph Renders DataFlowGraph as DOT format with hardware clusters (VPU/DMA), loop subgraph clusters, time-bucket rank constraints, cross-stream edge highlighting (red dashed), and node styling by stream/kind. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Wire the dataflow extraction and DOT exporter into the CLI so users can generate Graphviz dataflow graphs with a single flag. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Exercises the full pipeline (parse LLO -> simulate -> extract dataflow -> export DOT) using real post-finalize-llo files from ir_dumps/mosaic/. Tests skip gracefully when simulation requires arg overrides. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

gemini-code-assist

Code Review

This pull request implements SSA dataflow analysis and Graphviz DOT visualization. It adds a data model for dataflow graphs, extraction logic from simulated events, and a DOT exporter featuring hardware/loop clustering and time-bucketed alignment. The simulator now persists SSA metadata, and a --dataflow-output CLI flag is introduced. Review feedback highlights a bug in edge construction when variable names are reused, the omission of control-flow nodes in the graph extraction, and an escaping issue in the DOT exporter that breaks multi-line labels.

gemini-code-assist · 2026-05-01T03:51:15Z

+    # Map variable name -> producer node id (last writer wins)
+    producer: Dict[str, int] = {}
+    for node in nodes:
+        for var in node.ssa_outputs:
+            producer[var] = node.id
+
+    seen: Set[Tuple[int, int, str]] = set()
+    edges: List[DFEdge] = []
+
+    for node in nodes:
+        for var in node.ssa_inputs:
+            src_id = producer.get(var)
+            if src_id is None:
+                continue  # external input (e.g. %arg0)
+            key = (src_id, node.id, var)
+            if key in seen:
+                continue
+            seen.add(key)
+            edges.append(DFEdge(src=src_id, dst=node.id, variable=var))


The current implementation of _build_edges builds a global map of producers (producer) before iterating to create edges. This logic is incorrect for programs where variable names are reused, such as in expanded loops where the same SSA name appears in multiple iterations. In such cases, every consumer of a variable will incorrectly point to the last definition in the entire program rather than the most recent one in execution order.

To fix this, you should build the producer map while iterating through the nodes, ensuring that each use links to the definition that was most recently encountered.

Suggested change

# Map variable name -> producer node id (last writer wins)

producer: Dict[str, int] = {}

for node in nodes:

for var in node.ssa_outputs:

producer[var] = node.id

seen: Set[Tuple[int, int, str]] = set()

edges: List[DFEdge] = []

for node in nodes:

for var in node.ssa_inputs:

src_id = producer.get(var)

if src_id is None:

continue # external input (e.g. %arg0)

key = (src_id, node.id, var)

if key in seen:

continue

seen.add(key)

edges.append(DFEdge(src=src_id, dst=node.id, variable=var))

producer: Dict[str, int] = {}

seen: Set[Tuple[int, int, str]] = set()

edges: List[DFEdge] = []

for node in nodes:

# 1. Find producers for inputs using the current state of the map

for var in node.ssa_inputs:

src_id = producer.get(var)

if src_id is not None:

key = (src_id, node.id, var)

if key not in seen:

seen.add(key)

edges.append(DFEdge(src=src_id, dst=node.id, variable=var))

# 2. Update the map with this node's outputs for subsequent consumers

for var in node.ssa_outputs:

producer[var] = node.id

return edges

Good catch — accepted. The incremental producer map correctly handles expanded loops where SSA names are reused across iterations. Fixed in the next commit.

gemini-code-assist · 2026-05-01T03:51:15Z

+    def _walk(
+        ev: OpEvent,
+        loop_depth: int,
+        parent_loop_id: Optional[int],
+    ) -> None:
+        nonlocal next_id, next_loop_id
+
+        if ev.kind in _NODE_KINDS:
+            node_id = next_id
+            next_id += 1
+            node = DFNode(
+                id=node_id,
+                name=ev.name,
+                stream=ev.stream,
+                kind=ev.kind,
+                start_ns=ev.start_time_ns,
+                end_ns=ev.end_time_ns,
+                flops=ev.flops,
+                bytes=ev.bytes,
+                ssa_outputs=list(ev.attributes.get("ssa_outputs", [])),
+                ssa_inputs=list(ev.attributes.get("ssa_inputs", [])),
+                attributes=dict(ev.attributes),
+                loop_depth=loop_depth,
+                parent_loop_id=parent_loop_id,
+            )
+            nodes.append(node)
+            if parent_loop_id is not None:
+                loops.setdefault(parent_loop_id, []).append(node_id)
+            return
+
+        if ev.kind == OpKind.LOOP:
+            loop_id = next_loop_id
+            next_loop_id += 1
+            loops.setdefault(loop_id, [])
+            for child in ev.children:
+                _walk(child, loop_depth + 1, loop_id)
+            return
+
+        # ROOT, IF -- just recurse
+        for child in ev.children:
+            _walk(child, loop_depth, parent_loop_id)


The _walk function currently treats LOOP and IF events as containers but not as nodes in the dataflow graph. However, the design document specifies that control-flow operations should be represented as nodes (e.g., as diamonds) to capture dependencies like loop bounds or branch conditions.

Additionally, the current logic returns early when an event kind is in _NODE_KINDS, which would prevent recursion into loop or branch bodies if those kinds were added to the set. The function should be restructured to both create a node for the event and recurse into its children when appropriate.

Intentional design decision: LOOP/IF events are pure containers in the OpEvent tree — they don't define or use SSA variables themselves. Adding them as DFNodes would create orphan diamond nodes with no data-dependency edges, adding visual noise without information. The diamond styling for `OpStream.CONTROL` in the exporter targets leaf/block control operations (e.g. `scf.yield`) that do carry SSA semantics, not structural containers. Loop structure is already represented via `subgraph cluster_loop_N` clusters.

gemini-code-assist · 2026-05-01T03:51:15Z

+def _escape_dot(text: str) -> str:
+    """Escape characters that are special inside DOT label strings."""
+    return text.replace("\\", "\\\\").replace('"', '\\"')


The _escape_dot function escapes backslashes, which transforms the \n newline sequences used in _node_label into \\n. This causes Graphviz to render a literal backslash and 'n' instead of a newline in the node labels.

Since DOT labels allow literal newlines and the \n sequence is specifically intended for Graphviz, you should avoid escaping backslashes here, or only escape them if they are not part of a known escape sequence.

Suggested change

def _escape_dot(text: str) -> str:

"""Escape characters that are special inside DOT label strings."""

return text.replace("\\", "\\\\").replace('"', '\\"')

def _escape_dot(text: str) -> str:

"""Escape characters that are special inside DOT label strings."""

return text.replace('"', '\\"')

Good catch — the backslash escaping indeed breaks the `\n` DOT newline sequences produced by `_node_label`. Fixed by removing the backslash replacement; only double-quote escaping is needed for our labels.

- Build SSA producer map incrementally during edge construction so expanded loops with reused variable names link to the most recent producer, not the global last writer. - Remove backslash escaping in _escape_dot that was breaking \n newline sequences in multi-line DOT node labels. Addresses review comments on PR #2. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…tion fix - Add GCS upload/download path formula contract test (reviewer item #2) - Add AC2 download trigger chain test: kubectl wait → gcloud cp → tar (item #3) - Fix tautological assertion in test_kernel_is_positional_before_all_flags (item #5) - Add shell script download dir convention test Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

sii-xinglong and others added 10 commits April 30, 2026 20:25

docs: add design doc for SSA dataflow analysis and Graphviz visualiza…

fc0f591

…tion Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

docs: add implementation plan for SSA dataflow analysis

cf847c7

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix: address spec review issues for DOT exporter

25de1ed

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat: add --dataflow-output CLI flag for DOT graph export

8d596cb

Wire the dataflow extraction and DOT exporter into the CLI so users can generate Graphviz dataflow graphs with a single flag. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix: add SSA attrs to STALL events and utf-8 encoding to DOT export

27da59e

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

gemini-code-assist Bot reviewed May 1, 2026

View reviewed changes

sii-xinglong merged commit 9809b48 into main May 1, 2026

sii-xinglong mentioned this pull request May 1, 2026

fix: address dataflow review — edge ordering, LOOP/IF nodes, DOT escape #3

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: SSA dataflow analysis with Graphviz DOT export#2

feat: SSA dataflow analysis with Graphviz DOT export#2
sii-xinglong merged 11 commits into
mainfrom
feat/dataflow-analysis

sii-xinglong commented May 1, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 1, 2026

Uh oh!

sii-xinglong May 1, 2026

Uh oh!

gemini-code-assist Bot May 1, 2026

Uh oh!

sii-xinglong May 1, 2026

Uh oh!

gemini-code-assist Bot May 1, 2026

Uh oh!

sii-xinglong May 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

sii-xinglong commented May 1, 2026

Summary

Design

Changes

Usage

Test plan

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 1, 2026

Choose a reason for hiding this comment

Uh oh!

sii-xinglong May 1, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 1, 2026

Choose a reason for hiding this comment

Uh oh!

sii-xinglong May 1, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 1, 2026

Choose a reason for hiding this comment

Uh oh!

sii-xinglong May 1, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant