aruco emit detection3D by bogwi · Pull Request #2278 · dimensionalOS/dimos

bogwi · 2026-05-28T04:54:50Z

This PR builds upon #2242 and closes awaited #1654

Make marker detection emit vision_msgs/Detection3DArray so downstream detection consumers - filtering - debug overlays - tracking - spatial memory - and Rerun visualization can treat markers like other 3D detections.

Architecture

 LIVE (#1654)
  ==========
  Camera.color_image ──┐
  Camera.camera_info ──┼──► MarkerDetectionStreamModule.start()
  TF (world<-optical frame) ──┘  │  optical = camera_optical_frame_id(image, info)
                                 ▼
                      _append_image_with_pose
                      skip if: no CameraInfo | no TF
                      out: stream.append(Image, pose=7-tuple)  # no frame name in tuple
                                 │
                                 ▼
                      QualityWindow(img.sharpness, 0.5s)
                      -> sharpest Image / window
                                 │
                                 ▼
                      [optional] SpeedLimit(mps, dps)
                                 │
                                 ▼
                      DetectMarkers -> detect_markers_in_image
                      (camera_optical_frame_id + pose -> world_T_optical)
                      in:  Observation[Image] + pose + CameraInfo
                      out: 0..N Observation[Detection3DMarker]  (track_id=-1 unless smoothing)
                            OR sentinel None (empty frame)
                                 │
                                 ▼
                      MarkersPerFrame
                      in:  fan-out markers + tags
                      out: Observation[Detection3DArray]
                            (empty detections_length OK)
                                 │
                ┌────────────────┴────────────────┐
                ▼                                 ▼
      LCM Detection3DArray              MarkerTfModule
      (wire: bbox center/orient)        world->markers->marker_{id}
                │
                ▼
      rerun bridge: msg.to_rerun() -> rr.Boxes3D


  OFFLINE (PR #2242 + reused by live module)
  ==========================================
  SqliteStore.color_image (obs.pose from recording)
      -> QualityWindow -> SpeedLimit? -> DetectMarkers
      -> list[Observation[Detection3DMarker]]  (no MarkersPerFrame in map CLI)
      -> direct rr.log / PGO-corrected poses

codecov · 2026-05-28T05:02:44Z

Codecov Report

❌ Patch coverage is 93.35233% with 70 lines in your changes missing coverage. Please review.
✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
dimos/perception/fiducial/marker_pose.py	71.83%	16 Missing and 4 partials ⚠️
...ception/fiducial/marker_detection_stream_module.py	78.48%	17 Missing ⚠️
dimos/perception/fiducial/marker_transformer.py	87.27%	6 Missing and 8 partials ⚠️
dimos/perception/fiducial/marker_detect.py	80.48%	4 Missing and 4 partials ⚠️
dimos/msgs/vision_msgs/Detection3DArray.py	77.41%	5 Missing and 2 partials ⚠️
...os/perception/detection/type/detection3d/marker.py	86.66%	1 Missing and 1 partial ⚠️
dimos/perception/fiducial/marker_tf_module.py	93.10%	0 Missing and 2 partials ⚠️

📢 Thoughts on this report? Let us know!

greptile-apps · 2026-05-28T05:03:54Z

Greptile Summary

This PR refactors live ArUco/AprilTag detection to emit Detection3DArray messages, enabling downstream consumers (tracking, filtering, Rerun, TF) to treat fiducial markers like any other 3D detection. Marker detection logic is extracted from MarkerTfModule into dedicated modules (marker_pose.py, marker_detect.py, marker_transformer.py, marker_detection_stream_module.py), and MarkerTfModule is narrowed to TF-only publication from an incoming Detection3DArray port.

MarkerDetectionStreamModule is a new StreamModule that gates images on CameraInfo + TF availability, quality-windows them, runs DetectMarkers, fans out per-marker observations through MarkersPerFrame, and publishes one Detection3DArray per source frame via LCMTransport.
MarkersPerFrame collapses per-marker fan-out into a single Detection3DArray per source frame using a count-based fast-path flush; Detection3DArray.to_rerun() lets the Rerun bridge log oriented boxes directly from the wire type.
MarkerTfModule is simplified to subscribe to detections: In[Detection3DArray] and mirror bbox.center poses into the TF tree, fully decoupled from camera and detection logic.

Confidence Score: 5/5

Safe to merge; the refactoring is well-structured and all call sites are consistently updated.

The core detection and streaming logic is correctly implemented. MarkersPerFrame flush semantics handle all expected frame orderings. The two findings are both speculative edge cases that do not affect the current deployed configurations.

No files require special attention; the two suggestions in marker_detection_stream_module.py and desk_marker_tf.py are minor hardening improvements.

Important Files Changed

Filename	Overview
dimos/perception/fiducial/marker_detection_stream_module.py	New live-pipeline StreamModule; gates on CameraInfo+TF, builds DetectMarkers+MarkersPerFrame pipeline. Minor: camera_info is snapshotted at pipeline creation rather than passed as a callable, so post-start config updates are silently ignored by DetectMarkers.
dimos/perception/fiducial/marker_transformer.py	DetectMarkers and MarkersPerFrame transformers; flush logic (count fast-path + timestamp fallback) is correct for sequential frame delivery.
dimos/perception/fiducial/marker_tf_module.py	Successfully narrowed to TF-only publication; subscribes to Detection3DArray and mirrors bbox.center poses. All detection/camera logic removed cleanly.
dimos/perception/fiducial/marker_detect.py	New stateless per-frame detection helper; correctly handles size-mismatch guard, fisheye/radtan branching, and builds Detection3DMarker list.
dimos/msgs/vision_msgs/Detection3DArray.py	Adds to_rerun() and _label_for_detection(); slices detections_length correctly and handles empty arrays safely.
dimos/perception/detection/type/detection3d/marker.py	Detection3DMarker dataclass; post_init sets name from marker_label; to_detection3d_msg correctly overrides class_id and id.
dimos/robot/unitree/go2/blueprints/smart/unitree_go2.py	unitree_go2_markers blueprint updated to wire MarkerDetectionStreamModule → MarkerTfModule with LCMTransport. All call sites updated correctly.
dimos/perception/fiducial/marker_pose.py	Extracted shared pose helpers; fisheye path undistorts before solvePnP correctly.
dimos/perception/detection/type/detection3d/imageDetections3D.py	New helper converting Detection3DBBox list to Detection3DArray ROS message; clean and straightforward.
dimos/perception/fiducial/blueprints/desk_marker_tf.py	Blueprint updated to include MarkerDetectionStreamModule. create_desk_camera_info() is called at module import time rather than lazily.

Sequence Diagram

sequenceDiagram
    participant Cam as CameraModule
    participant MDSM as MarkerDetectionStreamModule
    participant DM as DetectMarkers
    participant MPF as MarkersPerFrame
    participant LCM as LCMTransport
    participant MTF as MarkerTfModule
    participant RR as Rerun

    Cam->>MDSM: color_image (Image)
    MDSM->>MDSM: _append_image_with_pose (TF lookup, gate on CameraInfo)
    MDSM->>DM: Stream[Observation[Image]] with pose tuple
    DM->>DM: QualityWindow / SpeedLimit
    DM->>DM: detect_markers_in_image
    DM-->>MPF: Observation[Detection3DMarker] x N
    DM-->>MPF: None sentinel (empty frame)
    MPF->>MPF: flush on count or ts-change
    MPF->>LCM: Detection3DArray (one per frame)
    LCM-->>MTF: Detection3DArray (in-process)
    LCM-->>RR: Detection3DArray (via LCM wire)
    MTF->>MTF: "_process_detections -> tf.publish"
    RR->>RR: "msg.to_rerun() -> rr.Boxes3D"

_{Reviews (9): Last reviewed commit: "fix test pydantic None" | Re-trigger Greptile}

file)

leshy

super mega clean, thanks

Integrate origin/main (PR #2242 loop_closure rewrite, #2278 aruco Detection3D, #2316 docs/coding-agents rename, mem2 time windowing, etc.). Took origin's marker-free PGO/PoseGraph rewrite for loop_closure (eval, pgo, test_pgo, markers_rrd), marker_transformer, and the map CLI; kept branch map_rrd. Stripped remaining # ---- section markers from map_rrd.

bogwi requested review from arkluc, leshy, mustafab0, paul-nechifor and spomichter as code owners May 28, 2026 04:54

greptile-apps Bot reviewed May 28, 2026

View reviewed changes

Comment thread dimos/memory2/type/observation.py Outdated

Comment thread dimos/utils/cli/map.py Outdated

Comment thread dimos/utils/cli/map.py Outdated

Comment thread dimos/utils/cli/map.py

bogwi marked this pull request as draft May 28, 2026 05:10

bogwi changed the title ~~feat/aruco emit detection3 d~~ aruco emit detection3 d May 28, 2026

bogwi force-pushed the danvi/feat/aruco-emit-detection3D branch from f8d4f3c to 9846741 Compare May 28, 2026 06:02

bogwi marked this pull request as ready for review May 28, 2026 06:03

bogwi marked this pull request as draft May 28, 2026 06:09

bogwi force-pushed the danvi/feat/aruco-emit-detection3D branch from 7d38835 to da3c529 Compare May 28, 2026 06:17

bogwi marked this pull request as ready for review May 28, 2026 06:48

bogwi marked this pull request as draft May 28, 2026 06:49

bogwi changed the title ~~aruco emit detection3 d~~ aruco emit detection3D May 28, 2026

bogwi marked this pull request as ready for review May 28, 2026 10:54

leshy added the PlzReview label May 29, 2026

bogwi added 11 commits May 29, 2026 14:49

Extract shared fiducial helpers into marker_pose.py

4f01581

Harden Detection3DMarker (extend PR #2242)

df48c5e

Refactor DetectMarkers into a shared core + thin transformer (PR #2242

7a052b3

file)

make no track_id as marker identity

14f78f9

Add ImageDetections3D (bbox markers)

aa1c362

Live detector: memory2 pipeline + StreamModule

e2ac28d

Make TF consume detections

8bc235a

Composite deploy, blueprints, wire rerun

58f7b39

File layout target - refactor

e59ef08

File layout target - refactor 2

6c8080d

remove redundant deploy() foos from fiducial

f7026a0

bogwi added 3 commits May 29, 2026 15:06

fix: mypy; .remove rdd from .gitignore

0038072

fix: CI fails, tests 3, 4

c135419

fix: pre-commit

1830e75

bogwi force-pushed the danvi/feat/aruco-emit-detection3D branch from dd9b101 to 1830e75 Compare May 29, 2026 06:23

bogwi added 2 commits May 29, 2026 18:09

remove camera_info_source config parameter

f7ab00a

fix: mypy

4e112db

leshy previously approved these changes May 29, 2026

View reviewed changes

leshy enabled auto-merge (squash) May 29, 2026 09:29

fix test pydantic None

3e0ea52

bogwi dismissed leshy’s stale review via 3e0ea52 May 29, 2026 09:33

leshy approved these changes May 29, 2026

View reviewed changes

leshy merged commit db8ac9f into main May 29, 2026
30 of 35 checks passed

leshy deleted the danvi/feat/aruco-emit-detection3D branch May 29, 2026 10:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

aruco emit detection3D#2278

aruco emit detection3D#2278
leshy merged 17 commits into
mainfrom
danvi/feat/aruco-emit-detection3D

bogwi commented May 28, 2026 •

edited

Loading

Uh oh!

codecov Bot commented May 28, 2026 •

edited

Loading

Uh oh!

greptile-apps Bot commented May 28, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

leshy left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bogwi commented May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

This PR builds upon #2242 and closes awaited #1654

Uh oh!

codecov Bot commented May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

greptile-apps Bot commented May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Sequence Diagram

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

leshy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bogwi commented May 28, 2026 •

edited

Loading

codecov Bot commented May 28, 2026 •

edited

Loading

greptile-apps Bot commented May 28, 2026 •

edited

Loading