Skip to content

2026.04.15- #67 - #70

@changh95

Description

@changh95

ZED X Nano

Image

Boxer: Robust Lifting of Open-World 2D Bounding Boxes to 3D

  • Open-vocabulary 2D detector (OWLv2) -> BoxerNet lift 2D detection to 3D oriented bounding box -> Temporal fusion
  • Camera intrinsic, Gravity direction 필요
  • Optional input: Semi-dense depth.
  • Temporal fusion -> Hungarian algorithm
  • https://facebookresearch.github.io/boxer/
Image

SLAM and VIO in Egocentric Data: Where Long-Horizon Tracking Breaks

Image

Rust robotics

EUPE - Efficient Universal Perception Encoder

Google Gemini ER1.6

  • https://deepmind.google/blog/gemini-robotics-er-1-6/
  • Agentic Physical AI를 위한 한걸음
  • Gemini 3.0 Flash, Gemini ER1.6 보다 발전
  • This model specializes in reasoning capabilities critical for robotics, including visual and spatial understanding, task planning and success detection. It acts as the high-level reasoning model for a robot, capable of executing tasks by natively calling tools like Google Search to find information, vision-language-action models (VLAs) or any other third-party user-defined functions.
Image Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions