-
Notifications
You must be signed in to change notification settings - Fork 694
dev agents agent_encode
leshy edited this page Mar 24, 2026
·
9 revisions
It's easy to encode something somehow, but we need strict evals to validate agent understanding. Two pronged approach here
- agent just understands the data that was sent to it.
- agent accesses an easy to inspect object (PointCloud2) and then uses python repl to investigate it
-
Good example
- good example of a natural agent encode would be audio transcription
-
How can we encode flat map for an agent to understand?
- someone worked on this for maps (without success but good convo to understand our eval considerations https://github.com/dimensionalOS/dimos/pull/822)
- how many rooms there are?
- where is the smallest room?
- take a hallway then turn left
- go to second room on your right
- approaches for this: VLM understanding costmap image seems difficult (see the PR above) but maybe numpy array agent can investigate with tools? some goofy ascii encoding? json? idk
-
How can we encode a pointcloud for an agent to understand?
- egocentric probably very important.
- how good is agent at using py repl to investigate a PointCloud2?