Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What's the meaning of modalities in MUJOCO PUSH dataset? #20

Open
mrbeann opened this issue May 26, 2022 · 2 comments
Open

What's the meaning of modalities in MUJOCO PUSH dataset? #20

mrbeann opened this issue May 26, 2022 · 2 comments

Comments

@mrbeann
Copy link
Contributor

mrbeann commented May 26, 2022

Hi, I recently tried the MUJOCO PUSH dataset, but I cannot figure out the concrete meaning of the modalities. The paper mentioned

The multimodal inputs are gray-scaled images (1 × 32 × 32) from an RGB camera, forces (and binary contact information) from a force/torque sensor, and the 3D position of the robot end-effector.

I found the modality in the dataset are "control", "image", "sensor", "pos". What are the correspondences between these modalities and the paper? (i.e. what's the meaning of these modalities?).

@arav-agarwal2
Copy link
Collaborator

Someone else can confirm, but here's how I think of things:
-> The "image" modality refers to the gray-scale images.
-> The "pos" modality refers to the 3d position of the end-effector.
-> The "sensor" refers to the forces/binary contact information.
-> The "control" refers to what the controller is sending the arm itself. ( This one I'm the least sure about ).

@mrbeann
Copy link
Contributor Author

mrbeann commented May 28, 2022

I agree with your ideas, but this does not seem to correspond to the paper? For example, Figure 8.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants