Skip to content

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

confused about the vcoco dataset #60

Closed
ltttpku opened this issue Oct 4, 2022 · 3 comments
Closed

confused about the vcoco dataset #60

ltttpku opened this issue Oct 4, 2022 · 3 comments
Labels
moved to discussion question Awaiting further information

Comments

@ltttpku
Copy link

ltttpku commented Oct 4, 2022

There're some cool properties of VCOCO dataset you implemented:
"object_to_action" gives me the list of actions for each object, i.e. {1: [0, 3, 11, 15], 2: [0, 1, 2, 3, 11], ......}
"objects" return the list of objects, i.e. ['background', 'person', 'bicycle', .......]
"actions" return the list of actions, i.e. ['hold obj', 'sit instr', 'ride instr', .......]

However, I'm confused about the relationships among them:

  1. Which object does the key 1 of "1: [0, 3, 11, 15]", which is the first item of object_to_action, represent?
  2. Which action does the values [0, 3, 11, 15] of "1: [0, 3, 11, 15]" represent?

According to the List of actions and objects, Actions 0, 3, 11, 15 represent hold obj, look obj, carry obj, cut obj respectively while Object 1 represent person, which appears to be weird.

@fredzzhang
Copy link
Owner

Hi @ltttpku,

As the name suggests, object_to_action shows the correspondence between the objects and actions. So, the key 1 is the object index and the list [0, 3, 11, 15] is the valid actions for that object.

It does seem a bit weird to have person as the potential object for cut object. But this list of correspondence was generated from the training data, which means there must be training examples for that particular combination. I will need to take a look at the dataset later.

Fred.

@fredzzhang fredzzhang added the question Awaiting further information label Oct 5, 2022
@fredzzhang
Copy link
Owner

Hi @ltttpku,

I have checked the dataset, and can confirm that all those actions do exist.

In particular, the cutting obj and person pair can happen in a surgery scene or barber's shop. You can use the dataset navigator utility as provided in the repo to check these images. The image indices are 559, 2173, 2363 and 4416.

Fred.

@ltttpku
Copy link
Author

ltttpku commented Oct 10, 2022

That makes sense! Closing the issue :p

@ltttpku ltttpku closed this as completed Oct 10, 2022
Repository owner locked and limited conversation to collaborators Oct 10, 2022
@fredzzhang fredzzhang converted this issue into discussion #61 Oct 10, 2022

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

Labels
moved to discussion question Awaiting further information
Projects
None yet
Development

No branches or pull requests

2 participants