Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

empty dataset and video&description to .npy&csv&json #4

Closed
Tikquuss opened this issue Nov 1, 2020 · 7 comments
Closed

empty dataset and video&description to .npy&csv&json #4

Tikquuss opened this issue Nov 1, 2020 · 7 comments

Comments

@Tikquuss
Copy link

Tikquuss commented Nov 1, 2020

When I execute scripts/train.sh yc2 mart, I get this error.
mart_err2

mart_err

len(train_dataset) gives me 0, yet all the steps prior to the training stage have been respected, and have been successfully completed.
How can I solve the problem please? @jayleicn
I work on google colab.

@jayleicn
Copy link
Owner

jayleicn commented Nov 4, 2020

Maybe the script was not configured with the correct path to the feature directory?

@Tikquuss
Copy link
Author

Tikquuss commented Nov 5, 2020

I haven't changed much to the configuration in place, except dur_file="/content/mart/densevid_eval/yc2_data/yc2_duration_frame.csv" in train.sh.
Here is my little notebook.
Thank you for your attention.

@jayleicn
Copy link
Owner

jayleicn commented Nov 5, 2020

Can you try setting the video feature path to your feature path? And verify the features exist in your feature path.

@Tikquuss
Copy link
Author

Tikquuss commented Nov 5, 2020

Thank you very much.

One last question please:

  • I have my own dataset of videos and descriptions of these videos, how to switch from these videos to .npy (and everything that goes with it : duration_frame, json).
  • I also don't understand the structure of the yc2_duration_frame.csv file, what do the three columns represent?

It would be really useful for the users to know how to switch from their video (and descriptions) to a dataset that can be used by this repo : since I'm setting up an application that will take a video and provide its description.

If I manage to have a clear and simple pipeline I will publish it.

@Tikquuss Tikquuss changed the title empty dataset empty dataset and video&description to .npy&csv&json Nov 5, 2020
@jayleicn
Copy link
Owner

jayleicn commented Nov 5, 2020

Thanks @Tikquuss. It would great if you can help set up a simple pipeline for preparing features!

Hope this helps!
Jie

@Tikquuss
Copy link
Author

Tikquuss commented Nov 7, 2020

I think this repo can do the job.
Another thing I don't understand is the structure of the json files (val_yc2.json for example).

@jayleicn
Copy link
Owner

jayleicn commented Nov 7, 2020

{
  "v_xHr8X2Wpmno": {
    "duration": 206.86,
    "timestamps": [
      [47, 60],
      [67, 89],
      [91, 98],
      ...
    ],
    "sentences": [
      "pick the ends off the verdalago",
      "combine lemon juice sumac garlic salt and oil in a bowl",
      "chop lettuce and place it in a bowl",
      ...
    ]
  },
  ...
}

Here is the first entry from the file. v_xHr8X2Wpmno is the video name, timestamps indicate the various segments in the video and sentences are the corresponding captions to these segments. For example, the sentence pick the ends off the verdalago describes 47-60seconds of the video v_xHr8X2Wpmno.

@jayleicn jayleicn closed this as completed Feb 8, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants