Dataset processing #10

maherr13 · 2022-04-10T23:53:30Z

I have a few questions regarding the dataset processing pipeline,

at generate_clips script, why the start index is 80 ??
why there are 13 records in each clip labeled idle in the train test split file ??
are there any parameters I would need to adjust when creating my own data set??

btw there is an error in 3_2_split_train_val_test.py that you naming the validation samples "val" while the model searches for "dev" labeled records.

The text was updated successfully, but these errors were encountered:

Garfield-Finch · 2022-04-11T04:26:57Z

Thank you for your interest in our work.

In our dataset, there is always a segment of introduction that we want to desert. We will change that part to be a parameter user can set. Thank you for that.
It is because we want to make sure that the training set and test set do not overlap.
As far as we are concerned, all the parameters that should be set by yourself have been indicated as "required=True" in the "argparser". It should be fine if you go by our default setting. If you find anything that is crucial but we did not notice, please follow up on this issue.

Thank you. That is an incompatibility between our naming rules and that of the original dataset from "Speech2Gesture"

maherr13 · 2022-04-11T11:50:40Z

Thank you for the illustration.

I have a question regarding the data collection, from your experiments How much data do I need to collect (in hours) to get good results in general, and as good as Oliver specifically?

ShenhanQian · 2022-04-11T12:08:46Z

Here are the lengths of our training sequences for your reference:

Subject	Length (hours)
Oliver	11
Kubinec	3
Luo	7
Xing	2

Besides the length, the variation and the quality of pose may also highly influence the results. So we suggest collecting videos with expressive gestures like Oliver's and visually checking if the detected keypoints are correct and stable.

maherr13 · 2022-04-11T12:49:10Z

Many Thanks. @ShenhanQian @Garfield-Finch

maherr13 closed this as completed Apr 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dataset processing #10

Dataset processing #10

maherr13 commented Apr 10, 2022

Garfield-Finch commented Apr 11, 2022 •

edited

maherr13 commented Apr 11, 2022

ShenhanQian commented Apr 11, 2022

maherr13 commented Apr 11, 2022

Dataset processing #10

Dataset processing #10

Comments

maherr13 commented Apr 10, 2022

Garfield-Finch commented Apr 11, 2022 • edited

maherr13 commented Apr 11, 2022

ShenhanQian commented Apr 11, 2022

maherr13 commented Apr 11, 2022

Garfield-Finch commented Apr 11, 2022 •

edited