Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question About Training Dataset #5

Closed
GrayNiwako opened this issue Nov 17, 2021 · 2 comments
Closed

Question About Training Dataset #5

GrayNiwako opened this issue Nov 17, 2021 · 2 comments

Comments

@GrayNiwako
Copy link

Thanks for your work! It’s very interesting!!
May I ask you some questions?
Did you manually annotate landmarks for the images generated by the TADNE model? And how many images does your training dataset include?

@hysts
Copy link
Owner

hysts commented Nov 17, 2021

Hi, @GrayNiwako

Sorry for the confusion. What I meant to say in the README was that the images in the README demo were generated by the TADNE model, not that the training data were also generated. The training images were collected from the internet, because I thought the variations of pose and appearance of generated images might be limited.

As for the number of landmark annotations, I manually annotated 300 images and split them into 200-100 for training and validation data, making sure to include enough pose and appearance variations in the validation data. The number of annotations is very small, but it still took me days to annotate.
About image selection, it is important to note that I didn't randomly choose images to annotate, but I manually selected relatively difficult images in terms of head pose, illumination, occlusion, etc., because it'd be a waste of time to annotate easy examples. Also, the annotation data were gradually increased in a human-in-the-loop cycle, where I annotated some images, trained models, and then manually checked the predictions to select the next images to annotate. I also tried semi-automatically selecting difficult images by checking consistency of predictions of multiple models and of rotated images.

FYI, the current model is still not good at dealing with occlusions and face with large deformations, etc., but it was good enough for what I wanted to do with this model, so I stopped there. I don't think the current number of training data is enough and I think it's easy to improve the model performance by increasing training data.
I do not plan to release the training data due to copyright issues, but I think pseudo-labels by the current model is good enough to achieve the same or possibly better performance than the current model.

@GrayNiwako
Copy link
Author

@hysts Thank you for your kind reply!!!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants