-
Notifications
You must be signed in to change notification settings - Fork 70
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GAST model with more than 17 keypoints? #28
Comments
Hi~ |
Hi @fabro66 thanks for your quick answer. According to the H3.6 Team, the dataset includes following 32 keypoints:
Here are all important keypoints plotted (without duplicates): As the foot toe keypoints are included, I would like to add them to the 3D pose prediction using GAST-Net too. Which configurations do I need to make to let the model output the foot toe keypoints? Probably i need to train the model from scratch right? |
Hi~
|
Thanks a lot for your help @fabro66! When running inference, do i need to detect the 2D toe keypoints and feed it to the GAST-NET? Alphapose for example has a model with 26 keypoints including the toe keypoints which can be used. Providing the 2D toe keypoints would probably help the uplifting for achieving higher accuracy with predicting the toe keypoints, right? |
Hi~ |
Hi @fabro66 thanks for the clarification. I tried to train the a new model including the toe keypoints as you described (using
So here I changed the
Do you have any idea which adjustments are missing to train it with the 19 keypoints (17 COCO + 2 toe keypoints)? |
Hi~ 0:pevils, 1:right hip, 2:right knee, 3:right ankle, 4:right toe, 5:left hip, 6:left knee, 7:left ankle, 8:left toe, 9:spine You should add some codes to ./model/local_attention.py file:
Please let me know if this problem is solved |
Hi @fabro66 thank you so much for helping again! I added the codes as you suggested:
Unfortunately i still receive the same error as above:
However I checked the GT 2D keypoints of Human3.6M dataset again and i guess they provide 17 keypoints only. For example: |
You need to prepare 2D GT with 19 keypoints according to VideoPose3D data preprocessing tutorial. Then you change part of the code of the following two files respectively.
|
Hi, I also try to use AlphaPose as the 2D detector, but the skeleton style in COCO and Human3.6m is different, SPINE, THORAX, PELVIS and HEAD, this four keypoints is not provided by the AlphaPose, currently I use the rest of 13 keypoints both in training and inference. Do you have any ideas in fine-tuning the model so that the output of the 2D detectors is 17 keypoints with same skeleton style in Human3.6m? Deepest thanks for your reply! BTW, @sebo361 mentioned that you want to add two more keypoints, perhaps you can try to use 17 keypoints in 2D gt and 19 keypoints in 3D gt to train a model since the human3.6 gt only contains 17 keypoints. I tried to use 13 keypoints in 2D gt to train a 17 keypoints and the performance is also acceptable for me. :) |
Hi~ Thank you for your interest in our works! You don't need to retrain a new model with COCO style. Because we have provided a file to convert the skeleton type from COCO to Human3.6M. The output of the HRNet 2D pose detector we used is also the type of COCO, which is also converted in this way. The final 3D estimation accuracy is acceptable.
|
Thanks for your prompt reply! I have find this part of the codes, it is useful! Thanks again! In the VideoPose3D, they fine-tune the 2D skeleton based on the COCO pretrained model so that the output of the 2D detector is the human3.6m style. Have you tried this method? |
Hi @fabro66 thank you for your help! I trained the model with 19 keypoints as you described before and it worked out great! I receive nice training results similar to the ones described in the paper. However I am trying to run the new model with 19 kpts on a video and running into issues again. I am using the Alphapose model with 26 keypoints and reordered the keypoints to the order as you described: 0:pevils, 1:right hip, 2:right knee, 3:right ankle, 4:right toe, 5:left hip, 6:left knee, 7:left ankle, 8:left toe, 9:spine When loading the new GAST model with 19 kpts i receive following error: Is there something wrong with loading the model? Do i need to refactor the SpatioTemporalModel? |
Hi~ Sorry for no prompt reply. |
Are you using a multi-GPU to train GASTNet? If this is the case, you need to change the parameter name when loading the model.
|
Thnak you a lot @fabro66, but know there ist still a mismatch between my trained model and the GastNet architecture:
Do you have an idea which feature of 62 VS 72 is missing here? |
@sebo361 |
Hi @fabro66 wow thats fantastic news! When will you release the code approximately? |
@sebo361 We will release this part of codes next month. |
@fabro66 Fantastic looking forward to it! |
@sebo361 I use whole-body HRNet provided by mmpose to detect 2D keypoints (133 keypoints, including body, foot, hand, and facial keypoints). It is easy to install. |
@sebo361 I have updated GAST-Net for inferring 19-joints 3D human pose. Please check it. |
@fabro66 Awesome thank you so much, it's working great 🥳 |
Hi, thank you for your amayzing work!
Is there a chance to get more than 17 keypoints from the 3D model? I am especially interested in receiving 3D foot keypoints.
Thanks!
The text was updated successfully, but these errors were encountered: